SlideShare uma empresa Scribd logo
1 de 24
Baixar para ler offline
Business Analytics For All
BA4All – Insight Session April 29th 2014
Guy Van der Sande – Vincent Greslebin
Unlocking the Value within
the Data Vault
Fifthplay : Architecture
Smart Homes Platform Data Warehouse
Data
Vault
Data
mart
Gebruikers
Marketing &
SSC
Utility Portal
ETL
Dag - 1
- Controle data kwaliteit
- Toepassing business
rules
- Aggregatie
- Filtering
Facility Portal
Fifthplay : Why Data Vault ?
• Pattern based design which allows agility to take place
• Easy to add new data sources making it future proof. This allows
Fifthplay to stay innovative
• Large volume of data
• Build up history that is not available in the operational system
• Possibility of performing analysis on raw data (cfr quality checks)
• Development speed (Pilot : 37 working days)
Data Vault ?
Data Vault ?
Data Vault ?
The Data Vault is a detail oriented, historical tracking and
uniquely linked set of normalized tables that support one or
more functional areas of business. It is a hybrid approach
encompassing the best of breed between 3rd normal form
(3NF) and star schema.
The design is flexible, scalable, consistent and adaptable to
the needs of the enterprise.
Standard architecture
 The centerpiece of the Enterprise Data Warehouse
 History is build-up
 Granularity as ‘detailed’ as possible
 No use of business rules
 Use of business keys that are horizontal in nature and
provide visibility across lines of business
 A new layer which has the benefits of the RAW Data
Vault, but with the business data embedded
 In the Business Data Vault the data has been altered,
cleansed and changed to meet the business rules
 Downstream of the raw data vault
 Starting point for Master Data Management
 Metadata is absolutely vital
The Data Vault Model exists of 3 basic entity types
• Hubs : contains a unique list of business keys
• Links : associations across or between business keys
• Satellites : holds descriptive data (about the business key) over time
Component parts of the Data Vault model
• Represents a Core Business Concept
• Is formed around the Business Key of this concept
• Is established the first time a new instance of that
business key is introduced
• Must be 1:1 with a single instance
• Consists of the business key, a sequence id, a load
date/time stamp and a record source.
Component parts - Hub
• Represents a natural business relationship between business keys
• Is established the first time this new unique association is presented
• Can represent an association between several Hubs and sometimes other
Links.
• maintains a 1:1 relationship with the unique and specific business defined
association between that set of keys.
• Consists of the sequence ids from the Hubs and Links
• Contains sequence id, a load date/time stamp and a
• record source.
Component parts - Link
• The Satellite contains the descriptive information
(context) for a business key.
• A Satellite can only describe one key (Hub or a Link).
• The Satellite is the only construct that manages time
slice data (data warehouse historical tracking of
values over time).
Component parts - Satellite
Fact
Dimension 1
Dimension 3
Dimension 2
Dimension 4
Data Vault – Why ?
Fact
Dimension 1
Dimension 3
Dimension 2
Dimension 4
Data Vault – Why ?
Fact
Dimension 1
Dimension 3
Dimension 2
Dimension 4
Fact
Dimension 5
Data Vault – Why ?
Data Vault – Why ?
DV
DM
DV
DM
S
S
S S
H
S
L
H
H
H
Data Vault – Why ?
DV
DM
S
S
S S
H
S
L
H
H
H
Dimension Fact
Data Vault – Why ?
Data Vault – How did we do
it with Fifthplay ?
HubServicePartner HubCustomer HubHomeAreaManager
HubSmartPlug
HubDeviceGroup
HubEnergyLogType
LinkServicePartnerCustomer LinkCustomerHomeAreaManager
LinkHomeAreaManagerSmartPlug
LinkCustomerDeviceGroup
LinkDeviceGroupSmartPlug
LinkDeviceSubGroupSmartPlug
LinkSmartPlugApplianceEnergyLogT
ype
HubCityLinkHomeAreaManagerCity
HubCountry
LinkCountryCity
HubSatServicePartner
HubSatCustomer
HubSatHomeAreaManager
LinkSatHomeAreaManagerCity
LinkSatCountryCity
HubSatCountry
HubSatDeviceGroup
HubSatSmartPlug
HubAppliance
HubSatAppliance
LinkSatSmartPlugApplianceEnergyL
ogType
HubSatHomeAreaManagerAddress
SeqServicePartnerPK
ServicePartnerID
LoadDateTime
RecordSource
SeqCustomerPK
CustomerID
LoadDateTime
RecordSource
SeqHomeAreaManagerPK
HomeAreaManagerNumber
LoadDateTime
RecordSource
SeqSmartPlugPK
SmartPlugID
LoadDateTime
RecordSource
SeqDeviceGroupPK
DeviceGroupID
LoadDateTime
RecordSource
SeqEnergyLogTypePK
EnergyLogName
LoadDateTime
RecordSource
SeqServicePartnerCustomerPK
SeqCustomer
LoadDateTime
RecordSource
SeqServicePartner
SeqCustomerHomeAreaMan
ager
PK
SeqCustomer
LoadDateTime
RecordSource
SeqHomeAreaManager
SeqHomeAreaManagerSmar
tPlug
PK
SeqHomeAreaManager
LoadDateTime
RecordSource
SeqSmartPlug
SeqCustomerDeviceGroupPK
SeqCustomer
LoadDateTime
RecordSource
SeqDeviceGroup
LoadDateTime
SeqDeviceGroupSmartPlugPK
LoadDateTime
RecordSource
SeqDeviceGroup
SeqDeviceSubGroupSmartPl
ug
PK
LoadDateTime
RecordSource
SeqDeviceGroup
SeqSmartPlug
SeqSmartPlug
SeqSmartPlugApplianceEner
gyLogType
PK
SeqEnergyLogType
LoadDateTime
RecordSource
SeqSmartPlug
SeqCityPK
CityPostalCode
LoadDateTime
RecordSource
CityName
SeqHomeAreaManagerCityPK
SeqCity
LoadDateTime
RecordSource
SeqHomeAreaManager
SeqCountryPK
CountryIsoCode
LoadDateTime
RecordSource
SeqCountryCityPK
SeqCity
LoadDateTime
RecordSource
SeqCountry
SeqSatServicePartnerPK
SeqServicePartner
LoadDateTime
RecordSource
LoadEndDateTime
ServicePartnerCode
ServiucePartnerEmail
ServicePartnerCustomerCon
tact
SeqSatCustomerPK
SeqCustomer
LoadDateTime
RecordSource
LoadEndDateTime
CustomerEmail
CustomerFirstName
CustomerLastName
CustomerLanguage
SeqSatHomeAreaManagerPK
SeqHomeAreaManager
LoadDateTime
RecordSource
LoadEndDateTime
HomeAreaManagerMode
HomeAreaManagerArchitec
ture
SeqSatHomeAreaManagerCi
ty
PK
SeqHomeAreaManagerCity
LoadDateTime
RecordSource
LoadEndDateTime
HAMCityAddressLine1
HAMCityPhoneNumber
HAMCityAddressLine2
SeqSatCountryCityPK
SeqCountryCity
LoadDateTime
RecordSource
LoadEndDateTime
CountryCityRegion
CountryCityState
SeqSatCountryPK
SeqCountry
LoadDateTime
RecordSource
LoadEndDateTime
CountryName
SeqSatDeviceGroupPK
SeqDeviceGroup
LoadDateTime
RecordSource
LoadEndDateTime
DeviceGroupName
DeviceGroupDescription
SeqSatSmartPlugPK
SeqSmartPlug
LoadDateTime
RecordSource
LoadEndDateTime
SmartPlugDisplayName
SmartPlugManufacturer
SmartPlugModel
SmartPlugIsGenerator
SmartPlugHasChildren
SmartPlugHasSchedule
SeqAppliancePK
ApplianceID
LoadDateTime
RecordSource
SeqSatAppliancePK
SeqAppliance
LoadDateTime
RecordSource
LoadEndDateTime
ApplianceCategory
SeqSatSmartPlugApplianceE
nergyLogType
PK
SeqSmartPlugApplianceEner
gyLogType
LoadDateTime
RecordSource
LoadEndDateTime
EnergyLogDateTime
EnergyLogValue
SeqAppliance
EnergyLogValueUnit
Legend
Hub
Link
Satellite
ServicePartnerWebPage
SeqSatHomeAreaManagerA
ddress
PK
SeqHomeAreaManager
LoadDateTime
RecordSource
LoadEndDateTime
HomeAreaManagerAddress
Line1
HomeAreaManagerPostalCo
de
HomeAreaManagerAddress
Line2
HomeAreaManagerCityNam
e
HomeAreaManagerProvince
HomeAreaManagerState
HomeAreaManagerCountry
Fifthplay Raw Data Vault Architecture
Fifthplay Raw Data Vault Architecture
HubSmartPlug
HubEnergyLogType
LinkSmartPlugApplianceEnergyLogT
ype
HubAppliance
HubSatAppliance
LinkSatSmartPlugApplianceEnergyL
ogType
SeqSmartPlugPK
SmartPlugID
LoadDateTime
RecordSource
SeqEnergyLogTypePK
EnergyLogName
LoadDateTime
RecordSource
SeqSmartPlugApplianceEner
gyLogType
PK
SeqEnergyLogType
LoadDateTime
RecordSource
SeqSmartPlug
SeqAppliancePK
ApplianceID
LoadDateTime
RecordSource
SeqSatAppliancePK
SeqAppliance
LoadDateTime
RecordSource
LoadEndDateTime
ApplianceCategory
SeqSatSmartPlugApplianceE
nergyLogType
PK
SeqSmartPlugApplianceEner
gyLogType
LoadDateTime
RecordSource
LoadEndDateTime
EnergyLogDateTime
EnergyLogValue
SeqAppliance
EnergyLogValueUnit
Legend
Hub
Link
Satellite
Fifthplay : Data Vault – lessons learned
• Don’t stop with data vault; A combination with classic
dimensional Kimball-methodology is advised
• Be creative; get out of your comfort zone, dare to walk
the thine line
• While setting up the data vault, operational issues
where discovered early in the process
• ETL-development goes very quickly because of the
typical pattern design of the data vault;
Data Vault – What’s next ?
2013 : Dan Linstedt
releases Data Vault
2.0 specs
History and what’s next ?
Relational modeling
(E.F.Codd)
Bill Inmon began
discussing Data
Warehousing
• Barry Devlin and
Dr Kimball
release
“Business Data
Warehouse”
• Bill Inmon
popularizes Data
Warehousing
• Dr Kimball
popularizes Star
Schema
Dan Linstedt begins
R&D on Data Vault
Modeling
Dan Linstedt
releases first 5
articles on Data
Vault Modeling
2012 : Dan Linstedt
announces Data
Vault 2.0
1960 1970 1980 1990 2000 2010
Thank You
BI@USGICT.be
http://www.linkedin.com/company/usgprofessionalsbe
+32 3 231 94 84 www.usgict.be
https://www.facebook.com/usgictbe
“In the Data Warehousing/BI world, we
should store the data as it stands on the
source system and interpret it on the
way out to the data marts. This is
absolutely critical to remember.”
Dan Linstedt
@BICC_at_USG

Mais conteúdo relacionado

Mais procurados

Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...Databricks
 
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)Roland Bouman
 
Lessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudLessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudDataWorks Summit
 
Manage tracability with Apache Atlas, a flexible metadata repository
Manage tracability with Apache Atlas, a flexible metadata repositoryManage tracability with Apache Atlas, a flexible metadata repository
Manage tracability with Apache Atlas, a flexible metadata repositorySynaltic Group
 
Hybrid Data Architecture: Integrating Hadoop with a Data Warehouse
Hybrid Data Architecture: Integrating Hadoop with a Data WarehouseHybrid Data Architecture: Integrating Hadoop with a Data Warehouse
Hybrid Data Architecture: Integrating Hadoop with a Data WarehouseDataWorks Summit
 
Munich Re: Driving a Big Data Transformation
Munich Re: Driving a Big Data TransformationMunich Re: Driving a Big Data Transformation
Munich Re: Driving a Big Data TransformationDataWorks Summit
 
Data in Motion - Data at Rest - Hortonworks a Modern Architecture
Data in Motion - Data at Rest - Hortonworks a Modern ArchitectureData in Motion - Data at Rest - Hortonworks a Modern Architecture
Data in Motion - Data at Rest - Hortonworks a Modern ArchitectureMats Johansson
 
Migrating legacy ERP data into Hadoop
Migrating legacy ERP data into HadoopMigrating legacy ERP data into Hadoop
Migrating legacy ERP data into HadoopDataWorks Summit
 
Data lake analytics for the admin
Data lake analytics for the adminData lake analytics for the admin
Data lake analytics for the adminTillmann Eitelberg
 
Solving Performance Problems on Hadoop
Solving Performance Problems on HadoopSolving Performance Problems on Hadoop
Solving Performance Problems on HadoopTyler Mitchell
 
Securing and governing a multi-tenant data lake within the financial industry
Securing and governing a multi-tenant data lake within the financial industrySecuring and governing a multi-tenant data lake within the financial industry
Securing and governing a multi-tenant data lake within the financial industryDataWorks Summit
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudDataWorks Summit/Hadoop Summit
 
The Rise of Big Data Governance: Insight on this Emerging Trend from Active O...
The Rise of Big Data Governance: Insight on this Emerging Trend from Active O...The Rise of Big Data Governance: Insight on this Emerging Trend from Active O...
The Rise of Big Data Governance: Insight on this Emerging Trend from Active O...DataWorks Summit
 
Reaching scale limits on a Hadoop platform: issues and errors created by spee...
Reaching scale limits on a Hadoop platform: issues and errors created by spee...Reaching scale limits on a Hadoop platform: issues and errors created by spee...
Reaching scale limits on a Hadoop platform: issues and errors created by spee...DataWorks Summit
 
Big data at United Airlines
Big data at United AirlinesBig data at United Airlines
Big data at United AirlinesDataWorks Summit
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesDenodo
 
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...DataWorks Summit
 
TopNotch: Systematically Quality Controlling Big Data by David Durst
TopNotch: Systematically Quality Controlling Big Data by David DurstTopNotch: Systematically Quality Controlling Big Data by David Durst
TopNotch: Systematically Quality Controlling Big Data by David DurstSpark Summit
 

Mais procurados (20)

Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
 
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
 
Lessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudLessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloud
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Manage tracability with Apache Atlas, a flexible metadata repository
Manage tracability with Apache Atlas, a flexible metadata repositoryManage tracability with Apache Atlas, a flexible metadata repository
Manage tracability with Apache Atlas, a flexible metadata repository
 
Hybrid Data Architecture: Integrating Hadoop with a Data Warehouse
Hybrid Data Architecture: Integrating Hadoop with a Data WarehouseHybrid Data Architecture: Integrating Hadoop with a Data Warehouse
Hybrid Data Architecture: Integrating Hadoop with a Data Warehouse
 
Munich Re: Driving a Big Data Transformation
Munich Re: Driving a Big Data TransformationMunich Re: Driving a Big Data Transformation
Munich Re: Driving a Big Data Transformation
 
Data in Motion - Data at Rest - Hortonworks a Modern Architecture
Data in Motion - Data at Rest - Hortonworks a Modern ArchitectureData in Motion - Data at Rest - Hortonworks a Modern Architecture
Data in Motion - Data at Rest - Hortonworks a Modern Architecture
 
Migrating legacy ERP data into Hadoop
Migrating legacy ERP data into HadoopMigrating legacy ERP data into Hadoop
Migrating legacy ERP data into Hadoop
 
Data lake analytics for the admin
Data lake analytics for the adminData lake analytics for the admin
Data lake analytics for the admin
 
Solving Performance Problems on Hadoop
Solving Performance Problems on HadoopSolving Performance Problems on Hadoop
Solving Performance Problems on Hadoop
 
Securing and governing a multi-tenant data lake within the financial industry
Securing and governing a multi-tenant data lake within the financial industrySecuring and governing a multi-tenant data lake within the financial industry
Securing and governing a multi-tenant data lake within the financial industry
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
 
Using Hadoop for Cognitive Analytics
Using Hadoop for Cognitive AnalyticsUsing Hadoop for Cognitive Analytics
Using Hadoop for Cognitive Analytics
 
The Rise of Big Data Governance: Insight on this Emerging Trend from Active O...
The Rise of Big Data Governance: Insight on this Emerging Trend from Active O...The Rise of Big Data Governance: Insight on this Emerging Trend from Active O...
The Rise of Big Data Governance: Insight on this Emerging Trend from Active O...
 
Reaching scale limits on a Hadoop platform: issues and errors created by spee...
Reaching scale limits on a Hadoop platform: issues and errors created by spee...Reaching scale limits on a Hadoop platform: issues and errors created by spee...
Reaching scale limits on a Hadoop platform: issues and errors created by spee...
 
Big data at United Airlines
Big data at United AirlinesBig data at United Airlines
Big data at United Airlines
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & Bénéfices
 
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
 
TopNotch: Systematically Quality Controlling Big Data by David Durst
TopNotch: Systematically Quality Controlling Big Data by David DurstTopNotch: Systematically Quality Controlling Big Data by David Durst
TopNotch: Systematically Quality Controlling Big Data by David Durst
 

Destaque

Data Vault: What is it? Where does it fit? SQL Saturday #249
Data Vault: What is it?  Where does it fit?  SQL Saturday #249Data Vault: What is it?  Where does it fit?  SQL Saturday #249
Data Vault: What is it? Where does it fit? SQL Saturday #249Daniel Upton
 
INDICADORES SOCIALES DE GESTIÓN COMUNITARIA
INDICADORES SOCIALES DE GESTIÓN COMUNITARIA INDICADORES SOCIALES DE GESTIÓN COMUNITARIA
INDICADORES SOCIALES DE GESTIÓN COMUNITARIA ESCOPROL
 
Muestra Con Tipo De Secc Final
Muestra Con  Tipo De Secc FinalMuestra Con  Tipo De Secc Final
Muestra Con Tipo De Secc Finalguest2e28de
 
Collision Management-June-2014-Full Magazine Article
Collision Management-June-2014-Full Magazine ArticleCollision Management-June-2014-Full Magazine Article
Collision Management-June-2014-Full Magazine ArticleVenessa Di Vito
 
Informatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyInformatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyNeil Swainston
 
Development of Indian Coast
Development of Indian CoastDevelopment of Indian Coast
Development of Indian CoastAmber Bhaumik
 
APROFE NYC Survey Results
APROFE NYC Survey ResultsAPROFE NYC Survey Results
APROFE NYC Survey Resultsmvanbusk
 
7 Trends to Watch in 2012
7 Trends to Watch in 20127 Trends to Watch in 2012
7 Trends to Watch in 2012HavasPR
 
Punto vive digital plus
Punto vive digital plusPunto vive digital plus
Punto vive digital plusandres pabon
 
FAQ de firma electrónica de Security Data
FAQ de firma electrónica de Security DataFAQ de firma electrónica de Security Data
FAQ de firma electrónica de Security DataSecurity Data
 
Multilenguaje en Drupal 8 y la iniciativa D8MI
Multilenguaje en Drupal 8 y la iniciativa D8MIMultilenguaje en Drupal 8 y la iniciativa D8MI
Multilenguaje en Drupal 8 y la iniciativa D8MIChristian López Espínola
 
Emotional Webdesign am Internet Briefing 2013 in Bern
Emotional Webdesign am Internet Briefing 2013 in BernEmotional Webdesign am Internet Briefing 2013 in Bern
Emotional Webdesign am Internet Briefing 2013 in BernRemy Blaettler
 
Sameer Desai speaks about Cannes at the RAMA meet
Sameer Desai speaks about Cannes at the RAMA meetSameer Desai speaks about Cannes at the RAMA meet
Sameer Desai speaks about Cannes at the RAMA meetseagulladvertising
 
Ultimo intercambio de regalos 2002
Ultimo intercambio de regalos 2002Ultimo intercambio de regalos 2002
Ultimo intercambio de regalos 2002ZAPHIRY BONETTO
 
Sustainability Assessment Report – Klean Kanteen
Sustainability Assessment Report – Klean Kanteen Sustainability Assessment Report – Klean Kanteen
Sustainability Assessment Report – Klean Kanteen Connie Kwan
 
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...Roland Bouman
 

Destaque (20)

Data Vault: What is it? Where does it fit? SQL Saturday #249
Data Vault: What is it?  Where does it fit?  SQL Saturday #249Data Vault: What is it?  Where does it fit?  SQL Saturday #249
Data Vault: What is it? Where does it fit? SQL Saturday #249
 
INDICADORES SOCIALES DE GESTIÓN COMUNITARIA
INDICADORES SOCIALES DE GESTIÓN COMUNITARIA INDICADORES SOCIALES DE GESTIÓN COMUNITARIA
INDICADORES SOCIALES DE GESTIÓN COMUNITARIA
 
Muestra Con Tipo De Secc Final
Muestra Con  Tipo De Secc FinalMuestra Con  Tipo De Secc Final
Muestra Con Tipo De Secc Final
 
Collision Management-June-2014-Full Magazine Article
Collision Management-June-2014-Full Magazine ArticleCollision Management-June-2014-Full Magazine Article
Collision Management-June-2014-Full Magazine Article
 
Informatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyInformatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems Biology
 
Development of Indian Coast
Development of Indian CoastDevelopment of Indian Coast
Development of Indian Coast
 
APROFE NYC Survey Results
APROFE NYC Survey ResultsAPROFE NYC Survey Results
APROFE NYC Survey Results
 
7 Trends to Watch in 2012
7 Trends to Watch in 20127 Trends to Watch in 2012
7 Trends to Watch in 2012
 
Punto vive digital plus
Punto vive digital plusPunto vive digital plus
Punto vive digital plus
 
FAQ de firma electrónica de Security Data
FAQ de firma electrónica de Security DataFAQ de firma electrónica de Security Data
FAQ de firma electrónica de Security Data
 
Multilenguaje en Drupal 8 y la iniciativa D8MI
Multilenguaje en Drupal 8 y la iniciativa D8MIMultilenguaje en Drupal 8 y la iniciativa D8MI
Multilenguaje en Drupal 8 y la iniciativa D8MI
 
Ttt 1&2
Ttt 1&2Ttt 1&2
Ttt 1&2
 
Emotional Webdesign am Internet Briefing 2013 in Bern
Emotional Webdesign am Internet Briefing 2013 in BernEmotional Webdesign am Internet Briefing 2013 in Bern
Emotional Webdesign am Internet Briefing 2013 in Bern
 
Sameer Desai speaks about Cannes at the RAMA meet
Sameer Desai speaks about Cannes at the RAMA meetSameer Desai speaks about Cannes at the RAMA meet
Sameer Desai speaks about Cannes at the RAMA meet
 
Motivasi bab2
Motivasi bab2Motivasi bab2
Motivasi bab2
 
Ultimo intercambio de regalos 2002
Ultimo intercambio de regalos 2002Ultimo intercambio de regalos 2002
Ultimo intercambio de regalos 2002
 
Sustainability Assessment Report – Klean Kanteen
Sustainability Assessment Report – Klean Kanteen Sustainability Assessment Report – Klean Kanteen
Sustainability Assessment Report – Klean Kanteen
 
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
 
2.componentes de html5
2.componentes de html52.componentes de html5
2.componentes de html5
 
Induction lights vs led lights green tech fixtures
Induction lights vs led lights   green tech fixturesInduction lights vs led lights   green tech fixtures
Induction lights vs led lights green tech fixtures
 

Semelhante a Experiences from a Data Vault Pilot Exploiting the Internet of Things

Real-life Customer Cases using Data Vault and Data Warehouse Automation
Real-life Customer Cases using Data Vault and Data Warehouse AutomationReal-life Customer Cases using Data Vault and Data Warehouse Automation
Real-life Customer Cases using Data Vault and Data Warehouse AutomationPatrick Van Renterghem
 
introduction to datawarehouse
introduction to datawarehouseintroduction to datawarehouse
introduction to datawarehousekiran14360
 
Alten calsoft labs analytics service offerings
Alten calsoft labs   analytics service offeringsAlten calsoft labs   analytics service offerings
Alten calsoft labs analytics service offeringsSandeep Vyas
 
LinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchLinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchSheetal Pratik
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?RTTS
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...DATAVERSITY
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesAmazon Web Services
 
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSAWS User Group Kochi
 
What_to_expect_from_oracle_database_12c
What_to_expect_from_oracle_database_12cWhat_to_expect_from_oracle_database_12c
What_to_expect_from_oracle_database_12cMaria Colgan
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Denodo
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.pptBsMath3rdsem
 
Creating Your Data Governance Dashboard
Creating Your Data Governance DashboardCreating Your Data Governance Dashboard
Creating Your Data Governance DashboardTrillium Software
 
20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.pptSumathiG8
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningProvectus
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesAmazon Web Services
 

Semelhante a Experiences from a Data Vault Pilot Exploiting the Internet of Things (20)

Real-life Customer Cases using Data Vault and Data Warehouse Automation
Real-life Customer Cases using Data Vault and Data Warehouse AutomationReal-life Customer Cases using Data Vault and Data Warehouse Automation
Real-life Customer Cases using Data Vault and Data Warehouse Automation
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
introduction to datawarehouse
introduction to datawarehouseintroduction to datawarehouse
introduction to datawarehouse
 
DWBASIC.ppt
DWBASIC.pptDWBASIC.ppt
DWBASIC.ppt
 
Alten calsoft labs analytics service offerings
Alten calsoft labs   analytics service offeringsAlten calsoft labs   analytics service offerings
Alten calsoft labs analytics service offerings
 
LinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchLinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbench
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Info sphere overview
Info sphere overviewInfo sphere overview
Info sphere overview
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business Outcomes
 
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
 
An Introduction To BI
An Introduction To BIAn Introduction To BI
An Introduction To BI
 
What_to_expect_from_oracle_database_12c
What_to_expect_from_oracle_database_12cWhat_to_expect_from_oracle_database_12c
What_to_expect_from_oracle_database_12c
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt
 
Creating Your Data Governance Dashboard
Creating Your Data Governance DashboardCreating Your Data Governance Dashboard
Creating Your Data Governance Dashboard
 
20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business Outcomes
 

Último

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 

Último (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 

Experiences from a Data Vault Pilot Exploiting the Internet of Things

  • 1. Business Analytics For All BA4All – Insight Session April 29th 2014 Guy Van der Sande – Vincent Greslebin Unlocking the Value within the Data Vault
  • 2. Fifthplay : Architecture Smart Homes Platform Data Warehouse Data Vault Data mart Gebruikers Marketing & SSC Utility Portal ETL Dag - 1 - Controle data kwaliteit - Toepassing business rules - Aggregatie - Filtering Facility Portal
  • 3. Fifthplay : Why Data Vault ? • Pattern based design which allows agility to take place • Easy to add new data sources making it future proof. This allows Fifthplay to stay innovative • Large volume of data • Build up history that is not available in the operational system • Possibility of performing analysis on raw data (cfr quality checks) • Development speed (Pilot : 37 working days)
  • 6. Data Vault ? The Data Vault is a detail oriented, historical tracking and uniquely linked set of normalized tables that support one or more functional areas of business. It is a hybrid approach encompassing the best of breed between 3rd normal form (3NF) and star schema. The design is flexible, scalable, consistent and adaptable to the needs of the enterprise.
  • 7. Standard architecture  The centerpiece of the Enterprise Data Warehouse  History is build-up  Granularity as ‘detailed’ as possible  No use of business rules  Use of business keys that are horizontal in nature and provide visibility across lines of business  A new layer which has the benefits of the RAW Data Vault, but with the business data embedded  In the Business Data Vault the data has been altered, cleansed and changed to meet the business rules  Downstream of the raw data vault  Starting point for Master Data Management  Metadata is absolutely vital
  • 8. The Data Vault Model exists of 3 basic entity types • Hubs : contains a unique list of business keys • Links : associations across or between business keys • Satellites : holds descriptive data (about the business key) over time Component parts of the Data Vault model
  • 9. • Represents a Core Business Concept • Is formed around the Business Key of this concept • Is established the first time a new instance of that business key is introduced • Must be 1:1 with a single instance • Consists of the business key, a sequence id, a load date/time stamp and a record source. Component parts - Hub
  • 10. • Represents a natural business relationship between business keys • Is established the first time this new unique association is presented • Can represent an association between several Hubs and sometimes other Links. • maintains a 1:1 relationship with the unique and specific business defined association between that set of keys. • Consists of the sequence ids from the Hubs and Links • Contains sequence id, a load date/time stamp and a • record source. Component parts - Link
  • 11. • The Satellite contains the descriptive information (context) for a business key. • A Satellite can only describe one key (Hub or a Link). • The Satellite is the only construct that manages time slice data (data warehouse historical tracking of values over time). Component parts - Satellite
  • 12. Fact Dimension 1 Dimension 3 Dimension 2 Dimension 4 Data Vault – Why ?
  • 13. Fact Dimension 1 Dimension 3 Dimension 2 Dimension 4 Data Vault – Why ?
  • 14. Fact Dimension 1 Dimension 3 Dimension 2 Dimension 4 Fact Dimension 5 Data Vault – Why ?
  • 15. Data Vault – Why ? DV DM
  • 18. Data Vault – How did we do it with Fifthplay ?
  • 19. HubServicePartner HubCustomer HubHomeAreaManager HubSmartPlug HubDeviceGroup HubEnergyLogType LinkServicePartnerCustomer LinkCustomerHomeAreaManager LinkHomeAreaManagerSmartPlug LinkCustomerDeviceGroup LinkDeviceGroupSmartPlug LinkDeviceSubGroupSmartPlug LinkSmartPlugApplianceEnergyLogT ype HubCityLinkHomeAreaManagerCity HubCountry LinkCountryCity HubSatServicePartner HubSatCustomer HubSatHomeAreaManager LinkSatHomeAreaManagerCity LinkSatCountryCity HubSatCountry HubSatDeviceGroup HubSatSmartPlug HubAppliance HubSatAppliance LinkSatSmartPlugApplianceEnergyL ogType HubSatHomeAreaManagerAddress SeqServicePartnerPK ServicePartnerID LoadDateTime RecordSource SeqCustomerPK CustomerID LoadDateTime RecordSource SeqHomeAreaManagerPK HomeAreaManagerNumber LoadDateTime RecordSource SeqSmartPlugPK SmartPlugID LoadDateTime RecordSource SeqDeviceGroupPK DeviceGroupID LoadDateTime RecordSource SeqEnergyLogTypePK EnergyLogName LoadDateTime RecordSource SeqServicePartnerCustomerPK SeqCustomer LoadDateTime RecordSource SeqServicePartner SeqCustomerHomeAreaMan ager PK SeqCustomer LoadDateTime RecordSource SeqHomeAreaManager SeqHomeAreaManagerSmar tPlug PK SeqHomeAreaManager LoadDateTime RecordSource SeqSmartPlug SeqCustomerDeviceGroupPK SeqCustomer LoadDateTime RecordSource SeqDeviceGroup LoadDateTime SeqDeviceGroupSmartPlugPK LoadDateTime RecordSource SeqDeviceGroup SeqDeviceSubGroupSmartPl ug PK LoadDateTime RecordSource SeqDeviceGroup SeqSmartPlug SeqSmartPlug SeqSmartPlugApplianceEner gyLogType PK SeqEnergyLogType LoadDateTime RecordSource SeqSmartPlug SeqCityPK CityPostalCode LoadDateTime RecordSource CityName SeqHomeAreaManagerCityPK SeqCity LoadDateTime RecordSource SeqHomeAreaManager SeqCountryPK CountryIsoCode LoadDateTime RecordSource SeqCountryCityPK SeqCity LoadDateTime RecordSource SeqCountry SeqSatServicePartnerPK SeqServicePartner LoadDateTime RecordSource LoadEndDateTime ServicePartnerCode ServiucePartnerEmail ServicePartnerCustomerCon tact SeqSatCustomerPK SeqCustomer LoadDateTime RecordSource LoadEndDateTime CustomerEmail CustomerFirstName CustomerLastName CustomerLanguage SeqSatHomeAreaManagerPK SeqHomeAreaManager LoadDateTime RecordSource LoadEndDateTime HomeAreaManagerMode HomeAreaManagerArchitec ture SeqSatHomeAreaManagerCi ty PK SeqHomeAreaManagerCity LoadDateTime RecordSource LoadEndDateTime HAMCityAddressLine1 HAMCityPhoneNumber HAMCityAddressLine2 SeqSatCountryCityPK SeqCountryCity LoadDateTime RecordSource LoadEndDateTime CountryCityRegion CountryCityState SeqSatCountryPK SeqCountry LoadDateTime RecordSource LoadEndDateTime CountryName SeqSatDeviceGroupPK SeqDeviceGroup LoadDateTime RecordSource LoadEndDateTime DeviceGroupName DeviceGroupDescription SeqSatSmartPlugPK SeqSmartPlug LoadDateTime RecordSource LoadEndDateTime SmartPlugDisplayName SmartPlugManufacturer SmartPlugModel SmartPlugIsGenerator SmartPlugHasChildren SmartPlugHasSchedule SeqAppliancePK ApplianceID LoadDateTime RecordSource SeqSatAppliancePK SeqAppliance LoadDateTime RecordSource LoadEndDateTime ApplianceCategory SeqSatSmartPlugApplianceE nergyLogType PK SeqSmartPlugApplianceEner gyLogType LoadDateTime RecordSource LoadEndDateTime EnergyLogDateTime EnergyLogValue SeqAppliance EnergyLogValueUnit Legend Hub Link Satellite ServicePartnerWebPage SeqSatHomeAreaManagerA ddress PK SeqHomeAreaManager LoadDateTime RecordSource LoadEndDateTime HomeAreaManagerAddress Line1 HomeAreaManagerPostalCo de HomeAreaManagerAddress Line2 HomeAreaManagerCityNam e HomeAreaManagerProvince HomeAreaManagerState HomeAreaManagerCountry Fifthplay Raw Data Vault Architecture
  • 20. Fifthplay Raw Data Vault Architecture HubSmartPlug HubEnergyLogType LinkSmartPlugApplianceEnergyLogT ype HubAppliance HubSatAppliance LinkSatSmartPlugApplianceEnergyL ogType SeqSmartPlugPK SmartPlugID LoadDateTime RecordSource SeqEnergyLogTypePK EnergyLogName LoadDateTime RecordSource SeqSmartPlugApplianceEner gyLogType PK SeqEnergyLogType LoadDateTime RecordSource SeqSmartPlug SeqAppliancePK ApplianceID LoadDateTime RecordSource SeqSatAppliancePK SeqAppliance LoadDateTime RecordSource LoadEndDateTime ApplianceCategory SeqSatSmartPlugApplianceE nergyLogType PK SeqSmartPlugApplianceEner gyLogType LoadDateTime RecordSource LoadEndDateTime EnergyLogDateTime EnergyLogValue SeqAppliance EnergyLogValueUnit Legend Hub Link Satellite
  • 21. Fifthplay : Data Vault – lessons learned • Don’t stop with data vault; A combination with classic dimensional Kimball-methodology is advised • Be creative; get out of your comfort zone, dare to walk the thine line • While setting up the data vault, operational issues where discovered early in the process • ETL-development goes very quickly because of the typical pattern design of the data vault;
  • 22. Data Vault – What’s next ?
  • 23. 2013 : Dan Linstedt releases Data Vault 2.0 specs History and what’s next ? Relational modeling (E.F.Codd) Bill Inmon began discussing Data Warehousing • Barry Devlin and Dr Kimball release “Business Data Warehouse” • Bill Inmon popularizes Data Warehousing • Dr Kimball popularizes Star Schema Dan Linstedt begins R&D on Data Vault Modeling Dan Linstedt releases first 5 articles on Data Vault Modeling 2012 : Dan Linstedt announces Data Vault 2.0 1960 1970 1980 1990 2000 2010
  • 24. Thank You BI@USGICT.be http://www.linkedin.com/company/usgprofessionalsbe +32 3 231 94 84 www.usgict.be https://www.facebook.com/usgictbe “In the Data Warehousing/BI world, we should store the data as it stands on the source system and interpret it on the way out to the data marts. This is absolutely critical to remember.” Dan Linstedt @BICC_at_USG