SlideShare uma empresa Scribd logo
1 de 33
Agile Methods and Data Warehousing:
How to Deliver Faster
Kent Graziano
Data Warrior LLC
Twitter @KentGraziano
Agenda
 My Bio
 Why Agile & DW
 Agile Manifesto
 12 Agile Principles
 Agile Concepts
 Two weeks?
 Getting to Agile/RAD
 Data Vault
 Conclusion
(C) 2005-2014 Kent Graziano
My Bio
 Oracle ACE Director
 Certified Data Vault Master and DV 2.0 Architect
 Member: Boulder BI Brain Trust
 Data Architecture and Data Warehouse Specialist
● 30+ years in IT
● 25+ years of Oracle-related work
● 20+ years of data warehousing experience
 Co-Author of
● The Business of Data Vault Modeling
● The Data Model Resource Book (1st Edition)
 Past-President of ODTUG and Rocky Mountain Oracle
User Group
(C) 2005-2014 Kent Graziano
Why Agile & DW?
 Perceptions
● DW too slow to produce results
● DW projects “fail”
● DW projects fail to adapt to business changes
 Goal
● Change perceptions
● Deliver value
● Be more adaptable and flexible
(C) 2005-2014 Kent Graziano
Objectives
 Understand what is meant by “agile”
 Try to apply some agile ideas to data
warehouse and business intelligence efforts
 Answer the question – can we deliver results
faster?
(C) 2005-2014 Kent Graziano
Manifesto for Agile Software Development
http://agilemanifesto.org
We are uncovering better ways of developing
software by doing it and helping others do it.
Through this work we have come to value:
Individuals and interactions over processes and
tools
Working software over comprehensive
documentation
Customer collaboration over contract negotiation
Responding to change over following a plan
That is, while there is value in the items on
the right, we value the items on the left more
Kent Beck
Mike Beedle
Arie van Bennekum
Alistair Cockburn
Ward Cunningham
Martin Fowler
James Grenning
Jim Highsmith
Andrew Hunt
Ron Jeffries
Jon Kern
Brian Marick
Robert C. Martin
Steve Mellor
Ken Schwaber
Jeff Sutherland
Dave Thomas
© 2001, the above authors
this declaration may be freely copied in any form,
but only in its entirety through this notice.
Principle #1
 Our highest priority is to satisfy the
customer through early and continuous
delivery of valuable software.
●Who is the customer?
●What is “valuable software” in data
warehousing?
● BI reports
● Dashboard interface
● Working ETL code?
● In the context of the customer!
(C) 2005-2014 Kent Graziano
Principle #2
 Welcome changing requirements, even
late in development. Agile processes
harness change for the customer's
competitive advantage.
● Must be flexible and adaptable in thinking and
design
● Use code generators (more on this later)
● Start with normalized models
(C) 2005-2014 Kent Graziano
Principle #3
 Deliver working software frequently,
from a couple of weeks to a couple of
months, with a preference to the
shorter timescale.
●Need good scope control!
●One subject area at a time
● What is a subject area?
●Think Data Vault (more on this later)
(C) 2005-2014 Kent Graziano
Principle #4
 Business people and developers
must work together daily throughout
the project.
● DW MUST have the business involved
● One of the Top 10 reasons for failure
● This applies for BI reports
● Daily interaction would be great!
● But – politics and priorities may interfere!
● At HP GBI/EDW – we used “war” room
(C) 2005-2014 Kent Graziano
Principle #5
 Build projects around motivated
individuals. Give them the environment
and support they need, and trust them to
get the job done.
● Need people who WANT to be on the project
● Get training if needed
● Keep units of work small to create an
atmosphere of success
● Don’t try a Big Bang EDW
(C) 2005-2014 Kent Graziano
Principle #6
 The most efficient and effective method of
conveying information to and within a
development team is face-to-face
conversation.
● Daily team huddles
● Co-located work space
● While face-to-face is efficient, still need some
documentation (or meta-data) for later
● Use a tool like JIRA
(C) 2005-2014 Kent Graziano
Principle #7
 Working software is the primary measure
of progress.
●Applied to DW:
● What is “working software?”
● BI reports
● Tables definitions and working ETL code
● Think more broadly – it is not just a data
entry screen
(C) 2005-2014 Kent Graziano
Principle #8
 Agile processes promote sustainable development. The
sponsors, developers, and users should be able to
maintain a constant pace indefinitely.
● DW Programs last a long time – don’t burn the team
out with unreasonable deadlines
● See P#5 – Motivated individuals
● Good planning and scope control
● No all nighters!
● Smallest valuable unit of work possible
● Keep it moving like a production line
● Pick (or develop) a standard, repeatable
methodology
● Study the Agile methods and adopt what works for your team
● Data Vault Modeling Methodology
(C) 2005-2014 Kent Graziano
Principle #9
 Continuous attention to technical
excellence and good design enhances
agility.
● Bad design + bad architecture = trouble
● Symptom: can’t build a requested data mart
● Frequent design reviews a must
● Improves team skills – provides cross training
● Over time – better designs, shorter review cycles
● Faster delivery
(C) 2005-2014 Kent Graziano
Principle #10
 Simplicity--the art of maximizing the
amount of work not done--is essential.
● KISS – Keep it Simple Stupid
● Write less code by hand
● Use code generators! (No syntax errors – ever)
● Oracle SDDM, Oracle Warehouse Builder
● ERWin
● AnalytxDS (for DV)
● WhereScape RED (also DV)
● Talend Open Profiler
● Modifications are easier – just regenerate the code
(C) 2005-2014 Kent Graziano
Principle #11
 The best architectures, requirements, and
designs emerge from self-organizing teams.
● Team of smart, motivated people = success
● We succeed (or fail) as a TEAM
● Don’t micro manage or pigeon-hole staff
● Encourage team work and team thinking
● Staff will gravitate to roles based on skills, interest,
and personality
● Then they have more buy-in to the process
● Eliminates delays and bottlenecks by having
shared responsibilities (no single point of failure)
(C) 2005-2014 Kent Graziano
Principle #12
 At regular intervals, the team reflects on how
to become more effective, then tunes and
adjusts its behavior accordingly.
● The Decision Model
● Debate Mode
● Check Points
● Related to self-organizing teams
● Make finding the solution to a problem the team’s
problem
● More buy-in to the solution
● Retrospectives are a MUST!
(C) 2005-2014 Kent Graziano
Decision Model in Action
Plan
Debate Decision

Check
Point
Questions
?
?
?
?
Answers
Mini-Debate
(Cause a
Slight Change
in Direction)
Iterate
Courtesy of Dr. Ed Freeman, CIO/CTO, Denver Public Schools
Agile Concepts for DW
 Team Huddles (Morning Scrum)
 Extreme Programming
 Pair Programming
 Domain Neutral Components
●Domain Archetypes
(C) 2005-2014 Kent Graziano
Team Huddles
 Daily Standup Meeting
● AKA Scum
● Morning Roll Call (FDD)
 Short meeting (< 15 minutes)
● Every morning, mandatory attendance
● Review assignments, accomplishments, backlogs
 Immediate feedback and assistance
● Keeps team motivated and on track (P #5)
● Identifies constraints and bottlenecks early in the
process
● Eliminates backlogs more quickly via re-assignments
 Improves team work
 Supports self-organizing teams (P #11)
(C) 2005-2014 Kent Graziano
Extreme Programming (XP)
 Programmer works directly with the end user
● At HP used Virtual Classroom or NetMeeting
 In DW:
● Best with developing BI reports
● DW or data mart must already be populated
● Reports developed using BI tool
● With the user in the room (or virtual)
● With constant user reviews and input using a web reporting tool
● Also applies to developing a dashboard or portal interface
● Works for ETL as well!
● Used war room with business to get near instant validation of
ETL changes
(C) 2005-2014 Kent Graziano
Pair Programming
 Part of XP
 Programmers work side-by-side
● One terminal
● One codes, the other reviews
● Two terminals, one cube
● One programming, one documenting
● Could also be done virtually!
 In DW:
● Writing ETL Code
● Pair data modeling
(C) 2005-2014 Kent Graziano
Two week iterations?
 Goal is really a few weeks to a few months
(see P#3)
 What is the deliverable?
● A fact table for a star schema
● A dimension table
● A complete star (fact and all dimensions)
● One piece of ETL code that populates a fact
table
● A function needed by the ETL code
● A new report or query
 Who is the customer?
● BI programmer?
● Knowledge worker?
● ETL programmer?
(C) 2005-2014 Kent Graziano
HP EDW Examples
 Business found missing report elements
 Solution: modify 3 tables to add 5 new columns in
reporting model (star schema)
 Tasks:
● Document requirements and ETL specs
● Modify Logical & Physical model (w/peer review)
● Rebuild tables in development
● Develop and test ETL
● MTI (Move To Integration) tables and code
● Execute and test ETL
● Modify report in UAT environment & test
 Result: Revised report ready in 18 hours, 44 minutes
● Less than 1 business day
 2nd case: 6 tables, 16 new columns
● Ready for UAT in 72 hours
(C) 2005-2014 Kent Graziano
Getting to Agile/RAD
 “better than average expertise”
● Expert consulting and mentoring
● Do the work (OTJ)
 At Denver Public Schools – took two years
before we could try being more “agile”
● Needed experience in DW, Oracle Designer, OWB,
and the “process” of building, deploying, and
maintaining and Oracle DW
 At HP GBI/EDW it took about a year
● Needed the right team and the right management
support
● Also the right project with willing business users
 At McKesson over a year so far setting standards
and training staff – but some success already
(C) 2005-2014 Kent Graziano
Data Vault – How it fits
 Data modeling technique for enterprise data
warehouse design
● See Data Vault white papers at www.danlinstedt.com
● The book: Super Charge Your Data Warehouse
 Allows modeling EDW in small chunks
● Develop model, build tables, build ETL, populate,
repeat (often)
● Key: prioritize the data requirements
● Think User Stories ala SCRUM
(C) 2005-2014 Kent Graziano
References
 Agile Management for Software Engineering:
Applying the Theory of Constraints for Business
Results by David J. Anderson
 CASE Method Fast-track: A RAD Approach by
Richard Barker & Dai Clegg
 The Goal by Eliyahu M. Goldratt
 The Business of Data Vault Modeling by Dan
Linstedt, Kent Graziano, & Hans Hultgren
 Super Charge Your Data Warehouse by Dan
Linstedt
(C) 2005-2014 Kent Graziano
Conclusion
 Agile concepts can be applied to data
warehouse and BI projects
● Not a purist definition!
● Try to apply the principles – be creative
 Suggested approaches
● Data Vault 2.0!
● Use team huddles
● Use universal models as template
● Use pair programming to increase quality and cross
training
● Use code generators like SDDM & OWB
● Use the Data Vault modeling approach
● Read about Agile Methods (XP & FDD)
● Read Oracle CASE Method Fast-Track
● Be flexible and give it a try
(C) 2005-2014 Kent Graziano
Super Charge Your Data Warehouse
Available on Amazon.com
Soft Cover or Kindle Format
Now also available in PDF at
LearnDataVault.com
Hint: Kent is the Technical
Editor
(C) 2005-2014 Kent Graziano
Data Vault References
www.learndatavault.com
www.danlinstedt.com
On YouTube:
www.youtube.com/LearnDataVault
On Facebook:
www.facebook.com/learndatavault
(C) 2005-2014 Kent Graziano
Contact Information
Kent Graziano
The Oracle Data Warrior
Data Warrior LLC
Kent.graziano@att.net
Visit my blog at
http://kentgraziano.com
(C) 2005-2014 Kent Graziano

Mais conteúdo relacionado

Mais procurados

Accenture informatica interview question answers
Accenture informatica interview question answersAccenture informatica interview question answers
Accenture informatica interview question answers
Sweta Singh
 

Mais procurados (20)

Data catalog
Data catalogData catalog
Data catalog
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
 
InfluxDB IOx Tech Talks: Intro to the InfluxDB IOx Read Buffer - A Read-Optim...
InfluxDB IOx Tech Talks: Intro to the InfluxDB IOx Read Buffer - A Read-Optim...InfluxDB IOx Tech Talks: Intro to the InfluxDB IOx Read Buffer - A Read-Optim...
InfluxDB IOx Tech Talks: Intro to the InfluxDB IOx Read Buffer - A Read-Optim...
 
What is Enterprise Architecture?
What is Enterprise Architecture?What is Enterprise Architecture?
What is Enterprise Architecture?
 
Testing Microservices @DevoxxBE 23.pdf
Testing Microservices @DevoxxBE 23.pdfTesting Microservices @DevoxxBE 23.pdf
Testing Microservices @DevoxxBE 23.pdf
 
EDW Webinar: Managing Change for Successful Data Governance
EDW Webinar: Managing Change for Successful Data GovernanceEDW Webinar: Managing Change for Successful Data Governance
EDW Webinar: Managing Change for Successful Data Governance
 
Introduction to Enterprise Architecture
Introduction to Enterprise Architecture Introduction to Enterprise Architecture
Introduction to Enterprise Architecture
 
The Forgotten People - Super Users Are Key
The Forgotten People - Super Users Are KeyThe Forgotten People - Super Users Are Key
The Forgotten People - Super Users Are Key
 
OOW15 - Oracle E-Business Suite Integration Best Practices
OOW15 - Oracle E-Business Suite Integration Best PracticesOOW15 - Oracle E-Business Suite Integration Best Practices
OOW15 - Oracle E-Business Suite Integration Best Practices
 
SharePoint Document Management
SharePoint Document ManagementSharePoint Document Management
SharePoint Document Management
 
Accenture informatica interview question answers
Accenture informatica interview question answersAccenture informatica interview question answers
Accenture informatica interview question answers
 
Backup and recovery in oracle
Backup and recovery in oracleBackup and recovery in oracle
Backup and recovery in oracle
 
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
 
Inside Parquet Format
Inside Parquet FormatInside Parquet Format
Inside Parquet Format
 
Master Data Management - Aligning Data, Process, and Governance
Master Data Management - Aligning Data, Process, and GovernanceMaster Data Management - Aligning Data, Process, and Governance
Master Data Management - Aligning Data, Process, and Governance
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Catalogs - Turning a Set of Parquet Files into a Data Set
Catalogs - Turning a Set of Parquet Files into a Data SetCatalogs - Turning a Set of Parquet Files into a Data Set
Catalogs - Turning a Set of Parquet Files into a Data Set
 
Master Data Management methodology
Master Data Management methodologyMaster Data Management methodology
Master Data Management methodology
 
Data Profiling, Data Catalogs and Metadata Harmonisation
Data Profiling, Data Catalogs and Metadata HarmonisationData Profiling, Data Catalogs and Metadata Harmonisation
Data Profiling, Data Catalogs and Metadata Harmonisation
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 

Destaque

Destaque (13)

Extreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 DimensionsExtreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
 
Worst Practices in Data Warehouse Design
Worst Practices in Data Warehouse DesignWorst Practices in Data Warehouse Design
Worst Practices in Data Warehouse Design
 
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data ModelingAgile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
 
Agile Data Warehousing: Using SDDM to Build a Virtualized ODS
Agile Data Warehousing: Using SDDM to Build a Virtualized ODSAgile Data Warehousing: Using SDDM to Build a Virtualized ODS
Agile Data Warehousing: Using SDDM to Build a Virtualized ODS
 
Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016
 
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureData Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
 
Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)
 
Top Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data ModelerTop Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data Modeler
 
Why Data Vault?
Why Data Vault? Why Data Vault?
Why Data Vault?
 
Agile Methods and Data Warehousing (2016 update)
Agile Methods and Data Warehousing (2016 update)Agile Methods and Data Warehousing (2016 update)
Agile Methods and Data Warehousing (2016 update)
 
Introduction to Data Vault Modeling
Introduction to Data Vault ModelingIntroduction to Data Vault Modeling
Introduction to Data Vault Modeling
 
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachUsing OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
 

Semelhante a Agile Methods and Data Warehousing

Amrutha_Resume[1_2]
Amrutha_Resume[1_2]Amrutha_Resume[1_2]
Amrutha_Resume[1_2]
Amrutha T
 
Sankar Gopal_Business Intelligence_Reporting_Analyst
Sankar Gopal_Business Intelligence_Reporting_AnalystSankar Gopal_Business Intelligence_Reporting_Analyst
Sankar Gopal_Business Intelligence_Reporting_Analyst
Sankar Narayanan
 
Mohd_Shaukath_5_Exp_Datastage
Mohd_Shaukath_5_Exp_DatastageMohd_Shaukath_5_Exp_Datastage
Mohd_Shaukath_5_Exp_Datastage
Mohammed Shaukath
 
Developer Productivity Engineering with Gradle
Developer Productivity Engineering with GradleDeveloper Productivity Engineering with Gradle
Developer Productivity Engineering with Gradle
All Things Open
 

Semelhante a Agile Methods and Data Warehousing (20)

Agile methods and dw mha
Agile methods and dw mhaAgile methods and dw mha
Agile methods and dw mha
 
Embedding a Shift Left Culture in your Enterprise
Embedding a Shift Left Culture in your EnterpriseEmbedding a Shift Left Culture in your Enterprise
Embedding a Shift Left Culture in your Enterprise
 
Managing software projects & teams effectively
Managing software projects & teams effectivelyManaging software projects & teams effectively
Managing software projects & teams effectively
 
Amrutha_Resume[1_2]
Amrutha_Resume[1_2]Amrutha_Resume[1_2]
Amrutha_Resume[1_2]
 
Resume
ResumeResume
Resume
 
OOW15 - Customer Success Stories: Upgrading to Oracle E-Business Suite 12.2
OOW15 - Customer Success Stories: Upgrading to Oracle E-Business Suite 12.2 OOW15 - Customer Success Stories: Upgrading to Oracle E-Business Suite 12.2
OOW15 - Customer Success Stories: Upgrading to Oracle E-Business Suite 12.2
 
Architecting for analytics
Architecting for analyticsArchitecting for analytics
Architecting for analytics
 
Excalibur: best practices for virtual desktop operations leveraging Citrix Di...
Excalibur: best practices for virtual desktop operations leveraging Citrix Di...Excalibur: best practices for virtual desktop operations leveraging Citrix Di...
Excalibur: best practices for virtual desktop operations leveraging Citrix Di...
 
Why we should consider Open Hybrid Cloud.pdf
Why we should  consider Open Hybrid Cloud.pdfWhy we should  consider Open Hybrid Cloud.pdf
Why we should consider Open Hybrid Cloud.pdf
 
Sankar Gopal_Business Intelligence_Reporting_Analyst
Sankar Gopal_Business Intelligence_Reporting_AnalystSankar Gopal_Business Intelligence_Reporting_Analyst
Sankar Gopal_Business Intelligence_Reporting_Analyst
 
Agile software development compfest 13
Agile software development compfest 13Agile software development compfest 13
Agile software development compfest 13
 
Mohd_Shaukath_5_Exp_Datastage
Mohd_Shaukath_5_Exp_DatastageMohd_Shaukath_5_Exp_Datastage
Mohd_Shaukath_5_Exp_Datastage
 
Nivi_Resume
Nivi_ResumeNivi_Resume
Nivi_Resume
 
Legacy code - Taming The Beast
Legacy code  - Taming The BeastLegacy code  - Taming The Beast
Legacy code - Taming The Beast
 
Project manager with 10+ years of IT experience.
Project manager with 10+ years of IT experience.Project manager with 10+ years of IT experience.
Project manager with 10+ years of IT experience.
 
Developer Productivity Engineering with Gradle
Developer Productivity Engineering with GradleDeveloper Productivity Engineering with Gradle
Developer Productivity Engineering with Gradle
 
Measuring the Productivity of Your Engineering Organisation - the Good, the B...
Measuring the Productivity of Your Engineering Organisation - the Good, the B...Measuring the Productivity of Your Engineering Organisation - the Good, the B...
Measuring the Productivity of Your Engineering Organisation - the Good, the B...
 
Rsqrd AI: From R&D to ROI of AI
Rsqrd AI: From R&D to ROI of AIRsqrd AI: From R&D to ROI of AI
Rsqrd AI: From R&D to ROI of AI
 
Extreme Programming 1st.pdf
Extreme Programming 1st.pdfExtreme Programming 1st.pdf
Extreme Programming 1st.pdf
 
DevOps, SAFe and critical information bearers: A practical approach for plann...
DevOps, SAFe and critical information bearers: A practical approach for plann...DevOps, SAFe and critical information bearers: A practical approach for plann...
DevOps, SAFe and critical information bearers: A practical approach for plann...
 

Mais de Kent Graziano

Mais de Kent Graziano (10)

Balance agility and governance with #TrueDataOps and The Data Cloud
Balance agility and governance with #TrueDataOps and The Data CloudBalance agility and governance with #TrueDataOps and The Data Cloud
Balance agility and governance with #TrueDataOps and The Data Cloud
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for Dinner
 
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on Snowflake
 
Rise of the Data Cloud
Rise of the Data CloudRise of the Data Cloud
Rise of the Data Cloud
 
Delivering Data Democratization in the Cloud with Snowflake
Delivering Data Democratization in the Cloud with SnowflakeDelivering Data Democratization in the Cloud with Snowflake
Delivering Data Democratization in the Cloud with Snowflake
 
Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)
 
Making Sense of Schema on Read
Making Sense of Schema on ReadMaking Sense of Schema on Read
Making Sense of Schema on Read
 
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFW
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Agile Methods and Data Warehousing

  • 1. Agile Methods and Data Warehousing: How to Deliver Faster Kent Graziano Data Warrior LLC Twitter @KentGraziano
  • 2. Agenda  My Bio  Why Agile & DW  Agile Manifesto  12 Agile Principles  Agile Concepts  Two weeks?  Getting to Agile/RAD  Data Vault  Conclusion (C) 2005-2014 Kent Graziano
  • 3. My Bio  Oracle ACE Director  Certified Data Vault Master and DV 2.0 Architect  Member: Boulder BI Brain Trust  Data Architecture and Data Warehouse Specialist ● 30+ years in IT ● 25+ years of Oracle-related work ● 20+ years of data warehousing experience  Co-Author of ● The Business of Data Vault Modeling ● The Data Model Resource Book (1st Edition)  Past-President of ODTUG and Rocky Mountain Oracle User Group (C) 2005-2014 Kent Graziano
  • 4. Why Agile & DW?  Perceptions ● DW too slow to produce results ● DW projects “fail” ● DW projects fail to adapt to business changes  Goal ● Change perceptions ● Deliver value ● Be more adaptable and flexible (C) 2005-2014 Kent Graziano
  • 5. Objectives  Understand what is meant by “agile”  Try to apply some agile ideas to data warehouse and business intelligence efforts  Answer the question – can we deliver results faster? (C) 2005-2014 Kent Graziano
  • 6. Manifesto for Agile Software Development http://agilemanifesto.org We are uncovering better ways of developing software by doing it and helping others do it. Through this work we have come to value: Individuals and interactions over processes and tools Working software over comprehensive documentation Customer collaboration over contract negotiation Responding to change over following a plan That is, while there is value in the items on the right, we value the items on the left more Kent Beck Mike Beedle Arie van Bennekum Alistair Cockburn Ward Cunningham Martin Fowler James Grenning Jim Highsmith Andrew Hunt Ron Jeffries Jon Kern Brian Marick Robert C. Martin Steve Mellor Ken Schwaber Jeff Sutherland Dave Thomas © 2001, the above authors this declaration may be freely copied in any form, but only in its entirety through this notice.
  • 7. Principle #1  Our highest priority is to satisfy the customer through early and continuous delivery of valuable software. ●Who is the customer? ●What is “valuable software” in data warehousing? ● BI reports ● Dashboard interface ● Working ETL code? ● In the context of the customer! (C) 2005-2014 Kent Graziano
  • 8. Principle #2  Welcome changing requirements, even late in development. Agile processes harness change for the customer's competitive advantage. ● Must be flexible and adaptable in thinking and design ● Use code generators (more on this later) ● Start with normalized models (C) 2005-2014 Kent Graziano
  • 9. Principle #3  Deliver working software frequently, from a couple of weeks to a couple of months, with a preference to the shorter timescale. ●Need good scope control! ●One subject area at a time ● What is a subject area? ●Think Data Vault (more on this later) (C) 2005-2014 Kent Graziano
  • 10. Principle #4  Business people and developers must work together daily throughout the project. ● DW MUST have the business involved ● One of the Top 10 reasons for failure ● This applies for BI reports ● Daily interaction would be great! ● But – politics and priorities may interfere! ● At HP GBI/EDW – we used “war” room (C) 2005-2014 Kent Graziano
  • 11. Principle #5  Build projects around motivated individuals. Give them the environment and support they need, and trust them to get the job done. ● Need people who WANT to be on the project ● Get training if needed ● Keep units of work small to create an atmosphere of success ● Don’t try a Big Bang EDW (C) 2005-2014 Kent Graziano
  • 12. Principle #6  The most efficient and effective method of conveying information to and within a development team is face-to-face conversation. ● Daily team huddles ● Co-located work space ● While face-to-face is efficient, still need some documentation (or meta-data) for later ● Use a tool like JIRA (C) 2005-2014 Kent Graziano
  • 13. Principle #7  Working software is the primary measure of progress. ●Applied to DW: ● What is “working software?” ● BI reports ● Tables definitions and working ETL code ● Think more broadly – it is not just a data entry screen (C) 2005-2014 Kent Graziano
  • 14. Principle #8  Agile processes promote sustainable development. The sponsors, developers, and users should be able to maintain a constant pace indefinitely. ● DW Programs last a long time – don’t burn the team out with unreasonable deadlines ● See P#5 – Motivated individuals ● Good planning and scope control ● No all nighters! ● Smallest valuable unit of work possible ● Keep it moving like a production line ● Pick (or develop) a standard, repeatable methodology ● Study the Agile methods and adopt what works for your team ● Data Vault Modeling Methodology (C) 2005-2014 Kent Graziano
  • 15. Principle #9  Continuous attention to technical excellence and good design enhances agility. ● Bad design + bad architecture = trouble ● Symptom: can’t build a requested data mart ● Frequent design reviews a must ● Improves team skills – provides cross training ● Over time – better designs, shorter review cycles ● Faster delivery (C) 2005-2014 Kent Graziano
  • 16. Principle #10  Simplicity--the art of maximizing the amount of work not done--is essential. ● KISS – Keep it Simple Stupid ● Write less code by hand ● Use code generators! (No syntax errors – ever) ● Oracle SDDM, Oracle Warehouse Builder ● ERWin ● AnalytxDS (for DV) ● WhereScape RED (also DV) ● Talend Open Profiler ● Modifications are easier – just regenerate the code (C) 2005-2014 Kent Graziano
  • 17. Principle #11  The best architectures, requirements, and designs emerge from self-organizing teams. ● Team of smart, motivated people = success ● We succeed (or fail) as a TEAM ● Don’t micro manage or pigeon-hole staff ● Encourage team work and team thinking ● Staff will gravitate to roles based on skills, interest, and personality ● Then they have more buy-in to the process ● Eliminates delays and bottlenecks by having shared responsibilities (no single point of failure) (C) 2005-2014 Kent Graziano
  • 18. Principle #12  At regular intervals, the team reflects on how to become more effective, then tunes and adjusts its behavior accordingly. ● The Decision Model ● Debate Mode ● Check Points ● Related to self-organizing teams ● Make finding the solution to a problem the team’s problem ● More buy-in to the solution ● Retrospectives are a MUST! (C) 2005-2014 Kent Graziano
  • 19. Decision Model in Action Plan Debate Decision  Check Point Questions ? ? ? ? Answers Mini-Debate (Cause a Slight Change in Direction) Iterate Courtesy of Dr. Ed Freeman, CIO/CTO, Denver Public Schools
  • 20. Agile Concepts for DW  Team Huddles (Morning Scrum)  Extreme Programming  Pair Programming  Domain Neutral Components ●Domain Archetypes (C) 2005-2014 Kent Graziano
  • 21. Team Huddles  Daily Standup Meeting ● AKA Scum ● Morning Roll Call (FDD)  Short meeting (< 15 minutes) ● Every morning, mandatory attendance ● Review assignments, accomplishments, backlogs  Immediate feedback and assistance ● Keeps team motivated and on track (P #5) ● Identifies constraints and bottlenecks early in the process ● Eliminates backlogs more quickly via re-assignments  Improves team work  Supports self-organizing teams (P #11) (C) 2005-2014 Kent Graziano
  • 22. Extreme Programming (XP)  Programmer works directly with the end user ● At HP used Virtual Classroom or NetMeeting  In DW: ● Best with developing BI reports ● DW or data mart must already be populated ● Reports developed using BI tool ● With the user in the room (or virtual) ● With constant user reviews and input using a web reporting tool ● Also applies to developing a dashboard or portal interface ● Works for ETL as well! ● Used war room with business to get near instant validation of ETL changes (C) 2005-2014 Kent Graziano
  • 23. Pair Programming  Part of XP  Programmers work side-by-side ● One terminal ● One codes, the other reviews ● Two terminals, one cube ● One programming, one documenting ● Could also be done virtually!  In DW: ● Writing ETL Code ● Pair data modeling (C) 2005-2014 Kent Graziano
  • 24. Two week iterations?  Goal is really a few weeks to a few months (see P#3)  What is the deliverable? ● A fact table for a star schema ● A dimension table ● A complete star (fact and all dimensions) ● One piece of ETL code that populates a fact table ● A function needed by the ETL code ● A new report or query  Who is the customer? ● BI programmer? ● Knowledge worker? ● ETL programmer? (C) 2005-2014 Kent Graziano
  • 25. HP EDW Examples  Business found missing report elements  Solution: modify 3 tables to add 5 new columns in reporting model (star schema)  Tasks: ● Document requirements and ETL specs ● Modify Logical & Physical model (w/peer review) ● Rebuild tables in development ● Develop and test ETL ● MTI (Move To Integration) tables and code ● Execute and test ETL ● Modify report in UAT environment & test  Result: Revised report ready in 18 hours, 44 minutes ● Less than 1 business day  2nd case: 6 tables, 16 new columns ● Ready for UAT in 72 hours (C) 2005-2014 Kent Graziano
  • 26. Getting to Agile/RAD  “better than average expertise” ● Expert consulting and mentoring ● Do the work (OTJ)  At Denver Public Schools – took two years before we could try being more “agile” ● Needed experience in DW, Oracle Designer, OWB, and the “process” of building, deploying, and maintaining and Oracle DW  At HP GBI/EDW it took about a year ● Needed the right team and the right management support ● Also the right project with willing business users  At McKesson over a year so far setting standards and training staff – but some success already (C) 2005-2014 Kent Graziano
  • 27. Data Vault – How it fits  Data modeling technique for enterprise data warehouse design ● See Data Vault white papers at www.danlinstedt.com ● The book: Super Charge Your Data Warehouse  Allows modeling EDW in small chunks ● Develop model, build tables, build ETL, populate, repeat (often) ● Key: prioritize the data requirements ● Think User Stories ala SCRUM (C) 2005-2014 Kent Graziano
  • 28. References  Agile Management for Software Engineering: Applying the Theory of Constraints for Business Results by David J. Anderson  CASE Method Fast-track: A RAD Approach by Richard Barker & Dai Clegg  The Goal by Eliyahu M. Goldratt  The Business of Data Vault Modeling by Dan Linstedt, Kent Graziano, & Hans Hultgren  Super Charge Your Data Warehouse by Dan Linstedt (C) 2005-2014 Kent Graziano
  • 29. Conclusion  Agile concepts can be applied to data warehouse and BI projects ● Not a purist definition! ● Try to apply the principles – be creative  Suggested approaches ● Data Vault 2.0! ● Use team huddles ● Use universal models as template ● Use pair programming to increase quality and cross training ● Use code generators like SDDM & OWB ● Use the Data Vault modeling approach ● Read about Agile Methods (XP & FDD) ● Read Oracle CASE Method Fast-Track ● Be flexible and give it a try (C) 2005-2014 Kent Graziano
  • 30. Super Charge Your Data Warehouse Available on Amazon.com Soft Cover or Kindle Format Now also available in PDF at LearnDataVault.com Hint: Kent is the Technical Editor (C) 2005-2014 Kent Graziano
  • 31. Data Vault References www.learndatavault.com www.danlinstedt.com On YouTube: www.youtube.com/LearnDataVault On Facebook: www.facebook.com/learndatavault (C) 2005-2014 Kent Graziano
  • 32.
  • 33. Contact Information Kent Graziano The Oracle Data Warrior Data Warrior LLC Kent.graziano@att.net Visit my blog at http://kentgraziano.com (C) 2005-2014 Kent Graziano

Notas do Editor

  1. This is your opening slide.