SlideShare uma empresa Scribd logo
1 de 58
Baixar para ler offline
Data structures enable you to store and organize

data so that it can be used efficiently. But how do

you know to apply the correct one? There is a

difference between structuring master data,

reference data and analytics data. This webinar 

will discuss the various data structures available 

and when to use each one. We will show how 

data structures should support your organizational

strategy and how each method can contribute to

business value.
Learning Objectives:
• Application of correct data structures to fit business needs
• How different structures create different business value 



Date: July 8, 2014

Time: 2:00 PM ET

Presented by: Dave Marsh & Peter Aiken
Copyright 2013 by Data Blueprint
Welcome: Design/Manage Data Structures
1
Copyright 2013 by Data Blueprint
Get Social With Us!
Like Us on Facebook
www.facebook.com/
datablueprint
Post questions and comments
Find industry news, insightful
content
and event updates.
Join the Group
Data Management &
Business Intelligence
Ask questions, gain insights
and collaborate with fellow
data management
professionals
Live Twitter Feed
Join the conversation!
Follow us:
@datablueprint
@paiken
Ask questions and submit your
comments: #dataed
3
Presented by Dave Marsh & Peter Aiken, Ph.D.
Design & Manage Data Structures
Marco Level
Copyright 2013 by Data Blueprint
Your Presenters
Dave Marsh
• Lead Data 

Consultant, 

Data Blueprint
• 30+ Years experience
designing and building
solutions for the private and
public sectors.
• Architecture/Design
experience in:
- Transactional processing
- Shop floor automation
- Data Warehousing
- Identity Management
- Mobile
Peter Aiken
• 30+ years DM 

experience
• 9 books/many articles
• Experienced with 500+ data
management practices
• Multi-year immersions: US
DoD, Nokia, Deutsche
Bank, Wells Fargo, &
Commonwealth of VA
4
Copyright 2013 by Data Blueprint
• Context: Data Management/DAMA/DM BoK/CDMP?
• What is a data structure?
• Structured data storage, a bit of history and context
• Why are data structures important?
• Data Personas/Usage (interest over time)
• Data Topology and alignment to the data audience
• Internal data structures to fit the needs
• Q & A?
Outline: Design/Manage Data Structures
6
• Context: Data Management/DAMA/DM BoK/CDMP?
• What is a data structure?
• Structured data storage, a bit of history and context
• Why are data structures important?
• Data Personas/Usage (interest over time)
• Data Topology and alignment to the data audience
• Internal data structures to fit the needs
• Q & A?
Maslow's Hierarchy of Needs
Copyright 2013 by Data Blueprint
7
You can accomplish
Advanced Data Practices
without becoming
proficient in the Basic
Data Management
Practices however this
will:
• Take longer
• Cost more
• Deliver less
• Present 

greater

risk
Data Management Practices Hierarchy
Basic Data Management Practices
Advanced 

Data 

Practices
• MDM
• Mining
• Big Data
• Analytics
• Warehousing
• SOA
Data Program Management
Data Stewardship Data Development
Data Support Operations
Organizational Data Integration
Copyright 2013 by Data Blueprint
8
Data Program 

Coordination
Feedback
Data

Development
Copyright 2013 by Data Blueprint
Standard

Data
Data Management is an Integrated System of Five Practice Areas
Organizational Strategies
Goals
Business

Data
Business Value
Application 

Models &
Designs
Implementation
Direction
Guidance
9
Organizational

Data Integration
Data

Stewardship
Data Support

Operations
Data 

Asset Use
Integrated

Models
Leverage data in organizational activities
Data management
processes and
infrastructure
Combining multiple
assets to produce
extra value
Organizational-entity
subject area data
integration
Provide reliable
data access
Achieve sharing of data
within a business area
Copyright 2013 by Data Blueprint
Five Integrated DM Practice Areas
10
Manage data coherently.
Share data across boundaries.
Assign responsibilities for data.
Engineer data delivery systems.
Maintain data availability.
Data Program 

Coordination
Data

Development
Organizational
Data Integration
Data

Stewardship
Data Support

Operations
Copyright 2013 by Data Blueprint
DAMA DM BoK & CDMP Data Management Functions
• Published by DAMA International
– The professional association for Data
Managers (40 chapters worldwide)
– DMBoK organized around
– Primary data management functions focused
around data delivery to the organization (more
at dama.org)
– Organized around several environmental
elements
• CDMP
– Certified Data Management Professional
– DAMA International and ICCP
– Membership in a distinct group made up of
your fellow professionals
– Recognition for your specialized knowledge in
a choice of 17 specialty areas
– Series of 3 exams
– For more information, please visit:
• http://www.dama.org/i4a/pages/index.cfm?pageid=3399
• http://iccp.org/certification/designations/cdmp
11
Copyright 2013 by Data Blueprint
• Context: Data Management/DAMA/DM BoK/CDMP?
• What is a data structure?
• Structured data storage, a bit of history and context
• Why are data structures important?
• Data Personas/Usage (interest over time)
• Data Topology and alignment to the data audience
• Internal data structures to fit the needs
• Q & A?
Outline
13
Copyright 2013 by Data Blueprint
What is a data structure?
• "An organization of information, usually in memory, for better
algorithm efficiency, such as queue, stack, linked list, heap, dictionary,
and tree, or conceptual unity, such as the name and address of a
person. It may include redundant information, such as length of the
list or number of nodes in a subtree."
• Some data structure characteristics
– Grammar for data objects
• Grammar is the principles 

or rules of an art, science, 

or technique "a grammar 

of the theater"
– Constraints for data 

objects
– Sequential order
– Uniqueness
– Arrangement
• Hierarchical, relational, 

network, other
– Balance
– Optimality
http://www.nist.gov/dads/HTML/datastructur.html
14
Copyright 2013 by Data Blueprint
How are data structures expressed as architectures?
• Details are
organized into 

larger
components
• Larger
components are
organized into
models
• Models are
organized into
architectures
A B
C D
A B
C D
A
D
C
B
15
Copyright 2013 by Data Blueprint
How are data structures expressed as architectures?
• Attributes are organized into 

entities/objects
– Attributes are characteristics of "things"
– Entitles/objects are "things" whose 

information is managed in support of strategy
– Examples
• Entities/objects are organized into models
– Combinations of attributes and entities are 

structured to represent information requirements
– Poorly structured data, constrains organizational information delivery
capabilities
– Examples
• Models are organized into architectures
– When building new systems, architectures are used to plan development
– More often, data managers do not know what existing architectures are and -
therefore - cannot make use of them in support of strategy implementation
– Why no examples?
16
Copyright 2013 by Data Blueprint
Data
Data
Data
Information
Fact Meaning
Request
A Model Specifying Relationships Among Important Terms
[Built on definition by Dan Appleton 1983]
Intelligence
Strategic Use
1. Each FACT combines with one or more MEANINGS.
2. Each specific FACT and MEANING combination is referred to as a DATUM.
3. An INFORMATION is one or more DATA that are returned in response to a specific REQUEST
4. INFORMATION REUSE is enabled when one FACT is combined with more than one
MEANING.
5. INTELLIGENCE is INFORMATION associated with its USES.
Wisdom & knowledge are 

often used synonymously
Data
Data
Data Data
17
Copyright 2013 by Data Blueprint
• Context: Data Management/DAMA/DM BoK/CDMP?
• What is a data structure?
• Structured data storage, a bit of history and context
• Why are data structures important?
• Data Personas/Usage (interest over time)
• Data Topology and alignment to the data audience
• Internal data structures to fit the needs
• Q & A?
Outline
19
Copyright 2013 by Data Blueprint
History (such as it is)
• Automate existing manual 

processing
• Data management was:
– Running millions of punched 

cards through banks of sorting, 

collating & tabulating machines
– Results printed on paper or 

punched onto more cards
– Data management meant physically storing and hauling around
punched cards
• Tasks (check signing, calculating, and machine control)
were implemented to provide automated support for
departmental-based processing
• Creating information silos
• Data Processing Manager
20
Copyright 2013 by Data Blueprint
Chief Information Officer
21
Copyright 2013 by Data Blueprint
CFO Necessary Prerequisites/Qualifications
• CPA
• CMA
• Masters of Accountancy
• Other recognized
degrees/certifications
• These are necessary
but insufficient
prerequisites/
qualifications
22
Copyright 2013 by Data Blueprint
CIO Qualifications
• No specific qualifications
• Typically technological fields:
– Computer science
– Software engineering
– Information systems
• Business
– Master of Business Administration
– Master of Science in Management
• Business acumen and strategic perspectives have taken
precedence over technical skills.
– CIOs appointed from the business side of the organization
• Especially if they have project management skills.
23
Copyright 2013 by Data Blueprint
What do we teach knowledge workers about data?
What percentage of the deal with it daily?
24
Copyright 2013 by Data Blueprint
• Context: Data Management/DAMA/DM BoK/CDMP?
• What is a data structure?
• Structured data storage, a bit of history and context
• Why are data structures important?
• Data Personas/Usage (interest over time)
• Data Topology and alignment to the data audience
• Internal data structures to fit the needs
• Q & A?
Outline
26
Copyright 2013 by Data Blueprint
Data Leverage
Less ROT
Technologies
Process
People
• Permits organizations to better manage their sole non-
depleteable, non-degrading, durable, strategic asset - data
– within the organization, and
– with organizational data exchange partners
• Leverage
– Obtained by implementation of data-centric technologies, processes, and
human skill sets
– Increased by elimination of data ROT (redundant, obsolete, or trivial)
• The bigger the organization, the greater potential leverage exists
• Treating data more asset-like simultaneously
1. lowers organizational IT costs and
2. increases organizational knowledge worker productivity
27
Copyright 2013 by Data Blueprint
Data Structure Questions
Program F
Program E
Program D
Program G
Program H
Program I
Application
domain 2Application
domain 3
• Who makes decisions about the range and scope of
common data usage?
28
Copyright 2013 by Data Blueprint
Running Query
29
Copyright 2013 by Data Blueprint
Optimized Query
30
Copyright 2013 by Data Blueprint
Repeat 100s, thousands, millions of times ...
31
Death by 1000 Cuts
Copyright 2013 by Data Blueprint
32
Copyright 2013 by Data Blueprint
5 Basic Data Structures
Indexed Sequential File: Built-in index permits location of
records of persons with last names starting with "T"
Index
Program: Where is the record for person
"Townsend?"
Index: Start looking here where the
"Ts" are stored
Relational Database: Records are related to
each other using relationships describable using relational
algebra
Flat File: Records are typically sorted
according to some criteria and must be
searched from the beginning for each access
Program: Must start at the beginning
and read each record when looking for
person "Townsend?"
Network Database: Records are related to each
other using arranged master records associated with
multiple detail records using linked lists and pointers Associative
Concept-oriented
Multi-dimensional
XML database

3NF

Star schema

Data Vault
Hierarchical Database: Records are related to each other
hierarchically using 'parent child' relationships
33
Copyright 2013 by Data Blueprint
Data structures organized into an Architecture
• How do data structures support
organizational strategy?
• Consider the opposite question?
– Were your systems explicitly designed to be
integrated or otherwise work together?
– If not, then what is the likelihood that they will
work well together?
– In all likelihood your organization is spending
between 20-40% of its IT budget compensating
for poor data structure integration
– They cannot be helpful as long as their structure
is unknown
• Two answers/two separate strategies
– Achieving efficiency and 

effectiveness goals
– Providing organizational dexterity for rapid
implementation
34
Copyright 2013 by Data Blueprint
Single
Data Store
No Single Data Store
• The thought of a single monolithic data
store which can service all of an
organization’s information needs has long
since been abandoned. In the modern
data management topology, multiple data
stores are created to service specific
processing needs and user groups within
the organization.
• Implications:
• The needs characteristics of the multitude
of the audiences served by the data
structures
• Data lifecycle
• The design styles (old and new) utilized to
organize the data to service the audiences
• A breakdown of the various stores
• The resultant store characteristics
35
Copyright 2013 by Data Blueprint
Conclusions
• 1 is not enough
• Most
organizations
have far to many
different data
structures and
they become
barriers to
progress and
integration
• Not much
expertise to figure
out these
challenges
36
Copyright 2013 by Data Blueprint
• Context: Data Management/DAMA/DM BoK/CDMP?
• What is a data structure?
• Structured data storage, a bit of history and context
• Why are data structures important?
• Data Personas/Usage (interest over time)
• Data Topology and alignment to the data audience
• Internal data structures to fit the needs
• Q & A?
Outline
38
Copyright 2013 by Data Blueprint
Data Personas (The Requirements)
Operational
Performer
Interested in alerts,
notifications and
reporting based on
current values (real-
time) data. They use the
information to make
decisions and changes
in the transactional
systems. These
changes are targeted to
improve the
organizations ability to
deliver in the short term.
Operational Analyst
(Manager)
Interested in aggregated
real-time data for their
domain of responsibility.
The data is displayed
using visualization
techniques of
scorecards, charts and
reports, preferably within
a single dashboard. The
searching is for
favorable/unfavorable
trends to indicate
adjustments are needed
in the staff & resource
allocations.
Data Analyst
Responsible to support
detailed and typically
complex analysis
requests from business
users/consumers of
data. The analyst role
span both the
operational and
historical time windows
and thus they need to be
versed in both the
operational and analytic
environments.
Data Miner/
Scientist
Responsible for using
statistical and machine
learning techniques to
identify patterns from
the data. These patterns
are correlated into
insights and actions for
better business
outcomes. The miner
may use operational
and historical data for
research.
Executive Consumer
Receives the data
through summary
dashboards with drill
down/through
capabilities. Request
detailed analysis and
reporting on High Value
Question from the Data
Analyst and Data
Miners. These
consumers are looking
at the data to make
short and long term
decisions to improve the
organizational efficiency
and customer
experience.
Operational Analytic
39
• Operational interest is high when data is introduced to the
operational stores. This interest wanes over time.
• Analytic interest is low when data is first introduced. The
interest increases as the data is collected and combined
with other enterprise data.
Copyright 2013 by Data Blueprint
Persona Data Interest
Operational
Interest
Analytic
Interest
Interest
Time
40
Copyright 2013 by Data Blueprint
• Context: Data Management/DAMA/DM BoK/CDMP?
• What is a data structure?
• Structured data storage, a bit of history and context
• Why are data structures important?
• Data Personas/Usage (interest over time)
• Data Topology and alignment to the data audience
• Internal data structures to fit the needs
• Q & A?
Outline
42
Copyright 2013 by Data Blueprint
Data Topology Today
43
Copyright 2013 by Data Blueprint
Data Store Purpose a review of the Data Topology
• Master Data
– Master Data is the term used to describe the data domains that drive
business activities. Master data is the data that must first be in
place before business transactions can occur. Master data is often
shared across the organizational business units and it is typically at
the center of business strategies. The transaction defines the
business/process event (order, dispatch, sales) while the Master
Data describes the ‘who’ (customers, drivers, account reps), the
‘what’ (load), the ‘when’ (date, time) and the ‘where’ (origin and
destination location).
• Online Transaction Processing (OLTP)
– “Transactional data” is the term used to describe the data involved in
the execution of the business activities. Transactional data
associates master data (i.e. customers and products) to a business
activity that often represents a unit or work, such as the creation of
an order.
• The Master Data and OLTP stores are where data is initially created and
persisted within the organization’s data and thus carry a special
classification of System of Record (SOR). They are created to capture
the transactional data as it arrives and makes the data available for the
processes and services. The data arrives into these databases through
manual entry or automated feeds. These data stores are logically (and
sometimes physically) separated by the transactional subject area they
are created to serve.
OLTP1
OLTP2
OLTPn...
Master
Data
44
Copyright 2013 by Data Blueprint
Data Store Purpose a review of the Data Topology
• Operational Data Store (ODS)
– An Operational Data Store (ODS) is created to integrate data from
two or more SORs for the purposes of data integration. The ODS is
normally created to satisfy reporting needs across functional SOR
boundaries. The ODS should hold very little historical information and
should focus on maintaining the most up-to-date data needed by the
organization for daily operations. Depending on the application
requirements, the ODS may institute a near real-time data feed from
the source applications. The ODS is expected to be technically
accurate and is considered to be an Authoritative Source. The data it
contains can be used for non-critical needs instead of having to
access the SOR. The more frequently the data is pushed into the
ODS environment, the less reliance there will be on direct access to
SORs for data reporting needs.
• Enterprise Data Warehouse (EDW)
– An Enterprise Data Warehouse (EDW) is responsible for collection
and integration of data from either SORs or from the Operational
Data Store. An EDW has an enterprise scope as it will pull from many
(if not all) SORs. The focus of the data warehouse is to be historical
in nature and in many instances is loaded with a latency (every 24
hours). The data warehouse is created to support historical analytics.
The expectation of the data warehouse is to be exhaustive in the data
it collects with a focus being on collecting and storing of the data.
EnterpriseData
Warehouse
(EDW)
Operational
DataStore
(ODS)
45
Copyright 2013 by Data Blueprint
Data Store Purpose a review of the Data Topology
• Data Marts
– A Data Mart is a subset of a data warehouse, it
is created to address specific questions and/or
subject area of questions. A Data Mart is built
and tuned to deliver the data to the end users,
it exists to get the data out from the data
warehouse.
Data Mart
46
Copyright 2013 by Data Blueprint
Data Store Purpose a review of the Data Topology
• Event Data Store
– Is the data store which logs, stores and reports the
discrete business and technical events which occur within
the process. This data store is a critical, and often
overlooked data domain for managing, controlling and
creating transparency into the business processes. The
events are used to report out the overall health of the
processes in both business and technical terms. This
consolidated solution is key to obtaining a 360 view of the
processes.
• Metadata Store
– Metadata is a broad term which includes descriptive
elements in both business and technical terms. It covers:
business terms, data elements descriptions, element
display formats, element valid values, element quality
targets, etc. Metadata is critical to an organization as it
describes the organization’s business and processing
infrastructure in detail. Metadata is entertainingly defined
as “data about the data”. That is, Metadata characterizes
other data and makes it easier to retrieve, interpret and
use information.
Technical
Metadata
Metadata
Store
Business
Metadata
Event
Data
Store
BusOPS
Events
TechOPS
Events
47
Operational i

n 



c

o

n

t

r

a

s

t 



w

i

t

h
Analytic
Subject-Oriented
Databases which are focused on
a single or small set of business
functions
Integrated
Collecting and semantically
aligning data from disparate
sources to achieve a
homogeneous viewVolatile
Data which may change
frequently
Non-Volatile
Data for which entered into the
database will not change
Atomic
Low grain data, each transaction,
each order with all of the
attributes
Aggregate
A summary of multiple orders or
transactions performed to
transform the atomic detail into
more comprehensible information
Current Valued: The data and
the system represents what is
current in this moment; not
yesterday, not last week --- now
Time Variant Data: is marked
and stored with a date/time
element where questions of what
was it yesterday and last week
can be answered
Copyright 2013 by Data Blueprint
Data Store Characteristics
48
Copyright 2013 by Data Blueprint
• Context: Data Management/DAMA/DM BoK/CDMP?
• What is a data structure?
• Structured data storage, a bit of history and context
• Why are data structures important?
• Data Personas/Usage (interest over time)
• Data Topology and alignment to the data audience
• Internal data structures to fit the needs
• Q & A?
Outline
50
• 3rd Normal Form (3NF)
– Inmon
• Dimensional
– Kimball
• Data Vault
– Lindstad
Copyright 2013 by Data Blueprint
Data Structure Design Styles
51
• 3rd Normal Form Modeling
• A mathematical data design 

technique founded in the early 

70s by E.F. Codd.
• Organizes data in simple rows 

and columns - Entities
• Creates connections between the 

entities called relationships to show how the data is inter-related
• It is purest form 3NF removes all data redundancies – a piece of
data is stored only once
• 3NF is based on mathematics, give the same facts to different
modelers; the model should be the same.
• Creates a visual (Entity Relation Diagram - ERD) which may be
understood by less technical personnel
• 3NF is the modeling style most popularly used for operationally
focused data stores.
Copyright 2013 by Data Blueprint
Design Styles – 3NF
52
Copyright 2013 by Data Blueprint
Design Styles – Dimensional Modeling
• A data design approach create and
refined by Ralph Kimball in the 80s
• Organizes data in Facts and
Dimensions
– Fact tables record the events (what)
within the 

business domain
– Dimension tables describing who,
when, how and where
• Created to exploit the capabilities
of the relational database to
retrieve and report against large
volumes of data.
• There are 2 variations to
Dimensional Modeling:
– Star Schema
– Snowflake
53
Copyright 2013 by Data Blueprint
Design Styles – Data Vault
• Newest of the relational database modeling techniques.
• Conceived in the 1990s by Dan Linstedt
• Focuses on linking the data from multiple disparate
locations without forcing the data to be semantically
aligned
NOTE:
There is a Data Ed presentation schedule for 14 October
2014 to cover the details of Data Vault designs!
54
DATA STORE AUDIENCE SERVED BUILD CHARACTERISTICS DESIGN STYLE
O

P

E

R

A

T

I

O

N

A

L
Master Data
OLTP
ODS
Event
A

N

A

L

Y

T

I

C
Data Warehouse
Data Mart
Copyright 2013 by Data Blueprint
Summary/Take Aways
DATA STORE AUDIENCE SERVED BUILD CHARACTERISTICS DESIGN STYLE
O

P

E

R

A

T

I

O

N

A

L
Master Data
Operations Manager
Operational Analyst
Subject Oriented
Volatile
Atomic
Current Valued
3NF
OLTP
Operational Performer
Operations Manager
Subject Oriented
Volatile
Atomic
Current Valued
3NF
ODS
Operational Manager
Operational Analyst
Executive Consumer
Integrated
Volatile
Atomic
Current Valued
3NF
Event All Personas
Integrated
Volatile
Atomic
Current Valued
3NF
A

N

A

L

Y

T

I

C
Data Warehouse Data Miner/Scientist
Integrated
Non-volatile
Atomic
Time Variant
3NF trending to
Data Vault
Data Mart
Operational Analyst
Data Analyst
Executive Consumer
Subject Oriented
Non-volatile
Atomic -or- Aggregated
Time Variant
Dimensional
55
Copyright 2013 by Data Blueprint
• Context: Data Management/DAMA/DM BoK/CDMP?
• What is a data structure?
• Structured data storage, a bit of history and context
• Why are data structures important?
• Data Personas/Usage (interest over time)
• Data Topology and alignment to the data audience
• Internal data structures to fit the needs
• Q & A?
Outline: Design/Manage Data Structures
56
Copyright 2013 by Data Blueprint
Questions to Ask
• Are you ready for a data
warehouse?
• Foundational Practices
• Is the business environment
constantly evolving?
• Will you get it right the first time?
• Do you have an agreed upon
enterprise-wide vocabulary
• Is your data warehouse intended to
be the enterprise audit-able system
of record?
• Extract, Transform and Load
• Data Transformations
• How fast do you need results?
• Performance of inserts vs reads
• Project deliverables
57
Copyright 2013 by Data Blueprint
Upcoming Events
August Webinar:
Data Management Maturity
August 12, 2014 @ 2:00 PM ET/11:00 AM PT
!
Sign up here:
• www.datablueprint.com/webinar-schedule
• www.Dataversity.net
!
!
!
!
!
!
!
!
!
!
!
Brought to you by:
58
Copyright 2013 by Data Blueprint
Questions?
+ =
59
Copyright 2013 by Data Blueprint
Why Architectural Models?
• Would you build a house without an architecture sketch?
• Would you like to have an estimate how much your new house is going to cost?
• If you hired a set of constructors from all over the world to build your house, would you
like them to have a common language?
• Would you like to verify the proposals of the construction team before the work gets
started?
• If it was a great house, would you like to build something rather similar again, in
another place?
• Would you drill into a wall of your house without a map of the plumbing and electric
lines?
• Model is the sketch of the system to be built in a project.
• Your model gives you a very good idea of how demanding the implementation work is
going to be!
• Model is the common language for the project team.
• Models can be reviewed before thousands of hours of implementation work will be
done.
• It is possible to implement the system to various platforms using the same model.
• Models document the system built in a project. This makes life easier for the support
and maintenance!
Would you build a house without an architecture
sketch?
Model is the sketch of the system to be built in a
project.
Would you like to have an estimate how much your
new house is going to cost?
Your model gives you a very good idea of how
demanding the implementation work is going to be!
If you hired a set of constructors from all over the
world to build your house, would you like them to
have a common language?
Model is the common language for the project team.
Would you like to verify the proposals of the
construction team before the work gets started?
Models can be reviewed before thousands of hours
of implementation work will be done.
If it was a great house, would you like to build
something rather similar again, in another place?
It is possible to implement the system to various
platforms using the same model.
Would you drill into a wall of your house without a
map of the plumbing and electric lines?
Models document the system built in a project. This
makes life easier for the support and maintenance!
60
Copyright 2013 by Data Blueprint
Inmon Implementation
61
Copyright 2013 by Data Blueprint
Kimball Implementation
62
Copyright 2013 by Data Blueprint
Data Vault Implementation
63
Copyright 2013 by Data Blueprint
Hybrid Approach
• (http://www.kimballgroup.com/2004/03/03/differences-of-opinion/)
• Learn Data Vault – “dv-in-kimball-bus-architecture”
64
10124 W. Broad Street, Suite C
Glen Allen, Virginia 23060
804.521.4056

Mais conteúdo relacionado

Mais procurados

Data Quality Strategies
Data Quality StrategiesData Quality Strategies
Data Quality StrategiesDATAVERSITY
 
Everybody is a Data Steward – Get Over It!
Everybody is a Data Steward – Get Over It!Everybody is a Data Steward – Get Over It!
Everybody is a Data Steward – Get Over It!DATAVERSITY
 
DataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data GovernanceDataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data GovernanceDATAVERSITY
 
Convincing Stakeholders Data Governance Is Essential
Convincing Stakeholders Data Governance Is EssentialConvincing Stakeholders Data Governance Is Essential
Convincing Stakeholders Data Governance Is EssentialDATAVERSITY
 
Lead Your Data Revolution - How to Build a Foundation of Trust and Data Gover...
Lead Your Data Revolution - How to Build a Foundation of Trust and Data Gover...Lead Your Data Revolution - How to Build a Foundation of Trust and Data Gover...
Lead Your Data Revolution - How to Build a Foundation of Trust and Data Gover...DATAVERSITY
 
Essential Metadata Strategies
Essential Metadata StrategiesEssential Metadata Strategies
Essential Metadata StrategiesDATAVERSITY
 
Data Architecture Strategies
Data Architecture StrategiesData Architecture Strategies
Data Architecture StrategiesDATAVERSITY
 
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesData-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesDATAVERSITY
 
Data Governance Strategies - With Great Power Comes Great Accountability
Data Governance Strategies - With Great Power Comes Great AccountabilityData Governance Strategies - With Great Power Comes Great Accountability
Data Governance Strategies - With Great Power Comes Great AccountabilityDATAVERSITY
 
DataEd Slides: Approaching Data Governance Strategically
DataEd Slides: Approaching Data Governance StrategicallyDataEd Slides: Approaching Data Governance Strategically
DataEd Slides: Approaching Data Governance StrategicallyDATAVERSITY
 
Business Value Through Reference and Master Data Strategies
Business Value Through Reference and Master Data StrategiesBusiness Value Through Reference and Master Data Strategies
Business Value Through Reference and Master Data StrategiesDATAVERSITY
 
Data Architecture vs Data Modeling
Data Architecture vs Data ModelingData Architecture vs Data Modeling
Data Architecture vs Data ModelingDATAVERSITY
 
Master Data Management - Practical Strategies for Integrating into Your Data ...
Master Data Management - Practical Strategies for Integrating into Your Data ...Master Data Management - Practical Strategies for Integrating into Your Data ...
Master Data Management - Practical Strategies for Integrating into Your Data ...DATAVERSITY
 
Data Management vs Data Strategy
Data Management vs Data StrategyData Management vs Data Strategy
Data Management vs Data StrategyDATAVERSITY
 
2011 digital trends webinar presentation
2011 digital trends webinar presentation2011 digital trends webinar presentation
2011 digital trends webinar presentationEconsultancy
 
RWDG Slides: The Stewardship Approach to Data Governance
RWDG Slides: The Stewardship Approach to Data GovernanceRWDG Slides: The Stewardship Approach to Data Governance
RWDG Slides: The Stewardship Approach to Data GovernanceDATAVERSITY
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
 
The future of bi isn't a bi tool
The future of bi isn't a bi toolThe future of bi isn't a bi tool
The future of bi isn't a bi toolDATAVERSITY
 
DataEd Slides: Growing Practical Data Governance Programs
DataEd Slides: Growing Practical Data Governance ProgramsDataEd Slides: Growing Practical Data Governance Programs
DataEd Slides: Growing Practical Data Governance ProgramsDATAVERSITY
 

Mais procurados (20)

Data Quality Strategies
Data Quality StrategiesData Quality Strategies
Data Quality Strategies
 
Everybody is a Data Steward – Get Over It!
Everybody is a Data Steward – Get Over It!Everybody is a Data Steward – Get Over It!
Everybody is a Data Steward – Get Over It!
 
DataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data GovernanceDataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data Governance
 
Convincing Stakeholders Data Governance Is Essential
Convincing Stakeholders Data Governance Is EssentialConvincing Stakeholders Data Governance Is Essential
Convincing Stakeholders Data Governance Is Essential
 
Lead Your Data Revolution - How to Build a Foundation of Trust and Data Gover...
Lead Your Data Revolution - How to Build a Foundation of Trust and Data Gover...Lead Your Data Revolution - How to Build a Foundation of Trust and Data Gover...
Lead Your Data Revolution - How to Build a Foundation of Trust and Data Gover...
 
Essential Metadata Strategies
Essential Metadata StrategiesEssential Metadata Strategies
Essential Metadata Strategies
 
Data Architecture Strategies
Data Architecture StrategiesData Architecture Strategies
Data Architecture Strategies
 
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesData-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance Strategies
 
Data Governance Strategies - With Great Power Comes Great Accountability
Data Governance Strategies - With Great Power Comes Great AccountabilityData Governance Strategies - With Great Power Comes Great Accountability
Data Governance Strategies - With Great Power Comes Great Accountability
 
DataEd Slides: Approaching Data Governance Strategically
DataEd Slides: Approaching Data Governance StrategicallyDataEd Slides: Approaching Data Governance Strategically
DataEd Slides: Approaching Data Governance Strategically
 
Business Value Through Reference and Master Data Strategies
Business Value Through Reference and Master Data StrategiesBusiness Value Through Reference and Master Data Strategies
Business Value Through Reference and Master Data Strategies
 
Data Architecture vs Data Modeling
Data Architecture vs Data ModelingData Architecture vs Data Modeling
Data Architecture vs Data Modeling
 
Master Data Management - Practical Strategies for Integrating into Your Data ...
Master Data Management - Practical Strategies for Integrating into Your Data ...Master Data Management - Practical Strategies for Integrating into Your Data ...
Master Data Management - Practical Strategies for Integrating into Your Data ...
 
Data Management vs Data Strategy
Data Management vs Data StrategyData Management vs Data Strategy
Data Management vs Data Strategy
 
2011 digital trends webinar presentation
2011 digital trends webinar presentation2011 digital trends webinar presentation
2011 digital trends webinar presentation
 
RWDG Slides: The Stewardship Approach to Data Governance
RWDG Slides: The Stewardship Approach to Data GovernanceRWDG Slides: The Stewardship Approach to Data Governance
RWDG Slides: The Stewardship Approach to Data Governance
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
The future of bi isn't a bi tool
The future of bi isn't a bi toolThe future of bi isn't a bi tool
The future of bi isn't a bi tool
 
DataEd Slides: Growing Practical Data Governance Programs
DataEd Slides: Growing Practical Data Governance ProgramsDataEd Slides: Growing Practical Data Governance Programs
DataEd Slides: Growing Practical Data Governance Programs
 

Semelhante a Data-Ed Webinar: Design & Manage Data Structures

Data Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s HomeData Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s HomeDATAVERSITY
 
Data Systems Integration & Business Value Pt. 1: Metadata
Data Systems Integration & Business Value Pt. 1: MetadataData Systems Integration & Business Value Pt. 1: Metadata
Data Systems Integration & Business Value Pt. 1: MetadataDATAVERSITY
 
Data-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: MetadataData-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: MetadataData Blueprint
 
DataEd Slides: Data Modeling is Fundamental
DataEd Slides:  Data Modeling is FundamentalDataEd Slides:  Data Modeling is Fundamental
DataEd Slides: Data Modeling is FundamentalDATAVERSITY
 
Data Systems Integration & Business Value Pt. 2: Cloud
Data Systems Integration & Business Value Pt. 2: CloudData Systems Integration & Business Value Pt. 2: Cloud
Data Systems Integration & Business Value Pt. 2: CloudData Blueprint
 
Data Systems Integration & Business Value Pt. 2: Cloud
Data Systems Integration & Business Value Pt. 2: CloudData Systems Integration & Business Value Pt. 2: Cloud
Data Systems Integration & Business Value Pt. 2: CloudDATAVERSITY
 
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data GardenData-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data GardenDATAVERSITY
 
Data-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringData-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringDATAVERSITY
 
Data-Ed Online Webinar: Data Architecture Requirements
Data-Ed Online Webinar: Data Architecture RequirementsData-Ed Online Webinar: Data Architecture Requirements
Data-Ed Online Webinar: Data Architecture RequirementsDATAVERSITY
 
Data-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsData-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsDATAVERSITY
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements  Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements Data Blueprint
 
DataEd Slides: Data Architecture versus Data Modeling
DataEd Slides:  Data Architecture versus Data ModelingDataEd Slides:  Data Architecture versus Data Modeling
DataEd Slides: Data Architecture versus Data ModelingDATAVERSITY
 
Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data Blueprint
 
Data-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMData-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMDATAVERSITY
 
The Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data MindThe Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data MindDATAVERSITY
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements Data Blueprint
 
Data-Ed Online: Data Architecture Requirements
Data-Ed Online: Data Architecture RequirementsData-Ed Online: Data Architecture Requirements
Data-Ed Online: Data Architecture RequirementsDATAVERSITY
 
Data Systems Integration & Business Value PT. 3: Warehousing
Data Systems Integration & Business Value PT. 3: Warehousing Data Systems Integration & Business Value PT. 3: Warehousing
Data Systems Integration & Business Value PT. 3: Warehousing Data Blueprint
 
Data-Ed: Data Systems Integration & Business Value Pt. 3: Warehousing
Data-Ed: Data Systems Integration & Business Value Pt. 3: WarehousingData-Ed: Data Systems Integration & Business Value Pt. 3: Warehousing
Data-Ed: Data Systems Integration & Business Value Pt. 3: WarehousingDATAVERSITY
 

Semelhante a Data-Ed Webinar: Design & Manage Data Structures (20)

Data Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s HomeData Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s Home
 
Data Systems Integration & Business Value Pt. 1: Metadata
Data Systems Integration & Business Value Pt. 1: MetadataData Systems Integration & Business Value Pt. 1: Metadata
Data Systems Integration & Business Value Pt. 1: Metadata
 
Data-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: MetadataData-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: Metadata
 
DataEd Slides: Data Modeling is Fundamental
DataEd Slides:  Data Modeling is FundamentalDataEd Slides:  Data Modeling is Fundamental
DataEd Slides: Data Modeling is Fundamental
 
Data Systems Integration & Business Value Pt. 2: Cloud
Data Systems Integration & Business Value Pt. 2: CloudData Systems Integration & Business Value Pt. 2: Cloud
Data Systems Integration & Business Value Pt. 2: Cloud
 
Data Systems Integration & Business Value Pt. 2: Cloud
Data Systems Integration & Business Value Pt. 2: CloudData Systems Integration & Business Value Pt. 2: Cloud
Data Systems Integration & Business Value Pt. 2: Cloud
 
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data GardenData-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
 
Data-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringData-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality Engineering
 
2014 dqe handouts
2014 dqe handouts2014 dqe handouts
2014 dqe handouts
 
Data-Ed Online Webinar: Data Architecture Requirements
Data-Ed Online Webinar: Data Architecture RequirementsData-Ed Online Webinar: Data Architecture Requirements
Data-Ed Online Webinar: Data Architecture Requirements
 
Data-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsData-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture Requirements
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements  Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements
 
DataEd Slides: Data Architecture versus Data Modeling
DataEd Slides:  Data Architecture versus Data ModelingDataEd Slides:  Data Architecture versus Data Modeling
DataEd Slides: Data Architecture versus Data Modeling
 
Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM
 
Data-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMData-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDM
 
The Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data MindThe Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data Mind
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements
 
Data-Ed Online: Data Architecture Requirements
Data-Ed Online: Data Architecture RequirementsData-Ed Online: Data Architecture Requirements
Data-Ed Online: Data Architecture Requirements
 
Data Systems Integration & Business Value PT. 3: Warehousing
Data Systems Integration & Business Value PT. 3: Warehousing Data Systems Integration & Business Value PT. 3: Warehousing
Data Systems Integration & Business Value PT. 3: Warehousing
 
Data-Ed: Data Systems Integration & Business Value Pt. 3: Warehousing
Data-Ed: Data Systems Integration & Business Value Pt. 3: WarehousingData-Ed: Data Systems Integration & Business Value Pt. 3: Warehousing
Data-Ed: Data Systems Integration & Business Value Pt. 3: Warehousing
 

Mais de DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 

Mais de DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Último

Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 

Último (20)

Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 

Data-Ed Webinar: Design & Manage Data Structures

  • 1. Data structures enable you to store and organize
 data so that it can be used efficiently. But how do
 you know to apply the correct one? There is a
 difference between structuring master data,
 reference data and analytics data. This webinar 
 will discuss the various data structures available 
 and when to use each one. We will show how 
 data structures should support your organizational
 strategy and how each method can contribute to
 business value. Learning Objectives: • Application of correct data structures to fit business needs • How different structures create different business value 
 
 Date: July 8, 2014
 Time: 2:00 PM ET
 Presented by: Dave Marsh & Peter Aiken Copyright 2013 by Data Blueprint Welcome: Design/Manage Data Structures 1
  • 2. Copyright 2013 by Data Blueprint Get Social With Us! Like Us on Facebook www.facebook.com/ datablueprint Post questions and comments Find industry news, insightful content and event updates. Join the Group Data Management & Business Intelligence Ask questions, gain insights and collaborate with fellow data management professionals Live Twitter Feed Join the conversation! Follow us: @datablueprint @paiken Ask questions and submit your comments: #dataed 3
  • 3. Presented by Dave Marsh & Peter Aiken, Ph.D. Design & Manage Data Structures Marco Level
  • 4. Copyright 2013 by Data Blueprint Your Presenters Dave Marsh • Lead Data 
 Consultant, 
 Data Blueprint • 30+ Years experience designing and building solutions for the private and public sectors. • Architecture/Design experience in: - Transactional processing - Shop floor automation - Data Warehousing - Identity Management - Mobile Peter Aiken • 30+ years DM 
 experience • 9 books/many articles • Experienced with 500+ data management practices • Multi-year immersions: US DoD, Nokia, Deutsche Bank, Wells Fargo, & Commonwealth of VA 4
  • 5. Copyright 2013 by Data Blueprint • Context: Data Management/DAMA/DM BoK/CDMP? • What is a data structure? • Structured data storage, a bit of history and context • Why are data structures important? • Data Personas/Usage (interest over time) • Data Topology and alignment to the data audience • Internal data structures to fit the needs • Q & A? Outline: Design/Manage Data Structures 6 • Context: Data Management/DAMA/DM BoK/CDMP? • What is a data structure? • Structured data storage, a bit of history and context • Why are data structures important? • Data Personas/Usage (interest over time) • Data Topology and alignment to the data audience • Internal data structures to fit the needs • Q & A?
  • 6. Maslow's Hierarchy of Needs Copyright 2013 by Data Blueprint 7
  • 7. You can accomplish Advanced Data Practices without becoming proficient in the Basic Data Management Practices however this will: • Take longer • Cost more • Deliver less • Present 
 greater
 risk Data Management Practices Hierarchy Basic Data Management Practices Advanced 
 Data 
 Practices • MDM • Mining • Big Data • Analytics • Warehousing • SOA Data Program Management Data Stewardship Data Development Data Support Operations Organizational Data Integration Copyright 2013 by Data Blueprint 8
  • 8. Data Program 
 Coordination Feedback Data
 Development Copyright 2013 by Data Blueprint Standard
 Data Data Management is an Integrated System of Five Practice Areas Organizational Strategies Goals Business
 Data Business Value Application 
 Models & Designs Implementation Direction Guidance 9 Organizational
 Data Integration Data
 Stewardship Data Support
 Operations Data 
 Asset Use Integrated
 Models Leverage data in organizational activities Data management processes and infrastructure Combining multiple assets to produce extra value Organizational-entity subject area data integration Provide reliable data access Achieve sharing of data within a business area
  • 9. Copyright 2013 by Data Blueprint Five Integrated DM Practice Areas 10 Manage data coherently. Share data across boundaries. Assign responsibilities for data. Engineer data delivery systems. Maintain data availability. Data Program 
 Coordination Data
 Development Organizational Data Integration Data
 Stewardship Data Support
 Operations
  • 10. Copyright 2013 by Data Blueprint DAMA DM BoK & CDMP Data Management Functions • Published by DAMA International – The professional association for Data Managers (40 chapters worldwide) – DMBoK organized around – Primary data management functions focused around data delivery to the organization (more at dama.org) – Organized around several environmental elements • CDMP – Certified Data Management Professional – DAMA International and ICCP – Membership in a distinct group made up of your fellow professionals – Recognition for your specialized knowledge in a choice of 17 specialty areas – Series of 3 exams – For more information, please visit: • http://www.dama.org/i4a/pages/index.cfm?pageid=3399 • http://iccp.org/certification/designations/cdmp 11
  • 11. Copyright 2013 by Data Blueprint • Context: Data Management/DAMA/DM BoK/CDMP? • What is a data structure? • Structured data storage, a bit of history and context • Why are data structures important? • Data Personas/Usage (interest over time) • Data Topology and alignment to the data audience • Internal data structures to fit the needs • Q & A? Outline 13
  • 12. Copyright 2013 by Data Blueprint What is a data structure? • "An organization of information, usually in memory, for better algorithm efficiency, such as queue, stack, linked list, heap, dictionary, and tree, or conceptual unity, such as the name and address of a person. It may include redundant information, such as length of the list or number of nodes in a subtree." • Some data structure characteristics – Grammar for data objects • Grammar is the principles 
 or rules of an art, science, 
 or technique "a grammar 
 of the theater" – Constraints for data 
 objects – Sequential order – Uniqueness – Arrangement • Hierarchical, relational, 
 network, other – Balance – Optimality http://www.nist.gov/dads/HTML/datastructur.html 14
  • 13. Copyright 2013 by Data Blueprint How are data structures expressed as architectures? • Details are organized into 
 larger components • Larger components are organized into models • Models are organized into architectures A B C D A B C D A D C B 15
  • 14. Copyright 2013 by Data Blueprint How are data structures expressed as architectures? • Attributes are organized into 
 entities/objects – Attributes are characteristics of "things" – Entitles/objects are "things" whose 
 information is managed in support of strategy – Examples • Entities/objects are organized into models – Combinations of attributes and entities are 
 structured to represent information requirements – Poorly structured data, constrains organizational information delivery capabilities – Examples • Models are organized into architectures – When building new systems, architectures are used to plan development – More often, data managers do not know what existing architectures are and - therefore - cannot make use of them in support of strategy implementation – Why no examples? 16
  • 15. Copyright 2013 by Data Blueprint Data Data Data Information Fact Meaning Request A Model Specifying Relationships Among Important Terms [Built on definition by Dan Appleton 1983] Intelligence Strategic Use 1. Each FACT combines with one or more MEANINGS. 2. Each specific FACT and MEANING combination is referred to as a DATUM. 3. An INFORMATION is one or more DATA that are returned in response to a specific REQUEST 4. INFORMATION REUSE is enabled when one FACT is combined with more than one MEANING. 5. INTELLIGENCE is INFORMATION associated with its USES. Wisdom & knowledge are 
 often used synonymously Data Data Data Data 17
  • 16. Copyright 2013 by Data Blueprint • Context: Data Management/DAMA/DM BoK/CDMP? • What is a data structure? • Structured data storage, a bit of history and context • Why are data structures important? • Data Personas/Usage (interest over time) • Data Topology and alignment to the data audience • Internal data structures to fit the needs • Q & A? Outline 19
  • 17. Copyright 2013 by Data Blueprint History (such as it is) • Automate existing manual 
 processing • Data management was: – Running millions of punched 
 cards through banks of sorting, 
 collating & tabulating machines – Results printed on paper or 
 punched onto more cards – Data management meant physically storing and hauling around punched cards • Tasks (check signing, calculating, and machine control) were implemented to provide automated support for departmental-based processing • Creating information silos • Data Processing Manager 20
  • 18. Copyright 2013 by Data Blueprint Chief Information Officer 21
  • 19. Copyright 2013 by Data Blueprint CFO Necessary Prerequisites/Qualifications • CPA • CMA • Masters of Accountancy • Other recognized degrees/certifications • These are necessary but insufficient prerequisites/ qualifications 22
  • 20. Copyright 2013 by Data Blueprint CIO Qualifications • No specific qualifications • Typically technological fields: – Computer science – Software engineering – Information systems • Business – Master of Business Administration – Master of Science in Management • Business acumen and strategic perspectives have taken precedence over technical skills. – CIOs appointed from the business side of the organization • Especially if they have project management skills. 23
  • 21. Copyright 2013 by Data Blueprint What do we teach knowledge workers about data? What percentage of the deal with it daily? 24
  • 22. Copyright 2013 by Data Blueprint • Context: Data Management/DAMA/DM BoK/CDMP? • What is a data structure? • Structured data storage, a bit of history and context • Why are data structures important? • Data Personas/Usage (interest over time) • Data Topology and alignment to the data audience • Internal data structures to fit the needs • Q & A? Outline 26
  • 23. Copyright 2013 by Data Blueprint Data Leverage Less ROT Technologies Process People • Permits organizations to better manage their sole non- depleteable, non-degrading, durable, strategic asset - data – within the organization, and – with organizational data exchange partners • Leverage – Obtained by implementation of data-centric technologies, processes, and human skill sets – Increased by elimination of data ROT (redundant, obsolete, or trivial) • The bigger the organization, the greater potential leverage exists • Treating data more asset-like simultaneously 1. lowers organizational IT costs and 2. increases organizational knowledge worker productivity 27
  • 24. Copyright 2013 by Data Blueprint Data Structure Questions Program F Program E Program D Program G Program H Program I Application domain 2Application domain 3 • Who makes decisions about the range and scope of common data usage? 28
  • 25. Copyright 2013 by Data Blueprint Running Query 29
  • 26. Copyright 2013 by Data Blueprint Optimized Query 30
  • 27. Copyright 2013 by Data Blueprint Repeat 100s, thousands, millions of times ... 31
  • 28. Death by 1000 Cuts Copyright 2013 by Data Blueprint 32
  • 29. Copyright 2013 by Data Blueprint 5 Basic Data Structures Indexed Sequential File: Built-in index permits location of records of persons with last names starting with "T" Index Program: Where is the record for person "Townsend?" Index: Start looking here where the "Ts" are stored Relational Database: Records are related to each other using relationships describable using relational algebra Flat File: Records are typically sorted according to some criteria and must be searched from the beginning for each access Program: Must start at the beginning and read each record when looking for person "Townsend?" Network Database: Records are related to each other using arranged master records associated with multiple detail records using linked lists and pointers Associative Concept-oriented Multi-dimensional XML database
 3NF
 Star schema
 Data Vault Hierarchical Database: Records are related to each other hierarchically using 'parent child' relationships 33
  • 30. Copyright 2013 by Data Blueprint Data structures organized into an Architecture • How do data structures support organizational strategy? • Consider the opposite question? – Were your systems explicitly designed to be integrated or otherwise work together? – If not, then what is the likelihood that they will work well together? – In all likelihood your organization is spending between 20-40% of its IT budget compensating for poor data structure integration – They cannot be helpful as long as their structure is unknown • Two answers/two separate strategies – Achieving efficiency and 
 effectiveness goals – Providing organizational dexterity for rapid implementation 34
  • 31. Copyright 2013 by Data Blueprint Single Data Store No Single Data Store • The thought of a single monolithic data store which can service all of an organization’s information needs has long since been abandoned. In the modern data management topology, multiple data stores are created to service specific processing needs and user groups within the organization. • Implications: • The needs characteristics of the multitude of the audiences served by the data structures • Data lifecycle • The design styles (old and new) utilized to organize the data to service the audiences • A breakdown of the various stores • The resultant store characteristics 35
  • 32. Copyright 2013 by Data Blueprint Conclusions • 1 is not enough • Most organizations have far to many different data structures and they become barriers to progress and integration • Not much expertise to figure out these challenges 36
  • 33. Copyright 2013 by Data Blueprint • Context: Data Management/DAMA/DM BoK/CDMP? • What is a data structure? • Structured data storage, a bit of history and context • Why are data structures important? • Data Personas/Usage (interest over time) • Data Topology and alignment to the data audience • Internal data structures to fit the needs • Q & A? Outline 38
  • 34. Copyright 2013 by Data Blueprint Data Personas (The Requirements) Operational Performer Interested in alerts, notifications and reporting based on current values (real- time) data. They use the information to make decisions and changes in the transactional systems. These changes are targeted to improve the organizations ability to deliver in the short term. Operational Analyst (Manager) Interested in aggregated real-time data for their domain of responsibility. The data is displayed using visualization techniques of scorecards, charts and reports, preferably within a single dashboard. The searching is for favorable/unfavorable trends to indicate adjustments are needed in the staff & resource allocations. Data Analyst Responsible to support detailed and typically complex analysis requests from business users/consumers of data. The analyst role span both the operational and historical time windows and thus they need to be versed in both the operational and analytic environments. Data Miner/ Scientist Responsible for using statistical and machine learning techniques to identify patterns from the data. These patterns are correlated into insights and actions for better business outcomes. The miner may use operational and historical data for research. Executive Consumer Receives the data through summary dashboards with drill down/through capabilities. Request detailed analysis and reporting on High Value Question from the Data Analyst and Data Miners. These consumers are looking at the data to make short and long term decisions to improve the organizational efficiency and customer experience. Operational Analytic 39
  • 35. • Operational interest is high when data is introduced to the operational stores. This interest wanes over time. • Analytic interest is low when data is first introduced. The interest increases as the data is collected and combined with other enterprise data. Copyright 2013 by Data Blueprint Persona Data Interest Operational Interest Analytic Interest Interest Time 40
  • 36. Copyright 2013 by Data Blueprint • Context: Data Management/DAMA/DM BoK/CDMP? • What is a data structure? • Structured data storage, a bit of history and context • Why are data structures important? • Data Personas/Usage (interest over time) • Data Topology and alignment to the data audience • Internal data structures to fit the needs • Q & A? Outline 42
  • 37. Copyright 2013 by Data Blueprint Data Topology Today 43
  • 38. Copyright 2013 by Data Blueprint Data Store Purpose a review of the Data Topology • Master Data – Master Data is the term used to describe the data domains that drive business activities. Master data is the data that must first be in place before business transactions can occur. Master data is often shared across the organizational business units and it is typically at the center of business strategies. The transaction defines the business/process event (order, dispatch, sales) while the Master Data describes the ‘who’ (customers, drivers, account reps), the ‘what’ (load), the ‘when’ (date, time) and the ‘where’ (origin and destination location). • Online Transaction Processing (OLTP) – “Transactional data” is the term used to describe the data involved in the execution of the business activities. Transactional data associates master data (i.e. customers and products) to a business activity that often represents a unit or work, such as the creation of an order. • The Master Data and OLTP stores are where data is initially created and persisted within the organization’s data and thus carry a special classification of System of Record (SOR). They are created to capture the transactional data as it arrives and makes the data available for the processes and services. The data arrives into these databases through manual entry or automated feeds. These data stores are logically (and sometimes physically) separated by the transactional subject area they are created to serve. OLTP1 OLTP2 OLTPn... Master Data 44
  • 39. Copyright 2013 by Data Blueprint Data Store Purpose a review of the Data Topology • Operational Data Store (ODS) – An Operational Data Store (ODS) is created to integrate data from two or more SORs for the purposes of data integration. The ODS is normally created to satisfy reporting needs across functional SOR boundaries. The ODS should hold very little historical information and should focus on maintaining the most up-to-date data needed by the organization for daily operations. Depending on the application requirements, the ODS may institute a near real-time data feed from the source applications. The ODS is expected to be technically accurate and is considered to be an Authoritative Source. The data it contains can be used for non-critical needs instead of having to access the SOR. The more frequently the data is pushed into the ODS environment, the less reliance there will be on direct access to SORs for data reporting needs. • Enterprise Data Warehouse (EDW) – An Enterprise Data Warehouse (EDW) is responsible for collection and integration of data from either SORs or from the Operational Data Store. An EDW has an enterprise scope as it will pull from many (if not all) SORs. The focus of the data warehouse is to be historical in nature and in many instances is loaded with a latency (every 24 hours). The data warehouse is created to support historical analytics. The expectation of the data warehouse is to be exhaustive in the data it collects with a focus being on collecting and storing of the data. EnterpriseData Warehouse (EDW) Operational DataStore (ODS) 45
  • 40. Copyright 2013 by Data Blueprint Data Store Purpose a review of the Data Topology • Data Marts – A Data Mart is a subset of a data warehouse, it is created to address specific questions and/or subject area of questions. A Data Mart is built and tuned to deliver the data to the end users, it exists to get the data out from the data warehouse. Data Mart 46
  • 41. Copyright 2013 by Data Blueprint Data Store Purpose a review of the Data Topology • Event Data Store – Is the data store which logs, stores and reports the discrete business and technical events which occur within the process. This data store is a critical, and often overlooked data domain for managing, controlling and creating transparency into the business processes. The events are used to report out the overall health of the processes in both business and technical terms. This consolidated solution is key to obtaining a 360 view of the processes. • Metadata Store – Metadata is a broad term which includes descriptive elements in both business and technical terms. It covers: business terms, data elements descriptions, element display formats, element valid values, element quality targets, etc. Metadata is critical to an organization as it describes the organization’s business and processing infrastructure in detail. Metadata is entertainingly defined as “data about the data”. That is, Metadata characterizes other data and makes it easier to retrieve, interpret and use information. Technical Metadata Metadata Store Business Metadata Event Data Store BusOPS Events TechOPS Events 47
  • 42. Operational i
 n 
 
 c
 o
 n
 t
 r
 a
 s
 t 
 
 w
 i
 t
 h Analytic Subject-Oriented Databases which are focused on a single or small set of business functions Integrated Collecting and semantically aligning data from disparate sources to achieve a homogeneous viewVolatile Data which may change frequently Non-Volatile Data for which entered into the database will not change Atomic Low grain data, each transaction, each order with all of the attributes Aggregate A summary of multiple orders or transactions performed to transform the atomic detail into more comprehensible information Current Valued: The data and the system represents what is current in this moment; not yesterday, not last week --- now Time Variant Data: is marked and stored with a date/time element where questions of what was it yesterday and last week can be answered Copyright 2013 by Data Blueprint Data Store Characteristics 48
  • 43. Copyright 2013 by Data Blueprint • Context: Data Management/DAMA/DM BoK/CDMP? • What is a data structure? • Structured data storage, a bit of history and context • Why are data structures important? • Data Personas/Usage (interest over time) • Data Topology and alignment to the data audience • Internal data structures to fit the needs • Q & A? Outline 50
  • 44. • 3rd Normal Form (3NF) – Inmon • Dimensional – Kimball • Data Vault – Lindstad Copyright 2013 by Data Blueprint Data Structure Design Styles 51
  • 45. • 3rd Normal Form Modeling • A mathematical data design 
 technique founded in the early 
 70s by E.F. Codd. • Organizes data in simple rows 
 and columns - Entities • Creates connections between the 
 entities called relationships to show how the data is inter-related • It is purest form 3NF removes all data redundancies – a piece of data is stored only once • 3NF is based on mathematics, give the same facts to different modelers; the model should be the same. • Creates a visual (Entity Relation Diagram - ERD) which may be understood by less technical personnel • 3NF is the modeling style most popularly used for operationally focused data stores. Copyright 2013 by Data Blueprint Design Styles – 3NF 52
  • 46. Copyright 2013 by Data Blueprint Design Styles – Dimensional Modeling • A data design approach create and refined by Ralph Kimball in the 80s • Organizes data in Facts and Dimensions – Fact tables record the events (what) within the 
 business domain – Dimension tables describing who, when, how and where • Created to exploit the capabilities of the relational database to retrieve and report against large volumes of data. • There are 2 variations to Dimensional Modeling: – Star Schema – Snowflake 53
  • 47. Copyright 2013 by Data Blueprint Design Styles – Data Vault • Newest of the relational database modeling techniques. • Conceived in the 1990s by Dan Linstedt • Focuses on linking the data from multiple disparate locations without forcing the data to be semantically aligned NOTE: There is a Data Ed presentation schedule for 14 October 2014 to cover the details of Data Vault designs! 54
  • 48. DATA STORE AUDIENCE SERVED BUILD CHARACTERISTICS DESIGN STYLE O
 P
 E
 R
 A
 T
 I
 O
 N
 A
 L Master Data OLTP ODS Event A
 N
 A
 L
 Y
 T
 I
 C Data Warehouse Data Mart Copyright 2013 by Data Blueprint Summary/Take Aways DATA STORE AUDIENCE SERVED BUILD CHARACTERISTICS DESIGN STYLE O
 P
 E
 R
 A
 T
 I
 O
 N
 A
 L Master Data Operations Manager Operational Analyst Subject Oriented Volatile Atomic Current Valued 3NF OLTP Operational Performer Operations Manager Subject Oriented Volatile Atomic Current Valued 3NF ODS Operational Manager Operational Analyst Executive Consumer Integrated Volatile Atomic Current Valued 3NF Event All Personas Integrated Volatile Atomic Current Valued 3NF A
 N
 A
 L
 Y
 T
 I
 C Data Warehouse Data Miner/Scientist Integrated Non-volatile Atomic Time Variant 3NF trending to Data Vault Data Mart Operational Analyst Data Analyst Executive Consumer Subject Oriented Non-volatile Atomic -or- Aggregated Time Variant Dimensional 55
  • 49. Copyright 2013 by Data Blueprint • Context: Data Management/DAMA/DM BoK/CDMP? • What is a data structure? • Structured data storage, a bit of history and context • Why are data structures important? • Data Personas/Usage (interest over time) • Data Topology and alignment to the data audience • Internal data structures to fit the needs • Q & A? Outline: Design/Manage Data Structures 56
  • 50. Copyright 2013 by Data Blueprint Questions to Ask • Are you ready for a data warehouse? • Foundational Practices • Is the business environment constantly evolving? • Will you get it right the first time? • Do you have an agreed upon enterprise-wide vocabulary • Is your data warehouse intended to be the enterprise audit-able system of record? • Extract, Transform and Load • Data Transformations • How fast do you need results? • Performance of inserts vs reads • Project deliverables 57
  • 51. Copyright 2013 by Data Blueprint Upcoming Events August Webinar: Data Management Maturity August 12, 2014 @ 2:00 PM ET/11:00 AM PT ! Sign up here: • www.datablueprint.com/webinar-schedule • www.Dataversity.net ! ! ! ! ! ! ! ! ! ! ! Brought to you by: 58
  • 52. Copyright 2013 by Data Blueprint Questions? + = 59
  • 53. Copyright 2013 by Data Blueprint Why Architectural Models? • Would you build a house without an architecture sketch? • Would you like to have an estimate how much your new house is going to cost? • If you hired a set of constructors from all over the world to build your house, would you like them to have a common language? • Would you like to verify the proposals of the construction team before the work gets started? • If it was a great house, would you like to build something rather similar again, in another place? • Would you drill into a wall of your house without a map of the plumbing and electric lines? • Model is the sketch of the system to be built in a project. • Your model gives you a very good idea of how demanding the implementation work is going to be! • Model is the common language for the project team. • Models can be reviewed before thousands of hours of implementation work will be done. • It is possible to implement the system to various platforms using the same model. • Models document the system built in a project. This makes life easier for the support and maintenance! Would you build a house without an architecture sketch? Model is the sketch of the system to be built in a project. Would you like to have an estimate how much your new house is going to cost? Your model gives you a very good idea of how demanding the implementation work is going to be! If you hired a set of constructors from all over the world to build your house, would you like them to have a common language? Model is the common language for the project team. Would you like to verify the proposals of the construction team before the work gets started? Models can be reviewed before thousands of hours of implementation work will be done. If it was a great house, would you like to build something rather similar again, in another place? It is possible to implement the system to various platforms using the same model. Would you drill into a wall of your house without a map of the plumbing and electric lines? Models document the system built in a project. This makes life easier for the support and maintenance! 60
  • 54. Copyright 2013 by Data Blueprint Inmon Implementation 61
  • 55. Copyright 2013 by Data Blueprint Kimball Implementation 62
  • 56. Copyright 2013 by Data Blueprint Data Vault Implementation 63
  • 57. Copyright 2013 by Data Blueprint Hybrid Approach • (http://www.kimballgroup.com/2004/03/03/differences-of-opinion/) • Learn Data Vault – “dv-in-kimball-bus-architecture” 64
  • 58. 10124 W. Broad Street, Suite C Glen Allen, Virginia 23060 804.521.4056