SlideShare uma empresa Scribd logo
1 de 25
WEB ANALYTICS
CHALLENGES IN DATA MODELING
AGENDA
• Introduction to Web Analytics
• Data Sources, Data Capture
• Vocabulary

• Data Modeling Basics
• Relational vs. Dimensional
• Normalization, De-normalization, Aggregation

• Web Analytics + Data Modeling
• Four-tiered Data Model for Web data
• Challenges

• Q&A
INTRODUCTION
• Anne Marie Macek
• Senior Manager, Data Strategy
• Consumer Insight and Revenue Strategy
• Marriott International

• 30+ years Data Modeling and Reporting
• 14+ years Data Warehousing and Business
Intelligence
• 4+ years Web Analytics Data and Reporting
• MBA, Management Information Systems
• BS, Mathematics and Computer Science
EXPERIENCE
• Data Modeling:
• Flat Files, IMS/DB, DB2, Oracle, Netezza
• MS Access, Borland Paradox
• Cognos Powerplay, MS Analysis Services, Cognos 10.2
Dynamic Cubes

• Reporting:
• COBOL, Focus, SAS, Actuate
• Cognos BI Suite

• Business Functions:
• eCommerce, Revenue Management, Sales & Marketing
• Human Resources, Finance
DEFINITION
• Web analytics is the
measurement, collection, analysis and reporting
of internet data for purposes of understanding and
optimizing web usage.

Source: Wikipedia
OBJECTIVES
• Website Performance
• Conversion Rate ($ sales / # visits)
• Trends over time
• In Response to Campaigns

• Website Optimization
• Customer Behavior
• Technological Trends

• Integration
• Customer Lifetime Value / Segmentation

• Personalization
• Proactive display of pertinent information
DATA SOURCES
•
•
•
•
•
•
•
•
•
•
•

Click-stream Data
Search Engine Optimization (SEO)
Campaign Classification
Email Campaigns
Advertising Impressions
3rd Party Marketing Data
IP Geolocation
Competitive Analysis
Customer Information
Multi-channel Analysis
Outcome Data
CLICKSTREAM COLLECTION
• Web Log Files
• Rudimentary data collected on company’s web server
• Page name, IP address, browser, date/time

• Does not screen out search engine robots

• JavaScript Tagging (Google
Analytics, Omniture, WebTrends)
•
•
•
•

As page loads, data is sent to 3rd party for collection
Assigns a cookie to the user
Can implement custom tags on specific pages
Does not count pages served from cache

• Packet Sniffers (Cloudmeter Pion, Tealeaf CX Connect)
• Software or hardware layer installed on web servers
• Parsing raw data, and ensuring PII can be complex
CLICKSTREAM ANALYSIS
• Number of Visitors
• Total vs. Unique
• New vs. Repeat

• Source of Visit (Session)

• External Link (Campaign Analysis / Attribution)
• Direct

• Searches Performed On Site
• Keywords
• Sort Order of Results

• Page Analysis
• Specific Actions Performed

• Order (Booking)
• Signup for Membership, Credit Card, Event

• Abandonment (Bounce Rate)
BRINGING CLICKSTREAM IN-HOUSE
• Control/Consolidate Business Rules
• Integration with Corporate Systems of Record
• Single Version of the Truth

• Integration with Other Web Data Sources
• Enable more “intelligent” metrics
• Not all visits are a conversion opportunity

• Shift from “visit analysis” to “customer analysis”
• Enable advanced statistical and predictive
modeling
• Multi-touch Attribution
• Pay Per Click (PPC) Keyword Bid Optimization
CLICKSTREAM CHALLENGES
• “Clickstream data … is delightfully complex, ever
changing, and full of mysterious occurrences.”
Avinash Kaushik, Web Analytics: An Hour a Day

• Volume

• Cons- It’s big
• Pros- It’s incremental

•
•
•
•
•
•
•

Fairly Unstructured
Exceptions to every rule
Mobile App vs. Mobile Web vs. Desktop
Rapidly Changing
Most queries require trending YTD + 2 years’ history
Few “natural” metrics; most require count (distinct)
How do I model this data??
DATA WAREHOUSE APPROACHES
Bill Inmon

Ralph Kimball

• DW is Central
Repository of all
Enterprise Data
• “Top Down”
• Relational Model (3NF)
• Feeds Functional Data
Marts
• Huge Undertaking

• DW is the “Virtual”
Integration of Various
Functional Data Marts
• “Bottom Up”
• Dimensional Model
• Quicker to Develop
• Silo-ed and Redundant
RELATIONAL MODEL

Source: sqlservercentral.com
DIMENSIONAL MODELS
Star Schema

Snowflake Schema

Source: Wikipedia
NORMALIZATION
• Removes redundancy and dependency from data
structures.
• 1NF: Remove Repeating Groups
• 2NF: Remove Partial Key Dependencies
• 3NF: Remove Dependencies Among Attributes
• Tutorial: http://phlonx.com/resources/nf3/

• Data Warehouses require some De-Normalization to
improve query performance
ECOMMERCE DATA WAREHOUSE

Native
Source
Model

Fact Model

BI Model

Aggregate
Model
NATIVE SOURCE MODEL
Plus

• In-database copy of
the source data
• Stores data elements
we are not yet ready to
model further
• Maintains details for
research purposes
• Prevents repeating
historical conversion

Minus

•
•
•
•

Huge
Unstructured
Not normalized (at all)
Not useful for analysis or
reporting
NATIVE SOURCE MODEL
FACT MODEL
Plus

• “Snow-relational”
• Nearly Normalized
(optimized for load)
• Multiple Fact &
Extension Tables
(manage I/O)
• Granular (click row)
• Contains keys to
integrate with
enterprise data

Minus

• Complex load
including propagation
and look-back
• Use requires nonfiltered joins of massive
tables
• Difficult to use for
analysis, cannot be
used for reporting
FACT MODEL
BI MODEL
Plus

Minus

• “Star-flake” Model
• De-normalized (optimized
for query)
• Pre-joined
• Granular (click row)
• Integrated with enterprise
data at load time
• Useful for detailed
analysis

• Complex load process
• It’s still big!
• Corrections to Fact
Model data issues
require re-build or
complex conversion
processes
• Difficult to use for
reporting
BI MODEL
AGGREGATE MODEL
Plus

• Star Schema (simple)
• De-normalized
(optimized for query)
• Aggregated
• Fast query
performance
• Great for predetermined reports

Minus

• Corrections to Fact
Model data issues and
embedded dimensions
require re-build
• Count distincts only
available for predetermined dimensions
• Limited use for analysis
AGGREGATE MODEL
QUESTIONS?
• Thank You!

Mais conteúdo relacionado

Mais procurados

Social Media Ethics Presentation
Social Media Ethics PresentationSocial Media Ethics Presentation
Social Media Ethics PresentationPalmRyan
 
Cyber security for kids
Cyber security for kidsCyber security for kids
Cyber security for kidsChris Burrows
 
以創新擴散理論探究資訊系統成功模式下行動商務型態之研究(紙本封面合併)
以創新擴散理論探究資訊系統成功模式下行動商務型態之研究(紙本封面合併)以創新擴散理論探究資訊系統成功模式下行動商務型態之研究(紙本封面合併)
以創新擴散理論探究資訊系統成功模式下行動商務型態之研究(紙本封面合併)Dennis Lo
 
Cybersecurity 1. intro to cybersecurity
Cybersecurity 1. intro to cybersecurityCybersecurity 1. intro to cybersecurity
Cybersecurity 1. intro to cybersecuritysommerville-videos
 
Social engineering
Social engineeringSocial engineering
Social engineeringVishal Kumar
 
Cyber Security Awareness Session for Executives and Non-IT professionals
Cyber Security Awareness Session for Executives and Non-IT professionalsCyber Security Awareness Session for Executives and Non-IT professionals
Cyber Security Awareness Session for Executives and Non-IT professionalsKrishna Srikanth Manda
 
Social Media: the good, the bad and the ugly
Social Media: the good, the bad and the uglySocial Media: the good, the bad and the ugly
Social Media: the good, the bad and the uglyMichael VanDervort
 
SPEL (Social, professional, ethical and legal) issues in Usability
SPEL (Social, professional, ethical and legal) issues in UsabilitySPEL (Social, professional, ethical and legal) issues in Usability
SPEL (Social, professional, ethical and legal) issues in UsabilityAndres Baravalle
 
Internet Safety for Children Powerpoint
Internet Safety for Children Powerpoint Internet Safety for Children Powerpoint
Internet Safety for Children Powerpoint bgmartinez1971
 
PageRank and Markov Chain
PageRank and Markov ChainPageRank and Markov Chain
PageRank and Markov ChainGenioAladino
 
04. availability-concepts
04. availability-concepts04. availability-concepts
04. availability-conceptsMuhammad Ahad
 
Information Security Awareness Training
Information Security Awareness TrainingInformation Security Awareness Training
Information Security Awareness TrainingRandy Bowman
 
Internet Safety
Internet SafetyInternet Safety
Internet Safetymcgeet
 
9781111534127 ppt ch02
9781111534127 ppt ch029781111534127 ppt ch02
9781111534127 ppt ch02stanbridge
 
Cyber bullying for Elementary
Cyber bullying for ElementaryCyber bullying for Elementary
Cyber bullying for Elementaryshannonmf
 

Mais procurados (20)

Social Media Ethics Presentation
Social Media Ethics PresentationSocial Media Ethics Presentation
Social Media Ethics Presentation
 
Cyber Safety
Cyber SafetyCyber Safety
Cyber Safety
 
Cyber security for kids
Cyber security for kidsCyber security for kids
Cyber security for kids
 
以創新擴散理論探究資訊系統成功模式下行動商務型態之研究(紙本封面合併)
以創新擴散理論探究資訊系統成功模式下行動商務型態之研究(紙本封面合併)以創新擴散理論探究資訊系統成功模式下行動商務型態之研究(紙本封面合併)
以創新擴散理論探究資訊系統成功模式下行動商務型態之研究(紙本封面合併)
 
Przetestuj swoje CV
Przetestuj swoje CVPrzetestuj swoje CV
Przetestuj swoje CV
 
Professional Ethics of IT
Professional Ethics of ITProfessional Ethics of IT
Professional Ethics of IT
 
Cybersecurity 1. intro to cybersecurity
Cybersecurity 1. intro to cybersecurityCybersecurity 1. intro to cybersecurity
Cybersecurity 1. intro to cybersecurity
 
Computer Ethics
Computer EthicsComputer Ethics
Computer Ethics
 
Social engineering
Social engineeringSocial engineering
Social engineering
 
Insider threat v3
Insider threat v3Insider threat v3
Insider threat v3
 
Cyber Security Awareness Session for Executives and Non-IT professionals
Cyber Security Awareness Session for Executives and Non-IT professionalsCyber Security Awareness Session for Executives and Non-IT professionals
Cyber Security Awareness Session for Executives and Non-IT professionals
 
Social Media: the good, the bad and the ugly
Social Media: the good, the bad and the uglySocial Media: the good, the bad and the ugly
Social Media: the good, the bad and the ugly
 
SPEL (Social, professional, ethical and legal) issues in Usability
SPEL (Social, professional, ethical and legal) issues in UsabilitySPEL (Social, professional, ethical and legal) issues in Usability
SPEL (Social, professional, ethical and legal) issues in Usability
 
Internet Safety for Children Powerpoint
Internet Safety for Children Powerpoint Internet Safety for Children Powerpoint
Internet Safety for Children Powerpoint
 
PageRank and Markov Chain
PageRank and Markov ChainPageRank and Markov Chain
PageRank and Markov Chain
 
04. availability-concepts
04. availability-concepts04. availability-concepts
04. availability-concepts
 
Information Security Awareness Training
Information Security Awareness TrainingInformation Security Awareness Training
Information Security Awareness Training
 
Internet Safety
Internet SafetyInternet Safety
Internet Safety
 
9781111534127 ppt ch02
9781111534127 ppt ch029781111534127 ppt ch02
9781111534127 ppt ch02
 
Cyber bullying for Elementary
Cyber bullying for ElementaryCyber bullying for Elementary
Cyber bullying for Elementary
 

Destaque

Web Metircs and KPI
Web Metircs and KPIWeb Metircs and KPI
Web Metircs and KPIShipra Malik
 
Clickstream Data Warehouse - Turning clicks into customers
Clickstream Data Warehouse - Turning clicks into customersClickstream Data Warehouse - Turning clicks into customers
Clickstream Data Warehouse - Turning clicks into customersAlbert Hui
 
Visual Mapping of Clickstream Data
Visual Mapping of Clickstream DataVisual Mapping of Clickstream Data
Visual Mapping of Clickstream DataDataWorks Summit
 
The Key to Effective Analytics: Fast-Returning Queries
The Key to Effective Analytics: Fast-Returning QueriesThe Key to Effective Analytics: Fast-Returning Queries
The Key to Effective Analytics: Fast-Returning QueriesEric Kavanagh
 
WEB Analytics - Data Mining - MIS - eBusiness website
WEB Analytics  - Data Mining - MIS - eBusiness website WEB Analytics  - Data Mining - MIS - eBusiness website
WEB Analytics - Data Mining - MIS - eBusiness website Jyotindra Zaveri
 
20140806 AWS Meister BlackBelt - Amazon Redshift (Korean)
20140806 AWS Meister BlackBelt - Amazon Redshift (Korean)20140806 AWS Meister BlackBelt - Amazon Redshift (Korean)
20140806 AWS Meister BlackBelt - Amazon Redshift (Korean)Amazon Web Services Korea
 
Web log & clickstream
Web log & clickstream Web log & clickstream
Web log & clickstream Michel Bruley
 
(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...
(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...
(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...Amazon Web Services
 
Clickstream Analysis
Clickstream AnalysisClickstream Analysis
Clickstream Analysisintuitiv.de
 
Why use big data tools to do web analytics? And how to do it using Snowplow a...
Why use big data tools to do web analytics? And how to do it using Snowplow a...Why use big data tools to do web analytics? And how to do it using Snowplow a...
Why use big data tools to do web analytics? And how to do it using Snowplow a...yalisassoon
 
찾아가는 AWS 세미나(구로,가산,판교) - AWS 기반 빅데이터 활용 방법 (김일호 솔루션즈 아키텍트)
찾아가는 AWS 세미나(구로,가산,판교) - AWS 기반 빅데이터 활용 방법 (김일호 솔루션즈 아키텍트)찾아가는 AWS 세미나(구로,가산,판교) - AWS 기반 빅데이터 활용 방법 (김일호 솔루션즈 아키텍트)
찾아가는 AWS 세미나(구로,가산,판교) - AWS 기반 빅데이터 활용 방법 (김일호 솔루션즈 아키텍트)Amazon Web Services Korea
 
AWS CLOUD 2017 - Amazon Redshift 기반 DW 와 비지니스 인텔리전스 구현 방법 (김일호 솔루션즈 아키텍트)
AWS CLOUD 2017 - Amazon Redshift 기반 DW 와 비지니스 인텔리전스 구현 방법 (김일호 솔루션즈 아키텍트)AWS CLOUD 2017 - Amazon Redshift 기반 DW 와 비지니스 인텔리전스 구현 방법 (김일호 솔루션즈 아키텍트)
AWS CLOUD 2017 - Amazon Redshift 기반 DW 와 비지니스 인텔리전스 구현 방법 (김일호 솔루션즈 아키텍트)Amazon Web Services Korea
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureJames Serra
 

Destaque (14)

Web Metircs and KPI
Web Metircs and KPIWeb Metircs and KPI
Web Metircs and KPI
 
Clickstream Data Warehouse - Turning clicks into customers
Clickstream Data Warehouse - Turning clicks into customersClickstream Data Warehouse - Turning clicks into customers
Clickstream Data Warehouse - Turning clicks into customers
 
Visual Mapping of Clickstream Data
Visual Mapping of Clickstream DataVisual Mapping of Clickstream Data
Visual Mapping of Clickstream Data
 
Clickstream & Social Media Analysis using Apache Spark
Clickstream & Social Media Analysis using Apache SparkClickstream & Social Media Analysis using Apache Spark
Clickstream & Social Media Analysis using Apache Spark
 
The Key to Effective Analytics: Fast-Returning Queries
The Key to Effective Analytics: Fast-Returning QueriesThe Key to Effective Analytics: Fast-Returning Queries
The Key to Effective Analytics: Fast-Returning Queries
 
WEB Analytics - Data Mining - MIS - eBusiness website
WEB Analytics  - Data Mining - MIS - eBusiness website WEB Analytics  - Data Mining - MIS - eBusiness website
WEB Analytics - Data Mining - MIS - eBusiness website
 
20140806 AWS Meister BlackBelt - Amazon Redshift (Korean)
20140806 AWS Meister BlackBelt - Amazon Redshift (Korean)20140806 AWS Meister BlackBelt - Amazon Redshift (Korean)
20140806 AWS Meister BlackBelt - Amazon Redshift (Korean)
 
Web log & clickstream
Web log & clickstream Web log & clickstream
Web log & clickstream
 
(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...
(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...
(GAM301) Real-Time Game Analytics with Amazon Kinesis, Amazon Redshift, and A...
 
Clickstream Analysis
Clickstream AnalysisClickstream Analysis
Clickstream Analysis
 
Why use big data tools to do web analytics? And how to do it using Snowplow a...
Why use big data tools to do web analytics? And how to do it using Snowplow a...Why use big data tools to do web analytics? And how to do it using Snowplow a...
Why use big data tools to do web analytics? And how to do it using Snowplow a...
 
찾아가는 AWS 세미나(구로,가산,판교) - AWS 기반 빅데이터 활용 방법 (김일호 솔루션즈 아키텍트)
찾아가는 AWS 세미나(구로,가산,판교) - AWS 기반 빅데이터 활용 방법 (김일호 솔루션즈 아키텍트)찾아가는 AWS 세미나(구로,가산,판교) - AWS 기반 빅데이터 활용 방법 (김일호 솔루션즈 아키텍트)
찾아가는 AWS 세미나(구로,가산,판교) - AWS 기반 빅데이터 활용 방법 (김일호 솔루션즈 아키텍트)
 
AWS CLOUD 2017 - Amazon Redshift 기반 DW 와 비지니스 인텔리전스 구현 방법 (김일호 솔루션즈 아키텍트)
AWS CLOUD 2017 - Amazon Redshift 기반 DW 와 비지니스 인텔리전스 구현 방법 (김일호 솔루션즈 아키텍트)AWS CLOUD 2017 - Amazon Redshift 기반 DW 와 비지니스 인텔리전스 구현 방법 (김일호 솔루션즈 아키텍트)
AWS CLOUD 2017 - Amazon Redshift 기반 DW 와 비지니스 인텔리전스 구현 방법 (김일호 솔루션즈 아키텍트)
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 

Semelhante a Web Analytics: Challenges in Data Modeling

Introduction: Relational to Graphs
Introduction: Relational to GraphsIntroduction: Relational to Graphs
Introduction: Relational to GraphsNeo4j
 
SSAS Design & Incremental Processing - PASSMN May 2010
SSAS Design & Incremental Processing - PASSMN May 2010SSAS Design & Incremental Processing - PASSMN May 2010
SSAS Design & Incremental Processing - PASSMN May 2010Dan English
 
Increasing Agility Through Data Virtualization
Increasing Agility Through Data VirtualizationIncreasing Agility Through Data Virtualization
Increasing Agility Through Data VirtualizationDenodo
 
Assessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use CasesAssessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use CasesDATAVERSITY
 
Informatica mdm online training in chennai
Informatica mdm online training in chennaiInformatica mdm online training in chennai
Informatica mdm online training in chennaiGoLogica Technologies
 
SharePoint 2013 BI Scenarios
SharePoint 2013 BI ScenariosSharePoint 2013 BI Scenarios
SharePoint 2013 BI ScenariosAndreas Aschauer
 
Achieving a Single View of Business – Critical Data with Master Data Management
Achieving a Single View of Business – Critical Data with Master Data ManagementAchieving a Single View of Business – Critical Data with Master Data Management
Achieving a Single View of Business – Critical Data with Master Data ManagementDATAVERSITY
 
Zen and the Art of Datanauting
Zen and the Art of DatanautingZen and the Art of Datanauting
Zen and the Art of DatanautingOntologySystems
 
Predictions for the Future of Graph Database
Predictions for the Future of Graph DatabasePredictions for the Future of Graph Database
Predictions for the Future of Graph DatabaseNeo4j
 
Data Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data EnvironmentData Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data EnvironmentDenodo
 
ADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and ComparisonADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and ComparisonDATAVERSITY
 
Introducing Neo4j
Introducing Neo4jIntroducing Neo4j
Introducing Neo4jNeo4j
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AIGary Allemann
 

Semelhante a Web Analytics: Challenges in Data Modeling (20)

Introduction: Relational to Graphs
Introduction: Relational to GraphsIntroduction: Relational to Graphs
Introduction: Relational to Graphs
 
Kicktag - About Kicktag & Cosmos 2014
Kicktag - About Kicktag & Cosmos 2014Kicktag - About Kicktag & Cosmos 2014
Kicktag - About Kicktag & Cosmos 2014
 
SSAS Design & Incremental Processing - PASSMN May 2010
SSAS Design & Incremental Processing - PASSMN May 2010SSAS Design & Incremental Processing - PASSMN May 2010
SSAS Design & Incremental Processing - PASSMN May 2010
 
Increasing Agility Through Data Virtualization
Increasing Agility Through Data VirtualizationIncreasing Agility Through Data Virtualization
Increasing Agility Through Data Virtualization
 
Assessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use CasesAssessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use Cases
 
Informatica mdm online training in chennai
Informatica mdm online training in chennaiInformatica mdm online training in chennai
Informatica mdm online training in chennai
 
SharePoint 2013 BI Scenarios
SharePoint 2013 BI ScenariosSharePoint 2013 BI Scenarios
SharePoint 2013 BI Scenarios
 
Achieving a Single View of Business – Critical Data with Master Data Management
Achieving a Single View of Business – Critical Data with Master Data ManagementAchieving a Single View of Business – Critical Data with Master Data Management
Achieving a Single View of Business – Critical Data with Master Data Management
 
Zen and the Art of Datanauting
Zen and the Art of DatanautingZen and the Art of Datanauting
Zen and the Art of Datanauting
 
Prez szabolcs
Prez szabolcsPrez szabolcs
Prez szabolcs
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
kalyani.ppt
kalyani.pptkalyani.ppt
kalyani.ppt
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
kalyani.ppt
kalyani.pptkalyani.ppt
kalyani.ppt
 
Predictions for the Future of Graph Database
Predictions for the Future of Graph DatabasePredictions for the Future of Graph Database
Predictions for the Future of Graph Database
 
Data Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data EnvironmentData Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data Environment
 
ADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and ComparisonADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and Comparison
 
DQS & MDS in SQL Server 2016
DQS & MDS in SQL Server 2016DQS & MDS in SQL Server 2016
DQS & MDS in SQL Server 2016
 
Introducing Neo4j
Introducing Neo4jIntroducing Neo4j
Introducing Neo4j
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AI
 

Mais de Excella

DCSUG - What's Really Going On? Observer Worksheet
DCSUG - What's Really Going On? Observer WorksheetDCSUG - What's Really Going On? Observer Worksheet
DCSUG - What's Really Going On? Observer WorksheetExcella
 
DCSUG - We Are The Leaders We Have Been Waiting For by Lyssa Adkins
DCSUG - We Are The Leaders We Have Been Waiting For by Lyssa AdkinsDCSUG - We Are The Leaders We Have Been Waiting For by Lyssa Adkins
DCSUG - We Are The Leaders We Have Been Waiting For by Lyssa AdkinsExcella
 
DCSUG - Servant Leadership Handout
DCSUG - Servant Leadership HandoutDCSUG - Servant Leadership Handout
DCSUG - Servant Leadership HandoutExcella
 
DCSUG - Servant Leadership
DCSUG - Servant LeadershipDCSUG - Servant Leadership
DCSUG - Servant LeadershipExcella
 
DCSUG - Applying Analysis in an Agile World
DCSUG - Applying Analysis in an Agile WorldDCSUG - Applying Analysis in an Agile World
DCSUG - Applying Analysis in an Agile WorldExcella
 
DCSUG - The Art and Practice of the Agile Leader
DCSUG - The Art and Practice of the Agile LeaderDCSUG - The Art and Practice of the Agile Leader
DCSUG - The Art and Practice of the Agile LeaderExcella
 
DCSUG - Finding Lean in Agile
DCSUG - Finding Lean in AgileDCSUG - Finding Lean in Agile
DCSUG - Finding Lean in AgileExcella
 
DCSUG - Impact Mapping
DCSUG - Impact MappingDCSUG - Impact Mapping
DCSUG - Impact MappingExcella
 
DCSUG - Happiness: A Key Component of Agile
DCSUG - Happiness: A Key Component of AgileDCSUG - Happiness: A Key Component of Agile
DCSUG - Happiness: A Key Component of AgileExcella
 
The Awkward Teenager of Testing
The Awkward Teenager of TestingThe Awkward Teenager of Testing
The Awkward Teenager of TestingExcella
 
Let's Sharpen Your Agile Ax, It's Story Splitting Time
Let's Sharpen Your Agile Ax, It's Story Splitting TimeLet's Sharpen Your Agile Ax, It's Story Splitting Time
Let's Sharpen Your Agile Ax, It's Story Splitting TimeExcella
 
Managing for Happiness by Jurgen Appelo at DCSUG on 8/9/2016
Managing for Happiness by Jurgen Appelo at DCSUG on 8/9/2016Managing for Happiness by Jurgen Appelo at DCSUG on 8/9/2016
Managing for Happiness by Jurgen Appelo at DCSUG on 8/9/2016Excella
 
The 7 Secrets of Highly Effective Retrospectives (DCSUG)
The 7 Secrets of Highly Effective Retrospectives (DCSUG)The 7 Secrets of Highly Effective Retrospectives (DCSUG)
The 7 Secrets of Highly Effective Retrospectives (DCSUG)Excella
 
Get Your Productivity Game On!!
Get Your Productivity Game On!!Get Your Productivity Game On!!
Get Your Productivity Game On!!Excella
 
How to Structure Multi Team Organizations
How to Structure Multi Team OrganizationsHow to Structure Multi Team Organizations
How to Structure Multi Team OrganizationsExcella
 
Tactics to Kickstart Your Journey Toward DevOps
Tactics to Kickstart Your Journey Toward DevOpsTactics to Kickstart Your Journey Toward DevOps
Tactics to Kickstart Your Journey Toward DevOpsExcella
 
Intro to Mocking - DjangoCon 2015
Intro to Mocking - DjangoCon 2015Intro to Mocking - DjangoCon 2015
Intro to Mocking - DjangoCon 2015Excella
 
Tactics to Kickstart Your Journey Toward Continuous Delivery
Tactics to Kickstart Your Journey Toward Continuous DeliveryTactics to Kickstart Your Journey Toward Continuous Delivery
Tactics to Kickstart Your Journey Toward Continuous DeliveryExcella
 
Using ansible vault to protect your secrets
Using ansible vault to protect your secretsUsing ansible vault to protect your secrets
Using ansible vault to protect your secretsExcella
 
Using Lean Thinking to Increase the Value of Agile
Using Lean Thinking to Increase the Value of AgileUsing Lean Thinking to Increase the Value of Agile
Using Lean Thinking to Increase the Value of AgileExcella
 

Mais de Excella (20)

DCSUG - What's Really Going On? Observer Worksheet
DCSUG - What's Really Going On? Observer WorksheetDCSUG - What's Really Going On? Observer Worksheet
DCSUG - What's Really Going On? Observer Worksheet
 
DCSUG - We Are The Leaders We Have Been Waiting For by Lyssa Adkins
DCSUG - We Are The Leaders We Have Been Waiting For by Lyssa AdkinsDCSUG - We Are The Leaders We Have Been Waiting For by Lyssa Adkins
DCSUG - We Are The Leaders We Have Been Waiting For by Lyssa Adkins
 
DCSUG - Servant Leadership Handout
DCSUG - Servant Leadership HandoutDCSUG - Servant Leadership Handout
DCSUG - Servant Leadership Handout
 
DCSUG - Servant Leadership
DCSUG - Servant LeadershipDCSUG - Servant Leadership
DCSUG - Servant Leadership
 
DCSUG - Applying Analysis in an Agile World
DCSUG - Applying Analysis in an Agile WorldDCSUG - Applying Analysis in an Agile World
DCSUG - Applying Analysis in an Agile World
 
DCSUG - The Art and Practice of the Agile Leader
DCSUG - The Art and Practice of the Agile LeaderDCSUG - The Art and Practice of the Agile Leader
DCSUG - The Art and Practice of the Agile Leader
 
DCSUG - Finding Lean in Agile
DCSUG - Finding Lean in AgileDCSUG - Finding Lean in Agile
DCSUG - Finding Lean in Agile
 
DCSUG - Impact Mapping
DCSUG - Impact MappingDCSUG - Impact Mapping
DCSUG - Impact Mapping
 
DCSUG - Happiness: A Key Component of Agile
DCSUG - Happiness: A Key Component of AgileDCSUG - Happiness: A Key Component of Agile
DCSUG - Happiness: A Key Component of Agile
 
The Awkward Teenager of Testing
The Awkward Teenager of TestingThe Awkward Teenager of Testing
The Awkward Teenager of Testing
 
Let's Sharpen Your Agile Ax, It's Story Splitting Time
Let's Sharpen Your Agile Ax, It's Story Splitting TimeLet's Sharpen Your Agile Ax, It's Story Splitting Time
Let's Sharpen Your Agile Ax, It's Story Splitting Time
 
Managing for Happiness by Jurgen Appelo at DCSUG on 8/9/2016
Managing for Happiness by Jurgen Appelo at DCSUG on 8/9/2016Managing for Happiness by Jurgen Appelo at DCSUG on 8/9/2016
Managing for Happiness by Jurgen Appelo at DCSUG on 8/9/2016
 
The 7 Secrets of Highly Effective Retrospectives (DCSUG)
The 7 Secrets of Highly Effective Retrospectives (DCSUG)The 7 Secrets of Highly Effective Retrospectives (DCSUG)
The 7 Secrets of Highly Effective Retrospectives (DCSUG)
 
Get Your Productivity Game On!!
Get Your Productivity Game On!!Get Your Productivity Game On!!
Get Your Productivity Game On!!
 
How to Structure Multi Team Organizations
How to Structure Multi Team OrganizationsHow to Structure Multi Team Organizations
How to Structure Multi Team Organizations
 
Tactics to Kickstart Your Journey Toward DevOps
Tactics to Kickstart Your Journey Toward DevOpsTactics to Kickstart Your Journey Toward DevOps
Tactics to Kickstart Your Journey Toward DevOps
 
Intro to Mocking - DjangoCon 2015
Intro to Mocking - DjangoCon 2015Intro to Mocking - DjangoCon 2015
Intro to Mocking - DjangoCon 2015
 
Tactics to Kickstart Your Journey Toward Continuous Delivery
Tactics to Kickstart Your Journey Toward Continuous DeliveryTactics to Kickstart Your Journey Toward Continuous Delivery
Tactics to Kickstart Your Journey Toward Continuous Delivery
 
Using ansible vault to protect your secrets
Using ansible vault to protect your secretsUsing ansible vault to protect your secrets
Using ansible vault to protect your secrets
 
Using Lean Thinking to Increase the Value of Agile
Using Lean Thinking to Increase the Value of AgileUsing Lean Thinking to Increase the Value of Agile
Using Lean Thinking to Increase the Value of Agile
 

Último

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 

Último (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 

Web Analytics: Challenges in Data Modeling

  • 2. AGENDA • Introduction to Web Analytics • Data Sources, Data Capture • Vocabulary • Data Modeling Basics • Relational vs. Dimensional • Normalization, De-normalization, Aggregation • Web Analytics + Data Modeling • Four-tiered Data Model for Web data • Challenges • Q&A
  • 3. INTRODUCTION • Anne Marie Macek • Senior Manager, Data Strategy • Consumer Insight and Revenue Strategy • Marriott International • 30+ years Data Modeling and Reporting • 14+ years Data Warehousing and Business Intelligence • 4+ years Web Analytics Data and Reporting • MBA, Management Information Systems • BS, Mathematics and Computer Science
  • 4. EXPERIENCE • Data Modeling: • Flat Files, IMS/DB, DB2, Oracle, Netezza • MS Access, Borland Paradox • Cognos Powerplay, MS Analysis Services, Cognos 10.2 Dynamic Cubes • Reporting: • COBOL, Focus, SAS, Actuate • Cognos BI Suite • Business Functions: • eCommerce, Revenue Management, Sales & Marketing • Human Resources, Finance
  • 5. DEFINITION • Web analytics is the measurement, collection, analysis and reporting of internet data for purposes of understanding and optimizing web usage. Source: Wikipedia
  • 6. OBJECTIVES • Website Performance • Conversion Rate ($ sales / # visits) • Trends over time • In Response to Campaigns • Website Optimization • Customer Behavior • Technological Trends • Integration • Customer Lifetime Value / Segmentation • Personalization • Proactive display of pertinent information
  • 7. DATA SOURCES • • • • • • • • • • • Click-stream Data Search Engine Optimization (SEO) Campaign Classification Email Campaigns Advertising Impressions 3rd Party Marketing Data IP Geolocation Competitive Analysis Customer Information Multi-channel Analysis Outcome Data
  • 8. CLICKSTREAM COLLECTION • Web Log Files • Rudimentary data collected on company’s web server • Page name, IP address, browser, date/time • Does not screen out search engine robots • JavaScript Tagging (Google Analytics, Omniture, WebTrends) • • • • As page loads, data is sent to 3rd party for collection Assigns a cookie to the user Can implement custom tags on specific pages Does not count pages served from cache • Packet Sniffers (Cloudmeter Pion, Tealeaf CX Connect) • Software or hardware layer installed on web servers • Parsing raw data, and ensuring PII can be complex
  • 9. CLICKSTREAM ANALYSIS • Number of Visitors • Total vs. Unique • New vs. Repeat • Source of Visit (Session) • External Link (Campaign Analysis / Attribution) • Direct • Searches Performed On Site • Keywords • Sort Order of Results • Page Analysis • Specific Actions Performed • Order (Booking) • Signup for Membership, Credit Card, Event • Abandonment (Bounce Rate)
  • 10. BRINGING CLICKSTREAM IN-HOUSE • Control/Consolidate Business Rules • Integration with Corporate Systems of Record • Single Version of the Truth • Integration with Other Web Data Sources • Enable more “intelligent” metrics • Not all visits are a conversion opportunity • Shift from “visit analysis” to “customer analysis” • Enable advanced statistical and predictive modeling • Multi-touch Attribution • Pay Per Click (PPC) Keyword Bid Optimization
  • 11. CLICKSTREAM CHALLENGES • “Clickstream data … is delightfully complex, ever changing, and full of mysterious occurrences.” Avinash Kaushik, Web Analytics: An Hour a Day • Volume • Cons- It’s big • Pros- It’s incremental • • • • • • • Fairly Unstructured Exceptions to every rule Mobile App vs. Mobile Web vs. Desktop Rapidly Changing Most queries require trending YTD + 2 years’ history Few “natural” metrics; most require count (distinct) How do I model this data??
  • 12. DATA WAREHOUSE APPROACHES Bill Inmon Ralph Kimball • DW is Central Repository of all Enterprise Data • “Top Down” • Relational Model (3NF) • Feeds Functional Data Marts • Huge Undertaking • DW is the “Virtual” Integration of Various Functional Data Marts • “Bottom Up” • Dimensional Model • Quicker to Develop • Silo-ed and Redundant
  • 14. DIMENSIONAL MODELS Star Schema Snowflake Schema Source: Wikipedia
  • 15. NORMALIZATION • Removes redundancy and dependency from data structures. • 1NF: Remove Repeating Groups • 2NF: Remove Partial Key Dependencies • 3NF: Remove Dependencies Among Attributes • Tutorial: http://phlonx.com/resources/nf3/ • Data Warehouses require some De-Normalization to improve query performance
  • 16. ECOMMERCE DATA WAREHOUSE Native Source Model Fact Model BI Model Aggregate Model
  • 17. NATIVE SOURCE MODEL Plus • In-database copy of the source data • Stores data elements we are not yet ready to model further • Maintains details for research purposes • Prevents repeating historical conversion Minus • • • • Huge Unstructured Not normalized (at all) Not useful for analysis or reporting
  • 19. FACT MODEL Plus • “Snow-relational” • Nearly Normalized (optimized for load) • Multiple Fact & Extension Tables (manage I/O) • Granular (click row) • Contains keys to integrate with enterprise data Minus • Complex load including propagation and look-back • Use requires nonfiltered joins of massive tables • Difficult to use for analysis, cannot be used for reporting
  • 21. BI MODEL Plus Minus • “Star-flake” Model • De-normalized (optimized for query) • Pre-joined • Granular (click row) • Integrated with enterprise data at load time • Useful for detailed analysis • Complex load process • It’s still big! • Corrections to Fact Model data issues require re-build or complex conversion processes • Difficult to use for reporting
  • 23. AGGREGATE MODEL Plus • Star Schema (simple) • De-normalized (optimized for query) • Aggregated • Fast query performance • Great for predetermined reports Minus • Corrections to Fact Model data issues and embedded dimensions require re-build • Count distincts only available for predetermined dimensions • Limited use for analysis

Notas do Editor

  1. How many attendees know Web Analytics? How many know Data Modeling? Attempt to bring everyone to an even playing field, then focus on challenges.
  2. Wikipedia definition
  3. Performance <> uptime, response time. Talking about Business Results.