SlideShare a Scribd company logo
1 of 12
Download to read offline
CUBE DESIGN
BY HANNES MEYER


OnLine Analytical Processing OLAP
Agenda
  What are cubes?
  Multidimensionality

  Storage of multidimensional data.

  Hierarchies

  Operations

  Demo
What are cubes?
    Multi-dimensional representation of data
What are cubes (cont.)?
  syn: Hypercube, multidimensional database (MDB),
   olap cube
  Cubes can have more than three dimensions
Fact Tables
    Contain numerical measurements of a certain
     business process.
       E.g.   $12.000 sales in NY store on 12-01-08
    Additionally foreign keys to different dimension
     tables
       E.g.   further store/sales person information
    Center in star schema
Dimension Tables
  Contain attributes by which data can be grouped
  e.g. city/region of store, product category

  Linked to the fact table via their primary keys

  Slowly changing dimensions: dimensions which

   change over time. Can be dealt with in 3 ways:
       Overwritingold values
       Add new row to table, distinguish records by versioning

       Add new column (attribute) to existing row
Data Storage Models
    relational databases (ROLAP)
       Datain tables
       Summaries stored in precalculated tables

    multi-dimensional databases (MOLAP)
       Data  in multidimensional arrays
       + Less disk space

       + Better Performance (precalculated aggregates)

       - Time to aggregate & calculate

       - Updates require recalculation

    Hybrid (HOLAP)
Hierarchies
    Grouping of dimensions         e.g. country -> sales
    e.g. month -> semester -        region -> state -> city
     > quartal -> year               -> store
    2008                           Germany
       H1   2008                      Southern    germany
         Q1      2008                   BaWue
               Jan 2008                       Stuttgart
                                                    Store A
               Feb 2008
                                                    Store B
               March 2008
         Q2      2008 …                 Bavaria
                                               Munich
       H2   2008 …                                Store A B C
Operations: Slice
    Slicing is the process of retrieving a block of data
     from a cube by filtering on one dimension
Operations: Dice
    Dicingis the process of retrieving a block of data
     from a cube by filtering on all dimensions
Operations: Drill Up/ Down
  Drilling up: Presenting data at a higher level on the
   hierarchy e.g. Store -> Region
  Drilling Down: Presenting data at a lower level on
   the hierarchy Region -> Store
Building the cube in SSAS
    Preconditions
       Connecting  datasources
       Defining views

       Selecting dimensions

  Define fact & dimension tables & time dimension
  Select measures

  Deploy & query the cube

   Demo

More Related Content

What's hot

What's hot (20)

Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
 
Exadata
ExadataExadata
Exadata
 
Scylla Summit 2022: Making Schema Changes Safe with Raft
Scylla Summit 2022: Making Schema Changes Safe with RaftScylla Summit 2022: Making Schema Changes Safe with Raft
Scylla Summit 2022: Making Schema Changes Safe with Raft
 
The Complete MariaDB Server tutorial
The Complete MariaDB Server tutorialThe Complete MariaDB Server tutorial
The Complete MariaDB Server tutorial
 
Mongo db
Mongo dbMongo db
Mongo db
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
New Analytic Uses of Master Data Management in the Enterprise
New Analytic Uses of Master Data Management in the EnterpriseNew Analytic Uses of Master Data Management in the Enterprise
New Analytic Uses of Master Data Management in the Enterprise
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...
Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...
Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...
 
Azure Cosmos DB + Gremlin API in Action
Azure Cosmos DB + Gremlin API in ActionAzure Cosmos DB + Gremlin API in Action
Azure Cosmos DB + Gremlin API in Action
 
What is data engineering?
What is data engineering?What is data engineering?
What is data engineering?
 
ETL
ETLETL
ETL
 
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | EdurekaData Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika Kotecha
 
Data Quality: A Raising Data Warehousing Concern
Data Quality: A Raising Data Warehousing ConcernData Quality: A Raising Data Warehousing Concern
Data Quality: A Raising Data Warehousing Concern
 
How to Manage Scale-Out Environments with MariaDB MaxScale
How to Manage Scale-Out Environments with MariaDB MaxScaleHow to Manage Scale-Out Environments with MariaDB MaxScale
How to Manage Scale-Out Environments with MariaDB MaxScale
 
Datawarehouse and OLAP
Datawarehouse and OLAPDatawarehouse and OLAP
Datawarehouse and OLAP
 
Five Connectivity and Security Use Cases for Azure VNets
Five Connectivity and Security Use Cases for Azure VNetsFive Connectivity and Security Use Cases for Azure VNets
Five Connectivity and Security Use Cases for Azure VNets
 
Data Warehouse
Data Warehouse Data Warehouse
Data Warehouse
 

Similar to Olap Cube Design

Using Continuous Etl With Real Time Queries To Eliminate My Sql Bottlenecks
Using Continuous Etl With Real Time Queries To Eliminate My Sql BottlenecksUsing Continuous Etl With Real Time Queries To Eliminate My Sql Bottlenecks
Using Continuous Etl With Real Time Queries To Eliminate My Sql Bottlenecks
MySQLConference
 
The Yahoo Open Stack
The Yahoo Open StackThe Yahoo Open Stack
The Yahoo Open Stack
Megan Eskey
 
Gmr Highload Presentation Revised
Gmr Highload Presentation RevisedGmr Highload Presentation Revised
Gmr Highload Presentation Revised
Ontico
 
Gmr Highload Presentation
Gmr Highload PresentationGmr Highload Presentation
Gmr Highload Presentation
Ontico
 
Internationalisierung Barcampbodensee Share
Internationalisierung Barcampbodensee ShareInternationalisierung Barcampbodensee Share
Internationalisierung Barcampbodensee Share
kindo
 

Similar to Olap Cube Design (10)

Using Continuous Etl With Real Time Queries To Eliminate My Sql Bottlenecks
Using Continuous Etl With Real Time Queries To Eliminate My Sql BottlenecksUsing Continuous Etl With Real Time Queries To Eliminate My Sql Bottlenecks
Using Continuous Etl With Real Time Queries To Eliminate My Sql Bottlenecks
 
The Yahoo Open Stack
The Yahoo Open StackThe Yahoo Open Stack
The Yahoo Open Stack
 
Gmr Highload Presentation Revised
Gmr Highload Presentation RevisedGmr Highload Presentation Revised
Gmr Highload Presentation Revised
 
Gmr Highload Presentation
Gmr Highload PresentationGmr Highload Presentation
Gmr Highload Presentation
 
Enterprise PHP Development (Dutch PHP Conference 2008)
Enterprise PHP Development (Dutch PHP Conference 2008)Enterprise PHP Development (Dutch PHP Conference 2008)
Enterprise PHP Development (Dutch PHP Conference 2008)
 
Architecting a Data Warehouse: A Case Study
Architecting a Data Warehouse: A Case StudyArchitecting a Data Warehouse: A Case Study
Architecting a Data Warehouse: A Case Study
 
Bcm Best Practise & Local Challenges
Bcm Best Practise & Local ChallengesBcm Best Practise & Local Challenges
Bcm Best Practise & Local Challenges
 
Groovy Finance
Groovy FinanceGroovy Finance
Groovy Finance
 
Internationalisierung Barcampbodensee Share
Internationalisierung Barcampbodensee ShareInternationalisierung Barcampbodensee Share
Internationalisierung Barcampbodensee Share
 
From Work To Word
From Work To WordFrom Work To Word
From Work To Word
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Recently uploaded (20)

Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 

Olap Cube Design

  • 1. CUBE DESIGN BY HANNES MEYER OnLine Analytical Processing OLAP
  • 2. Agenda   What are cubes?   Multidimensionality   Storage of multidimensional data.   Hierarchies   Operations   Demo
  • 3. What are cubes?   Multi-dimensional representation of data
  • 4. What are cubes (cont.)?   syn: Hypercube, multidimensional database (MDB), olap cube   Cubes can have more than three dimensions
  • 5. Fact Tables   Contain numerical measurements of a certain business process.   E.g. $12.000 sales in NY store on 12-01-08   Additionally foreign keys to different dimension tables   E.g. further store/sales person information   Center in star schema
  • 6. Dimension Tables   Contain attributes by which data can be grouped   e.g. city/region of store, product category   Linked to the fact table via their primary keys   Slowly changing dimensions: dimensions which change over time. Can be dealt with in 3 ways:   Overwritingold values   Add new row to table, distinguish records by versioning   Add new column (attribute) to existing row
  • 7. Data Storage Models   relational databases (ROLAP)   Datain tables   Summaries stored in precalculated tables   multi-dimensional databases (MOLAP)   Data in multidimensional arrays   + Less disk space   + Better Performance (precalculated aggregates)   - Time to aggregate & calculate   - Updates require recalculation   Hybrid (HOLAP)
  • 8. Hierarchies   Grouping of dimensions   e.g. country -> sales   e.g. month -> semester - region -> state -> city > quartal -> year -> store   2008   Germany   H1 2008   Southern germany   Q1 2008   BaWue   Jan 2008   Stuttgart   Store A   Feb 2008   Store B   March 2008   Q2 2008 …   Bavaria   Munich   H2 2008 …   Store A B C
  • 9. Operations: Slice   Slicing is the process of retrieving a block of data from a cube by filtering on one dimension
  • 10. Operations: Dice   Dicingis the process of retrieving a block of data from a cube by filtering on all dimensions
  • 11. Operations: Drill Up/ Down   Drilling up: Presenting data at a higher level on the hierarchy e.g. Store -> Region   Drilling Down: Presenting data at a lower level on the hierarchy Region -> Store
  • 12. Building the cube in SSAS   Preconditions   Connecting datasources   Defining views   Selecting dimensions   Define fact & dimension tables & time dimension   Select measures   Deploy & query the cube    Demo