SlideShare uma empresa Scribd logo
1 de 20
Building Data WareHouse by
Inmon

Chapter 12: The Really Large Data Warehouse

http://it-slideshares.blogspot.com/
Why   the Rapid Growth?
The Impact of Large Volumes of Data
Disk Storage in the Face of Data Separation
Moving Data from One Environment to
 Another
Inverting the Data Warehouse
Total Cost
Maximum Capacity
Summary
Why the Rapid Growth?
The  data warehouse contains history.
Data warehouses collect data at the most
 granular level
The need to bring lots of different kinds
 of data together
The Impact of Large Volumes of Data
    Basic   Data-Management Activities
     ◦ As data volumes grow large, normal database
       functions require increasingly larger amounts
       of resources.
    The   Cost of Storage
     ◦ The volume of data grows, the cost of the
       data increases dramatically
The Impact of Large Volumes of Data
    The    Real Costs of Storage
     ◦ There are lots of components to disk storage
       aside from the storage device itself
         Disk controller
         Communications lines
         Processor
         Software
The Impact of Large Volumes of Data
    The Usage Pattern of Data in the Face of
     Large Volumes
     ◦ Over time, as the volume of data grows, the
       percentage of data actually used drops
The Impact of Large Volumes of Data
    A   Simple Calculation
     Usage ratio = Actual bytes used / Total data warehouse bytes
     ◦ the volume of data found in your data
       warehouse goes up, the actual percentage
       used goes down
    Two     Classes of Data
     ◦ Infrequently used data is often called dormant
       data or inactive data.
     ◦ Frequently used data is often called actively used
       data.
The Impact of Large Volumes of Data
    Implications   of Separating Data into Two
     Classes
Disk Storage
in the Face of Data Separation
Near-Line       Storage
 ◦ near-line storage, (depending on the vendor) is
   sequential storage
 ◦ Characteristics:
      Robotically controlled
      Inexpensive
      Bulk amounts of data
      Reliable over a long period of time
      Seconds to access first record
Disk Storage
in the Face of Data Separation
Access   Speed and Disk Storage
 ◦ The difference between freely flowing blood
   and blood with many restricting components
Disk Storage
in the Face of Data Separation
Archival   Storage
 ◦ Needs for split storage to manage large
   amount of data
 ◦ Besides disk storage and near-line or bulk
   storage
 ◦ Different with near-line storage
Disk Storage
in the Face of Data Separation
Implications   of Transparency
 ◦ A record or row in the data warehouse is
   identical to a record or row in near-line
   storage.
Moving Data from
One Environment to Another
 Many   ways:
  ◦ have a database administrator manually move data
  ◦ hierarchical storage management (HSM)
  ◦ the cross-media storage management (CMSM) option
Moving Data from
    One Environment to Another
The   CMSM Approach
 ◦ The CMSM technology is fully
   automated.
 ◦ The CMSM is software that makes
   the physical location of the data
   transparent
 ◦ The end user does not need to
   know where data is—in the data
   warehouse or on near-line
   storage.
Moving Data from
One Environment to Another
A   Data Warehouse Usage Monitor
 ◦ Streamline the operations of the CMSM
   environment
 ◦ Two types:
   those that are supplied by the DBMS vendor
   those supplied by third-party monitors
Inverting the Data Warehouse
inverteddata warehouse: Consider a
 normal data warehouse.
To build a data warehouse:
 ◦ Normal way: put data first into disk storage
    (after the data ages) near-line or archival
   storage
 ◦ Alternative way: first enter data into near-line
   storage (not disk storage)  data is “staged”
   from the near-line environment to the disk
   environment (to accessed and analyzed) 
   (after over) returned to near-line storage
Total Cost
With  the introduction of near-line and
 archival storage, the growing costs of a
 data warehouse can be mitigated
Maximum Capacity
“XYZ   machine can handle up to nnn terabytes
 of data.”
Parameters measures the machines capacity:
  Volumes of data
  Number of users
  Workload complexity


The balanced case is where there is a fair
 amount of data, a fair number of users, and a
 reasonably complex workload
Summary
Data  warehouses grow large explosively
The data inside the warehouse separates
 into one of two classes—frequently used
 data or infrequently used data
Without near-line and/or archival
 storage, the costs of the data
 warehouseskyrocket as the data
 warehouse grows large
http://it-slideshares.blogspot.com/

Mais conteúdo relacionado

Mais procurados

Types of databases
Types of databasesTypes of databases
Types of databases
PAQUIAAIZEL
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
Eyad Manna
 
Dataware housing
Dataware housingDataware housing
Dataware housing
work
 
90300 633579030311875000
90300 63357903031187500090300 633579030311875000
90300 633579030311875000
sumit621
 

Mais procurados (20)

Meta Data and it's Type
Meta Data and it's TypeMeta Data and it's Type
Meta Data and it's Type
 
Types of databases
Types of databasesTypes of databases
Types of databases
 
Week 17 slides 1 7 multidimensional, parallel, and distributed database
Week 17 slides 1 7 multidimensional, parallel, and distributed databaseWeek 17 slides 1 7 multidimensional, parallel, and distributed database
Week 17 slides 1 7 multidimensional, parallel, and distributed database
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
 
Teradata
TeradataTeradata
Teradata
 
database and database types
database and database typesdatabase and database types
database and database types
 
Data warehouseing
Data warehouseingData warehouseing
Data warehouseing
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Classification of data mart
Classification of data martClassification of data mart
Classification of data mart
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
2016 SDMX Experts meeting, Building Together
2016 SDMX Experts meeting, Building Together2016 SDMX Experts meeting, Building Together
2016 SDMX Experts meeting, Building Together
 
Data warehousing ppt
Data warehousing pptData warehousing ppt
Data warehousing ppt
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Dataware housing
Dataware housingDataware housing
Dataware housing
 
Database systems
Database systemsDatabase systems
Database systems
 
Data mining 2 - Data warehouse (cheat sheet - printable)
Data mining 2 - Data warehouse (cheat sheet - printable)Data mining 2 - Data warehouse (cheat sheet - printable)
Data mining 2 - Data warehouse (cheat sheet - printable)
 
90300 633579030311875000
90300 63357903031187500090300 633579030311875000
90300 633579030311875000
 
Data mart
Data martData mart
Data mart
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Martin Willcox - What is a Data Lake, Anyway?
Martin Willcox - What is a Data Lake, Anyway?Martin Willcox - What is a Data Lake, Anyway?
Martin Willcox - What is a Data Lake, Anyway?
 

Semelhante a Lecture 12 The Really Large Data Warehouse

03 Data Recovery - Notes
03 Data Recovery - Notes03 Data Recovery - Notes
03 Data Recovery - Notes
Kranthi
 
Introduction to Data Storage and Cloud Computing
Introduction to Data Storage and Cloud ComputingIntroduction to Data Storage and Cloud Computing
Introduction to Data Storage and Cloud Computing
Rutuja751147
 

Semelhante a Lecture 12 The Really Large Data Warehouse (20)

Lecture 05 - The Data Warehouse and Technology
Lecture 05 - The Data Warehouse and TechnologyLecture 05 - The Data Warehouse and Technology
Lecture 05 - The Data Warehouse and Technology
 
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothThe Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
03 Data Recovery - Notes
03 Data Recovery - Notes03 Data Recovery - Notes
03 Data Recovery - Notes
 
Mis chapter 5
Mis  chapter 5Mis  chapter 5
Mis chapter 5
 
Deduplication in Open Spurce Cloud
Deduplication in Open Spurce CloudDeduplication in Open Spurce Cloud
Deduplication in Open Spurce Cloud
 
(Lecture 2)Data Warehouse Architecture.pdf
(Lecture 2)Data Warehouse Architecture.pdf(Lecture 2)Data Warehouse Architecture.pdf
(Lecture 2)Data Warehouse Architecture.pdf
 
Digital Media Storage.pptx
Digital Media Storage.pptxDigital Media Storage.pptx
Digital Media Storage.pptx
 
Information Storage and Management notes ssmeena
Information Storage and Management notes ssmeena Information Storage and Management notes ssmeena
Information Storage and Management notes ssmeena
 
Introduction to Data Storage and Cloud Computing
Introduction to Data Storage and Cloud ComputingIntroduction to Data Storage and Cloud Computing
Introduction to Data Storage and Cloud Computing
 
Elastic storage in the cloud session 5224 final v2
Elastic storage in the cloud session 5224 final v2Elastic storage in the cloud session 5224 final v2
Elastic storage in the cloud session 5224 final v2
 
Presentation
PresentationPresentation
Presentation
 
How To Build A Stable And Robust Base For a “Cloud”
How To Build A Stable And Robust Base For a “Cloud”How To Build A Stable And Robust Base For a “Cloud”
How To Build A Stable And Robust Base For a “Cloud”
 
ISM Unit 1.pdf
ISM Unit 1.pdfISM Unit 1.pdf
ISM Unit 1.pdf
 
Strongbox Data Storage Podcast
Strongbox Data Storage PodcastStrongbox Data Storage Podcast
Strongbox Data Storage Podcast
 
What is Data Lake and its Benefits?
What is Data Lake and its Benefits?What is Data Lake and its Benefits?
What is Data Lake and its Benefits?
 
Distributed dbms (ddbms)
Distributed dbms (ddbms)Distributed dbms (ddbms)
Distributed dbms (ddbms)
 
EVALUATE DATABASE COMPRESSION PERFORMANCE AND PARALLEL BACKUP
EVALUATE DATABASE COMPRESSION PERFORMANCE AND PARALLEL BACKUPEVALUATE DATABASE COMPRESSION PERFORMANCE AND PARALLEL BACKUP
EVALUATE DATABASE COMPRESSION PERFORMANCE AND PARALLEL BACKUP
 
Difference between Database vs Data Warehouse vs Data Lake
Difference between Database vs Data Warehouse vs Data LakeDifference between Database vs Data Warehouse vs Data Lake
Difference between Database vs Data Warehouse vs Data Lake
 
04.01 file organization
04.01 file organization04.01 file organization
04.01 file organization
 

Mais de phanleson

Lecture 1 - Getting to know XML
Lecture 1 - Getting to know XMLLecture 1 - Getting to know XML
Lecture 1 - Getting to know XML
phanleson
 

Mais de phanleson (20)

Learning spark ch01 - Introduction to Data Analysis with Spark
Learning spark ch01 - Introduction to Data Analysis with SparkLearning spark ch01 - Introduction to Data Analysis with Spark
Learning spark ch01 - Introduction to Data Analysis with Spark
 
Firewall - Network Defense in Depth Firewalls
Firewall - Network Defense in Depth FirewallsFirewall - Network Defense in Depth Firewalls
Firewall - Network Defense in Depth Firewalls
 
Mobile Security - Wireless hacking
Mobile Security - Wireless hackingMobile Security - Wireless hacking
Mobile Security - Wireless hacking
 
Authentication in wireless - Security in Wireless Protocols
Authentication in wireless - Security in Wireless ProtocolsAuthentication in wireless - Security in Wireless Protocols
Authentication in wireless - Security in Wireless Protocols
 
E-Commerce Security - Application attacks - Server Attacks
E-Commerce Security - Application attacks - Server AttacksE-Commerce Security - Application attacks - Server Attacks
E-Commerce Security - Application attacks - Server Attacks
 
Hacking web applications
Hacking web applicationsHacking web applications
Hacking web applications
 
HBase In Action - Chapter 04: HBase table design
HBase In Action - Chapter 04: HBase table designHBase In Action - Chapter 04: HBase table design
HBase In Action - Chapter 04: HBase table design
 
HBase In Action - Chapter 10 - Operations
HBase In Action - Chapter 10 - OperationsHBase In Action - Chapter 10 - Operations
HBase In Action - Chapter 10 - Operations
 
Hbase in action - Chapter 09: Deploying HBase
Hbase in action - Chapter 09: Deploying HBaseHbase in action - Chapter 09: Deploying HBase
Hbase in action - Chapter 09: Deploying HBase
 
Learning spark ch11 - Machine Learning with MLlib
Learning spark ch11 - Machine Learning with MLlibLearning spark ch11 - Machine Learning with MLlib
Learning spark ch11 - Machine Learning with MLlib
 
Learning spark ch10 - Spark Streaming
Learning spark ch10 - Spark StreamingLearning spark ch10 - Spark Streaming
Learning spark ch10 - Spark Streaming
 
Learning spark ch09 - Spark SQL
Learning spark ch09 - Spark SQLLearning spark ch09 - Spark SQL
Learning spark ch09 - Spark SQL
 
Learning spark ch07 - Running on a Cluster
Learning spark ch07 - Running on a ClusterLearning spark ch07 - Running on a Cluster
Learning spark ch07 - Running on a Cluster
 
Learning spark ch06 - Advanced Spark Programming
Learning spark ch06 - Advanced Spark ProgrammingLearning spark ch06 - Advanced Spark Programming
Learning spark ch06 - Advanced Spark Programming
 
Learning spark ch05 - Loading and Saving Your Data
Learning spark ch05 - Loading and Saving Your DataLearning spark ch05 - Loading and Saving Your Data
Learning spark ch05 - Loading and Saving Your Data
 
Learning spark ch04 - Working with Key/Value Pairs
Learning spark ch04 - Working with Key/Value PairsLearning spark ch04 - Working with Key/Value Pairs
Learning spark ch04 - Working with Key/Value Pairs
 
Learning spark ch01 - Introduction to Data Analysis with Spark
Learning spark ch01 - Introduction to Data Analysis with SparkLearning spark ch01 - Introduction to Data Analysis with Spark
Learning spark ch01 - Introduction to Data Analysis with Spark
 
Hướng Dẫn Đăng Ký LibertaGia - A guide and introduciton about Libertagia
Hướng Dẫn Đăng Ký LibertaGia - A guide and introduciton about LibertagiaHướng Dẫn Đăng Ký LibertaGia - A guide and introduciton about Libertagia
Hướng Dẫn Đăng Ký LibertaGia - A guide and introduciton about Libertagia
 
Lecture 1 - Getting to know XML
Lecture 1 - Getting to know XMLLecture 1 - Getting to know XML
Lecture 1 - Getting to know XML
 
Lecture 4 - Adding XTHML for the Web
Lecture  4 - Adding XTHML for the WebLecture  4 - Adding XTHML for the Web
Lecture 4 - Adding XTHML for the Web
 

Último

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 

Último (20)

Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 

Lecture 12 The Really Large Data Warehouse

  • 1. Building Data WareHouse by Inmon Chapter 12: The Really Large Data Warehouse http://it-slideshares.blogspot.com/
  • 2. Why the Rapid Growth? The Impact of Large Volumes of Data Disk Storage in the Face of Data Separation Moving Data from One Environment to Another Inverting the Data Warehouse Total Cost Maximum Capacity Summary
  • 3.
  • 4. Why the Rapid Growth? The data warehouse contains history. Data warehouses collect data at the most granular level The need to bring lots of different kinds of data together
  • 5. The Impact of Large Volumes of Data Basic Data-Management Activities ◦ As data volumes grow large, normal database functions require increasingly larger amounts of resources. The Cost of Storage ◦ The volume of data grows, the cost of the data increases dramatically
  • 6. The Impact of Large Volumes of Data The Real Costs of Storage ◦ There are lots of components to disk storage aside from the storage device itself  Disk controller  Communications lines  Processor  Software
  • 7. The Impact of Large Volumes of Data The Usage Pattern of Data in the Face of Large Volumes ◦ Over time, as the volume of data grows, the percentage of data actually used drops
  • 8. The Impact of Large Volumes of Data A Simple Calculation Usage ratio = Actual bytes used / Total data warehouse bytes ◦ the volume of data found in your data warehouse goes up, the actual percentage used goes down Two Classes of Data ◦ Infrequently used data is often called dormant data or inactive data. ◦ Frequently used data is often called actively used data.
  • 9. The Impact of Large Volumes of Data Implications of Separating Data into Two Classes
  • 10. Disk Storage in the Face of Data Separation Near-Line Storage ◦ near-line storage, (depending on the vendor) is sequential storage ◦ Characteristics:  Robotically controlled  Inexpensive  Bulk amounts of data  Reliable over a long period of time  Seconds to access first record
  • 11. Disk Storage in the Face of Data Separation Access Speed and Disk Storage ◦ The difference between freely flowing blood and blood with many restricting components
  • 12. Disk Storage in the Face of Data Separation Archival Storage ◦ Needs for split storage to manage large amount of data ◦ Besides disk storage and near-line or bulk storage ◦ Different with near-line storage
  • 13. Disk Storage in the Face of Data Separation Implications of Transparency ◦ A record or row in the data warehouse is identical to a record or row in near-line storage.
  • 14. Moving Data from One Environment to Another  Many ways: ◦ have a database administrator manually move data ◦ hierarchical storage management (HSM) ◦ the cross-media storage management (CMSM) option
  • 15. Moving Data from One Environment to Another The CMSM Approach ◦ The CMSM technology is fully automated. ◦ The CMSM is software that makes the physical location of the data transparent ◦ The end user does not need to know where data is—in the data warehouse or on near-line storage.
  • 16. Moving Data from One Environment to Another A Data Warehouse Usage Monitor ◦ Streamline the operations of the CMSM environment ◦ Two types:  those that are supplied by the DBMS vendor  those supplied by third-party monitors
  • 17. Inverting the Data Warehouse inverteddata warehouse: Consider a normal data warehouse. To build a data warehouse: ◦ Normal way: put data first into disk storage  (after the data ages) near-line or archival storage ◦ Alternative way: first enter data into near-line storage (not disk storage)  data is “staged” from the near-line environment to the disk environment (to accessed and analyzed)  (after over) returned to near-line storage
  • 18. Total Cost With the introduction of near-line and archival storage, the growing costs of a data warehouse can be mitigated
  • 19. Maximum Capacity “XYZ machine can handle up to nnn terabytes of data.” Parameters measures the machines capacity: Volumes of data Number of users Workload complexity The balanced case is where there is a fair amount of data, a fair number of users, and a reasonably complex workload
  • 20. Summary Data warehouses grow large explosively The data inside the warehouse separates into one of two classes—frequently used data or infrequently used data Without near-line and/or archival storage, the costs of the data warehouseskyrocket as the data warehouse grows large http://it-slideshares.blogspot.com/

Notas do Editor

  1. Historical data _ Detailed data _ Diverse data = Lots of data
  2. Splitting data over multiple storage media based on frequency of usage
  3. Archival storage is very similar to near-line storage , except that in archival storage, the probability of access drops very low. To put the probability of access in perspective, consider the following simple chart: High performance disk storage Access a unit of data once a month Near-line storage Access 0.5 units of data every year Archival storage Access 0.1 units of data every decade. Near-line storage can be thought of as a logical extension of the data warehouse. Archival storage cannot be thought of as a logical extension.
  4. Options for Moving Data: ADVANTAGES Manual Very simple; available immediately; operates at the row level HSM Relatively simple; not too expensive; fully automated CMSM Fully automated; operates at the row level DISADVANTAGES Manual Prone to error; requires human interaction HSM Operates at the data set level CMSM Expensive; complex to implement and operate
  5. third-party monitors are much better because the monitors supplied by the DBMS vendors require far more resources than those supplied The Extension of the Data Warehouse across Different Storage Media: The data warehouse can grow to petabytes (equivalent to a quadrillion bytes) of data and can still be effective and still be managed.
  6. third-party monitors are much better because the monitors supplied by the DBMS vendors require far more resources than those supplied