SlideShare uma empresa Scribd logo
1 de 29
DATA
WAREHOUSING
PRESENTED BY:-
SHRUTI DALELA
MCA V SEM
OUTLINES:-
 WHAT IS A DATA WAREHOUSING
 DATA WAREHOUSING DEFINITION
 HISTORY OF DATA WAREHOUSING
 FACTS ABOUT DATA WAREHOUSING
 CHARACTERISTICS OF DATA
WAREHOUSING
 USAGE AND TRENDS
 ARCHITECTURE OF DATA WAREHOUSE
 DBMS VS DATA WAREHOUSE
What Is A Data Warehouse?
A data warehouse is a powerful database
model
that significantly enhances the user’s ability
to
quickly analyze large, multidimensional data
sets.
It cleanses and organizes data to allow users
to
make business decisions based on facts.
Hence,
the data in the data warehouse must have
Data Warehousing Definition:-
 Date warehousing is an aspect to gather data
from multiple sources into central
repository,called Data warehouse.
 According to William H.Inmon,a leading
architect in the construction of data warehouse
systems,”A data warehouse is a subject –
oriented ,integrated ,time variant and non-
volatile collection of data in support of
management’s decision making process.
 “A data warehouse is simply a single complete,
and consistent store of data obtained from a
variety of sources and made available to end
users in a way they can understand and use it
in a business context.”
-----Barry Devlin,IBM consultant
DATA WAREHOUSES
Data Warehouses:
 Data spread in several databases –
physically located at numerous sites
 Data warehouse – repository of multiple
DBs in single schema; resides at single site.
 Data warehousing processes
1. Data Cleaning 2. Data Integration 3. Data
Transformation
4. Data Loading 5. Periodic data refreshing
Data warehouse diagram
 Data cleaning:-Data Cleaning includes,
filling in missing values, smoothing noisy
data, identifying or removing outliers, and
resolving inconsistencies.
 Data integration:-Data Integration includes
integration of multiple databases, data
cubes, or files.
 Data transformation:-Convert data from
legacy or host format to warehouse format.
 Load :-sort;summarize,consolidate;compute
views; check integrity.Build indices and
partitions.
 Refresh:-Propagates the update from data
sources to the warehouse.
 Data in a data warehouse are organized
around major subjects.
 Data provide information on historical
perspective – summarized on periodic
dimension.
 Eg. Sales of an item for a region in a period
 Data warehouse model – multidimensional
database structure / data cube
 Dimensions – Attributes / set of attributes
 Facts – Aggregated measures (Count /
Sales amount)
History of data warehousing
 The concept of data warehousing
dates back to the late 1980s when
IBM researcher Barry davlin and paul
murphy developed the “the business
data warehouse”.
 In essence, the data warehousing
concept was intended to provide an
architectural model for the flow of data
from opeational systems to decision
support environments.
Facts about data
warehousing:-
 Issues involved in warehousing include
techniques for dealing with errors and
techniques for efficient storage and
indexing of large volumes of data.
 This system is used for reporting and data
analysis.
 It usually contains historical data derived
from transaction data.
 Data warehousing is not meant for current
“live”data.
Components of a data
warehouse
• Sources –Data source interaction
• Data Transformation
• Data warehouse (data storage )
• Reporting (Data presentation )
• Metadata
Complete control over the four main
areas
of data management systems: -
 Clean data
 Query processing: multiple options
 Indexes: multiple types
 Security: data and access
Data Warehouse Advantages
Data Warehousing
Disadvantages
 Adding new data sources takes time and
associated high cost.
 Data owners lose control over their data,
raising ownership, security and privacy
issues.
 Long initial implementation time and
associated high cost.
 Difficult to accommodate changes in data
types and ranges, data source schema,
indexes and queries.
Characteristics of Data
Warehousing:-
 Subject –Oriented:-A data warehouse can
be used to analyze a particular subject
area.
For example:-”sales” can be a particular
subject.
 Integrated:-A data warehouse integrates
data from multiple data sources.
For example:-Source A and source B may
have different ways of identifying a product,
but in a data warehouse,there will be only
a single way of identifying a product.
 Time Variant :-Historical data is kept in a
data warehouse.
For example:-One can retrieve data from 3
months ,6months, 12 months ,or even older
data from a data warehouse.
 Non volatile:-Once data is in the data
warehouse,it will not change.So,historical
data in a data warehouse should never be
altered.
 It must be optimized for access to very
large amount of data.
 It is based on client server architecture.
 It is capable of handling dynamic matrices.
 It maintains transparency.
 It is consitent and flexible.
DATA WAREHOUSE
USAGE:-
• Three kinds of data warehouse
applications
1)Information processing:-Supports querying, basic
statistical analysis, and reporting using crosstabs, tables,
charts and graphs.
2)Analytical processing:-
• Multidimensional analysis of data warehouse data
• Supports basic OLAP operations, slice-dice, drilling,
pivoting
3)Data mining:-
• Knowledge discovery from hidden patterns
• Supports associations, constructing analytical models,
performing classification and prediction, and presenting
the mining results using visualization tools.
 Differences among the three tasks
TRENDS IN DATA
WAREHOUSING
 In the next few years, data warehousing is
expected make big strides in software,
especially for optimizing queries:-
o indexing very large tables
o enhancing SQL
o improving data compression
o expanding dimensional modeling
 Real-Time Data Warehousing
 Multiple Data Types
 Adding Unstructured Data
 Searching Unstructured Data
 Spatial Data
 Data Visualization
 Major Visualization Trends
 Visualization Types
 Advanced Visualization TechniquesChart
Manipulation.
 Drill Down.
 Advanced Interaction
Architecture of data warehouse
fig:- A three tier data warehousing
Top tier:
Middle tier
Bottom tier
Data
warehouse
server
Backend tools
1)Bottom tier:-The bottom tier is a warehouse
database server that is always a relational
database system.
 Back-end tools and utilities are used to feed data
into the bottom tier from operational databases or
other external sources. These tools and utilities
perform data extraction,cleaning and
transformation as well as load and refresh
functions to update the data warehouse.
 The date extracted using application program
interfaces known as gateways.
 Example of gateways are ODBC(open database
connection)and OLEDB(Open Linking and
embedding for database) by microsoft and
jdbc(java database connecton).
 This tier also contains a metadata repository, which
stores information about the data warehouse and
its contents.
2.)Middle tier:- The middle tier is an OLAP
server that is typically implemented using
either:-
a) A relational OLAP (ROLAP) model that
is,an extended relation DBMS that maps
operations.Intermediate server b/w
relational back-end server and client front
end tools.
b) A multidimentional OLAP (MOLAP) model
that is, a special purpose server that
directly implements multidimentional data
and operations. Supports multidimention
views.
3.)Top tier:-The top tier is a front –end client
layer ,which contains query and reporting
tools ,analysis tools,and or data mining
tools.
 Note:-
OLAP – Online Analytical Processing:
 This is the major task of Data Warehousing
System.
 Useful for complex data analysis and
decision making.
 Market oriented –used by
managers,executives and data analyst.
DBMS VS Data
Warehousing
In today’s corporate world ,every
business enterprise ,no matter how
big or small requires a database. The
more the business grows, the more
urgent is the requirement of a
database .The database is required to
keep a check on the growth of a
business in a specific period.
DBMS:-DBMS is at times known as the
database manager although it is the
abbreviated form of database
management system.
 It is basically a repertoire of computer
programs that devoted for the
management of the database of an
organization .
 It is a complete and comprehensive
methodology in use for specific purposes
 Like overall management of digital data-
bases,creation and maintenance of
data,searching and serving other
operations relating to the database.
DATA WAREHOUSE:- A data warehouse
is usually a place where various types’
data -bases are stored mainly for
purpose of security ,archival analysis
and storage.
 The data warehouse consists of either
one or several computer systems that
are networked together form a single
computer system.
 The data warehouse is a database of a
different kind: an OLAP (online analytical
processing) database. A data warehouse
exists as a layer on top of another database
or databases (usually OLTP databases).
 In DBMS,there is OLTP(online
transaction processing )is used.Here we
cannot analysis because data changes
day by day.
 In data warehousing there is
OLAP(online analytical processing .It
maintain historical data.It collects data
from different databases like oracle and
so on.It is used to find analysis and
generate reports.
 The key difference between DBMS and
data warehouse is the fact that a data
warehouse can be treated as a type of
database or a kind of database which
provides special facilities for analysis
and reporting while DBMS is the overall
system which manages a certain
for your attention!

Mais conteúdo relacionado

Mais procurados

Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecturepcherukumalla
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture janani thirupathi
 
Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Harish Chand
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data WarehouseShanthi Mukkavilli
 
Data mining slides
Data mining slidesData mining slides
Data mining slidessmj
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etlAashish Rathod
 
Dbms Introduction and Basics
Dbms Introduction and BasicsDbms Introduction and Basics
Dbms Introduction and BasicsSHIKHA GAUTAM
 
Application of data mining
Application of data miningApplication of data mining
Application of data miningSHIVANI SONI
 
Knowledge discovery process
Knowledge discovery process Knowledge discovery process
Knowledge discovery process Shuvra Ghosh
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notesMohit Saini
 
Metadata in data warehouse
Metadata in data warehouseMetadata in data warehouse
Metadata in data warehouseSiddique Ibrahim
 
OLAP OnLine Analytical Processing
OLAP OnLine Analytical ProcessingOLAP OnLine Analytical Processing
OLAP OnLine Analytical ProcessingWalid Elbadawy
 
Database Administration
Database AdministrationDatabase Administration
Database AdministrationBilal Arshad
 

Mais procurados (20)

Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
 
Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
 
Data mining slides
Data mining slidesData mining slides
Data mining slides
 
Data mining
Data miningData mining
Data mining
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etl
 
Dbms Introduction and Basics
Dbms Introduction and BasicsDbms Introduction and Basics
Dbms Introduction and Basics
 
Application of data mining
Application of data miningApplication of data mining
Application of data mining
 
Knowledge discovery process
Knowledge discovery process Knowledge discovery process
Knowledge discovery process
 
Distributed database
Distributed databaseDistributed database
Distributed database
 
Data analytics vs. Data analysis
Data analytics vs. Data analysisData analytics vs. Data analysis
Data analytics vs. Data analysis
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
 
Metadata ppt
Metadata pptMetadata ppt
Metadata ppt
 
Data Mining
Data MiningData Mining
Data Mining
 
Metadata in data warehouse
Metadata in data warehouseMetadata in data warehouse
Metadata in data warehouse
 
OLAP OnLine Analytical Processing
OLAP OnLine Analytical ProcessingOLAP OnLine Analytical Processing
OLAP OnLine Analytical Processing
 
Database Administration
Database AdministrationDatabase Administration
Database Administration
 

Destaque

Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse conceptsobieefans
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingJason S
 
Data warehousing with pentaho
Data warehousing with pentahoData warehousing with pentaho
Data warehousing with pentahoWirabumi Software
 
Agile data warehousing
Agile data warehousingAgile data warehousing
Agile data warehousingSneha Challa
 
Lecture 13
Lecture 13Lecture 13
Lecture 13Shani729
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingEyad Manna
 
Lecture 2, c++(complete reference,herbet sheidt)chapter-12
Lecture 2, c++(complete reference,herbet sheidt)chapter-12Lecture 2, c++(complete reference,herbet sheidt)chapter-12
Lecture 2, c++(complete reference,herbet sheidt)chapter-12Abu Saleh
 
Vbmca204821311240
Vbmca204821311240Vbmca204821311240
Vbmca204821311240Ayushi Jain
 
Lecture 5, c++(complete reference,herbet sheidt)chapter-15
Lecture 5, c++(complete reference,herbet sheidt)chapter-15Lecture 5, c++(complete reference,herbet sheidt)chapter-15
Lecture 5, c++(complete reference,herbet sheidt)chapter-15Abu Saleh
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data WarehousingAswathy S Nair
 
Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Conceptsraulmisir
 
Data Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationData Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationSunderland City Council
 
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Data mining: Concepts and Techniques, Chapter12 outlier Analysis Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Data mining: Concepts and Techniques, Chapter12 outlier Analysis Salah Amean
 

Destaque (20)

Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse concepts
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Data warehousing with pentaho
Data warehousing with pentahoData warehousing with pentaho
Data warehousing with pentaho
 
Agile data warehousing
Agile data warehousingAgile data warehousing
Agile data warehousing
 
data warehousing
data warehousingdata warehousing
data warehousing
 
Lecture 13
Lecture 13Lecture 13
Lecture 13
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Lecture 2, c++(complete reference,herbet sheidt)chapter-12
Lecture 2, c++(complete reference,herbet sheidt)chapter-12Lecture 2, c++(complete reference,herbet sheidt)chapter-12
Lecture 2, c++(complete reference,herbet sheidt)chapter-12
 
Vbmca204821311240
Vbmca204821311240Vbmca204821311240
Vbmca204821311240
 
Lecture 5, c++(complete reference,herbet sheidt)chapter-15
Lecture 5, c++(complete reference,herbet sheidt)chapter-15Lecture 5, c++(complete reference,herbet sheidt)chapter-15
Lecture 5, c++(complete reference,herbet sheidt)chapter-15
 
Database and types of databases
Database and types of databasesDatabase and types of databases
Database and types of databases
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data Warehousing
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Types of database
Types of databaseTypes of database
Types of database
 
Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Concepts
 
Data Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationData Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data Visualisation
 
wi-fi ppt
wi-fi pptwi-fi ppt
wi-fi ppt
 
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Data mining: Concepts and Techniques, Chapter12 outlier Analysis Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
 

Semelhante a Data warehousing

11667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect411667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect4ambujm
 
1.4 data warehouse
1.4 data warehouse1.4 data warehouse
1.4 data warehouseKrish_ver2
 
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptxUNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptxshruthisweety4
 
11666 Bitt I 2008 Lect3
11666 Bitt I 2008 Lect311666 Bitt I 2008 Lect3
11666 Bitt I 2008 Lect3ambujm
 
Data warehouse
Data warehouseData warehouse
Data warehouseRajThakuri
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingsumit621
 
Unit-IV-Introduction to Data Warehousing .pptx
Unit-IV-Introduction to Data Warehousing .pptxUnit-IV-Introduction to Data Warehousing .pptx
Unit-IV-Introduction to Data Warehousing .pptxHarsha Patel
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data WarehouseSOMASUNDARAM T
 
Module 1_Data Warehousing Fundamentals.pptx
Module 1_Data Warehousing Fundamentals.pptxModule 1_Data Warehousing Fundamentals.pptx
Module 1_Data Warehousing Fundamentals.pptxnikshaikh786
 
Data Mining Concept & Technique-ch04.ppt
Data Mining Concept & Technique-ch04.pptData Mining Concept & Technique-ch04.ppt
Data Mining Concept & Technique-ch04.pptMutiaSari53
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forAyushMeraki1
 
UNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxUNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxDURGADEVIL
 

Semelhante a Data warehousing (20)

11667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect411667 Bitt I 2008 Lect4
11667 Bitt I 2008 Lect4
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Chapter 2
Chapter 2Chapter 2
Chapter 2
 
DMDW 1st module.pdf
DMDW 1st module.pdfDMDW 1st module.pdf
DMDW 1st module.pdf
 
1.4 data warehouse
1.4 data warehouse1.4 data warehouse
1.4 data warehouse
 
Cs1011 dw-dm-1
Cs1011 dw-dm-1Cs1011 dw-dm-1
Cs1011 dw-dm-1
 
Dwbasics
DwbasicsDwbasics
Dwbasics
 
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptxUNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
 
11666 Bitt I 2008 Lect3
11666 Bitt I 2008 Lect311666 Bitt I 2008 Lect3
11666 Bitt I 2008 Lect3
 
Data Warehouse
Data Warehouse Data Warehouse
Data Warehouse
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Unit 1
Unit 1Unit 1
Unit 1
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Unit-IV-Introduction to Data Warehousing .pptx
Unit-IV-Introduction to Data Warehousing .pptxUnit-IV-Introduction to Data Warehousing .pptx
Unit-IV-Introduction to Data Warehousing .pptx
 
DATA WAREHOUSING.2.pptx
DATA WAREHOUSING.2.pptxDATA WAREHOUSING.2.pptx
DATA WAREHOUSING.2.pptx
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Module 1_Data Warehousing Fundamentals.pptx
Module 1_Data Warehousing Fundamentals.pptxModule 1_Data Warehousing Fundamentals.pptx
Module 1_Data Warehousing Fundamentals.pptx
 
Data Mining Concept & Technique-ch04.ppt
Data Mining Concept & Technique-ch04.pptData Mining Concept & Technique-ch04.ppt
Data Mining Concept & Technique-ch04.ppt
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining for
 
UNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docxUNIT-5 DATA WAREHOUSING.docx
UNIT-5 DATA WAREHOUSING.docx
 

Último

ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 

Último (20)

ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 

Data warehousing

  • 2. OUTLINES:-  WHAT IS A DATA WAREHOUSING  DATA WAREHOUSING DEFINITION  HISTORY OF DATA WAREHOUSING  FACTS ABOUT DATA WAREHOUSING  CHARACTERISTICS OF DATA WAREHOUSING  USAGE AND TRENDS  ARCHITECTURE OF DATA WAREHOUSE  DBMS VS DATA WAREHOUSE
  • 3. What Is A Data Warehouse? A data warehouse is a powerful database model that significantly enhances the user’s ability to quickly analyze large, multidimensional data sets. It cleanses and organizes data to allow users to make business decisions based on facts. Hence, the data in the data warehouse must have
  • 4. Data Warehousing Definition:-  Date warehousing is an aspect to gather data from multiple sources into central repository,called Data warehouse.  According to William H.Inmon,a leading architect in the construction of data warehouse systems,”A data warehouse is a subject – oriented ,integrated ,time variant and non- volatile collection of data in support of management’s decision making process.  “A data warehouse is simply a single complete, and consistent store of data obtained from a variety of sources and made available to end users in a way they can understand and use it in a business context.” -----Barry Devlin,IBM consultant
  • 5. DATA WAREHOUSES Data Warehouses:  Data spread in several databases – physically located at numerous sites  Data warehouse – repository of multiple DBs in single schema; resides at single site.  Data warehousing processes 1. Data Cleaning 2. Data Integration 3. Data Transformation 4. Data Loading 5. Periodic data refreshing
  • 7.  Data cleaning:-Data Cleaning includes, filling in missing values, smoothing noisy data, identifying or removing outliers, and resolving inconsistencies.  Data integration:-Data Integration includes integration of multiple databases, data cubes, or files.  Data transformation:-Convert data from legacy or host format to warehouse format.  Load :-sort;summarize,consolidate;compute views; check integrity.Build indices and partitions.  Refresh:-Propagates the update from data sources to the warehouse.
  • 8.
  • 9.  Data in a data warehouse are organized around major subjects.  Data provide information on historical perspective – summarized on periodic dimension.  Eg. Sales of an item for a region in a period  Data warehouse model – multidimensional database structure / data cube  Dimensions – Attributes / set of attributes  Facts – Aggregated measures (Count / Sales amount)
  • 10. History of data warehousing  The concept of data warehousing dates back to the late 1980s when IBM researcher Barry davlin and paul murphy developed the “the business data warehouse”.  In essence, the data warehousing concept was intended to provide an architectural model for the flow of data from opeational systems to decision support environments.
  • 11. Facts about data warehousing:-  Issues involved in warehousing include techniques for dealing with errors and techniques for efficient storage and indexing of large volumes of data.  This system is used for reporting and data analysis.  It usually contains historical data derived from transaction data.  Data warehousing is not meant for current “live”data.
  • 12. Components of a data warehouse • Sources –Data source interaction • Data Transformation • Data warehouse (data storage ) • Reporting (Data presentation ) • Metadata
  • 13.
  • 14. Complete control over the four main areas of data management systems: -  Clean data  Query processing: multiple options  Indexes: multiple types  Security: data and access Data Warehouse Advantages
  • 15. Data Warehousing Disadvantages  Adding new data sources takes time and associated high cost.  Data owners lose control over their data, raising ownership, security and privacy issues.  Long initial implementation time and associated high cost.  Difficult to accommodate changes in data types and ranges, data source schema, indexes and queries.
  • 16. Characteristics of Data Warehousing:-  Subject –Oriented:-A data warehouse can be used to analyze a particular subject area. For example:-”sales” can be a particular subject.  Integrated:-A data warehouse integrates data from multiple data sources. For example:-Source A and source B may have different ways of identifying a product, but in a data warehouse,there will be only a single way of identifying a product.  Time Variant :-Historical data is kept in a data warehouse.
  • 17. For example:-One can retrieve data from 3 months ,6months, 12 months ,or even older data from a data warehouse.  Non volatile:-Once data is in the data warehouse,it will not change.So,historical data in a data warehouse should never be altered.  It must be optimized for access to very large amount of data.  It is based on client server architecture.  It is capable of handling dynamic matrices.  It maintains transparency.  It is consitent and flexible.
  • 18. DATA WAREHOUSE USAGE:- • Three kinds of data warehouse applications 1)Information processing:-Supports querying, basic statistical analysis, and reporting using crosstabs, tables, charts and graphs. 2)Analytical processing:- • Multidimensional analysis of data warehouse data • Supports basic OLAP operations, slice-dice, drilling, pivoting 3)Data mining:- • Knowledge discovery from hidden patterns • Supports associations, constructing analytical models, performing classification and prediction, and presenting the mining results using visualization tools.  Differences among the three tasks
  • 19. TRENDS IN DATA WAREHOUSING  In the next few years, data warehousing is expected make big strides in software, especially for optimizing queries:- o indexing very large tables o enhancing SQL o improving data compression o expanding dimensional modeling
  • 20.  Real-Time Data Warehousing  Multiple Data Types  Adding Unstructured Data  Searching Unstructured Data  Spatial Data  Data Visualization  Major Visualization Trends  Visualization Types  Advanced Visualization TechniquesChart Manipulation.  Drill Down.  Advanced Interaction
  • 21. Architecture of data warehouse fig:- A three tier data warehousing Top tier: Middle tier Bottom tier Data warehouse server Backend tools
  • 22. 1)Bottom tier:-The bottom tier is a warehouse database server that is always a relational database system.  Back-end tools and utilities are used to feed data into the bottom tier from operational databases or other external sources. These tools and utilities perform data extraction,cleaning and transformation as well as load and refresh functions to update the data warehouse.  The date extracted using application program interfaces known as gateways.  Example of gateways are ODBC(open database connection)and OLEDB(Open Linking and embedding for database) by microsoft and jdbc(java database connecton).  This tier also contains a metadata repository, which stores information about the data warehouse and its contents.
  • 23. 2.)Middle tier:- The middle tier is an OLAP server that is typically implemented using either:- a) A relational OLAP (ROLAP) model that is,an extended relation DBMS that maps operations.Intermediate server b/w relational back-end server and client front end tools. b) A multidimentional OLAP (MOLAP) model that is, a special purpose server that directly implements multidimentional data and operations. Supports multidimention views.
  • 24. 3.)Top tier:-The top tier is a front –end client layer ,which contains query and reporting tools ,analysis tools,and or data mining tools.  Note:- OLAP – Online Analytical Processing:  This is the major task of Data Warehousing System.  Useful for complex data analysis and decision making.  Market oriented –used by managers,executives and data analyst.
  • 25. DBMS VS Data Warehousing In today’s corporate world ,every business enterprise ,no matter how big or small requires a database. The more the business grows, the more urgent is the requirement of a database .The database is required to keep a check on the growth of a business in a specific period.
  • 26. DBMS:-DBMS is at times known as the database manager although it is the abbreviated form of database management system.  It is basically a repertoire of computer programs that devoted for the management of the database of an organization .  It is a complete and comprehensive methodology in use for specific purposes  Like overall management of digital data- bases,creation and maintenance of data,searching and serving other operations relating to the database.
  • 27. DATA WAREHOUSE:- A data warehouse is usually a place where various types’ data -bases are stored mainly for purpose of security ,archival analysis and storage.  The data warehouse consists of either one or several computer systems that are networked together form a single computer system.  The data warehouse is a database of a different kind: an OLAP (online analytical processing) database. A data warehouse exists as a layer on top of another database or databases (usually OLTP databases).
  • 28.  In DBMS,there is OLTP(online transaction processing )is used.Here we cannot analysis because data changes day by day.  In data warehousing there is OLAP(online analytical processing .It maintain historical data.It collects data from different databases like oracle and so on.It is used to find analysis and generate reports.  The key difference between DBMS and data warehouse is the fact that a data warehouse can be treated as a type of database or a kind of database which provides special facilities for analysis and reporting while DBMS is the overall system which manages a certain