SlideShare uma empresa Scribd logo
1 de 20
PRESENTATION
     ON
    DATA
WAREHOUSING



        Presented By:
        Jagnesh Chawla
        Manpreet Singh
        Mintu
CONTENTS:
 Meaning Of data warehousing
 Benefit of data warehousing

 Problems

 Architecture of data warehouse

 Main components

 Data flows

 Tools and technologies

 Data Mart
MEANING:
   Data warehouse is data management and data
    analysis




   Goal: is to integrate enterprise wide corporate
    data into a single reository from which users can
    easily run queries
BENEFITS:
   The major benefit of data warehousing are high
    returns on investment.




   Increased productivity of corporate decision-
    makers
PROBLEMS:
 Underestimation of resources for data loading
 Hidden problems with source systems
 Required data not captured
 Increased end-user demands
 Data homogenization
 High demand for resources
 Data ownership
 High maintenance
 Long-duration projects
 Complexity of integration
ARCHITECTURE:

   Operational                                                                       Reporting, query,
   data source1                                                                      application
                                                                                     development,
                                                                 High                and EIS(executive
                                Meta-data                    summarized data         information system)
   Operational                                                                 Query Manage
                                                                                     tools
  data source 2                                   Lightly
                    Load Manager                summarized
                                                   data


  Operational
  data source n                 Detailed data                    DBMS
                                                                                   OLAP(online analytical
                                                                                   processing) tools

  Operational
                                    Warehouse Manager
 data store (ods)



ational data store (ODS)
                                                                                         Data mining

                                      Archive/backup
                                           data
                                                                                         End-user
                       Typical architecture of a data warehouse                          access tools
MAIN COMPONENTS:
 Operational data sourcesfor the DW is
  supplied from mainframe operational data held in
  first generation hierarchical and network databases,
  departmental data held in proprietary file systems,
  private data held on workstaions and private serves
  and external systems such as the Internet,
  commercially available DB, or DB assoicated with
  and organization’s suppliers or customers
 Operational datastore(ODS)is a
  repository of current and integrated operational data
  used for analysis. It is often structured and supplied
  with data in the same way as the data warehouse, but
  may in fact simply act as a staging area for data to be
  moved into the warehouse
MAIN COMPONENTS:
 query   manageralso called backend
 component, it performs all the operations
 associated with the management of user queries.
 The operations performed by this component
 include directing queries to the appropriate
 tables and scheduling the execution of queries
 end-user   access toolscan be categorized into
 five main groups: data reporting and query tools,
 application development tools, executive
 information system (EIS) tools, online analytical
 processing (OLAP) tools, and data mining tools
DATA FLOW:
 Inflow- The processes associated with the
  extraction, cleansing, and loading of the data
  from the source systems into the data warehouse.
 upflow- The process associated with adding value
  to the data in the warehouse through
  summarizing, packaging , packaging, and
  distribution of the data
 downflow- The processes associated with
  archiving and backing-up of data in the
  warehouse
DATA FLOW:
   outflow- The process associated with making the
    data availabe to the end-users.




   Meta-flow- The processes associated with the
    management of the meta-data
Warehouse Manager
   Operational
   data source1


                                                 Meta-flow
                                Meta-data                                High
                                                                     summarized data

Inflow                                                                                 Outflow
                                                       Lightly
                   Load                              summarized
                                                        data
                   Manager
                                                                  Upflow           Query Manage
 Operational
                                                                           DBMS
 data source n                  Detailed data

                                                Warehouse Manager


 Operational
data store (ods)
                                                                                                  Data mining
                                                                                                  tools
                                                                                                   End-user
                                                                   Downflow                        access tools

                                            Archive/backup
                                                 data


                        Information flows of a data warehouse
TOOLS AND TECHNOLOGIES:
   The critical steps in the construction of a data
    warehouse:


a. Extraction

b. Cleansing

c. Transformation
TOOLS AND TECHNOLOGIES:
   after the critical steps, loading the results into
    target system can be carried out either by
    separate products, or by a single, categories:

   code generators

   database data replication tools

   dynamic transformation engines
MANAGEMENT TOOLS:
   For the various types of meta-data and the day-
    to-day operations of the data warehouse, the
    administration and management tools must be
    capable of supporting those tasks:

   Monitoring data loading from multiple sources

   Data quality and integrity checks

   Managing and updating meta-data

   Monitoring database performance to ensure efficient query
    response times and resource utilization
 Auditing data warehouse usage to provide user
  chargeback information
 Replicating, subsetting, and distributing data

 Maintaining effient data storage management

 Purging data;

 Archiving and backing-up data

 Implementing recovery following failure

 Security management
DATA MART:
   Data mart a subset of a data warehouse that
    supports the requirements of particular
    department or business function

   The characteristics that differentiate data marts
    and data warehouses include:


   A data mart focuses on only the requirements of
    users associated with one department or business
    function
Warehouse Manager
        Operational
        data source1



                                                                          High
                                     Meta-data
                                                                      summarized data


       Operational
      data source 2                                        Lightly                                      Query
                         Load                            summarized
                                                            data                                        Manage
                         Manager

      Operational
                                                                                 DBMS
                                    Detailed data
      data source n

                                                    Warehouse Manager


      Operational
     data store (ods)


                                                    (First Tier)
                                                                                                                      (Third Tier)
Operational data store
(ODS)
                                                    Archive/backup                                                     End-user
                                                         data                                                          access tools

                                                                              Data Mart

                                                                                  summarized
                                                                            data(Relational database)




                                                                           Summarized data
                                                                       (Multi-dimension database)           (Second Tier)

                               Typical data warehouse adn data mart architecture
DATA MART ISSUES:
   Data mart functionalitythe capabilities of data marts
    have increased with the growth in their popularity


   Data mart sizethe performance deteriorates as data
    marts grow in size, so need to reduce the size of data marts
    to gain improvements in performance


   Data mart load performancetwo critical components:
    end-user response time and data loading performanceto
    increment DB updating so that only cells affected by the
    change are updated and not the entire MDDB structure
REFERENCES:
 Book of DBMS
 Google.com

 Wikipedia, the free encyclopedia

 InformIT.com

 Allfree-stuff.com
data warehousing

Mais conteúdo relacionado

Mais procurados

Data Warehouse Modeling
Data Warehouse ModelingData Warehouse Modeling
Data Warehouse Modelingvivekjv
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingEyad Manna
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureJames Serra
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecturepcherukumalla
 
Modern data warehouse presentation
Modern data warehouse presentationModern data warehouse presentation
Modern data warehouse presentationDavid Rice
 
Data Warehouse Fundamentals
Data Warehouse FundamentalsData Warehouse Fundamentals
Data Warehouse FundamentalsRashmi Bhat
 
Data Virtualization: An Essential Component of a Cloud Data Lake
Data Virtualization: An Essential Component of a Cloud Data LakeData Virtualization: An Essential Component of a Cloud Data Lake
Data Virtualization: An Essential Component of a Cloud Data LakeDenodo
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSINGKing Julian
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingEdureka!
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Miningcpjcollege
 
Snowflake Best Practices for Elastic Data Warehousing
Snowflake Best Practices for Elastic Data WarehousingSnowflake Best Practices for Elastic Data Warehousing
Snowflake Best Practices for Elastic Data WarehousingAmazon Web Services
 
The oracle database architecture
The oracle database architectureThe oracle database architecture
The oracle database architectureAkash Pramanik
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationDenodo
 

Mais procurados (20)

Data Warehouse Modeling
Data Warehouse ModelingData Warehouse Modeling
Data Warehouse Modeling
 
Data Lifecycle Management
Data Lifecycle ManagementData Lifecycle Management
Data Lifecycle Management
 
Database System Architectures
Database System ArchitecturesDatabase System Architectures
Database System Architectures
 
Modern Data Architecture
Modern Data ArchitectureModern Data Architecture
Modern Data Architecture
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
Modern data warehouse presentation
Modern data warehouse presentationModern data warehouse presentation
Modern data warehouse presentation
 
Data Warehouse Fundamentals
Data Warehouse FundamentalsData Warehouse Fundamentals
Data Warehouse Fundamentals
 
Databases
DatabasesDatabases
Databases
 
Data Virtualization: An Essential Component of a Cloud Data Lake
Data Virtualization: An Essential Component of a Cloud Data LakeData Virtualization: An Essential Component of a Cloud Data Lake
Data Virtualization: An Essential Component of a Cloud Data Lake
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Mining
 
Snowflake Best Practices for Elastic Data Warehousing
Snowflake Best Practices for Elastic Data WarehousingSnowflake Best Practices for Elastic Data Warehousing
Snowflake Best Practices for Elastic Data Warehousing
 
XML Databases
XML DatabasesXML Databases
XML Databases
 
The oracle database architecture
The oracle database architectureThe oracle database architecture
The oracle database architecture
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data Virtualization
 

Destaque

Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse conceptsobieefans
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingsumit621
 
Lecture 13
Lecture 13Lecture 13
Lecture 13Shani729
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data WarehousingAswathy S Nair
 
Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Harish Chand
 
Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Conceptsraulmisir
 
Data Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationData Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationSunderland City Council
 
Types of databases
Types of databasesTypes of databases
Types of databasesPAQUIAAIZEL
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingJason S
 

Destaque (16)

Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse concepts
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Lecture 13
Lecture 13Lecture 13
Lecture 13
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
Database and types of databases
Database and types of databasesDatabase and types of databases
Database and types of databases
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data Warehousing
 
Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)
 
Types of database
Types of databaseTypes of database
Types of database
 
Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Concepts
 
Data Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationData Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data Visualisation
 
wi-fi ppt
wi-fi pptwi-fi ppt
wi-fi ppt
 
Types dbms
Types dbmsTypes dbms
Types dbms
 
Types of databases
Types of databasesTypes of databases
Types of databases
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 

Semelhante a data warehousing

data resource management
 data resource management data resource management
data resource managementsoodsurbhi123
 
Oracle: Fundamental Of Dw
Oracle: Fundamental Of DwOracle: Fundamental Of Dw
Oracle: Fundamental Of Dworacle content
 
Big Data For Investment Research Management
Big Data For Investment Research ManagementBig Data For Investment Research Management
Big Data For Investment Research ManagementIDT Partners
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data WarehouseZalpa Rathod
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEZalpa Rathod
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Cana Ko
 
Informatica Interview Questions & Answers
Informatica Interview Questions & AnswersInformatica Interview Questions & Answers
Informatica Interview Questions & AnswersZaranTech LLC
 
Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems divjeev
 
Establishing A Robust Data Migration Methodology - White Paper
Establishing A Robust Data Migration Methodology - White PaperEstablishing A Robust Data Migration Methodology - White Paper
Establishing A Robust Data Migration Methodology - White PaperJames Chi
 
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP OpsIRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP OpsIRJET Journal
 
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse EMC
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data SolutionsMark Kromer
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityDatabase Architechs
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forAyushMeraki1
 

Semelhante a data warehousing (20)

data resource management
 data resource management data resource management
data resource management
 
Oracle: Fundamental Of DW
Oracle: Fundamental Of DWOracle: Fundamental Of DW
Oracle: Fundamental Of DW
 
Oracle: Fundamental Of Dw
Oracle: Fundamental Of DwOracle: Fundamental Of Dw
Oracle: Fundamental Of Dw
 
Big Data For Investment Research Management
Big Data For Investment Research ManagementBig Data For Investment Research Management
Big Data For Investment Research Management
 
DW 101
DW 101DW 101
DW 101
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data Warehouse
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831
 
Informatica Interview Questions & Answers
Informatica Interview Questions & AnswersInformatica Interview Questions & Answers
Informatica Interview Questions & Answers
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems
 
Establishing A Robust Data Migration Methodology - White Paper
Establishing A Robust Data Migration Methodology - White PaperEstablishing A Robust Data Migration Methodology - White Paper
Establishing A Robust Data Migration Methodology - White Paper
 
Ch03
Ch03Ch03
Ch03
 
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP OpsIRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
 
Data Management
Data ManagementData Management
Data Management
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
Use Big Data Technologies to Modernize Your Enterprise Data Warehouse
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data Quality
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining for
 

Último

Easier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties ReimaginedEasier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties Reimaginedpanagenda
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGDSC PJATK
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsLeah Henrickson
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfFIDO Alliance
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxJennifer Lim
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon
 
Your enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4jYour enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4jNeo4j
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfFIDO Alliance
 
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024Lorenzo Miniero
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfFIDO Alliance
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIES VE
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceSamy Fodil
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfFIDO Alliance
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfSrushith Repakula
 
Designing for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at ComcastDesigning for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at ComcastUXDXConf
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaCzechDreamin
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty SecureFemke de Vroome
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024Stephanie Beckett
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentationyogeshlabana357357
 

Último (20)

Easier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties ReimaginedEasier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdf
 
Your enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4jYour enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4j
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
Designing for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at ComcastDesigning for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at Comcast
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty Secure
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 

data warehousing

  • 1. PRESENTATION ON DATA WAREHOUSING Presented By: Jagnesh Chawla Manpreet Singh Mintu
  • 2. CONTENTS:  Meaning Of data warehousing  Benefit of data warehousing  Problems  Architecture of data warehouse  Main components  Data flows  Tools and technologies  Data Mart
  • 3. MEANING:  Data warehouse is data management and data analysis  Goal: is to integrate enterprise wide corporate data into a single reository from which users can easily run queries
  • 4. BENEFITS:  The major benefit of data warehousing are high returns on investment.  Increased productivity of corporate decision- makers
  • 5. PROBLEMS:  Underestimation of resources for data loading  Hidden problems with source systems  Required data not captured  Increased end-user demands  Data homogenization  High demand for resources  Data ownership  High maintenance  Long-duration projects  Complexity of integration
  • 6. ARCHITECTURE: Operational Reporting, query, data source1 application development, High and EIS(executive Meta-data summarized data information system) Operational Query Manage tools data source 2 Lightly Load Manager summarized data Operational data source n Detailed data DBMS OLAP(online analytical processing) tools Operational Warehouse Manager data store (ods) ational data store (ODS) Data mining Archive/backup data End-user Typical architecture of a data warehouse access tools
  • 7. MAIN COMPONENTS:  Operational data sourcesfor the DW is supplied from mainframe operational data held in first generation hierarchical and network databases, departmental data held in proprietary file systems, private data held on workstaions and private serves and external systems such as the Internet, commercially available DB, or DB assoicated with and organization’s suppliers or customers  Operational datastore(ODS)is a repository of current and integrated operational data used for analysis. It is often structured and supplied with data in the same way as the data warehouse, but may in fact simply act as a staging area for data to be moved into the warehouse
  • 8. MAIN COMPONENTS:  query manageralso called backend component, it performs all the operations associated with the management of user queries. The operations performed by this component include directing queries to the appropriate tables and scheduling the execution of queries  end-user access toolscan be categorized into five main groups: data reporting and query tools, application development tools, executive information system (EIS) tools, online analytical processing (OLAP) tools, and data mining tools
  • 9. DATA FLOW:  Inflow- The processes associated with the extraction, cleansing, and loading of the data from the source systems into the data warehouse.  upflow- The process associated with adding value to the data in the warehouse through summarizing, packaging , packaging, and distribution of the data  downflow- The processes associated with archiving and backing-up of data in the warehouse
  • 10. DATA FLOW:  outflow- The process associated with making the data availabe to the end-users.  Meta-flow- The processes associated with the management of the meta-data
  • 11. Warehouse Manager Operational data source1 Meta-flow Meta-data High summarized data Inflow Outflow Lightly Load summarized data Manager Upflow Query Manage Operational DBMS data source n Detailed data Warehouse Manager Operational data store (ods) Data mining tools End-user Downflow access tools Archive/backup data Information flows of a data warehouse
  • 12. TOOLS AND TECHNOLOGIES:  The critical steps in the construction of a data warehouse: a. Extraction b. Cleansing c. Transformation
  • 13. TOOLS AND TECHNOLOGIES:  after the critical steps, loading the results into target system can be carried out either by separate products, or by a single, categories:  code generators  database data replication tools  dynamic transformation engines
  • 14. MANAGEMENT TOOLS:  For the various types of meta-data and the day- to-day operations of the data warehouse, the administration and management tools must be capable of supporting those tasks:  Monitoring data loading from multiple sources  Data quality and integrity checks  Managing and updating meta-data  Monitoring database performance to ensure efficient query response times and resource utilization
  • 15.  Auditing data warehouse usage to provide user chargeback information  Replicating, subsetting, and distributing data  Maintaining effient data storage management  Purging data;  Archiving and backing-up data  Implementing recovery following failure  Security management
  • 16. DATA MART:  Data mart a subset of a data warehouse that supports the requirements of particular department or business function  The characteristics that differentiate data marts and data warehouses include:  A data mart focuses on only the requirements of users associated with one department or business function
  • 17. Warehouse Manager Operational data source1 High Meta-data summarized data Operational data source 2 Lightly Query Load summarized data Manage Manager Operational DBMS Detailed data data source n Warehouse Manager Operational data store (ods) (First Tier) (Third Tier) Operational data store (ODS) Archive/backup End-user data access tools Data Mart summarized data(Relational database) Summarized data (Multi-dimension database) (Second Tier) Typical data warehouse adn data mart architecture
  • 18. DATA MART ISSUES:  Data mart functionalitythe capabilities of data marts have increased with the growth in their popularity  Data mart sizethe performance deteriorates as data marts grow in size, so need to reduce the size of data marts to gain improvements in performance  Data mart load performancetwo critical components: end-user response time and data loading performanceto increment DB updating so that only cells affected by the change are updated and not the entire MDDB structure
  • 19. REFERENCES:  Book of DBMS  Google.com  Wikipedia, the free encyclopedia  InformIT.com  Allfree-stuff.com