3. What is Data Warehouse ? A data warehouse is a repository of information gathered from multiple sources stored under a unified schema,at a single site. The data warehouse is a relational data base organised to hold information in a structure that best supports reporting and analysis.
4.
5.
6. Architecture : A Data Warehouse Architecture (DWA) is a way of representing the overall structure of data, communication, processing and presentation that exists for end-user computing within the enterprise. The architecture of data warehouse is as follows:
7.
8. Load Manager : Data flows into the data warehouse through the “load manager”.The data is extracted from the operational databases & supplemented by data imported from external sources.
9. Query manager : It provides an interface between the warehouse& its users.It performs task like directing the queries to appropriate tables, monitoring the effectiveness of the indexes & summary data & query scheduling.
10.
11.
12.
13. Data Sources: Data sources refers to any electronic repository of information where data is passed from these systems to the data warehouse either on a transaction-by transaction basis for real-time data warehouses or on a regular cycle. Data Transformation: The Data Transformation layer receives data from the data sources, cleans and standardizes it, and loads it into the data repository. Data Warehouse: The data warehouse is a relational database organized to hold information in a structure that best supports reporting and analysis.
14. Reporting: The data in the data warehouse must be available to all the users if the data warehouse is to be useful. Metadata: Metadata or "data about data", is used to inform users of the data warehouse about its status and the information held within the data warehouse. Operations: Data warehouse operations comprises of the processes of loading, manipulating and extracting data from the data warehouse. Operations also covers user management, security, capacity management and related functions.
15.
16.
17.
18.
19.
20.
21. Future Developments: Data Warehousing is such a new field that it is difficult to estimate what new developments are likely to most affect it. Clearly, the development of parallel DB servers with improved query engines is likely to be one of the most important. Parallel servers will make it possible to access huge data bases in much less time.
22. Conclusion : Data Warehousing is not a new phenomenon. All large organizations already have data warehouses, but they are just not managing them. Over the next few years, the growth of data warehousing is going to be enormous with new products and technologies coming out frequently. In order to get the most out of this period, it is going to be important that data warehouse planners and developers have a clear idea of what they are looking for and then choose strategies and methods that will provide them with performance today and flexibility for tomorrow.