Search any resource (Books, Web Sites, Papers, etc.) to find three definitions for Data Warehousing. Include the detailed information (Title, authors and the source of the definitions. For example:
“Data warehousing is a collection of decision support technologies, aimed at enabling the knowledge worker (executive, manager, analyst) to make better and faster decisions.” An overview of data warehousing and OLAP technology by S Chaudhuri, U Dayal, from ACM Sigmod record, Volume 26 , Issue 1 (March 1997) Pages: 65 – 74.
1. “A data warehouse is an integrated and time varying collection of data derived from operational data and primarily used in strategic decision making by means of online analytical processing (OLAP) techniques.” from “Conceptual data warehouse design” by B. Husemann, J. Lichtenberger, and G. Vossen. Page 1.
2. “A galactic data warehouse is a subject-oriented, integrated, time-variant, non-volatile collection of data in support of management’s decision making process about any and all enterprise business processes and departments, and about the enterprise taken as a whole. A business process-oriented data warehouse is a subject-oriented, integrated, time-variant, non-volatile collection of data in support of management’s decision making process about any and all business processes and their interactions with one another and the external world. A department-oriented data warehouse is a subject-oriented, integrated, time-variant, non-volatile collection of data in support of management’s decision making process about any and all departments, and their interactions with one another and with the external world.” From DKMS Brief No. Six: Data Warehouses, Data Marts, and Data Warehousing: New Definitions and New Conceptions by Joseph M.Firestone.
3. “Physically, a data warehouse system consists of databases (source databases, materialized views in the data warehouse), data transport agents that ship data from one database to another, and a repository which stores meta data about the system and its evolution.” From Architecture and Quality in Data warehouses: An Extended Repository Approach by M. Jarke, M. A. Jeusfeld, C. Quix, and P. Vassiliadis.
Provide a brief summary to compare the three definitions that you’ve found. Tell me which one is your favorite and why?
The first definition explains the components of a data warehouse and also its functionality in a general way. The second definition explains the function of a data warehouse and its components specific to each kind of a data warehouse like decision making with respect to a business module. The third definition explains the components of a data warehouse but does not specify the functionality of a data warehouse. I prefer the second definition over the other two definitions.