Uploaded on Nov 22, 2021
PPT on Data Warehousing.
Data Warehousing
DATA WAREHOUSING Introduction • A data warehouse is a type of data management system that is designed to enable and support business intelligence (BI) activities, especially analytics. • Data warehouses are solely intended to perform queries and analysis and often contain large amounts of historical data. Source: www.oracle.com Elements • A relational database to store and manage data • An extraction, loading, and transformation (ELT) solution for preparing the data for analysis • Statistical analysis, reporting, and data mining capabilities • Client analysis tools for visualizing and presenting data to business users Source: www.oracle.com Benefits of a Data Warehouse • Data warehouses offer the overarching and unique benefit of allowing organizations to analyze large amounts of variant data and extract significant value from it, as well as to keep a historical record. Source: www.oracle.com Characteristics • Subject-oriented: They can analyze data about a particular subject or functional area (such as sales). • Integrated: Data warehouses create consistency among different data types from disparate sources. • Nonvolatile: Once data is in a data warehouse, it’s stable and doesn’t change. • Time-variant: Data warehouse analysis looks at change over time. Source: www.oracle.com Common architectures • Simple: All data warehouses share a basic design in which metadata, summary data, and raw data are stored within the central repository of the warehouse. • Simple with a staging area: Operational data must be cleaned and processed before being put in the warehouse. Although this can be done programmatically, many data warehouses add a staging area for data before it enters the warehouse, to simplify data preparation. Source: www.oracle.com Common architectures cont. • Hub and spoke. Adding data marts between the central repository and end users allows an organization to customize its data warehouse to serve various lines of business. • Sandboxes. Sandboxes are private, secure, safe areas that allow companies to quickly and informally explore new datasets or ways of analyzing data without having to conform to or comply with the formal rules and protocol of the data warehouse. Source: www.oracle.com What is a Cloud Data Warehouse? • A cloud data warehouse uses the cloud to ingest and store data from disparate data sources. • The original data warehouses were built with on-premises servers. Source: www.oracle.com Advantages of cloud data warehouses • Elastic, scale-out support for large or variable compute or storage requirements • Ease of use • Ease of management • Cost savings Source: www.oracle.com What is a Modern Data Warehouse? • A modern data architecture addresses those different needs by providing a way to manage all data types, workloads, and analysis. • It consists of architecture patterns with necessary components integrated to work together in alignment with industry best practices. Source: www.oracle.com Designing a Data Warehouse • Any data warehouse design must address the following: – Specific data content – Relationships within and between groups of data – The systems environment that will support the data warehouse – The types of data transformations required – Data refresh frequency Source: www.oracle.com
Comments