Beruflich Dokumente
Kultur Dokumente
Advantages of Data Warehousing? Data warehouses tend to have a high query success, as they have complete control over the four main areas of data management systems: Clean data Indexes: multiple types Query processing: multiple options Security: data and access
The result is high costs associated with initial purchases from multiple vendors; and integration, administration, and maintenance of the BI solutions.
What is OLAP ?
OLAP stands for online analytical processing. Online: You have access to live data (rather than static data). Analytical processing: You can analyze your data for reporting. You can create reports that are: - Multidimensional -Calculation intensive - Supported by time-based analysis -Ideal for applications with unpredictable, ad hoc query requirements
A dimensional model can be implemented using tables or cubes. Tables In the case of tables, the dimensional model is typically implemented as a star or snowflake schema. Dimension tableswhich contain information about hierarchies, levels, and attributesjoin to one or more fact tables. Fact tables store quantifiable measures such as sales, expenses, and inventory. Cubes Cubes logically represent data in a similar way to tablesalthough the data is actually stored in multidimensional arrays. Like dimension tables, cube dimensions organize members into hierarchies, levels, and attributes.
Measures Represent factual data Are organized by one or more dimensions Populate the cells of a cube Can be numeric data, text, dates, Booleans,etc
Measure Types Stored measures store the result in data cells. Calculated measures evaluate calculated data from a formula.
Dimensions
Hierarchy A hierarchy is a parent-child relationship between the members of a dimension. Hierarchies enable logical groupings of dimension members for the purposes of: Navigation of data Aggregation of measures Allocation of data in a budgeting or planning application Certain calculations, such as shares and indexes Dimensions can have multiple hierarchies.
Level
Each level represents a position in the hierarchy. The members at different levels have a one-to- many parent/child relationship. A hierarchy typically contains several levels, and a single level can be included in multiple hierarchies.
Physical Layers of a Data Warehouse Staging Area You need to clean and process your operational data before putting it into the warehouse. Most data warehouses use a staging area to perform this operation. This is especially important when data is being extracted from multiple source systems
Atomic Data Layer The atomic data layer is the enterprise data model, which stores data in the third normal form.This is the most detail level of data; it does not hold any aggregate data. The atomic data layer is used as the source for dependent data marts and summary data.
Performance Data Layer The performance data layer is also known as the data exploration layer of the warehouse. It is built from the base data warehouse schema, containing summary data that allows for fast access for reporting and analytic applications. This summary layer is often implemented as a star or snowflake schema with materialized views and cubes.
Managing the Data Warehouse Warehouse Builder Oracle Database 11g delivers Oracle Warehouse Builder for managing the data warehouse. It is a best-of-breed design and ETL tool with advanced data quality and data integration capabilities.OWB can target star and snowflake schemas as well as OLAP cubes. Analytic Workspace Manager Oracle Database 11g also delivers Analytic Workspace Manager, which is a tool that is specifically designed to create and manage OLAP cubes. It assumes that the source data for the cube is of high quality, and is often used in combination with other ETL and Data Quality processes other than OWB.
ROLAP Versus MOLAP How to store the data for OLAP? In relational database tables to be used by OLAP metadata (relational OLAP or ROLAP) In analytical workspaces (multidimensional OLAP or MOLAP)
Which One to Choose: ROLAP or MOLAP? For business intelligence applications that are simple, with fixed or very predictable reporting patterns and simple calculations, the multidimensional data types (MOLAP) might not be required. You can use the relational data types. If, however, the system must support sophisticated calculations and excellent query performance in an unpredictable query environment, the performance benefits of the MOLAP option can be significant.