Sie sind auf Seite 1von 23

Data WareHousing ,Business Intelligence and OLAP

What is Data WareHousing ?


A data warehouse is a strategic collection of all types of data in support of the decision-making process at all levels of an enterprise. It is a single data store created for two primary reasons:analytical reporting and decision support.

Technological Solution for Data WareHousing Parallelism. Partitioning. data compression.

Advantages of Data Warehousing? Data warehouses tend to have a high query success, as they have complete control over the four main areas of data management systems: Clean data Indexes: multiple types Query processing: multiple options Security: data and access

What is Business Intelligence :Definition & Purpose


Business intelligence is the process of transforming data into information and through discovery transforming that information into knowledge.
The purpose of business intelligence is to convert the volume of data into business value through analytical reporting.

The User Dilemma


Where Is That Information? Can I Trust My Data?

Problem: Multivendor,Unintegrated Environment


Most BI solutions are incomplete, requiring you to integrate disparate BI systems from multiple vendors for capabilities such as ad hoc query, Web analysis, reporting, online analytical processing (OLAP), data mining, and extraction, transformation, and loading (ETL).

The result is high costs associated with initial purchases from multiple vendors; and integration, administration, and maintenance of the BI solutions.

The Optimal Information Platform


Security & Compliance Management & Change High Availability RAC

Business Intelligence: Requirements


Efficient design of data warehouses Enterprise reporting Ad hoc query and analysis (relational and multidimensional) Advanced analytics Easy administration

What is OLAP ?
OLAP stands for online analytical processing. Online: You have access to live data (rather than static data). Analytical processing: You can analyze your data for reporting. You can create reports that are: - Multidimensional -Calculation intensive - Supported by time-based analysis -Ideal for applications with unpredictable, ad hoc query requirements

The Dimensional Model


The multidimensional logical model has the following elements: Measures Dimensions Hierarchies Levels Attributes

A dimensional model can be implemented using tables or cubes. Tables In the case of tables, the dimensional model is typically implemented as a star or snowflake schema. Dimension tableswhich contain information about hierarchies, levels, and attributesjoin to one or more fact tables. Fact tables store quantifiable measures such as sales, expenses, and inventory. Cubes Cubes logically represent data in a similar way to tablesalthough the data is actually stored in multidimensional arrays. Like dimension tables, cube dimensions organize members into hierarchies, levels, and attributes.

Measures Represent factual data Are organized by one or more dimensions Populate the cells of a cube Can be numeric data, text, dates, Booleans,etc

Measure Types Stored measures store the result in data cells. Calculated measures evaluate calculated data from a formula.

Dimensions

Form the edges of the measure


Provide pointers to the actual cells inside the multidimensional measures

Hierarchy A hierarchy is a parent-child relationship between the members of a dimension. Hierarchies enable logical groupings of dimension members for the purposes of: Navigation of data Aggregation of measures Allocation of data in a budgeting or planning application Certain calculations, such as shares and indexes Dimensions can have multiple hierarchies.

Level

Each level represents a position in the hierarchy. The members at different levels have a one-to- many parent/child relationship. A hierarchy typically contains several levels, and a single level can be included in multiple hierarchies.

Types of Hierarchy Level Based Value Based

Physical Layers of a Data Warehouse Staging Area You need to clean and process your operational data before putting it into the warehouse. Most data warehouses use a staging area to perform this operation. This is especially important when data is being extracted from multiple source systems

Atomic Data Layer The atomic data layer is the enterprise data model, which stores data in the third normal form.This is the most detail level of data; it does not hold any aggregate data. The atomic data layer is used as the source for dependent data marts and summary data.

Performance Data Layer The performance data layer is also known as the data exploration layer of the warehouse. It is built from the base data warehouse schema, containing summary data that allows for fast access for reporting and analytic applications. This summary layer is often implemented as a star or snowflake schema with materialized views and cubes.

Managing the Data Warehouse Warehouse Builder Oracle Database 11g delivers Oracle Warehouse Builder for managing the data warehouse. It is a best-of-breed design and ETL tool with advanced data quality and data integration capabilities.OWB can target star and snowflake schemas as well as OLAP cubes. Analytic Workspace Manager Oracle Database 11g also delivers Analytic Workspace Manager, which is a tool that is specifically designed to create and manage OLAP cubes. It assumes that the source data for the cube is of high quality, and is often used in combination with other ETL and Data Quality processes other than OWB.

ROLAP Versus MOLAP How to store the data for OLAP? In relational database tables to be used by OLAP metadata (relational OLAP or ROLAP) In analytical workspaces (multidimensional OLAP or MOLAP)

Which One to Choose: ROLAP or MOLAP? For business intelligence applications that are simple, with fixed or very predictable reporting patterns and simple calculations, the multidimensional data types (MOLAP) might not be required. You can use the relational data types. If, however, the system must support sophisticated calculations and excellent query performance in an unpredictable query environment, the performance benefits of the MOLAP option can be significant.

Das könnte Ihnen auch gefallen