Beruflich Dokumente
Kultur Dokumente
Overview
q q q q
The Process of Planning and Building a Warehouse Data Warehouse Architecture (revisited) Classification of Tools Focus: OLAP Tools
Multidimensional Data Modeling OLAP Architectures OLAP query languages
q q
Implementation
Conceptual Design (Implementation Independent) Logical + Physical Design (e.g. Product specific)
Classification of Tools
q q q
Repository Systems
Metadata Storage
Repository Systems
q
Technical Metadata
When was data last loaded from which system Data model for OLTP and OLAP databases
q
Functionality
Communication hub for different tools Guides user exploration Guides development process Impact analysis
ETL Tools
q
Transformation
(Graphical) Specification of Transformation Rules (Expressive Power)
Loading
Ability to use database features (e.g. bulk loading)
Process Management
Scheduling, Monitoring, Error Handling
Databases for DW
q
q q
Support for Materialized Views (Preaggregation) Special Analytical Capabilities (e.g. SQL Extensions)
Top N Ranking
Frontend Tools
Reporting Why did it happen? Interactive OLAP Ad hoc-Queries What will happen?
Additional Benefit
DW: Tools and Projects
What happened?
Data Mining
Number of Users
2001 FORWISS,Carsten Sapia - sapia@forwiss.de
10
11
specialized database technology multidimensional storage structures E.g. Hyperion Essbase, Oracle Express, Cognos PowerPlay (Server) Query Performance Powerful MD Model write access Database Features
multiuser access/ backup and recovery
Frontend Tool
q q
+ + +
Multidim. Database
Sparsity
DW: Tools and Projects
12
idea: use relational data storage star (snowflake) schema E.g. Microstrategy, SAP BW advantages of RDBMS
+ scalability, reliability, security etc.
ROLAPEngine
SQL
Meta Data
Query Data
Relational DB
DW: Tools and Projects
no
write access
2001 FORWISS,Carsten Sapia - sapia@forwiss.de
13
proprietary data structure on the client data stored as file mostly RAM based architectures E.g. Business Objects, Cognos PowerPlay mobile user ease of installation and use volume multiuser capabilites
2001 FORWISS,Carsten Sapia - sapia@forwiss.de
+ +
data no
DW: Tools and Projects
14
DW Integration
MOLAP ROLAP ClientOLAP
15
Combining Architectures I
Drill through
qhighly
Multidim. Database
qdense q95%
Relational Database
DW: Tools and Projects
(sparse)
of the requirements
2001 FORWISS,Carsten Sapia - sapia@forwiss.de
16
Combining Architectures II
Hybrid OLAP (HOLAP)
qequal
Data
qStorage qCube
HOLAP System
Meta Data
Multidim. Storage
DW: Tools and Projects
Relational Storage
2001 FORWISS,Carsten Sapia - sapia@forwiss.de
17
OLAP Standards
q q q
Idea: define interface between client and server Benefit: Component oriented architectures Proposal 1: OLAP Council
union of OLAP Tool producers not implemented so far (even by the council members)
artwork copyright Intersystems GmbH artwork copyright Intersystems GmbH 1999 FORWISS
19
Conceptual Design
Using the Data Warehouse Requirement Analysis
Implementation
20
Which business process is being modeled? What is the subject of analysis (fact) and what is being measured? On what granularity level is active analysis being done? Which properties (dimensions) determine the measures? Which different levels of aggregation are meaningful? What additional information is needed for the different levels? What is the variability and the cardinality of the dimensions?
q q q q q
21
Facts
q q q q
Fact = Subject of Analysis Measures = Attributes describing facts Derived Measures Additivity of Measures
globally additiv additiv for some dimensions
22
Dimensions
q q q q
Dimensions = static structure of business information Used for navigating the data space Choosing the necessary granularity Dimension Members = Instances of a dimension
e.g. 8.12.1997 and Juli 1997 are members of dimension time
Structuring Dimension
using different dimension levels (hierarchies) using descriptive attributes
23
Simple Hierarchies
Month Quarter 1/2 Year Period Year
Dimension Level
2. Halbjahr 99
24
Unbalanced Hierarchies
Plant/Site Plant1 Div A ... Great Outdoors Bu 1 Div B Bu 2 ...
DW: Tools and Projects 2001 FORWISS,Carsten Sapia - sapia@forwiss.de
Business Unit
Business Division
Enterprise
25
Alternative Hierarchies
Customer Geogr. Region Bavaria Hessen Hamburg Germany Country
26
Alternative Pathes
Ort Geogr. Region Country
Bayern Hessen Hamburg Germany Germany (South) Germany (West) Germany (North)
Sales Region
DW: Tools and Projects 2001 FORWISS,Carsten Sapia - sapia@forwiss.de
27
q q
28
29
Branc h
Margin Range
30
Client
Impromptu
Server
31
Logical+Physical Design
Using the Data Warehouse Requirement Analysis
Implementation
32
Practical Demonstration
33
q q q
Hierarchical structures of dimensions can be complex ME/R notation can be used to document conceptual models Several ways to map an MD model to a relational DB
2001 FORWISS,Carsten Sapia - sapia@forwiss.de
34
A B A B
Result Measures m1 m2
35
SELECT g1,...,gn, aggr(m1),..., aggr(mk) FROM FactName, Dim1,..., Dimn WHERE Dim1.level(r1) = r1 AND ... AND Dimn.level(rn) = rn AND Dim1.d1=FactName.d1 AND ... AND Dimn.dn=FactName.dn GROUP BY g1,...,gn
DW: Tools and Projects 2001 FORWISS,Carsten Sapia - sapia@forwiss.de