Beruflich Dokumente
Kultur Dokumente
Warehouse
Issues in DW design
Data Warehouse
A read-only database for decision
analysis
Subject Oriented
Integrated
Time variant
Nonvolatile
consisting of time stamped
operational and external data.
Data Warehouse vs
Operational Databases
Highly tuned
Real time Data
Detailed records
Current values
Accesses small
amounts of data
in a predictable
manner
Flexible access
Consistent timing
Summarized as
appropriate
Historical
Access large
amounts of data in
unexpected ways
Data Warehouse:
New Approach
An old idea with a new interest
because of:
Cheap Computing Power
Special Purpose Hardware
New Data Structures
Intelligent Software
Warehousing Problems
Business Issues
Data Quantity
Data Accuracy
Maintenance
Ownership
Cost
Warehousing Problems
Business Issues
Database Issues
DBMS Software
Technology
Complexity
Warehousing Problems
Business Issues
Data Issues
Analysis Issues
User Interface
Intelligent Processing
Three Approaches
Data Mart
Extracted and managerial support data
designed for departmental or EUC
applications
Data Package
Data required for a specific application
Classical Warehouse
Source
Archived data
Extraction
Data
Tool
VLDB technology
Analysis
IT driven software
Mart
Source
Extraction
Batch summary
Data
Tool
Analysis
Package
Source
Mart
Extraction
Data
Tool
PC tools
Analysis
Trained user
Three Fundamental
Processes
Data Acquisition
Data Storage
Data a
Access
Data Acquisition
Acquisition steps
Storage
The storage component holds the
data so that the many
different data mining, executive
information
and
decision support systems can
make use of it effectively.
Specialized hardware
Storage
Access
Access Tools
OLAP
Data Visualization
Hardware Budget
Design Issues
Relational and Multidimensional
Models
Denormalized and indexed
relational models more flexible
Multidimensional models simpler to
use and more efficient
Business Model
As always in life, there are some
disadvantages to 3NF:
Performance can be truly awful. Most of
the work that is performed on
denormalizing a
data model is an
attempt to reach performance objectives.
The structure can be overwhelmingly
complex. We may wind up creating many
small relations which the user might think
of as a single relation or group of data.
Structural Dimensions
Simple DW pattern.
Other Dimensions