Beruflich Dokumente
Kultur Dokumente
Basic principles
Warehouse Database
Refresh
Relational tools
Text, image
Spatial
OLAP tools
External data
Web
Analysts Strategic
Oracle Reports
Oracle Discoverer
Oracle Express
Data Modeling
Oracle Data Mart Designer
OLTP Databases Data Mart Database Oracle8
SQL*PLUS
OLTP Engines
Warehousing Engines
Data Extraction
Oracle Data Mart Builder
Data Management
Oracle Enterprise Manager
Disadvantages:
Involves a high risk, takes a longer time Runs the risk of needing to change requirements
Data Mart
A subset of a data warehouse that supports the requirements of a particular department or business function. Characteristics include:
Do not normally contain detailed operational data unlike data warehouses. May contain certain levels of aggregation
Marketing
Sales
External Data
Sales or Marketing
External Data
Row 4
SALES
Product
GL_Line
Two dimensions
Three dimensions
Can access multidimensional and relational database structures Caters for calculated fields
Disadvantages:
Difficulty of changing model Lack of support for very large volumes of data May require significant processing power
MOLAP Server
The application layer stores data in a multidimensional structure DSS client The presentation layer provides the MOLAP multidimensional view Engine Efficient storage and processing Application layer Complexity hidden from the user Analysis using preaggregated summaries and precalculated Warehouse measures
ROLAP Server
The warehouse stores DSS client atomic data. The application layer ROLAP generates SQL for the engine three- dimensional view. Application The presentation layer Multiple layer SQL provides the multidimensional view.
Warehouse server
MOLAP
MDDB
Query Periodic load Warehouse Express Server
ROLAP
Cache
Live fetch Query
Data
Express user
ROLAP
Simple
Complex Analysis
Data Acquisition
Identify, extract, transform, and transport source data Consider internal and external data Perform gap analysis between source data and target database objects Plan move of data between sources and target Define first-time load and refresh strategy Define tool requirements Build, test, and execute data acquisition modules
Modeling
Warehouses differ from operational structures:
Analytical requirements Subject orientation
Identify dimension tables Link fact tables to the dimension tables Create views for users
Dimension Tables
Dimension tables have the following characteristics: Contain textual information that represents the attributes of the business Contain relatively static data Are joined to a fact table through a foreign key reference
Product
Channel
Fact Tables
Fact tables have the following characteristics:
Contain numeric measures (metrics) of the business May contain summarized (aggregated) data May contain date-stamped data Are typically additive Have key value that is typically a concatenated key composed of the primary keys of the dimensions Joined to dimension tables through foreign keys that reference primary keys in the dimension tables
Dimension tables
Sales Fact Table Product_id Store_id Item_id Day_id Sales_dollars Sales_units ...
Item Table Item_id Item_desc ...
Provides fast access to precomputed data Reduces use of I/O, CPU, and memory Is distilled from source systems and precalculated summaries Usually exists in summary fact tables
Total Percentage
Sales() Store
SALES BY MONTH Month Tot_Sales Jan 99 51,000 Feb 99 40,000 Mar 99 17,000
Summary advisor
Summary usage Summary recommendations Space requirements
Sales fact
Time dimension