Beruflich Dokumente
Kultur Dokumente
In simplest terms Data Warehouse can be defined as collection of Data marts. -Data marts : Subjective collection of Data.
Bill Inmon
A data warehouse is a subject-oriented, integrated, timevariant,and nonvolatile collection of data in support of managements decision-making process. ERP will Run the Business - like how Tyres Run the Car BI (Reports,Data mining,Dashboards,kpis) will help you to take business decisions based on your historical data. - like Steering, mirrors, breaks, dashboards will help, how smoothly you can run the Car or reach the Destination.
In What way a Data warehouse helps any Business Lets say A producer wants to know.
Which are our lowest/highest margin customers ? What is the most effective distribution channel? Who are my customers and what products are they buying?
What product prom-otions have the biggest impact on revenue? What impact will new products/services have on revenue and margins? 4
A single, complete and consistent store of data obtained from a variety of different sources made available to end users in a what they can understand and use in a business context.
[Barry Devlin]
6
Information
A process of transforming data into information and making it available to users in a timely enough manner to make a difference
[Forrester Research, April 1996]
Data
8
Oracle 10g
IBM DB2
Base Product
$ 25K
$ 40K
$ 25K
$ 25K
$ 40K 56K
$ 25K 35K
Business Intelligence
(included)
$ 25K
$ 116K $ 56K
$ $ 35K 154.5K
$ 25K
$ 232K 116K
$ 154.5K 164.5K
$116K $232K
$164.5K
Multi-core
High Availability
Business Intelligence
(included)
$ 25K
$348k $$464k232K
$$164.5K 329K
What happened?
Additional Benefit
Number of Users
OLTP Online Transaction Processing OLAP Online Analytical Processing MOLAP Multidimensional OLAP ROLAP Relational OLAP HOLAP Hybrid OALP Dimensions De-normalized master tables Attributes Columns of Dimensions Hierarchies sequential order of attributes Facts (Measure group) Transactions tables in DWH Fact (Measures) Cubes Multidimensional storage of Data KPIs Key performance indicator Dashboards combination of reports,kpis,charts Data Marts Subjective Collection of Data SCDs Slowly changing Dimensions Perspectives Child Cube
Data Analysis
Data Storage
Repository
Data-Migration
Middleware (Populations-Tools)
OLTP O
Stage DB Optional
A
CUBE
ROLAP
MOLAP
SSAS SSIS SSRS Integration Services Analysis Services Reporting Services Data Marts
1. OLTP (on-line transaction processing) 2. Day-to-day operations: purchasing, inventory, banking, manufacturing, payroll, registration, accounting, etc. 3. The tables are in the Normalized form.
3. The tables are in the De-Normalized form. 4. We Called the Storage objects as Dimension and Facts. i.e., All the masters Are dimension and the Transactions are Facts. 5. For Designing OLTP we used Dimension modeling. OLAP is classified into two i.e., MOLAP & ROLAP
4. We Called the Storage objects as Tables. i.e., All the masters and the Transactions are stored in the tables.
5. For Designing OLTP we used data modeling.
Base_Rate Category
Cat_Id Cat_Name Cat_Desc Cat_Name Cat_Desc Group_Name
Group
Group_Desc
Topics Later We will Cover 1. Types of Dimensions 2. Slowly changing Dimensions 3. Hierarchies
Group_Id
Group_Name Group_Desc
Group_Id
SalesOrderDetails Cust_Id SalesPerson Prod_Id Order_Date Booked_Date Delivery_Date Unit_Price Qty Tax Created_By
SalesOrder_Fact
Cust_Id Prod_Id Order_Date Reference keys of Dimensions
Delivery_Date
Unit_Price Qty Total_Amount Numeric fields called as Fact or measure
Tax
Qty*Unit_Price+Tax=Total Amount Usually calculate all the calculations before storing into OLAP
Org_Dim Org_Id
Order_Date
Delivery_Date Org_Id Unit_Price Time_Dim Date Year Month
Cust_Dim Cust_Id
STAR Schema
Product_Dim
Prod_Id Prod_Name Base_Rate
SalesOrder_Fact
Cust_Id Prod_Id Order_Date
Cat_Name
Cat_Desc Group_Name Group_Desc
Delivery_Date
Unit_Price Qty Total_Amount
Tax
1. Dimensions will have only relation with the Fact. (Normalized model) 2. One to many or One to One relation will Occur. 3. Performance is fast but required huge storage space.
1. Dimension will have a relation other than Fact. (DeNormalized model) 2. Used for many to many relation. 3. Performance is Low but required Less storage space.