Beruflich Dokumente
Kultur Dokumente
Company Confidential
Data Warehousing
Introduction , Terminology Necessity / Why Data Warehouse ? Characteristics of a Data Warehouse Building phases Security Tools The Impact of Web Benefits
Company Confidential
to transform it
For too long, enterprises have been data rich and information poor technologically condemned to be informational mazes
Most organizations remain structurally incapable of providing useful business intelligence to management
Company Confidential
Warehouse of Data
Company Confidential
Data Warehouse Vs Operational Systems Data Warehouse Data Operational System Data
Long time frame Static Data is usually summarized Ad hoc query access Updated periodically Data driven
Sunday, March 30, 2014
Short time frame Rapid changes Record-level access SAP standard transactions Updated in real time Event driven
Company Confidential 5
Aggregate Data
Meta Data Staging Area
Data Mart
Company Confidential
Company Confidential
Granularity (or Grain) defines the level of detail stored in the physical warehouse Low granularity indicates lot of detail while high granularity indicates less detail Example : A commercial airline is building a data warehouse. What will the granularity be? Choice A : Each record represents a flight (High Granular) Choice B : Each record represents the customer on a flight (Less Granular)
However, you should be aware that the granularity of data affects Volumes of Data, Data Maintenance, Indexing Level of Data Exploration Query and Reporting Constraints
Sunday, March 30, 2014 Company Confidential 8
Company Confidential
Region
State
M o n t h
District Location
Product
Analytical technique whereby the user navigates from the most summarized to the most detailed level
Company Confidential
10
M o n t h Product
P r o d c t
Region
Company Confidential
11
Consistent Data
Access to Corporate & Organizational Data
Company Confidential
13
Separate
Available
Accessible Subject Oriented Integrated Time Variant
Non Volatile
Company Confidential
14
Designing
Extraction
Cleaning
Transformation
Loading Querying
Company Confidential
15
Top Down Approach setting up enterprise wide architecture first & then going for individual data marts very difficult, time consuming, expensive Bottom up Approach start with highly focused data marts & then combine them for enterprise wide requirements Hybrid Approach Start with data mart having focus on enterprise wide scope
Sunday, March 30, 2014 Company Confidential 16
Logical Data model Vs Physical Data model Logical Data model Physical Data model
Uses business names Business experts drive it Includes entities, attributes & relationships
Names limited by DBMS DBAs drive it Includes tables, columns, keys & database triggers, indexes etc.
Company Confidential
17
Star Schema
Model with Central Fact table surrounded by many Dimension tables. Central Fact table is Long & Narrow having many rows & few Columns. Dimension tables are Short & Wide having few rows & many columns.
Sunday, March 30, 2014 Company Confidential 18
Company Confidential
19
Normalized Dimension Tables Each Smaller Dimension table joined to a Fact table & Descriptive Dimension table
Company Confidential
20
CUSTOMER
PRODUCT
SALES
TIME
STORE
REGION
REGION SUMMARY
PRODUCT CUSTOMER TIME
Identify Data
Classify Data
Quantify the value of Data
OLAP
Mining
Company Confidential
23
Consistency. Everyone in the organization can draw upon a common pool of data and see reports that reflect their needs. Accessibility. Accessing the data warehouse through a common pathway, the Web browser, simplifies the process of finding information. Availability. Access to information is available to anyone at anytime, even if the database administrator is not available. The data warehouse is independent of operational activities and can be accessed via the Web whenever necessary.
Company Confidential
24
Low development costs. Software provides a standard framework for developing Web-enabled applications. Low maintenance costs. Less time is spent maintaining client-side, typically PC-based applications software; and support can be focused on ensuring that information in the data warehouse brings competitive advantages Time savings. If information consumers are directed to reports, then information providers, typically qualified specialists whose time is expensive, spend less time answering the same questions again and again.
Company Confidential
25
Improved business communications. By providing Web-enabled software-based corporate information to customers, business partners, and the public, you can improve your business performance, reinforce brand loyalties and increase your organization's exposure. Data protection. Keeping the importance of standard Web security technology in mind, you can build and deploy secure applications for your organization.
Company Confidential
26
Low marginal cost/scaleable solutions. The Rapid Warehousing Methodology -- think big, start small -demonstrates that to maximize return on investment, it is best to develop for a small number of people first, and then extend the solution to larger groups. With browser technology, the cost of doubling or tripling the user community is negligible. Low training costs. Web browsers are intuitive and easy to use.
Company Confidential
27
Increase customer profitability Cost effective decision making Manage customer and business partner relationships Manage risk, assets and liabilities Integrate inventory, operations and manufacturing Link multiple locations and geographies Identify developing trends and reduce time to market Facilitate process change Improve quality assurance programs Production & Performance awareness
Sunday, March 30, 2014 Company Confidential 28