Beruflich Dokumente
Kultur Dokumente
GAURAV KALRA
SAKSHI BAJ
PREKSHA JAIN
YASH PAREEK
JENTI TAHILIANI
UNDERSTANDING DATA WAREHOUSING
CUSTOMIZING DATA WAREHOUSES
DATA MODELING
DATA WAREHOUSING USING OLAP SYSTEM
TYPES OF OLAP
OLAP V/S OLTP
DEVELOPING BUSINESS STRATEGIES
A Data Warehouse is a
repository database that
efficiently extracts data
from existing systems for
the rapid analysis and
reporting of information in a
business enterprise.
RELATIONAL DATABASES ( RDBMS ):-
This organizes data into related rows
and columns as specified by the relational
model.
FLAT FILE STRUCTURE :-
This comprises fields that are the same
fixed length with records each separated
by a delimiter.
Formulate Business Strategies
Used for Analysis / Effective
Report Generation
Collate and consolidate data
Generate complex Query / Report
DATA VALIDATION
DATA PREPARATION
DATA CLEANSING AND
TRANSFORMATION
DATA LOADING
The following are the characteristics of
data in a Data Warehouse :-
NON VOLATILE DATA
EASY ACCESSIBLE DATA
CONSISTENT DATA
SUBJECT-ORIENTED DATA
A Data Mart is a customized smaller
version of a data warehouse. Data
Marts are smaller and focused on
the customization needs of a
particular department and the end
users. These provide inputs to the
decision support system for the
department.
Efficient management of data
The decision support systems can easily
analyze this information
The cost of data mart is less than that
of data warehouse
The performance of the systems is good.
Data Modeling is defined as the
process whereby integrated data
obtained from divergent sources is
modeled to map to the users
perspective.Data modeling is used to
design the data warehouse.
Dimension:-It is any category of stored
information..
Attribute:- It is a fact or a measure
that describes a dimension.
Hierarchy :- It describes the levels of
relationships between different attributes.
Fact table:- It contains well defined
numeric attributes or information pertaining
to the fact.
A Schema is a diagrammatic representation
of the relation that exist among a group of
tables in a database.
It is used for dimensional modeling
It provides a multi dimensional view to the end
user
Epicenter of this schema is a fact table
Queries that rely on multiple dimension can use the
schema to return accurate result
OLAP, originally introduced in 1994 in
a paper by EF Codd, is a decision
support counterpart to OLTP.
( Online Analytical Processing )
Decision support software that allows
the user to quickly analyze
information that has been summarized
into Multidimensional view and
hierarchies.
Ina survey, we asked respondents
about the extent to which they had
achieved eight separate potential
business benefits. The possible levels
varied from "Proven and Quantified" to
"Got Worse/More Expensive." We used
these responses to calculate weighted
scores, plotted in a Figure which can be
seen in the next slide
On-Line Analytical Processing (OLAP) tools meet the
need for interactive multidimensional reporting and
analysis. They allow
operational managers to perform trend, comparative,
and time-based
analysis by enabling exploration of pre-calculated and
summarized data
along multiple dimensions. Operational managers can
explore data first
at a summary level, then drill down through the data
hierarchy
In DATA WAREHOUSE using OLAP system,
information exploration and historical trend
analysis requirements of the organization
are assessed.
The data that is analyzed by OLAP tools is
obtained from the DATA WAREHOUSE to
determine the future ratio of success for
the companies.
Steps in the OLAP Creation Process
A Data cube stores dimensions,
hierarchies and facts, which are
analyzed according to the user
perspective.
Offers real time access to data.
Provides a multi-dimensional view.
Provides quick access to all
aggregates statements.
Management of the Data
Warehouse and Business
Intelligence.
OLAP systems have been traditionally
categorized using the following
taxonomy
Multidimensional OLAP (MOLAP)
Relational OLAP (ROLAP)
Hybrid OLAP (HOLAP)
MOLAP is the 'classic' form of OLAP and
is sometimes referred to as just OLAP.
MOLAP uses database structures that are
generally optimal for attributes.