Beruflich Dokumente
Kultur Dokumente
nonvolatile updateable
History of DATA
WAREHOUSING
concept dates back to the late
1980s when IBM researchers Barry Devlin
and Paul Murphy developed "business
data warehouse".
In essence, data warehousing concept
was intended to provide an architectural
model for the flow of data from
operational systems to decision
support environments
In absence of a data warehousing architecture, an
enormous amount of redundancy of information was
required to support the multiple decision support
environments that usually existed.
personal productivity
budgeting,
forecasting,
product line and customer profitability,
sales analysis,
financial consolidations
manufacturing mix analysis
--applications that use historical, projected and derived
data.
Benefits of Data
Warehousing
Has a subject area orientation
Data is arranged by subject rather than by application,
and is more intuitive for users to navigate.
closer in concept to the way decision makers think
about their business.
METADATA LAYER
The data directory -This is often usually more detailed than an
operational system data directory. There are dictionaries for the entire
warehouse and sometimes dictionaries for the data that can be
accessed by a particular reporting and analysis tool.
Data Warehouse
Components
LOAD MANAGEMENT
relates to the collection of info. from disparate internal or external
sources.
loading process includes summarizing, manipulating and changing data
structures into format that lends itself to analytical processing.
Actual raw data kept alongside or within data warehouse enabling
construction of new and different representations.
WAREHOUSE MANAGEMENT
relates to day-to-day management of data warehouse.
management tasks associated with warehouse includes ensuring its
availability, the effective backup of its contents & security.
QUERY MANAGEMENT
relates to provision of access to contents of warehouse
includes partitioning of info. into different areas with different
privileges to different users.
Access may be provided through custom-built applications, or ad hoc
query tools.
Technologies Involved In Data
Warehousing
Key advantage :
easier for the user to understand and to use.
the retrieval of data from the data warehouse tends to operate very quickly.
disadvantages
to maintain integrity of facts and dimensions, loading the data warehouse with data
from different operational systems is complicated
difficult to modify data warehouse structure if organization adopting the
dimensional approach changes the way in which it does business.
NORMALIZED APPROACH
the data in the data warehouse are stored following the Codd
normalization rule.
Key advantage
it is straightforward to add information into the database.
disadvantage
because of no. of tables involved, difficult for users to
1) Join data from different sources into meaningful information
2) Access the information without a precise understanding of
sources of data & data structure of the data warehouse.