Beruflich Dokumente
Kultur Dokumente
Copyright
Your use of this document is subject to the Terms of Use governing the
Cognos software products and related services which you have licensed or
purchased from Cognos. The information contained in this document is
proprietary information of Cognos Incorporated and/or its licensors and is
protected under copyright and other applicable laws. You may use the
information and methodologies described in this document 'as is' or you may
modify them, however Cognos will not be responsible for any deficiencies or
errors that result from modifications which you make. Copyright 2006 (c)
Cognos Incorporated. All Rights Reserved.
You can print selected pages, a section, or the whole book. Cognos grants
you a non-exclusive, non-transferable license to use, copy, and reproduce the
copyright materials, in printed or electronic format, solely for the purpose of
providing internal training on, operating, and maintaining the Cognos
software.
This document is maintained by the Best Practices, Product and Technology
team. You can send comments, suggestions, and additions to Best Practices,
Product and Technologies.
Contents
1. INTRODUCTION ..................................................................................................4
1.1 PURPOSE ................................................................................................................... 4
1.2 APPLICABILITY ............................................................................................................ 4
1.3 EXCLUSIONS AND EXCEPTIONS ......................................................................................... 4
2. WHAT IS A FLEXIBLE MODEL..............................................................................4
2.1 FLEXIBILITY FOR MODEL GROWTH .................................................................................... 4
2.2 FLEXIBILITY AND REPORTING .......................................................................................... 7
3. SUMMARY ...........................................................................................................9
1. Introduction
1.1 Purpose
This document is aimed towards the Framework Manager model developer.
The purpose of this document is to outline design theory that will aid in
building maintainable models and allowing for easy extension of the model as
reporting requirements grow and mature.
1.2 Applicability
The current document applies to modeling within Cognos ReportNet and
Cognos 8 although the theory has broader scope and applicability.
In the example above we can see two examples of table design within a data
warehouse. On the left there are multiple tables that combine to produce the
dimensional information for the Product dimension while a single table
populates the query subject on the right. By defining determinants in the
same order and with the same keys the rollups will be identical for both
approaches. The several tables that are used in the normalized approach can
be merged into a combined model query subject that will provide the same
look and feel as the denormalized dimension with its single point of entry for
the common product information.
Common Dimensions
Business
Process Date Branch Department Product Customer Employee Geography
Human X X X
Resources
Accounting X X X X
Sales X X X X X X X
1
R. Kimball and M. Ross, The Data Warehouse Toolkit, 2nd ed., (New York, Wiley Computer
Publishing, 2002)
The table above allows a quick scan of the relevant data for each business
process. The format allows common areas to be quickly identified such as the
Branch dimension. Depending on how the business processes operate the
Branch information could be modelled in several ways.
For example, a Branch may be closely tied to its Geographic location or to a
Department. If the Brach information only relates to one of these dimensions
then it would be a logical choice to generate a query subject that merges the
Branch with the single associated parent dimension. On the other hand, the
Branch may have no relationship at all to Department or Geography and
would stand as its own dimension without any ties to the other dimensions in
the mode. Finally, a Branch may be tied to both a Department and
Geographical location and it would make sense to have a specific Branch
dimension with multiple hierarchies based on Department and Geography
respectively.
The choice for the modelling approach in such a scenario depends largely on
the logical relationships between the types of data that are being tracked by
the business process. If there is no relationship between the data then the
choice should be for separate dimensions. If there is a relationship then
combining the dimensions for logical reporting should be considered.
The true strength of the Bus Matrix is that it allows for a forward-looking view
of the modelling process. The current project may only be addressing the
Human Resources reporting requirements but the Bus Matrix shows that
there are other business processes that could make use of the Date,
Employee, and Geography dimensions at a later date. By reviewing the
common requirements it is possible to build in appropriate join keys to relate
the other business processes and ensure that the dimensions are modelled to
reflect the appropriate dimension granularity across the multiple processes.
For example, Accounting may be interested in the Monthly grain total
expenses while Human Resources is tracking the daily movement of
employees within the organization. In such a case it would be valuable to
build in the relevant key field for the month join to the accounting fact tables
and build a determinant in the date dimension at the month level with the
group by clause. With these items added during initial model development it
is a simple process to link the Accounting fact tables into the model at a later
date. Likewise, if the model were starting with the Accounting processes then
it would be useful to include the Day level granularity into the Date dimension
to make it easier to integrate Human Resources fact data.
When modelling for these two types of reporting there is a tendency to use
considerably different approaches. Ad-hoc models will tend towards general
query subjects that allow a wide range of business questions to be answered
by a single model. Canned models often tend towards individual query
subjects that serve to address the specific requirements of the canned
reports. With regards to a scale of flexibility ad-hoc models are most flexible
while canned models are at the least flexible end of the scale. Consider the
following as an illustration of the narrowing of scope for the two applications.
Development Time
Development Time
3. Summary
A metadata model is a foundation for both future modelling and report
development. There are two key areas that need to be addressed to ensure a
successful model.
The first area is understanding the scope of the current data set in terms of
all the other data in the organization. By looking ahead it is possible to
esablish common data structures that will easily integrate the data from other
business processes. Likewise, by understanding different data formats you
can proactively model for this integration or identify areas where data
warehousing or tuning will benefit the organization.
The second area is understanding your user population and how they will
access the data that is being made accessible through the model. By focusing
on a nearsighted goal it may be possible that the long-term needs of the user
community are being lost. It is difficult to modify a model that has been
designed with too narrow a focus to meet wider user needs. By identifying
who the current users are and what your user base may be in the future it is
possible to select the approprate range of vision for a model.
With only a little forsight and planning at the beginning of a modelling project
it is possible to reap large gains when the model is called upon to grow with
your users. Further, proper planning will save considerable effort for
everyone involved in the maintenance of your model metadata.