Beruflich Dokumente
Kultur Dokumente
Introductions
Presenter
Javier Loria
Solid Quality Learning
javier@solidqualitylearning.com
Agenda
Overview
& BI Challenges
Introducing the UDM
The UDM in Detail
Data Mining Overview
Agenda
Overview
& BI Challenges
Introducing the UDM
The UDM in Detail
Data Mining Overview
Integrate
z
Data acquisition
from source
systems and
integration
Data transformation
and synthesis
Analyze
z
Data enrichment,
with business
logic, hierarchical
views
Data discovery via
data mining
Report
z
z
Data presentation
and distribution
Data access for
the masses
Overview
Data Models
Multiple Data Sources
Multiple APIs
Duplication of Data
Atlanta
Chicago
Denver
Grapes
Cherries
Melons
Apples
Dallas
Q4
Q1
Q2
Q3
Time Dimension
Di Pro
m du
en c
si t
on
Markets Dimension
What Is a Cube?
What Is a Cube?
Enterprise BI Today
Data Sources
MOLAP
Data Models
Tools
OLAP
Browser
MOLAP
Datamart
Reporting
Tool (1)
Datamart
Reporting
Tool (2)
DW
Reporting
Tool (3)
OLAP
Flexible schema
Simple management
Detail reporting
High performance
End-user oriented
Rich analytics
Rich semantics
Feature
Agenda
Overview & BI Challenges
Introducing
the UDM
The UDM in Detail
Data Mining Overview
OLAP Cubes
Multiple
Multidimensional
fact tables
Full richness the
dimensions attributes
Transaction level access
Star, snowflake, 3NF
Complex relationships
Recursive self joins
Slowly changing
dimensions
navigation
Hierarchical presentation
Friendly entity names
Powerful MDX calculations
Central KPI framework
Multiple perspectives
Partitions
Aggregations
Distributed sources
UDMs Role
Allows
OLAP
Browser
MOLAP
Reporting
Tool
Datamart
Datamart
DW
UDM
BI Applications
Datamart
Datamart
DW
UDM
MOLAP
OLAP
Browser
Reporting
Tool
BI Applications
platforms
XML/A client API
Managed
ADOMD.NET
OLE DB for OLAP
Streamlined BI Infrastructure
Unified
BI Development Studio
Complete,
Performance
Proactive
MOLAP
caching
becomes transparent
Relational
MOLAP Caching
Data Source
Tool
MOLAP
MOLAP
Datamart
UDM
Datamart
DW
Cache
Notifications
XML/A or ODBO
Analysis
Services
OLAP
Browser
Reporting
Tool
BI Applications
Agenda
Overview & BI Challenges
Introducing the UDM
The
UDM in Detail
Data Mining Overview
OLTP
OLAP
XML
Data Sources
Queries
Attribute-Based
from attributes
created
Cubes
No
More Limits
Stored
as XML
Logical Grouping of Measures and
Dimensions
Perspectives
UDM
Categorization
Semantically
Measures
Dimensions
Attributes
Hierarchies
Meaningful Categories
Time
UDM
Natural (Calendar)
Fiscal
Reporting
Manufacturing
ISO 8601
Translations
UDM
Attribute Semantics
Names
Vs. Keys
Ordering
Descretization
Value
Goal Value
Status
Trend
Graphical Representation
Data Mining
Writeback
Actions
Live Server
OLAP
Cube
Dashboard Server
OLAP
Cube
OLAP
Cube
OLAP
Cube
OLAP
Cube
Analytics Server
Selector
and
KPI Designer
(All Professional Clients)
Web Standard
(zero footprint)
Web Professional
(Includes
Business Reporter
for Excel)
Desktop Professional
(Includes
Business Reporter
for Excel)
understanding
Platform total customizable
Agenda
Overview & BI Challenges
Introducing the UDM
The UDM in Detail
Data
Mining Overview
Historical
Historical
Dataset
Dataset
SQL
SQL
OLE/DB
OLE/DB
Text
Text File
File
Web
Web
..NET
NET
Native
Native
Reporting
Reporting
Prediction
Mining Models
Cube
Cube
Cube
Cube
New
New
Dataset
Dataset
Operations
(SSIS)
http://www.crisp-dm.org
Decision Trees
Clustering
Sequence
Clustering
Association
Time Series
Nave Bayes
Neural Net
Examples
Algorithms
Decision Trees
Naive Bayes
Neural Nets
Clustering
Sequence Clustering
Decision Trees
Association
Forecast sales
Predict stock prices
Time Series
All
All
Thank You
Javier Lora
Business Intelligence,
Solid Quality Learning
javier@solidqualitylearning.com
Decision Trees
Nave Bayes
for classification
Some
typical questions:
Cluster Analysis
Partitioning methods
Hierarchical methods
Density based methods
Model-based methods, more
a heterogeneous population
into a number of more homogenous
subgroups or clusters
Some typical questions:
Discover distinct groups of customers
Identify groups of houses in a city
In biology, derive animal and plant
taxonomies
Sequence Clustering
Analyzes
Typically
Sequence
news news
weather
Association Rules
For
Considers
item
An item set is a combination of items in a
single transaction
The algorithm scans through the dataset
trying to find item sets that tend to appear
in many transactions
Time Series
Predict
Neural Network
It can be slow
Back-Propagation
Training