Beruflich Dokumente
Kultur Dokumente
Speaker Introduction
Bala Chandran Dir. Enterprise BI, MicroStrategy
Drivers Of High
Performance
Strageloopnetworks.com
Torbit.com
INTRODUCING MicroStrategy
PRIME
INTRODUCING MicroStrategy
PRIME
PARALLEL
Linear scalability
to 1,000s of
CPUs
INTRODUCING MicroStrategy
PARALLEL
Linear scalability
to 1,000s of
CPUs
PRIME
RELATIONAL
Flexible schema &
Partitioned data
INTRODUCING MicroStrategy
PARALLEL
Linear scalability
to 1,000s of
CPUs
RELATIONAL
Flexible schema &
Partitioned data
PRIME
IN-MEMORY
3x to 10x faster
7x to 20x more
users
INTRODUCING MicroStrategy
PARALLEL
Linear scalability
to 1,000s of
CPUs
RELATIONAL
Flexible schema &
Partitioned data
PRIME
IN-MEMORY
3x to 10x faster
7x to 20x more
users
ENGINE
Tightly-coupled
interactive exploration
10
200 + petabytes of
Hadoop Source Data
30 + Terabytes
Analyzed in PRIME
3500+ Cores
Guy Bayes
Head of Enterprise BI, Facebook
11
User Scale
HADOOP
Data Scale
12
User Scale
MPP
Databases
HADOOP
Data Scale
13
User Scale
Inmemory
DBs
MPP
Databases
HADOOP
Data Scale
14
User Scale
Complex
Custom Development
Risky
Java + Transactional DB clusters + Web 2.0 +
In-memory + BI Tools + .
Slow
Inmemory
DBs
MPP
Databases
HADOOP
Data Scale
15
User Scale
MicroStrategy PRIME
Inmemory
DBs
MPP
Databases
HADOOP
Data Scale
16
Example Applications
17
Example Applications
Application Characteristics
dimensions
18
1
9
MicroStrategy PRIME - 7x more users and 3x faster than the next best inmemory technology
3x
Faster
Complex analytical
dashboard
7x More
Users
19
OLAP Services
PRIME
SMP architecture
MPP architecture
Data Size
100GB Limit
No theoretical limit
Tested to 4.6 TB
Data Rows
2B Limit
No theoretical limit
Tested to 200B
Load Rate
8 GB/Hr
No theoretical limit
Tested to 7TB/Hr
20
Interactive Exploration
of
Terabyte Datasets
by
100,000s of Users
21
22
Traditional Disk speed is a banana slug with a top speed of 0.007 mph
23
24
Query Engines
Parallel Execution
Bottleneck
Shared Memory
Memory
Memory
Memory
Distributed Data
http://blog.delloem.com/2010/12/talking-hpc-with-sagiv-tech/image001/
26
Pros
Cons
More expensive
Limited upgradeability
Pros
Easy to upgrade
Cons
http://www.edn.com/design/communication
s-networking/4313434/The-evolution-tonetwork-flow-processing
Oracle, 2012
28
Query Engines
Parallel Execution
Memory
Memory
Memory
Distributed Data
29
30
Traditional BI
Visualization
Layer
Loosely
Coupled
Analytics layer
optimizes queries for
data
Visualization Layer
Data Layer
Data Layer
31
Traditional BI
Even if you install BI and DB
on the same server
They run in separate
processes
MicroStrategy PRIME
Query Engine and
Application Engine run
In-process analytics
App Engine
App Engine
Query
Processing
Query
Processing
Process 0
Process 1
32
75+data sets
33
34
VISUALIZATION
API
Application Engines
Analytics Engines
Parallel query
execution
Data partitioning within and across
nodes
Optimized in-memory data
structure
Tightly
coupled for
minimal
computation
al distance
DATA
DATA
DATA
DATA
Commodity hardware
Parallel data
loading
SOURCE DATA
35
Data
Warehouse
MicroStrategy
PRIME
SOURCE DATA
36
Thank You
@BG_Chandran #MSTRPrime
37