Beruflich Dokumente
Kultur Dokumente
+
Live Query Engine
Supports the worlds largest databases
Data
warehouses
Attaches directly to enterprise data stores Datamarts
and cubes
Compatible with major data platforms Fast
databases
Leverages enterprise data models and
security
In-Memory Engine
Extremely fast computational ability
Tableau v8.2.3
IBM BigInsights
Tableau v7.0.10
Tableau v8.0
Cloudera Impala
Google BigQuery
What is Big Data?
Modern data drives need for a new generation of
databases
Relational Databases
Application independent
Scale-up architecture
Structured data only
Schema-on-write
Limited data processing
High cost
Volume of Data
DATA WAREHOUSE v DATA LAKE
structured, processed DATA structured, semi-
structured, un-structured,
raw
schema-on-write PROCESSING schema-on-read
expensive for large data STORAGE designed for low cost
volumes storage
less agile, fixed configuration AGILITY highly agile, configure
and reconfigure as
needed
mature SECURITY rapidly maturing
business professionals USERS data scientists, business
professionals
DATA WAREHOUSE v DATA LAKE
structured, processed DATA structured, semi-
structured, un-structured,
raw
schema-on-write PROCESSING schema-on-read
expensive for large data STORAGE designed for low cost
volumes storage
less agile, fixed configuration AGILITY highly agile, configure
and reconfigure as
needed
mature SECURITY rapidly maturing
business professionals USERS data scientists, business
professionals
Challenges: Query speed
minutes
Hive on
MapReduce sub-minute
Hive on Tez or
Hive on Spark
sub-second
Modern Analytics (aka Big Data) Stack
Query design
best practice Tableau Data Engine
Fast analytical
DBs
Is OLAP back?
Cold Warm Hot Framework
Performance
Technologies
Hive on MapReduce Cloudera, Hortonworks, MapR, Amazon EMR, and others
Hadoop
HDFS Hive
Cold Use Cases
HDFS Hive
TDE
+
Tableau Data Extracts When to use them?
Execution engines
Certain pitfalls must be
Hadoop
avoided
Data blending large datasets
Avoid large Hadoop dataset to second dataset blending
Executed on the Tableau client side
Unnecessary joins
Imperfectly implemented on many big data systems
Connections with huge number of schemas and columns
Inefficient formulas
Leverage a multi-tiered approach based
on your data
Aggregated
data
TDE
Hadoop
Impala
Raw data
+
Spark SQL (large)
HDFS Hive LLAP
Presto
Drill
Prepared
data Fast
analytical
database
Human Scale of Data
Single
Consumable
Chunk of Data
Chunks of Human
at the Human Scale
(Dashboard)
Aggregation Level Consumable Data
Aggregation of Data Tiers
Year (4)
Filter Year
Month (48)
Filter Month
Region to Country to State to County to
Select Dimension Select Dimension
Zip Code
Week (105)
Filter Week Drill Down to Raw Data with
Context
Day (90)
Select Month Filter Day
Use Aggregates for Guided
Raw Data Drilling
In the Weeds
Select Week
Use Action Filters to Navigate
the Pyramid
Action Filters: Big Data Secret Weapon
Data Mining
COLD Detailed Data
Raw Data
Machine Learning
Tableau Big Data Customer Use Cases
1. Use Case
I. Move from Legacy System & Excel to Agile Visual System
II. Game Health: Are games being played as designed?
III. Business Health: Metrics and measures of regional revenue objectives
IV. Market Health: Global competitive, economic and other external factors
1. Use Case
I. 1.5 Billion Game Plays per day with 149 million daily active users
II. Generate 1 Petabyte+ of data per year; 25 Billion Events per Day!
III. Striking the right balance between making games fun while also making them challenging to
players
IV. Understanding how, when and why players make in-game purchases
V. Overcoming the limitations of Hadoop as an open-source solution for data management and
analytics
2. High Level Architecture
Data Scientist
Game Server
Logs/Activity
Tableau Desktop
Business Analyst
King Lessons Learned
1. Use Case
I. Created Data Science and Engineering Team inside Product Org
II. Cloudera Data Lake Platform called TPS (The Philosophers Stone)
III. Combining Data from Email Campaign Data + Google Analytics (CSV & JSON)
2. High Level Architecture
Tableau Desktop
GoPro Lessons Learned
Main topic 2:
Subtopic copy goes here
Subtopic copy goes here
Main topic 3:
Subtopic copy goes here
Subtopic copy goes here
Lorem ipsum dolor sit amet, error possim
abhorreant vix ne, ne mel debitis iudicabit
voluptatibus. Affert timeam debitis no nam. Sint
democritum complectitur his an.
Subheader
Sample Text
Sample Text
Sample Text
Sample Text
Sample Text
Sample Text
Please add a 5% gray, .5pt
thickness border around
vizzes
ullamcorper ipsum suscipit in.
Curabitur fermentum lacinia
lectus non laoreet. Sed volutpat,
dui eu rutrum volutpat, nulla mi
accumsan dui, non venenatis
mauris augue nec lectus.
Sample Code:
var pd = require('pretty-data').pd;
Friday, October 23
Beginning Your Geographic Analysis Journey
11:30am 12:30pm | Location
Friday, October 23
Beginning Your Geographic Analysis Journey
11:30am 12:30pm | Location
Friday, October 23
Beginning Your Geographic Analysis Journey
11:30am 12:30pm | Location
Please complete
the session survey
from the Session
Details screen in
your TC16 app
email@email.com
To modify table, first click anywhere in table, Layout > Shrink or expand column widths by adjusting the
so the Table Tools menu is highlighted at top Cell Size, or set them to same size with Distribute
To modify the table layout, click Table Tools > Layout To Layout > Use Alignment settings to adjust text
modify the table style, click Table Tools > Design alignment and cell margins
Layout > To add columns, click into cell and choose Tip: To quickly add a row, place cursor in this last cell
Insert Left or Insert Right and hit Tab key
Header 3 Header 4
Body Copy
Body copy should be set to Arial 24pt when possible.
Try to limit each slide to a maximum of 3 font sizes.
Type Tips
Create visual differentiation/focus by using scale and color versus using bullets.
PowerPoint palette for this Slide Master
Text/ Text/
Background Background Accent 3 Accent 4 Accent 6
Light 1 Light 2
Embedded Analytics Folder Location Mobile Menu Mobile People Scalable Search Security Server Admin
These icons are provided so you can use them to for diagrams showing architecture, workflow, etc. Icon colors can be
modified by right-clicking item and selecting theme color.