Beruflich Dokumente
Kultur Dokumente
www.bi-q.ie
• ODI Performance
• ODI Scheduling
• ODI Deployment/Release
ULI BETHKE BI-Quotient
www.bi-q.ie
• Dublin based
• Blog www.bi-q.ie
• ODI 2007
• Reviewer two ODI books
• ODI articles OTN
• Deputy chair OUG BI SIG. Next event 11th June
• ODI advanced trainer
ODI PERFORMANCE BI-Quotient
www.bi-q.ie
Target
- Least amount of roundtrips. Network (JDBC, XML)
- One target database server only (DW)
Another Server
- ODBC drivers
- JEE agent on Weblogic
- No support for target OS
- Resources on target
- DBA
INTERFACES BI-Quotient
www.bi-q.ie
• JDBC drivers
• JVM
• Type 4 or 5 JDBC drivers (Data Direct)
• Array fetch size.
• DB packet size.
• Network packet size.
PERFORMANCE MONITORING BI-Quotient
www.bi-q.ie
• dbms_sqltune_util0.sqltext_to_sqlid
• Link to Data Dictionary Tables
BI-Quotient
www.bi-q.ie
MACIEJ KOCON
• Dublin based
• ODI 2005 (Sunopsis)
• Reviewer two ODI books
• Blog www.bi-q.ie
• maciek@bi-q.ie
BI-Quotient
ORCHESTRATING DWH www.bi-q.ie
PROCESSES
• Orchestration of Data Process Flow
– Standard DWH Process flow orchestration
– Packages in Oracle Data Integrator 10g
– Load Plans in Oracle Data Integrator 11g
• Process Flow use cases - efficiency analysis
• Alternative scheduling
– benefits
TYPICAL DATA FLOW in DWH BI-Quotient
www.bi-q.ie
step 1
STAGE E-LT
DATA EXTRACT
loads data from
sources
TYPICAL DATA FLOW in DWH BI-Quotient
www.bi-q.ie
step 1 step 2
STAGE DIMs E-LT
INT_A
PRC_B
INT_C
PKG_DE
INT_D
INT_E
ORCHESTRATION – ODI PACKAGES BI-Quotient
www.bi-q.ie
using object directly using scenarios – compiled code
PKG_ABC SYNCHRONOUS PKG_ABCDE
INT_A INT_A
PRC_B PRC_B
INT_C INT_C
PKG_DE
PKG_DE
INT_D
INT_E
ORCHESTRATION – ODI PACKAGES BI-Quotient
www.bi-q.ie
using object directly using scenarios – compiled code
PKG_ABC SYNCHRONOUS PKG_ABCDE
INT_A INT_A
PRC_B PRC_B
INT_C INT_C
PKG_DE
PKG_DE
INT_D
INT_E
ASYNCHRONOUS PKG_ABCDE
INT_A
PRC_B
INT_C
PKG_DE
ODI 10g vs. ODI 11 BI-Quotient
www.bi-q.ie
A D F
B E G
C
ODI 10g vs. ODI 11 BI-Quotient
www.bi-q.ie
ODI 11
Load plans
ODI 10g vs. ODI 11 BI-Quotient
www.bi-q.ie
ODI 11
Load plans
A D F
SAME
B E G EFFECT!
C
PROCESS FLOW EFFICIENCY ANALYSIS BI-Quotient
www.bi-q.ie
A
30 sequential
10 10
D F
B
10
30
A
10
D F
parallel
E G
C
10 B
30 10 10 E G
10 C
30 10
30 + 30 + 10 = 70
10
PROCESS FLOW EFFICIENCY ANALYSIS BI-Quotient
www.bi-q.ie
A
30 sequential
10 10
D F
B
10
30
A
10
D F
parallel
E G
C
10 B
30 10 10 E G
10 C
30 10
30 + 30 + 10 = 70
10
DOWNSIDES:
• POSSIBLE INEFFICIENCIES (IDLE RESOURCES)
PROCESS FLOW EFFICIENCY ANALYSIS BI-Quotient
www.bi-q.ie
OPTIMIZATION ATTEMPT
30
A 10 10
D F
10
B 30 10
E G
10
C
PROCESS FLOW EFFICIENCY ANALYSIS BI-Quotient
www.bi-q.ie
OPTIMIZATION ATTEMPT
30
sequential
A 10 10
D F
10 A D
B
parallel
30 10
E G F
10
C B
30 10 10
E G
10 30 + 10
30 10 + 10 = 50 C
10 10 + 30
OPTIMIZATION ATTEMPT
30
sequential
A 10 10
D F
10 A D
B
parallel
30 10
E G F
10
C B
30 10 10
E G
10 30 + 10
30 10 + 10 = 50 C
10 10 + 30
OPTIMIZATION ATTEMPT
A
30 sequential
D
10
F
10 70
B
10
30
A
10
D F
parallel
E G 70
C
10 B
30 10 10 E G
10 C
30 10
30 + 30 + 10 = 70
10
DOWNSIDE:
• INEFFICIENCY EXISTS BUT CAN’T BE RESOLVED
• CONSUMER WAITING & IMPACT
TRADITIONAL SCHEDULING - LIMITATIONS BI-Quotient
www.bi-q.ie
SCHEDULER
DEPENDENCY DRIVEN SCHEDULING BI-Quotient
www.bi-q.ie
A
D
B
E
C
A A A
B B B
C C C
D D D
E E E
A A A
B B B
C C C
D D D
E E E
DEPENDENCY DRIVEN SCHEDULING BI-Quotient
www.bi-q.ie
A
D
PACKGAGES
B &
E LOAD PLANS
C
A A A
B B B
C C C
D D D
E E E
A A A
B B B
C C C
D D D
E E E
PROCESS FLOW EFFICIENCY ANALYSIS BI-Quotient
www.bi-q.ie
A
30 sequential
10 10
D F 70
B
10
30
A
10
D F
parallel
E G 70
C
10 B
30 10 10 E G
10 C
30 10
30 + 30 + 10 = 70
10
30
A 10 10
D F
10
B 30 10
E G
10
C
10 10 10
10
30 30 10
PROCESS FLOW EFFICIENCY ANALYSIS BI-Quotient
www.bi-q.ie
A
30 sequential
10 10
D F 70
B
10
30
A
10
D F
parallel
E G 70
C
10 B
30 10 10 E G
10 C
30 10
30 + 30 + 10 = 70
10
30
A 10 10
D F 30
10
B 30 10
E G 70
10
C
10 10 10
10
30 30 10
• Notifications
– errors (+auto-restartability)
– finish summary
– logging
• Multiple/overlapping E-LT streams
– load with different frequencies
• Parameterization
– improved system stress control
– process prioritization
F I R S T RUN
10
processes
F I R S T RUN T O D AY
10 584
processes processes
1389
DEPENDENCIES
F I R S T RUN T O D AY
10 584
processes processes
1389
DEPENDENCIES
10 584
processes processes
1389
DEPENDENCIES
12h43m
TIME
LOAD PLANS
F I R S T RUN T O D AY
10 584
processes processes
1389
DEPENDENCIES