Sie sind auf Seite 1von 3

Data Stage Course content

Data warehousing concepts


About IBM web sphere DS & QS
 Data warehouse & Data
warehousing
 Data warehousing Architectures
 ETL project phases
 ETL Process
 ETL Architecture
 Over view of project Architecture
 ER, Dimensional Model & I E
Notation
 Data warehousing Teams & Roles
 Understand Mapping Documents
Introduction to Data Stage












History & Features


Difference between 7.5X2 & 8.x
IBS DS & QS 8.0.1
DS info Sphere 8.5 Enhancements
View on Web console
IBM Data Stage 7.5x2 & 8.x
Architecture
Traditional Batch Processing
Partition & Pipeline parallelism
Partitioning & re- Partitioning
techniques
Combinality, Combining & collecting
Techniques
Configuration File & Node
Components

 7.5X2 & 8X Enhancements-New


Features
 Parallel Processing environments
(SMP, MPP etc.)
 DS Client Components Overview
 DS architecture Components
Overview

Data Stage Jobs & Stages


 Difference between server & parallel
jobs
 Overview of Parallel jobs, server
jobs, mainframe jobs, migration jobs
and job sequencing jobs
 Repository, DS Designer Toolbar &
Palette
 Active & Passive stages
 Palette Customization
 About Link Markers
 Frame work Operators

Design, Compile & Run DS jobs


 DS Job Design Process
 Designer Canvas customization
 Compile, Force Compile & multiple
job compile
 DTD & OSH code

Working with file stages







Sequential file & stage rules


Data set & types
File set
Differences-Data et, file set & seq
file stages
 Lookup file set stage
Development/Debug & STUB Stages





Row generator & column generator


Head, tail & sample stages
Peek stage
Copy stage

Working with relational databases


 Enterprise and plug-in stages
overview
 Oracle, DB/2, SQL server stages
 TERADATA
 ODBC (oracle, SQL server, Excel &
Access)
 Dynamic RDBMS stage
 Orchestrate schema import

Processing stages
Horizontal combining
 Join stage
 Lookup stage (normal, sparse, range,
case less)s
 Merge state
 Funnel stage & funneling
Sorting & vertical combining
 In-stage sorts (traditional sort)

 Sort stage (complex & simple sorts)


 Aggregate stage
 Remove duplicate stage
Data transformation with transformer
 Basic transformer vs. parallel
transformer
 External functions & macros
 Stage variables & system variables
 Transformer constraints
 Transformer execution order
Filtering methods





Constraints & source level


Filter stage
Switch stage
External filter stage

Other stages





Modify stage
Compress & Expand stage
Decode & Encode stage
Pivot stage

Slowly changing dimensions


 Type -0, 1, 2 Dimensions
 Implementing SCD -I & II in 8.0.1 &
7.5x2
 SCD stage
 Change capture & change apply
stage
 Surrogate key stage (state file &
sequence object)

Performance tuning tips


 Performance tuning with best
practices
 Performance tuning with
partitioning techniques
 Performance Analysis & Estimate
resource
Advance topics
 Parameter set & Parameter file
creation
 Data connection
Job control
 Job sequencing (run stages, error
handling stages, flow control stages
& etc)
 Batch jobs
Data stage designer additional features
 Import & export of repository
components
 Node configuration
 Generate exports

Data stage director


 Schedule/run/validate a compiled
job
 Message handling (job & project
level)
 Monitor jobs, filter/purge logs
 Unlocking job & customize

Web console & DS administrator


 security services, session
management
 Domain management, reporting,
scheduling..
 ADD, Delete and Organize projects
Information analyzer
 Column, primary key, foreign key,
cross domain & base line analysis