Sie sind auf Seite 1von 11

Introduction to DataStage

The DataStage ETL component


DataStage Server component Repository DataStage Server DataStage Package Installer DataStage Client component Manage Designer Director Administrator

Server Components
Repository - A central store that contains all the information required to build a data mart or datawarehouse. DataStage Server - Runs executable jobs that extract, transform and load data into a data warehouse.

DataStage Package Installer - A user interface used to install packaged DataStage jobs and plug- ins

Clients Components
DataStage Manager
The DataStage Manager is the basic metadata management tool

Used to create and organize various metadata types including legacy source data warehouse target data definitions ETL job components residing in the DataStage Repository. Manager can import or export jobs ,table definitions,routines,transform ,containers etc Reporting and documentation tools for viewing the detailed metadata in the repository

Clients Components
DataStage Designer
In the Designer ETL tasks execute within individual "stage" objects that you assemble to create complete ETL "jobs."

Types of Jobs in DataStage Server Parallel Job Sequencer Shared Container Mainframe

Different stages for Server Jobs


Sequential File stages Partitioner /Collector & IPC stages Shared in-memory hashed files stages Oracle Bulk Loader OCI Oracle stage ODBC stage Transformer Stage Link Stage

Different stages for Parallel Extender Jobs


Join Stage Funnel Stage Merge Stage Remove Duplication Stage Compress Stage Row Generator Stage Column Generator Stage Change Capture Stage Change Apply Stage

Different stages for Sequencer Job


Routine

Job ExecCommand Email Notification Wait for file Run activity on exception

Shared Container Job


A container is a group of stages and links.Containers helps in simplify and modularizing the server job designs by replacing complex areas of the diagram with in single container stage.

Shared Container Local Container

Clients Components
DataStage Director
Director interactively monitors and controls the operation of the

DataStage Server Schedules and monitors jobs Can See the job log file (messages and warning ) Collects statistics Performs recoveries We can validate,schedule ,run and delete the jobs Can create,run,schedule ,unscheduled a Job Batch

Clients Components
DataStage Administrator
Create the Projects Assign users and roles Run the scheduled jobs Set the buffers properties Tune the jobs log file

Das könnte Ihnen auch gefallen