Beruflich Dokumente
Kultur Dokumente
Agenda
Center Of Excellence
Overview &
Components
Target
s
Center Of Excellence
PowerCenter 8x Architecture
Client Tools
Sources
Designer
Standards,
Messaging,
Web Services
Packaged
Applications
Relational/Flat
Files
Mainframe/
Midrange
WF Manager
Rep. Manager
Targets
Monitor
Administration Console
Application Services
Integration
Service(s)
Repository
Service(s)
Web Services
Hub
SAP BW
Service
PowerCenter
Connects
Repository
Database
Standards,
Messaging,
Web Services
Packaged
Applications
Relational/Flat
Files
Mainframe/
Midrange
PowerExchange
Core Services
Domain/Gateway Service
Log Service
Authentication
Configuration
Service management
-
Center Of Excellence
What is a Domain?
Unified and single point of admin/config for:
Domain Consists of
Set of Nodes
Set of Services
Zero or more Grids
Set of Resources
Center Of Excellence
Center Of Excellence
Services
Application Service
Service that is configured by the end user and represents a key
visible component (Integration Service, Repository Service etc)
External clients directly interact with these services
Core Service
Infrastructure (internal) service (Gateway Service, Logging
Service etc)
Center Of Excellence
HA Setup
One Primary node and list of Backup nodes (Active/Passive
mode)
Application Services
Core Services
Center Of Excellence
Overview .. Informatica
Repository
Center Of Excellence
10
Grid
Collection on nodes
Integration Service can be assigned to Grid
Service runs on all nodes in the grid
Center Of Excellence
11
PowerCenter Architecture:
Data Flow
Windows 95,
98, NT 4.0
or 2000
Workflow
Monitor
Workflow
Manager
Client Tools
Repository
Manager
Designer
ODBC
ODBC
TCP/IP
Heterogeneous
Sources
Oracle
MS SQL Server
Sybase
SAP R/3 & BW Informix
PeopleSoftDB2 UDB
Siebel ODBC
MQ Series Flat File
XML
TIBCO
VSAM/COBOL
JMSCopybook
Remote Files
Heterogeneous
Targets
Repository
Service
Native/ODBC
Sources
Targets
Object
Repository
Native
ODBC
Oracle
MS SQL
Server
Sybase
DB2 UDB
PowerConnect
TCP/IP
Native
ODBC
Remote Files
PowerConnect
TCP/IP
PowerCenter Server Engine
Buffers
Key
Data
Metadata
Reader
DTM
Writer
Center Of Excellence
12
Center Of Excellence
13
Center Of Excellence
14
Overview .. Sources
Center Of Excellence
15
Overview .. Targets
Center Of Excellence
16
Questions
Center Of Excellence
17
Center Of Excellence
19
Informatica Server
Center Of Excellence
20
Center Of Excellence
21
Center Of Excellence
22
Center Of Excellence
23
DTM Threads
Center Of Excellence
24
DTM Threads
For example, a pipeline contains one source, one target. You configure two
partitions in the session properties. The DTM creates the following threads to
process the pipeline:
Two reader threads - One for each partition.
Two writer threads - One for each partition
When the pipeline contains an Aggregator or Rank transformation, the DTM
creates one additional set of threads for each Aggregator or Rank transformation
Center Of Excellence
25
DTM Threads
Center Of Excellence
26
Questions
Center Of Excellence
27
Repository Server
Repository Server
Center Of Excellence
29
Center Of Excellence
30
Questions
Center Of Excellence
31
Repository Manager
Center Of Excellence
33
Repository
Center Of Excellence
34
Repository
Can create and store the
following types of metadata in
the repository:
Database connections
Global objects
Mappings
Mapplets
Multi-dimensional metadata
Reusable transformations
Sessions and batches
Shortcuts
Source definitions
Target definitions
Transformations
Center Of Excellence
35
Repository Types
Center Of Excellence
36
Center Of Excellence
37
Center Of Excellence
38
Dependency Window
The Dependency window can display the following types of
dependencies:
Center Of Excellence
39
Center Of Excellence
40
Crystal Reports
The Repository Manager includes four Crystal Reports that provide
views of your metadata:
Center Of Excellence
41
Repository Security
Center Of Excellence
42
Center Of Excellence
43
Types of Locks
There are five kinds of locks on repository objects:
Read lock - Created when you open a repository object in a folder
for which you do not have write permission
Save lock
Center Of Excellence
44
Folders
Center Of Excellence
45
Folders
Center Of Excellence
46
Folders
Any mapping in a folder can use only those source and target
definitions or reusable transformations that are stored:
in the same folder
in a shared folder and accessed through a shortcut
Center Of Excellence
47
Folders
Center Of Excellence
48
Copying Folders
Center Of Excellence
49
Copying Folders
When you copy a folder,
the Repository Manager
allows to:
Re-establish
shortcuts
Choose an
Informatica Server
Copy connections
Copy persisted values
Compare folders
Replace folders
Center Of Excellence
50
Comparing Folders
Center Of Excellence
51
Comparing Folders
Center Of Excellence
52
Comparing Folders
Center Of Excellence
53
Folder Versions
Center Of Excellence
54
Center Of Excellence
55
Questions
Center Of Excellence
56
Designer
Center Of Excellence
58
Designer Workspace
Center Of Excellence
Navigator
Workspace
Status bar
Output
59
Designer Tools
Source Analyzer
To import or create source definitions for flat file, XML,
Cobol, ERP, and relational sources
Warehouse Designer
To import or create target definitions
Transformation Developer
To create reusable transformations
Mapplet Designer
To create mapplets
Mapping Designer
To create mappings
Center Of Excellence
60
Source Analyzer
Center Of Excellence
61
Center Of Excellence
62
The source definition appears in the Source Analyzer. In the Navigator, the new
source definition appears in the Sources node of the active repository folder,
under the source database name
Center Of Excellence
63
Center Of Excellence
64
Center Of Excellence
65
Warehouse Designer
Center Of Excellence
66
Center Of Excellence
67
Center Of Excellence
68
Mapping
Center Of Excellence
69
Mapping
Sample Mapping
Center Of Excellence
70
Mapping - Invalidation
Center Of Excellence
71
Mapping - Components
Center Of Excellence
72
Mapping - Updates
Center Of Excellence
73
Mapping - Validation
The Designer marks a mapping valid for the following reasons:
Connection validation - Required ports are connected and that
all connections are valid
Expression validation - All expressions are valid
Object validation - The independent object definition matches
the instance in the mapping
The Designer performs connection validation each time you
connect ports in a mapping and each time you validate or save a
mapping
You can validate an expression in a transformation while you are
developing a mapping
Center Of Excellence
74
Mapping - Validation
Center Of Excellence
75
Questions
Center Of Excellence
76
Transformations used
in Informatica
Transformations
Data passes into and out of transformations through ports that you
connect in a mapping or mapplet
Center Of Excellence
78
Transformation Types
Center Of Excellence
79
Center Of Excellence
80
Center Of Excellence
81
Transformations - Properties
Port Name
Copied ports will inherit the name of contributing port
Copied ports with the same name will be appended with a
number
Data types
Transformations use internal data types
Data types of input ports must be compatible with data types
of the feeding output port
Center Of Excellence
82
Aggregator Transformation
Performs aggregate
calculations
Components of the
Aggregator
Transformation
Aggregate expression
Group by port
Sorted Input option
Aggregate cache
The Aggregator is an
active and connected
transformation
Center Of Excellence
83
Aggregator Transformation
Center Of Excellence
84
Expression Transformation
Center Of Excellence
85
Expression Transformation
Center Of Excellence
86
Filter Transformation
Center Of Excellence
87
Joiner Transformation
Center Of Excellence
88
Joiner Transformation
Use the Joiner transformation to join two sources with at
least one matching port
It uses a condition that matches one or more pairs of ports
between the two sources
Requires two input transformations from two separate data
flows
It supports the following join types
Normal (Default)
Master Outer
Detail Outer
Center Of Excellence
Full Outer
89
Lookup Transformation
Used to look up data in a relational table, view, or synonym
The Informatica Server queries the lookup table based on the lookup
ports in the transformation
It compares Lookup transformation port values to lookup table column
values based on the lookup condition
Can use the Lookup transformation to perform many tasks, including:
Get a related value
Perform a calculation
Update slowly changing dimension tables
Center Of Excellence
90
For each input row, the Informatica Server queries the lookup table or
cache based on the lookup ports and the condition in the
transformation
Center Of Excellence
91
Lookup Transformation
With unconnected Lookups, you can pass multiple input values into the
transformation, but only one column of data out of the transformation
Use the return port to specify the return value in an unconnected lookup
transformation
Center Of Excellence
92
Lookup Caching
Session performance can be improved by
caching the lookup table
Caching can be static or dynamic
By default, the lookup cache remains static
and does not change during the session
Caching can be persistent Cache used
across sessions
Center Of Excellence
93
Router Transformation
A Router transformation tests data for one or
more conditions and gives the option to route
Itany
has the
following types
rows of data that do not meet
of the
of groups:
conditions to a default output group
Input
Output
There are two types of
output groups:
User-defined groups
Default group
Center Of
Center Of Excellence
95
Sequence Generator
Transformation
Generates numeric
values
It can be used to
create unique primary key values
replace missing primary keys
cycle through a sequential range of numbers
It provides two output ports: NEXTVAL and CURRVAL
These ports can not be edited or deleted
Can not add ports to the sequence generator
transformation
When NEXTVAL is connected to the input port of another
transformation, the Informatica Server generates a
sequence of numbers
Center Of Excellence
96
Sequence Generator
Transformation
Center Of Excellence
97
Source Qualifier
Transformation
The Source Qualifier represents the records that the
Informatica Server reads when it runs a session
Can use the Source Qualifier to perform the following tasks:
Join data originating from the same source database
Filter records when the Informatica Server reads source
data
Specify an outer join rather than the default inner join
Specify sorted ports
Select only distinct values from the source
Create a custom query to issue a special SELECT
statement for the Informatica Server to read source data
Center Of Excellence
98
For relational sources, the Informatica Server generates a query for each Source
Qualifier when it runs a session
The default query is a SELECT statement for each source column used in the
mapping
The Informatica Server reads only those columns in Source Qualifier that are
connected to another transformation
Center Of Excellence
99
DD_INSERT 0
DD_UPDAT 1
E
Delete
DD_DELET
E
Reject
DD_REJEC 3
Center Of Excellence
T
100
Rank Transformation
Allows to select only the top or bottom rank of data, not just
one value
Can use it to return
the largest or smallest numeric value in a port or group
the strings at the top or the bottom of a session sort order
During the session, the Informatica Server caches input
data until it can perform the rank calculations
Can select only one port to define a rank
Center Of Excellence
101
Rank Transformation
When you create a Rank transformation, you can configure the following properties:
Enter a cache directory
Select the top or bottom rank
Select the input/output port that contains values used to determine the rank. You
can select only one port to define a rank
Select the number of rows falling within a rank
Define groups for ranks
Center Of Excellence
102
Rank Transformation
Rank Transformation
Ports:
Variable port - Can
use to store values
or calculations to
use in an expression
Rank port - Use to
designate the
column for which
you want to rank
values
Center Of Excellence
103
Stored Procedure
Transformation
A Stored Procedure transformation is
an important tool for populating and maintaining databases
a precompiled collection of Transact-SQL statements and
optional flow control statements, similar to an executable
script
used to call a stored procedure
The stored procedure must exist in the database before
creating a Stored Procedure transformation
One of the most useful features of stored procedures is the
ability to send data to the stored procedure, and receive data
from the stored procedure
Center Of Excellence
104
Stored Procedure
Transformation
There are three
types of data that pass between the
Informatica Server and the stored procedure:
Input/Output parameters - For many stored
procedures, you provide a value and receive a
value in return
Return values - Most databases provide a return
value after running a stored procedure
Status codes - Status codes provide error handling
for the Informatica Server during a session
Center Of Excellence
105
Stored Procedure
Transformation
Center Of Excellence
106
Stored Procedure
Transformation
Can set up the Stored Procedure transformation in one of two modes, either
connected or unconnected
The flow of data through a mapping in connected mode also passes through the
Stored Procedure transformation
Cannot run the same instance of a Stored Procedure transformation in both
connected and unconnected mode in a mapping. You must create different
instances of the transformation
Center Of Excellence
107
Stored Procedure
Transformation
Center Of Excellence
108
Dynamic Lookup
Transformation
Center Of Excellence
109
Dynamic Lookup
Transformation
Center Of Excellence
110
Transformation Language
The designer provides a transformation language to help you write
expressions to transform source data
With the transformation language, you can create a transformation
expression that takes the data from a port and changes it
Can write expressions in the following transformations:
Aggregator
Expression
Filter
Rank
Router
Update Strategy
Center Of Excellence
111
Transformation Language
Expressions can consist of any combination of the
following components:
Ports (input, input/output, variable)
String literals, numeric literals
Constants
Functions
Local and system variables
Mapping parameters and mapping variables
Operators
Return values
Center Of Excellence
112
Transformation Language
The functions available in PowerCenter are
Aggregate Functions e.g. AVG, MIN, MAX
Character Functions e.g. CONCAT, LENGTH
Conversion Functions e.g. TO_CHAR, TO_DATE
Date Functions e.g. DATE_DIFF, LAST_DAY
Numeric Functions e.g. ABS, CEIL, LOG
Scientific Functions e.g. COS, SINH
Special Functions e.g. DECODE, IIF, ABORT
Test Functions e.g. ISNULL, IS_DATE
Variable Functions e.g. SETMAXVARIABLE
Center Of Excellence
113
Questions
Center Of Excellence
114
Re usable
Transformations and
Mapplets
Reusable Transformation
A Transformation is said to be in reusable mode
when multiple instances of the same
transformation can be created.
Reusable transformations can be used in multiple
mappings.
Creating Reusable transformations:
Design it in the Transformation Developer
Promote a standard transformation from the Mapping
Designer.
Center Of Excellence
116
Mapplet
A mapplet is a reusable object that represents a set of
transformations
It allows to reuse transformation logic and can contain as
many transformations as needed
Mapplets can:
Include source definitions
Accept data from sources in a mapping
Include multiple transformations
Pass data to multiple pipelines
Contain unused ports
Center Of Excellence
117
Center Of Excellence
118
Mapplet - Components
Each mapplet must include the following:
One Input transformation and/or Source Qualifier
transformation
At least one Output transformation
A Mapplet should contain exactly one of the following:
Input transformation with at least one port
connected to a transformation in the mapplet
Source Qualifier transformation with at least one
port connected to a source definition
Center Of Excellence
119
Mapplet
Center Of Excellence
120
Expanded Mapplet
For example, in the figure, the mapplet uses the Input transformation
IN_CustID_FirstLastName to define mapplet input ports. The Input
transformation is connected to one transformation, EXP_WorkaroundLookup,
which passes data to two separate transformations
Center Of Excellence
121
Questions
Center Of Excellence
122
1. Task Developer
2. Workflow
Designer
3. Worklet Designer
Workflow Monitor
1. Gantt
Chart
2. Task View
Center Of Excellence
124
Workflow Manager
The Workflow Manager replaces the Server Manager in version
5.0. Instead of running sessions, you now create a process called
the workflow in the Workflow Manager.
A workflow is a set of instructions on how to execute tasks such as
sessions, emails, and shell commands.
A session is now one of the many tasks you can execute in the
Workflow Manager.
The Workflow Manager provides other tasks such as Assignment,
Decision, and Events. You can also create branches with
conditional links. In addition, you can batch workflows by creating
worklets in the Workflow Manager.
Center Of Excellence
125
Workflow
Manager
Center Of Excellence
126
Workflow Designer
Use the Workflow Designer to create a workflow by
connecting tasks with links. You can also create tasks in
the Workflow Designer as you develop the workflow.
Worklet Designer
Use the Worklet Designer to create a worklet.
Center Of Excellence
127
Workflow Tasks
Center Of Excellence
128
Create Task
Center Of Excellence
129
Workflow Monitor
PowerCenter 6.0 provides a new tool, the Workflow
Monitor, to monitor workflow, worklets, and tasks.
The Workflow Monitor displays information about
workflows in two views:
1. Gantt Chart view
2. Task view.
130
Center Of Excellence
131
Center Of Excellence
132
Questions
Center Of Excellence
133
Performance Tuning
Performance Tuning
First step in performance tuning is to identify the
performance bottleneck in the following order :
Target
Source
Mapping
Session
System
The most common performance bottleneck occurs
when the Informatica Server writes to a target database.
Center Of Excellence
135
Target Bottlenecks
Identifying
A target bottleneck can be identified by
configuring the session to write to a flat file
target.
Optimizing
Dropping Indexes and Key Constraints before
loading.
Increasing commit intervals.
Use of Bulk Loading / External Loading.
Center Of Excellence
136
Source Bottlenecks
Identifying
Add a filter condition after Source qualifier to false so
that no data is processed past the filter transformation.
If the time it takes to run the new session remains
about the same, then there is a source bottleneck.
In a test mapping remove all the transformations and if
the performance is similar, then there is a source
bottleneck.
Optimizing
Optimizing the Query by using hints.
Use informatica Conditional Filters if the source
system lacks indexes.
Center Of Excellence
137
Mapping Bottlenecks
Identifying
If there is no source bottleneck, add a Filter
transformation in the mapping before each target
definition. Set the filter condition to false so that no
data is loaded into the target tables. If the time it
takes to run the new session is the same as the
original session, there is a mapping bottleneck.
Optimizing
Configure for Single-Pass reading
Avoid unnecessary data type conversions.
Avoid database reject errors.
Use Shared Cache / Persistant Cache
Center Of Excellence
138
Session Bottlenecks
Identifying
If there is no source, Target or Mapping bottleneck, then there
may be a session bottleneck.
Use Collect Performance Details. Any value other than zero in
the readfromdisk and writetodisk counters for Aggregator,
Joiner, or Rank transformations indicate a session bottleneck.
Low (0-20%) BufferInput_efficiency and BufferOutput_efficiency
counter values also indicate a session bottleneck.
Optimizing
Increase the number of partitions.
Tune session parameters.
Center Of Excellence
139
Configure the index and data cache memory for the Aggregator, Rank, and
Joiner transformations in the Configuration Parameters dialog box
The amount of memory you configure depends on partitioning, the
transformation that requires the largest cache, and how much memory cache
and disk cache you want to use
Center Of Excellence
140
Center Of Excellence
141
Incremental Aggregation
First Run creates idx and dat files.
Second Run performs the following actions:
For each i/p record, the Server checks historical information in the index
file for a corresponding group, then:
If it finds a corresponding group, it performs the aggregate operation
incrementally, using the aggregate data for that group, and saves the
incremental change
If it does not find a corresponding group, it creates a new group and
saves the record data
When writing to the target Informatica Server
Updates modified aggregate groups in the target
Inserts new aggregate data
Deletes removed aggregate data
Ignores unchanged aggregate data
Saves modified aggregate data in the index and data files
Center Of Excellence
142
Incremental Aggregation
You can find options for incremental aggregation on the Transformations tab in the
session properties
The Server Manager displays a warning indicating the Informatica Server overwrites
the existing cache and a reminder to clear this option after running the session
Center Of Excellence
143
System Bottlenecks
Identifying
If there is no source, Target, Mapping or Session bottleneck,
then there may be a system bottleneck.
Use system tools to monitor CPU usage, memory usage, and
paging.
On Windows :- Task Manager
On Unix Systems toots like sar, iostat. For Eg: sar u
(%usage on user, idle time, i/o waiting time)
Optimizing
Improve network speed.
Improve CPU performance
Check hard disks on related machines
Reduce Paging
Center Of Excellence
144
PMCMD
Can use the command line program pmcmd to communicate with the
Informatica Server
Can perform the following actions with pmcmd:
Determine if the Informatica Server is running
Start sessions and batches
Stop sessions and batches
Recover sessions
Stop the Informatica Server
Can configure repository usernames and passwords as
environmental variables with pmcmd
can also customize the way pmcmd displays the date and time on
the machine running the Informatica Server
pmcmd returns zero on success and non-zero on failure
You can use pmcmd with operating system scheduling tools like cron
to schedule sessions, and you can embed pmcmd into shell scripts
or Perl programs to run or schedule sessions
Center Of Excellence
145
PMCMD
Need the following information to use pmcmd:
Repository username
Repository password
Connection type - The type of connection from the client
machine to the Informatica Server
Port or connection - The TCP/IP port number or IPX/SPX
connection (Windows NT/2000 only) to the Informatica
Server
Host name - The machine hosting the Informatica Server
Session or batch name - The names of any sessions or
batches you want to start or stop
Folder name - The folder names for those sessions or
batches
Parameter file Center Of Excellence
146
Commit Points
A commit interval is the interval at which the Informatica
Server commits data to relational targets during a session
The commit point can be a factor of the commit interval, the
commit interval type, and the size of the buffer blocks
The commit interval is the number of rows you want to use
as a basis for the commit point
The commit interval type is the type of rows that you want
to use as a basis for the commit point
Can choose between the following types of commit interval
Target-based commit
Source-based commit
During a source-based commit session, the Informatica
Server commits data to the target based on the number of
rows from an active Center
sourceOfinExcellence
a single pipeline
147
Commit Points
During a target-based
commit session, the
Informatica Server continues
to fill the writer buffer after it
reaches the commit interval
When the buffer block is
filled, the Informatica Server
issues a commit command
As a result, the amount of
data committed at the
commit point generally
exceeds the commit interval
Center Of Excellence
148
Commit Points
During a source-based commit session, the Informatica Server
commits data to the target based on the number of rows from an
active source in a single pipeline
These rows are referred to as source rows
A pipeline consists of a source qualifier and all the transformations
and targets that receive data from the source qualifier
An active source can be any of the following active
transformations:
Advanced External Procedure
Source Qualifier
Normalizer
Aggregator
Joiner
Rank
Mapplet, if it contains one of the above transformations
Center Of Excellence
149
Commit Points
When the Informatica Server runs a source-based commit session, it identifies the
active source for each pipeline in the mapping
The Informatica Server generates a commit row from the active source at every
commit interval
When each target in the pipeline receives the commit row, the Informatica Server
performs the commit
Center Of Excellence
150
Commit Points
Center Of Excellence
151
Multiple Servers
You can register multiple PowerCenter Servers with a
PowerCenter repository
Can run these servers at the same time
Can distribute the repository session load across available
servers to improve overall performance
Can use the Server Manager to administer and monitor
multiple servers
With multiple Informatica Servers, you need to decide which
server you want to run each session and batch
You can register and run only one PowerMart Server in a
local repository
Cannot start a PowerMart Server if it is registered in a local
repository that has multiple servers registered to it
Center Of Excellence
152
Multiple Servers
When attached to multiple servers, you can only view, or monitor, one Informatica
Server at a time, but you have access to all the servers in the repository
Center Of Excellence
153
Questions
Center Of Excellence
154
Debugger
Debugger
After you save a mapping, you can run some initial tests
with a debug session before you configure and run a
session in the Server Manager
Center Of Excellence
156
Debugger
Center Of Excellence
157
Debugger
Debugger
After you set the instance name, breakpoint type, and optional data condition,
you can view each parameter in the Breakpoints section of the Breakpoint Editor
Center Of Excellence
159
Questions
Center Of Excellence
160