Beruflich Dokumente
Kultur Dokumente
Course Objectives
At the end of this course you will be able to:
Understand how to use all major PowerCenter
components
Agenda
Duration 1.5 hrs per day, 4 Weeks training course
Version 8.6
Operational Systems
RDBMS Mainframe
Data
Warehouse
Other
Aggregate Data
Cleanse Data
Consolidate Data
Apply Business Rules
De-normalize
Aggregated data
Historical
Transform
Extract
ETL
Load
4
Informatica as ETL
Client Objects
Designer
Repository Manager
Workflow Manager
Workflow Monitor
8
Informatica Repository
The Informatica repository is a set of tables that store the
metadata created using the Informatica Client tools.
Metadata is added to the repository tables when we
perform tasks in the Informatica Client application such
as developing mappings or creating sessions.
The Workflow Manager adds metadata to the repository
tables in the form of tasks and workflows.
The Integration Service creates metadata in the
repository such as start and finish times of tasks as well
as workflow status.
10
11
Integration Service
The Integration Service reads mapping and session
information from the repository.
It extracts data from the mapping sources and stores the
data in memory while it applies the transformation rules
that are configured in the mapping.
The Integration Service loads the transformed data into
the mapping targets.
12
Repository Service
The Repository Service is an application service that
manages the repository. It retrieves, inserts, and updates
metadata in the repository database tables. Select a
Repository Service in the Navigator to access information
about the service.
Integration Service
Repository
Service
Repository
Manager
Repository
Agent(s)
Repository
13
14
Installations Required
Oracle : 10G or any Relational DB
PL/SQL Developer or Toad
Informatica 8x
15
Lab
Install
Repository Creation
16
17
18
Design Process
1. Create Source definition(s)
2. Create Target definition(s)
3. Create a Mapping
4. Create a Session Task
5. Create a Workflow from Task components
6. Run the Workflow and verify the results
19
20
21
Repository
Relational
XML file
Source
Analyzer
Flat file
COBOL file
22
Relational Source
ODBC
Table
View
Synonym
DEF
Repository
Server
TCP/IP
Repository
Agent
native
DEF
Repository
23
24
Source
Analyzer
Flat File
DEF
Fixed Width or
Delimited
Repository
Server
TCP/IP
Repository
Agent
native
DEF
Repository
25
26
27
Data Previewer
Preview data in
Relational Sources
Flat File Sources
Relational Targets
Flat File Targets
28
29
Data Display
30
31
32
33
34
35
Repository
Server
Database
DEF
TCP/IP
Table
View
Synonym
Repository
Agent
native
DEF
Repository
36
37
38
39
DEF
DEF
DEF
Execute SQL
via
Designer
Target database
physical tables
40
41
42
43
Lookup properties
Can override
Lookup SQL
Toggle
caching
Database
Connection
Object name
171
Make cache
persistent
Set cache
sizes
172
Lookup Conditions
Multiple conditions are supported
173
Uncached
Each Mapping row needs one SQL SELECT
175
Dynamic Lookup
176
Dynamic Lookup
By the end of this section you will be familiar with:
Dynamic lookup theory
Dynamic lookup advantages
Dynamic lookup limitations
177
Persistent caches
By default, Lookup caches are not persistent
When Session completes, cache is erased
Cache can be made persistent with the Lookup
properties
When Session completes, the persistent cache is
stored on server hard disk files
The next time Session runs, cached data is loaded
fully or partially into RAM and reused
Can improve performance, but stale data may pose
a problem
179
180
181
182
183
Associated Port
184
185
186
187
INSERT
DD_INSERT
UPDATE DD_UPDATE1
DELETE DD_DELETE
REJECT DD_REJECT
189
inserted
191
Router transformation
192
Router transformation
By the end of this section you will be familiar with:
Using a Router
Router groups
193
Router transformation
Rows sent to multiple filter conditions
Active Transformation
Connected
Ports
All input/output
Specify filter condition
for each group
Usage
Link source data in
one pass to multiple
filter conditions
194
195
Router Groups
Input group (always one)
User-defined groups
Default group (always one)
Each group has one condition
ALL group conditions are
evaluated for each row
Group outputs can be ignored
196
Reusable transformations
197
Reusable transformations
By the end of this section you will be familiar with:
Reusable transformation advantages
Reusable transformation limitations
Promoting transformations to reusable
Demoting reusable transformations
198
Transformation Developer
Reusable
transformations
199
Reusable transformations
Define once - reuse many times
Reusable Transformations
201
203
204
Passive Transformation
Connected
Ports
Two predefined output
ports, NEXTVAL and
CURRVAL
No input ports allowed
Usage
Generate sequence
numbers
Shareable across
mappings
205
Target Options
206
Target options
By the end of this section you will be familiar with:
Row operations
Load types
Constraint-based loading
Error handling
207
Target properties
Session Task
Chose target
Row operations
Error handling
208
Delete SQL
DELETE from <target> WHERE <primary key> = <pkvalue>
The SQL statement used will appear in the Session log file
209
Constraint-based Loading
Maintains referential integrity in the Targets
pk1
fk1, pk2
fk2
pk1
fk1, pk2
fk2
Example 1
With only One Active source, rows
for Targets 1-3 will be loaded
properly and maintain referential
integrity
Example 2
With Two Active sources, it is not
possible to control whether rows for
Target 3 will be loaded before or
after those for Target 2
210
212
213
Multi-Task Workflows
Tasks can be run sequentially, like this:
Tasks shows are all Sessions, but they can also be other
Tasks such as Commands, Timer, Email, etc.
214
Multi-Task Workflows
Tasks can be run concurrently, like this:
Tasks shows are all Sessions, but they can also be other
Tasks such as Commands, Timer, Email, etc.
215
Multi-Task Workflows
Tasks can be run in a combination concurrent and
sequential pattern within one Workflow, like this:
Tasks shows are all Sessions, but they can also be other
Tasks such as Commands, Timer, Email, etc.
216
Additional transformations
217
Additional transformations
By the end of this section you will be familiar with:
The Rank transformation
The Normalizer transformation
The Stored Procedure transformation
The External Procedure transformation
The Union transformation
218
Rank transformation
Filters the top or bottom range of records
Active Transformation
Connected
Ports
Mixed
One predefined output
port RANKINDEX
Variables allowed
Group By allowed
Usage
Select top/bottom
Number of records
219
Normalizer transformation
Normalizes records from relational or VSAM sources
Active Transformation
Connected
Ports
Input / output or output
Usage
Required for VSAM
Source definitions
Normalize flat file or
relational source
definitions
Generate multiple records
from one record
220
Normalizer transformation
YEAR,ACCOUNT,MONTH1,MONTH2,MONTH3, MONTH12
1997,Salaries,21000,21000,22000,19000,23000,26000,29000,29000,34000,34000,4
0000,45000
1997,Benefits,4200,4200,4400,3800,4600,5200,5800,5800,6800,6800,8000,9000
1997,Expenses,10500,4000,5000,6500,3000,7000,9000,4500,7500,8000,8500,8250
221
Normalizer transformation
Generated Column ID
222
Passive Transformation
Connected/Unconnected
Ports
Mixed
R denotes port will return
a value from the stored
function to the next
transformation
Usage
Perform transformation
logic outside PowerMart /
PowerCenter
223
Union Transformation
Multiple input group transformation that can be used to
merge data from multiple pipelines or pipeline branches
into one pipeline branch.
Similar to the UNION ALL SQL statement
Union transformation does not remove duplicate rows
224
225
226
Java Transformation
Transformation type:
Active/Passive
Connected
Java transformation behavior is based on the following
events:
The transformation receives an input row
The transformation has processed all input rows
The transformation receives a transaction notification
such as commit or rollback
227
229
HTTP Transformation
Transformation type:
Passive
Connected
The HTTP transformation enables you to connect to an
HTTP server to use its services and applications. When
you run a session with an HTTP transformation, the
Integration Service connects to the HTTP server and
issues a request to retrieve data from or update data on
the HTTP server, depending on how you configure the
transformation.
231
SQL Transformation
Transformation type:
Active/Passive
Connected
The SQL transformation processes SQL queries
midstream in a pipeline. You can insert, delete, update,
and retrieve rows from a database. You can pass the
database connection information to the SQL
transformation as input data at run time. The
transformation processes external SQL scripts or SQL
queries that you create in an SQL editor. The SQL
transformation processes the query and returns rows and
database errors
232
233
Conditional lookups
By the end of this section you will know conditional
lookup:
Technique
Advantages
Limitations
234
lookup function
Unconnected lookup
Always literally unconnected from other transformations.
There are no blue data flow arrows leading to or from an
unconnected lookup
Unconnected lookup
236
Conditional Lookups
237
Return port
WARNING !
If the return port is not
defined, the lookup
function expression will
be invalid
239
Syntax: :lkp.lookupname(portname)
CONNECTED LOOKUP
UNCONNECTED LOOKUP
Heterogeneous Targets
241
Heterogeneous Targets
By the end of this section you will be familiar with:
Heterogeneous target types
Heterogeneous target limitations
Target conversions
242
Oracle table
Flat file
244
245
246
Mapplets
247
Mapplets
By the end of this section you will be familiar with:
Mapplet advantages
Mapplet types
Mapplet limitations
248
Mapplet Designer
Mapplet
Transformation
Icons
Mapplet Output
Transformation
249
Mapplet Advantages
Useful for repetitive tasks / logic
Represents a set of transformations
Mapplets are reusable
Use an instance of a Mapplet in a Mapping
Changes to a Mapplet are inherited by all instances
Server expands the Mapplet at runtime
250
251
252
Unsupported Transformations
You may use any transformation in a Mapplet except:
XML Source definitions
COBOL Source definitions
Normalizer
Pre and post-Session stored procedures
Target definitions
Other Mapplets
253
254
255
Transformation
Transformation
256
Mapplet Output
Use a Mapplet Output transformation
Define Mapplet Output ports
Mapplets must contain at least one Output
transformation
An Output transformation must have at least one port
connected to another transformation within the Mapplet
257
Mapplet Output
Group NOT mapped
259
260
Passive
Active
Multiple Passive
Mapplets can populate
the same target
instance
261
Repository Topics
By the end of this section you should be familiar with:
The purpose of the Repository Server and Agent
The Repository Manager interface
Repository maintenance operations
Security and privileges
Object sharing, searching, and locking
262
Repository Service
Each repository has an independent architecture for the
management of the physical repository tables
Components: one Repository Service
Informatica Adminconsole
Domain
Repository
Agent(s)
Repository
263
264
Repository Management
Perform all repository maintenance tasks using the
Informatica Admin Console
Maintenance tasks:
Create
Copy
Backup
Restore
Upgrade
Register
Un-Register
Delete
Notify Users
Last activity log
265
Analysis Window
Dependency Window
Output Window
266
Steps:
Create groups
Assign privileges to
groups
Create users
Assign users to
groups
Assign additional
privileges to users
(optional)
267
User Management
GROUP STRUCTURE
Groups
Users
Privileges
Administrators
Administrator
Database user
(all privileges)
Public
As defined
As defined
As defined
Use Designer
Browse Repository
Use Workflow Manager
As defined
SECURITY CONTROL
Security Access
To
Issued By
Issued To
Privileges
Repository
Super User
Administrator
Groups
Users
Permissions
Folder
Super User
Folder Owner
Administrator
Folder Owner
Owners Group
Repository
268
270
Thank You
271