Sie sind auf Seite 1von 7

c 



How do you fix the error "OCI has fetched truncated data" in DataStage 
What are the command line functions that import and export the DS jobs?

What is job control?how can it used explain with steps?

How do you do Usage analysis in datastage ? 


How to run the job in command prompt in unix?

Do you know about INTEGRITY/QUALITY stage? 


How can we write parallel routines in data stage PX, can anyone help out in this one quickly. If is any example programe
its very helpful to me 

What is APT_CONFIG in datastage 


What are OConv () and Iconv () functions and where are they used? 
What is hashing algorithm and explain breafly how it works? 
Difference between Hashfile and Sequential File? 
What are the enhancements made in datastage 7.5 compare with 7.0 
What is data file & descriptor file in dataset stage ? Plese expain briefly

How will you call external function or subroutine from datastage? 


What is the difference between datastage and informatica 
Do u know about METASTAGE? 
Why do you use SQL LOADER or OCI STAGE? 
How i create datastage Engine stop start script. Actually my idea is as below. !#bin/bash dsadm - user su - root password
(encript) DSHOMEBIN=/Ascential/DataStage/home/dsadm/Ascential/DataStage/DSEngine/bin if check ps -ef | grep
DataStage (client connection is there) { kill -9 PID (client connection) } uv -admin - stop > dev/null uv -admin - start >
dev/null verify process check the connection echo "Started properly" 

Types of Parallel Processing? 


What are the Repository Tables in DataStage and What are they?

What is the transaction size and array size in OCI stage?how these can be used? 
wht is the difference beteen validated ok and compiled in datastage. 
Differentiate Database data and Data warehouse data? 
What is the exact difference betwwen Join,Merge and Lookup Stage?? 
How to find the number of rows in a sequential file? 
Dimensional modelling is again sub divided into 2 types. 
How to implement slowly changing dimentions in Datastage? 
How to find errors in job sequence? 
What is NLS in datastage? how we use NLS in Datastage ? what advantages in that ? at the time of installation i am not
choosen that NLS option , now i want to use that options what can i do ? to reinstall that datastage or first uninstall and
install once again ? 

How do you populate source files? 


What is the difference between sequential file and a dataset? When to use the copy stage? 
Types of vies in Datastage Director? 
What are all the third party tools used in DataStage? 
Tell me the environment in your last projects 
How good are you with your PL/SQL? 
How do we do the automation of dsjobs? 
how to convert datastage server shared container to parallel shared container

If a DataStage job aborts after say 1000 records, how to continue the job from 1000th record after fixing the error? 
What are Static Hash files and Dynamic Hash files? 
How to handle Date convertions in Datastage? Convert a mm/dd/yyyy format to yyyy-dd-mm?

How can we join one Oracle source and Sequential file?. 


If data is partitioned in your job on key 1 and then you aggregate on key 2, what issues could arise? 
Where we use link partitioner in data stage job?explain with example? 
How can we implement Lookup in DataStage Server jobs? 
What is DS Administrator used for - did u use it? 
What about System variables? 
How can I connect my DB2 database on AS400 to DataStage? Do I need to use ODBC 1st to open the database
connectivity and then use an adapter for just connecting between the two? Thanks alot of any replies. 

What are the different types of lookups in datastage? 


How we can call the routine in datastage job?explain with steps? 
What are the environment variables in datastage?give some examples? 
What are constraints and derivation? * Explain the process of taking backup in DataStage? *What are the different types of
lookups available in DataStage? 

It is possible to call one job in another job in server jobs? 


What is the meaning of the following.. 1)If an input file has an excessive number of rows and can be split-up then use
standard 2)logic to run jobs in parallel 3)Tuning should occur on a job-by-job basis. Use the power of DBMS. 

What is a project? Specify its various components? 


What happens if the job fails at night? 
What Happens if RCP is disable ? 
How do u clean the datastage repository. 
What other ETL's you have worked with? 
what is the mean of Try to have the constraints in the 'Selection' criteria of the jobs itself. This will eliminate the
unnecessary records even getting in before joins are made? 

How do you pass filename as the parameter for a job?

What is difference between Merge stage and Join stage? 


Explain the differences between Oracle8i/9i? 
What is the order of execution done internally in the transformer with the stage editor having input links on the lft hand
side and output links? 

How is datastage 4.0 functionally different from the enterprise edition now?? what are the exact changes? 
What is DS Manager used for - did u use it? 
How do you track performance statistics and enhance it? 
What is Modulus and Splitting in Dynamic Hashed File? 
How can you implement Complex Jobs in datastage 
How many jobs have you created in your last project? 
what is the diffrance betwin DNS DHCP ? file system of windows? compare with nis & dns

What are Routines and where/how are they written and have you written any routines before? 
Please list out the versions of datastage Parallel , server editions and in which year they are realised. 
How to implement type2 slowly changing dimenstion in datastage? give me with example? 
How do you pass the parameter to the job sequence if the job is running at night? 
What happends out put of hash file is connected to transformer .. What error it throughs 
It is possible to access the same job two users at a time in datastage? 
What is the default cache size? How do you change the cache size if needed? 
What is Hash file stage and what is it used for? 
What r XML files and how do you read data from XML files and what stage to be used? 
How do you eliminate duplicate rows?

How to parametarise a field in a sequential file?I am using Datastage as ETL Tool,Sequential file as source. 
What are the differences between the data stage 7.0 and 7.5 in server jobs? 
DataStage from Staging to MDW is only running at 1 row per second! What do we do to remedy? 
Can we use shared container as lookup in datastage server jobs? 
What are Sequencers? 
It is possible to run parallel jobs in server jobs? 
What is the difference between Datastage and Datastage TX? 
What does a Config File in parallel extender consist of? 
What is the difference between Symetrically parallel processing,Massively parallel processing? 
My requirement is like this : Here is the codification suggested: SALE_HEADER_XXXXX_YYYYMMDD.PSV
SALE_LINE_XXXXX_YYYYMMDD.PSV XXXXX = LVM sequence to ensure unicity and continuity of file exchanges Caution,
there will an increment to implement. YYYYMMDD = LVM date of file creation COMPRESSION AND DELIVERY TO:
SALE_HEADER_XXXXX_YYYYMMDD.ZIP AND SALE_LINE_XXXXX_YYYYMMDD.ZIP if we run that job the target file names
are like this sale_header_1_20060206 & sale_line_1_20060206. If we run next time means the target files we like this
sale_header_2_20060206 & sale_line_2_20060206. If we run the same in next day means the target files we want like
this sale_header_3_20060306 & sale_line_3_20060306. i.e., whenever we run the same job the target files automatically
changes its filename to filename_increment to previous number(previousnumber + 1)_currentdate; Please do needful by
repling this question.. 

Dimension Modelling types along with their significance 


How can I specify a filter command for processing data while defining sequential file output data? 
Job type can't runnable in this plot form and view data server command error. to running the jobs this error can be
displayed.

Did you work in UNIX environment? 


How to kill the job in data stage? 
Will the data stage consider the second constraint in the transformer once the first condition is satisfied ( if the link odering
is given) 

How can we improve the performance of DataStage jobs? 


What are other Performance tunings you have done in your last project to increase the performance of slowly running
jobs? 

What is iconv and oconv functions? 


What is merge and how it can be done plz explain with simple example taking 2 tables .......

What are the Job parameters? 


Containers : Usage and Types? 
How do you merge two files in DS? 
How do you remove duplicates without using remove duplicate stage? 
what is meaning of file extender in data stage server jobs. can we run the data stage job from one job to another job that
file data where it is stored and what is the file extender in ds jobs. 

Can any one tell me how to extract data from more than 1 hetrogenious Sources. mean, example 1 sequenal file, Sybase ,
Oracle in a singale Job. 

What is data set? and what is file set? 


How can ETL excel file to Datamart? 
How we use NLS function in Datastage? what are advantages of NLS function? where we can use that one? explain briefly? 
How to handle the rejected rows in datastage? 
What is version Control? 
Importance of Surrogate Key in Data warehousing? 
What's the difference between Datastage Developers and Datastage Designers. What are the skill's required for this. 
What is the flow of loading data into fact & dimensional tables? 
How I can convert Server Jobs into Parallel Jobs? 
Is it possible to move the data from oracle ware house to SAP Warehouse using with DATASTAGE Tool. 
How can we create Containers? 
What user varibale activity when it used how it used !where it is used with real example 
How can I extract data from DB2 (on IBM iSeries) to the data warehouse via Datastage as the ETL tool. I mean do I first
need to use ODBC to create connectivity and use an adapter for the extraction and transformation of data? Thanks so
much if anybody could provide an answer. 

How to implement type2 slowly changing dimensions in data stage?explain with example?

What is the OCI? and how to use the ETL Tools? 


Where does unix script of datastage executes weather in clinet machine or in server.suppose if it eexcutes on server then
it will execute ? 

How do you eliminate duplicate rows? 


What is the purpose of exception activity in data stage 7.5? 
How to drop the index befor loading data in target and how to rebuild it in data stage? 
If the size of the Hash file exceeds 2GB..What happens? Does it overwrite the current rows? 
How much would be the size of the database in DataStage ? What is the difference between Inprocess and Interprocess ? 
How can we pass parameters to job by using file. 
What is difference between data stage and informatica 
Whats difference betweeen operational data stage (ODS) & data warehouse? 
What are Stage Variables, Derivations and Constants? 
What are the most important aspects that a beginner must consider doin his first DS project ? 
How to remove duplicates in server job 
How can you do incremental load in datastage? 
What is DS Designer used for - did u use it? 
Did you Parameterize the job or hard-coded the values in the jobs? 
What are types of Hashed File? 
What is the max capacity of Hash file in DataStage? 
What is the utility you use to schedule the jobs on a UNIX server other than using Ascential Director? 
Suppose if there are million records did you use OCI? if not then what stage do you prefer?

What is the difference between LOOK UP File Stage and LookUP stage ? 

What are the Steps involved in development of a job in DataStage? 


Defaults nodes for datastage parallel Edition 
How to use rank&updatestratergy in datastage 
What are validations you perform after creating jobs in designer. What r the different type of errors u faced during
loading and how u solve them 

Functionality of Link Partitioner and Link Collector? 


How can u implement slowly changed dimensions in datastage? explain? 2) can u join flat file and database in
datastage?how? 

How to improve the performance of hash file? 


What is job control?how it is developed?explain with steps? 
If your running 4 ways parallel and you have 10 stages on the canvas, how many processes does datastage create? 
Which one is the better partition to stages for performance in Enterprise edition? 
What is difference between serverjobs & paraller jobs 
How does DataStage handle the user security? 
1.What about System variables? 2.How can we create Containers? 3.How can we improve the performance of
DataStage? 4.what are the Job parameters? 5.what is the difference between routine and transform and function? 6.What
are all the third party tools used in DataStage? 7.How can we implement Lookup in DataStage Server jobs? 8.How can we
implement Slowly Changing Dimensions in DataStage?. 9.How can we join one Oracle source and Sequential file?. 10.What
is iconv and oconv functions? 11.Difference between Hashfile and Sequential File? 12. Maximum how many characters we
can give for a Job name in DataStage? 

How to implement routines in data stage,have any one has any material for data stage pl send to me 
I want to process 3 files in sequentially one by one , how can i do that. while processing the files it should fetch files
automatically . 

Purpose of using the key and difference between Surrogate keys and natural key 
If I add a new environment variable in Windows, how can I access it in DataStage? 
How the hash file is doing lookup in serverjobs?How is it comparing the key values? 
What is merge ?and how to use merge?

What is the difference between drs and odbc stage 


What are orabulk and bcp stages? 
What is DS Director used for - did u use it? 
Briefly describe the various client components? 
What is SQL tuning? how do you do it ? 
What is difference b/w stage and informatica ?

What does separation option in static hash-file mean? 


There are three different types of user-created stages available for PX. What are they? Which would you use? What are
the disadvantage for using each type? 

Does Enterprise Edition only add the parallel processing for better performance? Are any stages/transformations
available in the enterprise edition only? 

What are the difficulties faced in using DataStage ? or what are the constraints in using DataStage ? 
What is ' insert for update ' in datastage 
What is environment variables?what is the use of this? 
What will you in a situation where somebody wants to send you a file and use that file as an input or reference and then
run job. 

Where actually the flat files store?what is the path? 


how to improve the job performance?

Das könnte Ihnen auch gefallen