Sie sind auf Seite 1von 12

1

Data Warehousing Questions: 1. Casual dimensions can be used a) As a helper table b) For explaining why a record exists in a fact table c) For integrating data marts into a data warehouse d) For handling changes to the data. 2. Factless Fact tables are used a) For event tracking b) For handling multi valued dimensions c) As a helper table d) None of the above 3. Helper tables are used a) For handling one to man relationships b) For handling man to one relationships c) For handling many to many relationships d) None of the above !. "hich of the following is not true with for an #$%& a) 't is integrated and sub(ect oriented b) Contains current and detailed data c) Contains years of historical data d) )sed for ma*ing tactical decisions +. Conformed dimensions are used a) For handling multi valued dimensions b) For integrating data marts into a data warehouse c) For event trac*ing d) For e,plaining wh a record e,ists in a fact table -. "hich of the following statements are not true& a) A collection of data marts does not e.ual a data warehouse. b) A data warehouse contains summari/ed data. c) data warehouse contains sub!ect oriented" volatile" integrated and cleansed data# d) None of the above 0. 1etadata is associated with the following a) 2,traction 3ransformation and 4oading b) Front end access c) $ata warehouse administration d) ll of the above 5. Compound 6rowth Factor 7C6F) is associated with a) Data explosion b) $ata integration c) %calabilit d) $ata warehouse performance 8. 3he relationship between a fact table and a dimension table is a) #ne to #ne b) #ne to 1an c) $any to %ne d) 1an to 1an 19. "hich of the following is suited for large: transaction intensive applications& a) &%' ( b) 1#4A; c) H#4A; d) $#4A; 11. An Architected $ata "arehouse a) $oes not maintain historical integrit b) 's inefficient with respect to data ac.uisition c) &educes server and client processing re)uired d) $oes not store historical data

12. A fact able is li*ed to a dimension table using a) *urrogate keys b) Compound *e s c) Comple, *e s d) None of the above 13. 3he following methods can be used for handling slowl changing dimensions a) 'nserting a new field in the table b) 'nserting a new record in the table c) #verwrite the e,isting data d) ll of the above 1!. 'n a <ottom )p data warehouse architecture a) 3he data warehouse is created first and then the data marts b) +he data marts are created first and then the data warehouse c) A set of independent data marts are created d) An enterprise data warehouse is created without creating data marts 1+. 'n a 3op $own data warehouse architecture a) +he data warehouse is created first and then the data marts b) 3he data marts are created first and then the data warehouse c) A set of independent data marts are created d) An enterprise data warehouse is created without creating data marts 1-. "hich of the statements are not true& a) A data mart contains less information than a data warehouse b) A data mart covers a single sub(ect area c) $ata marts can be used for creating a data warehouse d) ,one of the above. 10. "hich of the statements are not true for 1#4A; Architecture& a) ;rocessing overhead for large input data sets is high b) 3he no of dimensions is usuall restricted to 19 or less c) *calability is good d) 't has a good user interface and functionalit 15. "hich of the following is associated with coverage tables& a) Factless fact tables b) Conformed dimensions c) Casual dimensions d) Helper tables 18. "hich of the following statements are false for surrogate *e s& a) %urrogate *e s should not be composed of natural *e s glued together. b) %urrogate *e s should ideall be numeric integers. c) *urrogate keys should contain meaningful information# d) None of the above 29. How are dimensions in a 1ulti $imensional database related& a) -ierarchically b) 'n a Networ* c) < an inverse list d) Non of the above 21. 3he process of standardi/ing the data that is e,tracted from the operational s stems is *nown as a) +ransformation b) Cleansing c) ;opulation d) $ata %taging

2
22. "hich of the following is used for calculating the Compound 6rowth Factor for an application& a) %parsit of the data b) Clustering of the data c) Number of dimensions and the number of levels in the dimension hierarch d) ll of the above 23. "hich of the following is an 1#4A; tool & a) -olos .*egate *oftware) b) <usiness #b(ects c) <rio =uer 2nterprise d) $%% %erver 71icro%trateg ) 2!. 3he term Corporate 'nformation Factor is associated with a) /ill 0nmon b) >alph ?imball c) %id Adelman d) Chuc* ?elle 2+. "hich of the following is suited for Near 4ine %torage& a) ;hoto optical storage b) %iloed tape storage c) and / d) None of the above 2-. "hich of the following statements are false for >#4A;& a) =uer performance is slower compared to 1#4A; b) %uitable for fre.uent updates c) *calability is poor d) %ummar tables are implemented in the relational database 20. "hich t pe of modeling techni.ue is associated with a data warehousing& a) Dimensional $odeling b) 2> 1odeling c) #b(ect #riented 1odeling d) All of the above 25. #4A; that access its own proprietar data storage for reporting is called as a) $ultidimensional %' ( b) >elational #4A; c) H brid #4A; d) All of the above 28. "hich of the following is not an 234 tool& a) ;ower 1art b) $ata %tage c) *cenario d) $ata @unction 39. 1a(orit of wor* involved in a data warehousing pro(ect is carried out in this phase a) >e.uirements gathering b) $ata warehouse design c) 1+' d) $ata warehouse testing 31. "hat two t pes of processing are done in an e,ploration warehouse& a) 1xploration and Data $ining b) #4A; and $ata 1ining c) 2,ploration and #4A; d) $ata 1ining and 1ultidimensional anal sis 32. $ata that is loaded into a data warehouse and never used is referred to as a) Dormant data b) c) d) %tatic data #perational data 3ime dependant data

33. "hat determines the class of an #$%& a) 3he t pe of data that is stored in the #$% b) +he speed of update synchroni2ation from the operational environment to the %D* c) 3he duration of data that is stored in the #$%. d) 3he amount of time it ta*es to update the #$% 3!. "hat is meant b a Awrin*le of timeB& a) 3he time it ta*es to populate a data warehouse b) +he amount of time that elapses between the update of a record in the operational environment and the time that the update is reflected in the data warehouse c) 3he granularit of the time dimension that is present in the data warehouse d) 3he time it ta*es to populate the data warehouse from the data staging area 3+. "hat is the metadata called that is used for the corporate information factor & a) 2,ploration metadata b) Corporate metadata c) Distributed metadata d) 'ntegrated metadata 3-. 3he process of viewing detailed data from summari/ed data is *nown as a) Drill up $rill down b) $rill through c) $rill across 30. "here in the corporate information factor is the most volume of data *ept& a) $ata "arehouse b) #$% c) $ata staging area d) ,ear line storage 35. "hat two pieces of software are needed in order to ma*e near line storage applicable to the data warehouse environment& a) Cross media storage manager and an activity monitor b) Cross media storage manager and an .uer enhancer c) Activit monitor and an anal sis software d) Activit 1onitor and an .uer enhancer 38 #$% can be classified into CCCCCCCC classes a) 2 b) 3 c) 3 d) + !9. How fre.uentl is a data warehouse refreshed& a) Hourl b) $ail c) 1onthl d) Depends on the users re)uirements !1. "hat is reach through in #4A; technolog & a) 's a means of bringing the data accessible to the end user from that which is stored in the #4A; server b) 0s a means of extending the data accessible to the end user beyond that which is stored in the %' ( server c) 3o allow $ill upDdown anal sis

3
d) 's a means of restricting the part of the data that is stored in the #4A; server +2. "hich of the following is an e,ample of a >ule 'nduction Algorithm a) 6>' b) C +.9 c) and / d) None of the above +3. Aggregating the detail data from production s stems for update of the <usiness 'ntelligence environment provides all of the following benefits e,ceptH a) &educing the length of the records written to the warehouse

!2. "hich one of the following is not an e,ample of a measure& a) ;roduct cost b) ;roduct .uantit c) (roduct name d) ;roduct price !3. <it mapped inde,ing can be used a) "hen the distinct value of a column is high b) When the distinct value of a column is low c) "hen the storage space available is less d) "hen the table has a few rows

b) c) d)

!!. 1ultiple processors sharing the same hard dis* and >A1 is called a a) 1assive ;arallel ;rocessing 2nvironment b) Clustered % mmetric 1ultiE;rocessor ;rocessing 2nvironment c) Non )niform 1emor Access 2nvironment d) *ymmetric $ulti4(rocessor (rocessing 1nvironment !+. For multi dimensional design which of the following statements are generall true a) 't is highl normali/ed b) 0t is highly de normali2ed c) 's based on an 2> 1odel d) 's based on the users re.uirements !-. "hat is derived data& a) $ata that is reconciled and stored in the data warehouse b) Data that is calculated and stored in the data warehouse c) $ata that is stored in the #$% d) Numeric data that is stored in the data warehouse !0. "hich of the following statements are true& a) $ata mining is the process of carr out multidimensional anal sis b) $ata mining is used for reporting c) Data mining is used for uncovering patterns that are present in data d) None of the above !5. "hich of the following is an e,ample of a Neural Networ* algorithm a) <a esian Networ*

%implif ing the table structures in the <usiness 'ntelligence environment >educing the amount of data written to the <usiness 'ntelligence environment 'mproving the performance of .ueries against the <usiness 'ntelligence data structures

+!. How man techni.ues are there for handling slowl changing dimensions& a) 1 b) 2 c) 5 d) ! ++. "hich of the following is a disadvantage of the Clustering 3echni.ue& a) 't is difficult to set initial parameters b) 't can be hard to interpret the final results c) 't is difficult to choose the right similarit measure d) ll of the above +-. "hich of the following is an advantage of 6enetic Algorithms& a) 't is eas to appl the results b) 't is able to handle a wide range of data t pes c) 't integrates well with Neural Networ*s d) ll of the above +0. >ule 'nduction methods: Neural Networ*s: ?Enearest Neighbor: $ecision 3rees and Case <ased >easoning fall under this classification of data mining techni.ues a) Classification b) ;rediction c) $ependenc anal sis d) $ata description and summari/ation +5. "hich of the following is a H#4A; product& a) 'nformi, I 1etacube b) $icrosoft *Q' *erver %' ( *ervices c) Cognos I ;ower;la 3ransformer d) H perion I 2ssbase +8. "hich of the following statements are not true for Clustering 3echni.ues& a) Automated cluster are eas to appl . b) 0t cannot be applied on diverse data types# c) 't can be hard to interpret the final result. d) 't is difficult to set the initial parameter. -9. "hich of the following statements are not true& a) $ecision trees perform classification without re.uiring much computation. b) $ecision trees can at best wor* on small samples of data and cannot easil approach large data sets c) Decision trees are more appropriate for estimation tasks where the goal is to predict the value of a continuous attribute# d) None of the above.

b) c) d)

>adial <asis Function 1ultila er ;erceptron ll of the above

!8. "hich of the following is an e,ample of a decision tree algorithm a) '$3 b) HuntFs Algorithm c) CA>3 d) ll of the above +9. "hich inde,ing techni.ue would the .uer optimi/er choose if the table being .ueried has a small number of distinct *e values& a) $>$A b) ;arallel data loader c) Dynamic bitmapped index d) ;arallel inde, maintenance +1. "hich of the following products perform the data cleansing process& a) 1+0 b) Gal2, c) $ata @oiner d) <usiness #b(ects

!
-1. A customer has its data warehouse loading directl from its operational s stem on a nightl basis. 3he loading process ta*es between si, and eight hours. "hich of the following actions will most li*el reduce this load time& a) 'mplement referential integrit -8. A data mart is based on an #perational $ata %tore that is updated b a batch process that starts dail at 2H99 oJcloc* am. "hat scheduling method should be used to populate this data mart& a) 1xternally based b) 'nternall based c) Cascade based d) 3ime based 09. "hich of the following tools from Cognos is a data mining tool& a) *cenario b) $ecision %tream c) A,iant d) ;ower House 01. "here will ou find measures in a data warehouse& a) $imension tables b) Fact tables c) Helper tables d) 4oo* up tables 02. A compan ac.uires a smaller competitor and needs to integrate its data into the enterprise data warehouse. "hat should be the primar concern when loading their data& a) Networ* protocols and bandwidth

b) c) d)

>educe the level of data summari/ation 'ncrease the lineDlin* speed between the two machines Create an operational data store on the warehouse machine and load from there

-2. "hich of the following access methods is li*el to ield the best performance& a) A full table scan

b) c) d)

An adEhoc .uer over a data mart pre4defined )uery over a data mart A data miner e,traction re.uest over an operational data store

-3. "here should data be stored to optimi/e departmental .uer and reporting s stems& a) sub!ect4oriented data mart

b) c) d)

An enterprise data warehouse An operational data store A transactional database

b) c) d)

$ata normali/ation and securit Data cleansing and transformation >aw data si/e and software license management

-!. A business manager needs to create a trend anal sis of sales over the last five ears. 3here is uncertaint of what business rules will affect the trending calculation. "here should the <usiness 1anager loo* to understand the information in the data warehouse& a) 3he fact table

b) c) d)

$etadata repository File la out comments %tored ;rocedure definitions

03. "hich of the following must be a function of a data e,traction and transformation tool& a) bility to retrieve data from all known database management systems b) Abilit to store the data mart database designs and ma*e those designs available to the business anal sts c) Abilit to translate data elements in the source s stems into data warehouse data d) Abilit to run on all *nown platforms and operating s stems 0!. "here can an end user .uic*l find out details about a field that is present in the data warehouse& a) 3he 234 scripts that are used for loading the data into the data warehouse. b) +he metadata that is associated with the data warehouse# c) 3he user manual for the data warehouse. d) All of the above. 0+. "hich of the following approaches should be used in the first phase of creating a data model for a data mart& a) Focus on the key elements of the business area

-+. "hat is the best tool to use for cluster anal sis& a) #4A; tool

b) c) d)

=uer tools Gisual "arehouse 0ntelligent $iner for Data

--. 3 picall : when should the selection of presentation tools ta*e place& a) After the data warehouse is built

b) c) d)

$uring the creation of the data models $uring the building of the data warehouse fter determining end user skills and needs

b) c) d)

'nclude *e elements for additional business areas )se a highl normali/ed model which allows for fle,ible additions for the future 'nclude foreign *e s for an of the compan Js business areas that are candidates for <usiness 'ntelligence

-0. "hich of the following is the fastest method for the initial load of data into a data warehouse table& a) 'mport b) 'oad utility c) Full refresh d) $ifferential cop -5. "hen moving data into a data warehouse from heterogeneous data sources: which of the following tools is appropriate& a) $ata Hub b) $ata 6uide c) Data 6oiner d) $ata ;ropagator

0-. Are most e,ploration data warehouses permanent structures& a) Kes b) ,o c) $epends on the users d) $epends on the t pe of data that is stored in it 00. "hich of the following is a t pe of #$% a) % nchroni/ed update b) %tore and forward update c) #n an unscheduled basis

+
d) ll of the above c) d) -elper table 1ultiple dimension tables

05. "here does most clic* stream data come from that is fed into the data warehouse& a) log tape b) Coo*ies c) ;rofile record d) % stem of record 08. "hat is the difference between e,ploration processing and data mining& a) +here are no assumptions and hypotheses going into exploration processing where as there are assumptions and hypotheses when going into data mining# b) 2,ploration processing is used for identif ing patterns that are stored in the data where as data mining is used for anal /ing the data. c) 'n the case of e,ploration processing users donFt *now what the are loo*ing for where as in the case of data mining users *now what the are loo*ing for. d) 3here is no difference between e,ploration processing and data mining. 59. "hat is the *e to most clic* stream records found in the corporate information factor & a) A log tape b) Cookies c) ;rofile record d) % stem of record 51. "hat is the primar interface from the corporate information factor to the web site& a) $ata "arehouse b) $ata 1art c) %D* d) $ata staging area 52. "hich of the following statements are false& a) A collection of data warehouse will e.ual a data mart. b) A data warehouse is used for ma*ing tactical decisions. c) A data warehouse is updated b transactions. d) ll of the above# 53. "hat is another name for a %atellite data mart a) Dependant data mart b) 'ndependent data mart c) %tovepipe data mart d) Function data mart 5!. "hat does the term LAdEhoc Anal sisL mean& a) <usiness anal sts access the data present in the data warehouse data from different locations. b) <usiness anal sts use data mining techni.ues for anal /ing the data. c) /usiness analysts carry out on the fly analysis#

50. 'f ou want to *now wh a record e,ists in a fact table ou would use the following& a) 4oo* up table b) Conformed dimension c) Casual dimension d) Helper table 55. Kou can use a factless fact table as aH a) s a coverage table b) As a helper table c) As a loo* up table d) All of the above 58. For a casual dimension which of the following statements are true& a) casual dimension should not change the grain of the fact table# b) A casual dimension can change the grain of the fact table. c) Casual dimension should be avoided in a dimensional model. d) Casual dimensions should be used when there are multiple facts in the fact table. 89. "hich of the following statements are false for a %tar %chema& a) 3he star schema is built for .uic* access to the data. b) star schema is highly normali2ed# c) 'n a star schema: facts are in a the fact table and the descriptions that lead to those facts are in dimension tables. d) None of the above.

81. "hich of the following is not a benefit of an #$%& a) 't integrates the transaction data. b) 't provides transaction level reporting on the data. c) 't s nchroni/es the structural differences in the data. d) ,one of the above# 82. 'n an Architected data mart environment which of the following statements are false& a) 1etadata is consistent across the data marts. b) +hey do not re)uire an enterprise data mart architecture to succeed# c) 3he are used for an incremental approach to build the data warehouse. d) $ata is consistent across the data marts. 83. "hat is a (un* dimension& a) 't contains data that is no longer useful for the end users. b) 't does not contain an numeric facts and is basicall used for lin*ing the dimension tables. c) 0t is a convenient grouping of random flags and attributes to get them out of the out of a fact table and into a useful dimensional framework# d) 't is a stand alone fact table resulting from the deletion of dimension tables. 8!. "here do degenerate dimensions usuall occur& a) 'n covering invoice numbers. b) 0n line item oriented fact table designs# c) 'n handling man to man mapping between a fact table and a dimension table. d) "hen a particular value is shared across dimensions. 8+. "hen would ou ideall use multiple fact tables& a) "hen all the fact canFt be place in a single fact table. b) 0n order to support a business with many process# c) 'n order to trac* multiple events.

d)

<usiness anal sts access the data warehouse data infre.uentl .

5+. "hich of the following statements are false for Association 3echni.ues& a) 't produces clear results. b) 't is eas to implement. c) 't produces understandable results. d) ,one of the above 5-. 'n order to handle a man to man relationship in a dimensional model ou will use which of the following& a) Casual dimension b) 4oo* up table

d) 'n order to support users that are located at different locations and re.uire high response times. d) Dimension tables describe the data that is stored in the fact table#

8-. "hich of the following statements are true with respect to aggregation& a) Aggregate data should be stored in the original fact and dimension tables that contain the detail data. b) Aggregate data has to be stored in separate fact tables but the same dimension tables can be used. c) 1ach level of aggregate data should be stored in separate set of fact and dimension tables# d) All levels of aggregation should be stored in a set of fact and dimension tables and the detailed data should be stored in separate fact and dimension table. 80. 3he term shrun*en dimension is associated with. a) ggregation b) Filtering c) 2,traction d) 4oading 85. 'n t he case of value based reporting which of the following statements are true& a) 't is beneficial to have an inde, created directl on the affected fact in the fact table. b) 3he possible upper and lower limits that are re.uired should be stored in the dimensional table that is li*ed to the fact table that contains the re.uired numeric value. c) A surrogate *e is not re.uired for lin*ing the fact table that contains the re.uired numeric attribute with the dimension table that contains the list of ranges. d) ll of the above# 88. "hich time of inde,ing benefits data warehouse applications a) Gector inde,ing b) <E3ree 'nde,ing c) /itmapped indexing d) @oin 'nde,ing 199. %tovepipe data marts are also called asH a) Architected data marts. b) 0ndependent data marts# c) $ependant data marts. d) %atellite data marts. 191. 3he most useful facts in a fact table are& a) ,umeric and additive# b) Numeric and non additive. c) Numeric and alphanumeric. d) Alphanumeric and non additive. 192. "hich of the following statements is false for disposable data marts& a) $isposable data marts are created to support a specific short lived business situation. b) 0t is a permanent structure# c) 't allows the data to be designed specificall for the re.uirement. d) None of the above 193. )suall what does a large number of dimensions indicate& a) 3hat the application is ver large. b) *everal dimensions are not at all independent and should be combined# c) 3he design of the data warehouse was not done correctl . d) 3he application should be bro*en down. 19!. "hich of the following statements are true for dimensional tables& a) +hey are the entry points into the data warehouse# b) $ata in a dimension table should not change. c) 2ver dimension table should be lin*ed to onl a single fact table.

19+. "hich of the following statements are true for conformed dimensions& a) 't is used for describing the data that is stored in the data warehouse. b) 0t is a dimension that means the same thing with every possible fact table to which it can be !oined# c) Without a strict adherence to conformed dimensions the data warehouse cannot function as an integrated whole# d) 't is used for handling the man to man mapping between the data that is present in a data warehouse. 19-. How can an open ended man valued attribute can be associated with a fact table record& a) /y using a bridge table between the dimension table and the fact table# b) < creating a new loo* up table. c) < going in for a conformed dimension. d) All of the above. 190. 3he best wa to get started on the data warehouse design is to& a) 'dentif the tools that will be used for creating the data warehouse. b) 'dentif the team that will be wor*ing on the creation of the data warehouse. c) /uild a matrix of data marts and dimensions# d) ;repare the architecture diagram for the data warehouse. 195. 'f a dimension that has to be inserted into a dimensional model does not match the grain of the model what can me done& a) Do not include the dimension# b) Change the grain declaration for the dimension# c) Change the other dimension tables to suit the new dimension table. d) Change the grain of the fact table. 198. "hich of the following statements are true& a) 3he si/e of the dimension tables is the same as the si/e of the fact table. b) +he si2e of the dimension tables is less than the si2e of the fact table# c) 3he si/e of the dimension tables is greater than the si/e of the fact table. d) 3he si/e of the dimension tables compared to the fact table varies from case to case. 119. "hich of the following statements are true for %now Fla*e %chemas a) 0t affects cross attribute browsing performance# b) 't should alwa s be used in the design of a data warehouse. c) 0t defeats the purpose of using bitmap indexes# d) All of the above 111. "hen would ou ideall use a t pe 3 %lowl Changing dimension 7b adding an addition field)& a) 0t is used when a change is tentative b) When we want to keep the tracking history with the old value of the attribute as well as the new# c) "hen we want to save on storage space. d) 't should be used when the old value of the attribute is of no importance. 112. 't is reasonable to plan on a CCCCCC percent storage overhead for aggregates as a target for the overall data warehouse& a) 09 b) 59 c) 89 d) 788

0
113. "hich of the following statements are true when it come to aggregation& a) ggregates must be stored in their own fact tables separate from the base atomic data# b) +he dimension tables attached to the aggregate fact tables must be shrunken versions of the dimension tables associated with the base fact table# c) 3he base fact table and all of its aggregated fact tables should be independent. d) Force all *Q' created by any end user data access tool or application to refer exclusively to the base fact table and its associated dimension tables# 11!. "hich of the following statements are true when it come to Neural Networ*s& a) +hey perform very well on non linear domains# b) +hey are difficult to understand# c) 3he cannot be used for e,tracting comple, trends present in the data. d) 3he are unable to derive meaning from imprecise data. 11+. "hich of the following statements are true for genetic algorithms& a) )nable to handle a wide range of data t pes. b) )sed widel in commercial pac*ages. c) 9sed for generating scoring functions for $emory /ased &easoning# d) 9sed as an embedded optimi2ation engine in scheduling packages# 11-. "hich of the following statements are true for a star schema& a) A star schema is highl normali/ed b) 0t is built for simplicity and speed# e) +he star schema is built for )uick access to the data# c) All of the above 110. "hich of the following statements are true& a) Decision trees perform classification without re)uiring much computation# b) $ecision trees can at best wor* on large data sets c) $ecision trees are more appropriate for estimation tas*s where the goal is to predict the value of a continuous attribute. d) Decision trees provide a clear indication of which fields are most important for prediction and classification# 115. "hich of the following factors influence the 234 architecture& a) 3he t pe of end user access tool that will be used. b) 3he t pe of anal sis that will be carried out on the data warehouse. c) :olume at each data warehouse component# d) Complexity of the process at each stage# 118. "hich of the following statements are false for a data warehouse data modeling techni.ue& a) 't should accommodate changes to the business rules. b) 0t should be influenced by the operation processes that create the data# c) 't should be possible to integrate data from additional sources over time. d) 't should provide unbiased data that can subse.uentl be filtered to meet specific ob(ectives. 129. "hich of the following statements are true for >#4A;& a) Query performance is slower compared to $%' ( b) 't is not suitable for fre.uent updates c) *calability is good d) *ummary tables are implemented in the relational database 121. After which stage would ou carr out product selection and installation for a data warehouse& a) b) c) d) +echnical architecture design stage# $imensional modeling. ;h sical design. >e.uirements definition.

122. After which stage will ou carr out the data staging design and development& a) $imensional modeling b) (hysical design c) 3echnical architecture design d) >e.uirements definition 123. "hich of the following statements are true for Association 3echni.ues& a) 0t produces clear results# b) 't is difficult to implement. c) 0t produces understandable results# d) None of the above 12!. "hat is an audit dimension generall used for& a) 't is used for locating data that is stored in the data warehouse. b) 't is used for storing metadata such as the data sources that are used for populating the data warehouse. c) 't is used for *eeping trac* of the users who access the data warehouse. d) 0t is used for keeping track of details such as when the data was loaded into the data warehouse and the total number of records extracted# 12+. After which stage will ou carr out the ph sical design for the data warehouse& a) Dimensional modeling b) 3echnical architecture design c) >e.uirements definition d) ;roduct selection and installation 12-. "hat is the use of a data staging area& a) 't serves as a storage area on which end users can carr out anal sis. b) A data staging area serves the same purpose as an #$%. c) 0t serves as a storage area where the source data can be prepared for loading into the data warehouse# d) All of the above. 120. 3he process of transforming involves the following& a) (urging selected fields from the legacy data that is not useful for the data warehouse# b) Creating surrogate keys for each dimension record in order to avoid a dependence on legacy defined keys# c) 'nde,ing the data that is loaded into the data warehouse. d) All of the above. 125. "hich of the following is true for dimensional modeling& a) 't cannot be easil e,tended to accommodate une,pected new data elements. b) 0t is a predictable and standard framework# c) 't is difficult to design a data warehouse based on a dimensional model. d) +he framework of the star !oin schema withstands unexpected changes in user behavior# 128. "hat could be the reason for a data model not being able to easil accommodate new sources of data& a) +he data has prematurely been aggregated# b) 3here are too man dimensions in the data model. c) 3he data model is too comple,. d) All of the above. 139. "hat are the t pes of metadata that are available&

5
a) b) c) d) <usiness metadata %perational metadata +echnical metadata Anal tical metadata c) d) 't is usuall specific to the metadata management tool that is being used. All of the above.

131. "hat are the levels of metadata that are available& a) 'ntermediate b) /asic c) Advanced d) Core 132. "hich of the following statements are true for an #$%& a) 't addresses anal tical needs. b) 0t synchroni2es the structural differences in the data c) +he update schedule is either daily or less time fre)uency# d) $etail of data is mostl *ept for 89 to 159 da s. 133. "hich of the following statements are true& a) #nl a data warehouse is sub(ect oriented and not an #$%. b) A data warehouse is used for tactical decisions where as an #$% is used for strategic decisions. c) A data warehouse and an #$% contains current and detail data. d) %nly an %D* is updated by transactions and not a data warehouse# 13!. "hat is the first step that is carried out in a bottom up implementation approach for a data warehouse& a) n enterprise data mart architecture is developed# b) An initial sub(ect areas is selected for the first architected data mart c) 3he enterprise data warehouse is architected and an initial sub(ect area is selected for the data mart. d) %imultaneousl start wor* on two or more architected data marts based on the users re.uirements. 13+. 'n a non architected data mart environment the following is true& a) 0t could result in multiple business rules# b) 3he cost of developing the data marts will be high. c) A single e,traction processes can be used for populating the data marts d) *emantics across the data marts could be different# 13-. "hen comparing >#4A; with 1#4A3 which of the following statements are true& a) )ser interface and functionalit is good in >#4A; and normal in 1#4A;. b) 1#4A; and >#4A; stores details and summari/ed data. c) 3he common access language for >#4A; and 1#4A; is %=4. d) &%' ( support for a large number of users is good where as there is limited support in the case of $%' (# 130. "hen summari/ing multi valued attributes facts in a man to man relationship scenario what should be used& a) <alance weight b) Weighting factor c) Common ratio factor d) Additive weight 135. 3he ?Emeans Algorithm is used for a) Association b) Clustering c) >ule induction d) $ecision trees 138. "hich of the following statements are true for content simplification metadata& a) 0t is usually specific to the front end tool that is being used# b) 't is usuall specific to the 234 tool that is being used.

1!9. 3he Ashared ever thingB architecture means %1; machines are well suited for& a) Canned .ueries b) d hoc )ueries c) Canned and ad hoc .ueries d) None of the above. 1!1. Apriori principle is used in the following techni.ue a) ssociation b) Clustering c) >ule induction d) $ecision trees 1!2. "hich of the following is not true when it comes to dimensional modeling& a) 'mplementing a dimensional data model will lead to a stovepipe decision support s stem. b) $imensional models onl wor* with retail databases. c) %nowfla*ing is an alternative to dimensional modeling. d) ll of the above 1!3. 3he most useful fact table grains are& a) 0ndividual transactions b) %ummari/ed operational data c) 'ine items from control documents like invoices d) $erived operational data. 1!!. Hunts Algorithm is an e,ample of a CCCCCCCCCCC algorithm a) Decision tree b) Neural Networ* c) >ule 'nduction d) 6enetic Algorithm 1!+. >egression anal sis: >egression trees: Neural Networ*s and ?Enearest Neighbor fall under this classification of data mining techni.ues a) Classification b) (rediction c) $ependenc anal sis d) $ata description and summari/ation 1!-. "hich of the following statements are true for artificial attributes& a) 3he are derived from the operational data. b) +hey do not exist in the current business# c) +hey can be used when dimension need to be combined together# d) 3he are used for describing the data that is stored in the data warehouse. 1!0. "hat is an atomic data mart& a) A data mart that periodicall undergoes changes. b) data mart that stores the most detailed data# c) A data mart that is used for feeding the data warehouse. d) A stand alone data mart. 1!5. "hich of the following is a characteristic of an #$% a) *ub!ect oriented b) Non volatile c) 0ntegrated d) Contain detail data 1!8. %;>'N3 is an e,ample of a CCCCCCCCCCC algorithm& a) >ule 'nduction b) Decision tree c) Association d) Clustering

8
1+9. Correlation anal sis: >egression anal sis: Association rules: <a esian Networ*s and 'nductive logic programming fall under this classification of data mining techni.ues a) Classification b) ;rediction c) Dependency analysis d) $ata description and summari/ation 1. "hat is a $ata 1art& a) $atabaseJs used b a single business anal st b) $atabaseJs used b the whole business organi/ation c) %caled down version of a $ata "arehouse usuall developed to solve a particular business problem d) A LviewL of the $ata "arehouse created within the database management s stem "hat is $ata "ebhouse& a) An Active $ata "arehouse b) A $ata "arehouse which has the data feed from the 'nternet log and related data: and the information from the $ata "arehouse web enabled c) A collection $ata 1arts interEconnected as a "eb d) A #perational $ata %tore: $ata 1art and $ata "arehouse is called a $ata "ebhouse $ata 1art is a) *ingle *ub!ect %riented Data Warehouse b) A collection of $ata "arehouse c) An Application on a $ata "arehouse d) None of the Above "hich one of the following is N#3 an e,ample of #perational % stems& a) #rder 3rac*ing Applications: such as catalog sales b) Customer service applications: such as setting up customer accounts c) <an*ing functions: such as deposits and withdrawals d) *ales forecasting applications "hich la er of the data warehouse architecture does an end user directl deals with a) %taging la er b) 2,ternal data la er c) 0nformation access layer d) $ata warehouse la er "hat should be the primar source of data for a data mart& a) subset of the data warehouse created in the database management system b) $ata e,tracted from the target s stems c) 3he operational databases d) $ata e,tracted from the data warehouse databases "here is 1eta data usuall store& a) "ord ;rocessing documents b) %pread sheets c) 0nformation &epository d) All the above "hich of the following are the e,amples of dimensions& a) Customers b) 3ime c) ;roduct d) ll the above 3he process of populating a data warehouse is called a) 'oading b) 2,tracting c) d) 3ransforming None of the Above

19. Architected $ata "arehouse a) &educes server and client processing re)uired b) $oes not maintain historical integrit c) 'nefficient data ac.uisition d) None of the Above 11. "hich one of the following are called J%tove ;ipe % stemsJ& a) $ata"arehouse b) $ependent $ata 1arts c) 0ndependent Data $arts d) #perational $ata %tores 12. "hat is the advantage in going for a <ottom )p approach in building a 2nterprise $ata "arehouse a) $ata Cleaning is not re.uired b) 1nsures that all departments get a Data $art c) >eturn on 'nvestment can be felt at the earliest d) $ata 4oading is Faster 13. "hich of the following must be a function of a data 2,traction and transformation tool& a) bility to retrieve data from all known database management systems b) Abilit to store the data mart database designs and ma*e those designs available to the business anal sts c) Abilit to translate data elements in the source s stems into data warehouse data d) Abilit to run on all *nown platforms and operating s stems 1!. 3he process of re.uesting detailed information a) $rill up b) $rill left c) Drill down d) $rill right 1+. "hat is the difference between a $ata "arehouse or $ata 1art and an #perational $ata %tore& a) n operational data store contains more current data than either a Data Warehouse or a Data $art b) An operational data store and a data warehouse or data mart trac* different sub(ect areas in the organi/ation c) An operational data store is a cop of the data warehouse or data mart d) 3he operational data store tends to be larger than the data warehouse or data mart 1-. $ata "arehousing Characteristics 7i) %ub(ect #riented 7ii) Non Golatile 7iii) 3ime Gariant 7iv) 'ntegrated 7v) 3ime 'nvariant a) 7iii) M7i)M7iv)M7v) #nl b) 7i)M7ii)M7iv)M7v) #nl c) .i);.ii);.iii);.iv) %nly d) All 10. "hich process loads the data from heterogeneous source s stems to the data warehouse a) Cleaning b) 1ining c) 1+' (rocess d) None of the Above 15. "hat does the term LAdEhoc Anal sisL mean& a) <usiness anal sts access the $ata "arehouse data from different locations. b) <usiness anal sts use sampling techni.ues c) /usiness analysts start )uery and analysis on the fly

2.

3.

!.

+.

-.

0.

5.

8.

19
d) <usiness anal sts access the $ata "arehouse data infre.uentl a) b) c) d) Creation of large" detailed transaction level reports L"HA3 'FL anal sis L%licing and $icingL of the data with drill down when something interesting is found. 3ime series anal sis

18. %parse $ata in #4A; Cube indicates a) $issing Data b) $ata >epetitions c) Neroes d) >are $ata 29. "hat is an #perational % stem& a) n application system that supports the organi2ation<s day to day activities b) An application s stem that trac*s and manages the financial assets of the organi/ation c) An application s stem that supports the creation of products7s) that the organi/ation mar*ets d) An application s stem that supports the planning and forecasting within the organi/ation 21. "hich of the following are the advantages of creating individual marts and then rolling them up into a central warehouse& 1)=uic* %uccesses 2)>apid protot ping of data transformations 3)$uplication of data a) 1 onl b) 7 and = only c) 1: 2 and 3 d) None of the Above 22. $isadvantage of data mart is a) $oes not provide integrated view of information b) )ncontrolled proliferation results in redundanc c) 1ore number of $ata 1arts are comple, to maintain d) ll of the above 23. "hich of the following statements correctl describe a dimension table in $imensional 1odeling& a) $imension tables do not contain numeric fields b) $imension tables do not need s stemEgenerated *e s c) $imension tables usuall have fewer fields than fact tables d) Dimension tables contain fields that describe the facts 2!. "hat is the most common use of a multi dimensional database 71$$<)& a) ccess pre determined aggregated data across several dimensions b) Access huge data warehouses c) Access application pac*ages d) As the onl t pe of database management s stem used for a data warehouse 2+. 3able $enormali/ation 7a)'mproves =uer performance 7b) $uplicates 'nformation a) #nl 7a) 3rue b) #nl 7b) 3rue c) /oth .a) and .b) +rue d) None 2-. <it 1apped 'nde,ing can be usedO a) "hen the distinct value of a column is high b) When the distinct value of a column is low c) "hen the storage space available is less d) "hen the table is having ver few rows

25. %cheduling of various tas*s needed to build and maintain a data warehouse is ta*en care of b a) %taging la er b) (rocess management layer c) 'nformation access la er d) 3ransformation la er 28. "hat is an #perational $ata %tore 7#$%)& a) A set of database that support reporting from an application s stem b) set of databases that provide integrated operations data to serve the organi2ation<s day to day activities c) A set of database to provide operational data for a single department d) A set of databases that support #4A; 39. #4A; that accesses the raw data l ing in the >$<1% for reporting is called as a) 1ultidimensional #4A; b) &elational %' ( c) H brid #4A; d) All the above 31. $ata "arehouses and $ata 1arts assist a) >eports to regulator agencies b) Audit reporting c) Decision *upport d) Accounting >eporting 32. #perational $ata %tore 7#$%) Characteristics 7a) %ub(ect #riented 7b) Golatile 7c) Current or Near Current collection of data 7d) integrated a) 7b)M7a) b) .a);.b);.c);.d) c) 7d)M7c) d) 7a)M7b)M7c) 33. "hat is a Fact 3able in a $ata "arehouse terminolog & a) An 3able that has the histor of a <usiness. b) 3able that contains 3ime related data c) An 3able present in a $ata "arehouse 3able d) +able that contains measurable data# 3!. "hat is the difference between a $ata "arehouse or $ata 1art and an #perational $ata %tore& a) n operational data store contains more current data than either a Data Warehouse or a Data $art b) An operational data store and a data warehouse or data mart trac* different sub(ect areas in the organi/ation c) An operational data store is a cop of the data warehouse or data mart d) 3he operational data store tends to be larger than the data warehouse or data mart 3+. "h data in a $ata "arehouse called as 3ime Gariant& a) <ecause data in the data warehouse is accurate as of some moment in time b) <ecause ever *e structure in the data warehouse contains E implicitl or e,plicitl Ean element of time: such as da : wee*: month: etc. c) .a);.b) d) #nl 7a)

20. "hich of the following is N#3 a t pe of process t picall done b a #4A; tool&

11
b) c) d) Decision support system #perational s stem 3ransactional s stem

3-. $ata "arehouse is a) Collection of Histor $ata b) =uer Centric c) $ecision %upport % stem d) ll the above 30. 2nterprise $ata warehouse contains a) #nl detailed data b) #nl summari/ed data c) Detailed and summari2ed data d) None of the Above 35. "hat is %now Fla*e %chema in a $atabase design& a) +he Dimension +ables have a Foreign >ey +able# b) 3he $imension 3ables do not have a Foreign ?e 3able. c) 3he Fact 3able has one $imension 3able d) 3he $imension table is #nl of 3ime 38. 'n order to be successful at decisionEma*ing: what does an organi/ation need& a) 2,perienced decision ma*ers b) de)uate and timely data c) A corporate strateg d) All the above !9. Non Architectured $ata 1arts are also *nown as a) 4egac $ata 1arts b) 4ega 1arts c) NonE'ntegrated $ata 1arts d) ll of the above !1. 'n building a data warehouse -9 E 59 P of wor* is re.uired in which stage& a) $atabase $esign b) 1+' deployment c) #4A; deplo ment d) Cleaning !2. Categories of #4A; 3oolsH 4evel 1E <asic .uer and displa of dataQ 4evel 2E 4evel 1 R advanced selection and arithmetic operationsQ 4evel 3E 4evel 1 and 4evel 2 R sophisticated data anal sis techni.ues. "hich of the following is an e,ample of a process a) $ispla a report based on specific selection criteria b) Drill down to another level of detail c) Calculate a rolling average on a set of data d) $ispla the top 19 items that meet a specific selection criteria !3. 3he Fact 3able is related to a $imension 3able in the $imensional 1odeling b a) #ne to 1an b) $any to %ne c) #ne to #ne d) 1an 3o 1an !!. "hich of the following are considered to be advantages while building a data warehouse& a) 3he abilit to access 2nterpriseE"ide data b) 3he abilit to have consistent data c) 3he abilit to perform anal sis .uic*l d) ll the above !+. AdEhoc access path: low transaction volume: low number of users are the characteristics of which s stem a) 'nformational s stem

!-. 3he *e performance indicators of an enterprise are a) $imension attributes b) Facts" $easures c) %ummar data d) $etailed data !0. "hat is an Active $ata "arehouse& a) A ;roduction $ata "arehouse b) Close4coupled %'+( and Data Warehouse c) CloseEcoupled $ata 1arts d) None !5. 2,ecuting a decision support .uer against an operational s stem usuall results in a) No change in performance b) Degradation in performance c) 'mprovement in performance d) None !8. $ata can be cleaned 7a)at the source7b)during transformation7c)in the $ata "arehouse a) #nl 7a) b) #nl 7b) c) Could be by all methods or any one of the methods based on the 1nvironment of Data Warehouse *etup d) #nl 7c) +9. "hat modeling techni.ue would be used to design a specific database that will be implemented with star schema& a) #b(ect 1odeling b) 2ntit >elationship 1odeling c) Dimensional $odeling d) None of the Above +1. 3he person who defined rules for #4A;7#n4ine Anal tical ;rocessing) is a) <ill 'nmon b) >alph ?imball c) C @ $ate d) 1 F Codd +2. "hat is a $imension 3able in $ata "arehouse terminolog & a) An 3able present in a $ata "arehouse 3able b) +able of members" positions" or units of the same type which is used as categories by which data is analy2ed c) An 3able storing measurable unit d) 3able storing $ata t pe that has a value to be anal /ed +3. 'n which schema onl primar dimension tables are (oined to fact tables& a. %tar %chema b# *now flaked schema c. <oth star schema and snow fla*ed schema d. None of the Above +!. "hat is the substantial benefit of implementing a $ata "arehouse or $ata 1art& a) 'mproves the morale of the organi/ationJs *nowledge wor*ers b) ;rovides man new tools to the e,ecutives within the organi/ation

12
c) d) 'ncreases technical *nowledge within the e,ecutive ran*s of the organi/ation because the will have to learn to use a computer 0mproves decision4making within the organi2ation -!. "hich one of the following is N#3 an advantage of brea*ing a $ata "arehouse up into smaller $ata 1arts a) $ata 1arts are guaranteed to have shared fields b) All data transformations from sources are common c) 0t takes longer to develop the Data Warehouse due to increased time for advanced planning and design d) None of the Above -+. "hich of the following statements are true about #4A;& a) Ad Hoc >eporting 3ool b) Has $rill through: %lice M $ice facilit c) <uilt on a $ata "arehouseD$ata 1art d) ll the above

++. #4A; that accesses its own proprietar $ata %torage for reporting is called as a) $ultidimensional %' ( b) >elational #4A; c) H brid #4A; d) All the above +-. Characteristics of dimension tables are a) Contains a primar *e b) Has one to man relationship to the fact table c) Contains other attribute columns that are useful for levels of aggregation d) ll the above +0. 3he t pe of 1odeling used for the $atabase designing in %tar %chema is a) 2> 1odeling b) Dimensional $odeling c) An 1ethod d) All the methods +5. 3he following statement fits for $ata 1ining 7a) 'tFs an ad hoc reporting tool 7b) 'tFs a technolog to find Hidden patterns in huge data a) #nl 7a) b) %nly .b) c) <oth 7a) and 7b) d) None +8. $egenerate dimensions can be represented as a) n entry in the fact table without an associated dimension table b) $imension table that contain all attributes that are necessar to provide .uer values c) An entr in the fact table with an associated dimension table d) None of the Above -9. >2$<>'C? and 32>A$A3A databases are mostl used for a) #43; applications b) %' ( applications c) <oth #43; and #4A; applications d) ## $atabase applications -1. "hich one of the following is N#3 an e,ample of measures& a) #rder =uantit b) (roduct name c) $ollar Galue d) 'nventor Count -2. "hich of the following is a problemDanal sisDstud that could be assisted b using Clustering $ata 1ining 3echni.ue a) 1ar*et <as*et Anal sis b) Credit Card Fraud $etection c) Campaign $arketing d) 3ime %eries Anal sis -3. "hich one of the following is the complete general process in the order involved in building a $ata "arehouse& a) 2valuate $ata ES2,tract $ata ES%tore $ata b) 2,tract $ata ES2valuate $ata c) 1xtract Data4?*tore Data 4?1valuate Data d) None of the Above

Das könnte Ihnen auch gefallen