Sie sind auf Seite 1von 24

Topic Questions

Option (A)

Option (B)

Can be updated by end users

Contains numerous naming


conventions and formats.

DM

A data warehouse is which of the


following?

DM

A star schema has what type of relationship


between a dimension and fact table?
Many-to-many

One-to-one

DM

Fact tables are which of the following?

Completely demoralized

Partially demoralized

DM

A snowflake schema is which of the


following types of tables?

Fact

Dimension

DM

A goal of data mining includes which of the To explain some observed event
following?
or condition
To confirm that data exists

DM

OLAP databases are called decision support


system ?
TRUE

FALSE

DM

In Star Schema Dimension tables are

Short and Fat

Long and Thin

DM

The data in Data Warehouse is generally

Clean Data
Ralph Kimball believes that
portions of data can be
combined based on relevance of
data and can be used for
reporting

Dirty Data
Inmon believes that portions of
data can be combined based on
relevance of data and can be
used for reporting

DM

Choose two

DM

In which type of SCD(Slowly changing


dimensions) do we preserve history of data: Type One

Type Two

ETL

During ETL load we generally have

Unsorted data for Aggregator

Sorted data for Aggregator

DM

Sequence of jobs to load data in to


warehouse

First load data into fact tables


then dimension tables, then
Aggregates if any

First load data into dimension


tables, then fact tables, then
Aggregates if any

DM

Snowflaking means

Normalizing the data

Denormalizing the data

Drill Across generally use the following


OLAP join to generate report

Self Join

Inner Join

DM

In general data in Data Warehousing is

Normalized

Denormalized

DM

Consolidated data mart is

First level data mart

Second level data mart

DM

In 4 step dimensional process, declaring


gain of business process is

First Step

DM

Dimensions are Confirmed when

They are different

Second
They areStep
either same or one is
subset of another

DM

You need to create an index on the SALES


table, which is 10 GB in size. You want
your index to be spread across many
tablespaces, decreasing contention for
index lookup, and increasing scalability and
manageability.Which type of index would
be best for this table?
bitmap

DM

Which of the following statements is true?

unique

A data warehouse is valuable


A data warehouse is useful to all only if the organisation has an
organisations that currently use interest in analysing historical
OLTP's
data.

Analytical processing is

the act of using software to


analyse highly consolidated
data, often to view the changes
over time.

the act of exporting data into a


spreadsheet for analysis

DM

Which of the following statements is true?

The fact table of a data


warehouse is the main store of
descriptions of the transactions
stored in a DWH

The fact table of a data


warehouse is the main store of
all of the recorded transactions
over time.

DM

Which of the following is associated with a


data warehouse
A relation

DM

A flat file

Which of the following statements is true?

A data warehouse
automatically makes a copy of
The more data a data warehouse every transaction recorded in
has, the better it is
an OLTP system

A data warehouse

must import data from


transactional systems whenever
significant changes occur in the takes regular copies of
transactional data.
transaction data

DM

Granularity refers to

The number of fact tables in a


data warehouse

DM

Dimensionality refers to
The main organisational justification for
implementing a data warehouse is to
provide

DM

DM

DM

OLAP OLTP stands for

The level of detail of the data


descriptions held in a data
warehouse

The level of detail of data that is The data that describes the
held in the fact table
transactions in the fact table.
lagre scale transaction
processing

Cheaper ways of handling


transactions

DM

Data in a data warehouse

On Line Transaction Processing On Line Terminal Protocol


must be in normalised form to
in a flat file format
at least 3NF

DM

A data warehouse needs to be

time varient

DM

Transaction processing is

Subject orientated
the act of analysing each
the act of processing individual transaction to verify that it is
transactions
valid
On Line Analytical Protocol

On Line Abstraction
Processing

Attributes

Entity identifier

DM

What is a technique for documenting the


relationships between entities in a database
environment?
Attributes

Entity identifier

DM

What indicates having the potential to


contain more than one value for an attribute
at any given time?
Constraint

Single-valued

DM

Which relationship is between two entities


in which an instance of entity A can be
related to zero, one, or more instances of
entity B and entity B can be related to zero,
one, or more instances of entity A?
One-to-many relationship

One-to-one relationship

OLAP OLAP stands for


What is a formal way to express data
relationships to a database management
DM
system?

DM

Which of the following uses a series of


logically related two-dimensional tables or
files to store information in the form of a
database?
Database

All of the following terms describe OLAP, The gathering of input


OLAP except
information

Database management system

Processing input information

DM

Which tool is used to help an organization


build and use business intelligence?

Data warehouse

Data mining tools

DM

What does the data dictionary identify?

Field names

Field types

DM

DM

Which of the following is a data


manipulation tool?
File generators
When gathering business information
requirements, you should focus only on the
requirements provided by the business
groups.
TRUE

Query by example tool

FALSE

One difference between the design of


online transaction processing (OLTP) and
online analytical processing (OLAP)
systems is that the OLTP system design is
OLAP optimized for getting data into the database. TRUE

FALSE

DM

Designing a data warehouse in first normal


form (1NF) is not recommended.
TRUE

FALSE

DM

Cardinality is defined as the number of


relationships existing between entities.

TRUE

FALSE

DM

It is not important to include metadata when


designing a data warehouse.
TRUE

FALSE

DM

There is no need to include a time


dimension in the data warehouse.

FALSE

TRUE

DM

The level of granularity you choose for the


time dimension has no significant impact on
the size of your database.
TRUE
Surrogate keys are generated on tables in
the data warehouse after the table is
populated.
TRUE

DM

To improve performance, all tables in the


data warehouse should be indexed.

TRUE

FALSE

DM

Fact tables are often referred to as the


measures of business performance.

TRUE

FALSE

TRUE

FALSE

TRUE

FALSE

TRUE

FALSE

DM

DM

Dimension tables are used to provide


descriptions of the business subjects and
descriptive information about each row in
the fact table.
A high level of granularity means more
detail; a low level of granularity mean less
detail.

DM

One method of managing the history in


dimension tables is to drop the dimension
and rebuild the table from scratch.

DM

FALSE

FALSE

DM

You do not need to be concerned with


maintaining the history of changing data in
the dimension tables.
TRUE
Effective use of summaries is the best
technique for improving performance in
data warehouses.
TRUE

DM

Summary data cannot be combined with


detailed fact data.

TRUE

FALSE

DM

When choosing a level of summarization,


there are two approaches: summarizing the
entire dimension, or summarizing part of
the dimension and partially improving
performance
TRUE

FALSE

DM

Table partitioning splits the storage of a


table into smaller individual units.

TRUE

FALSE

DM

Denormalization is the factor that increases


the sparseness in a database.
TRUE

FALSE

DM

FALSE

FALSE

What are the actual data values that occupy


the cells as defined by the dimensions
OLAP selected?
Nesting

Aggregation

The term that defines filtering data in an


OLAP OLAP cube is ___________ .

dicing

slicing

What is an item that matches a specific


OLAP description or classification?

Category

Measure

The cube structure in OLAP achieves the


OLAP __________ functionality.

shared

information

Aggregation provides OLAP with


OLAP __________

multidimensional data

pre-calculated data

OLAP What is the acronym that defines OLAP?

FHTMI

FASMI

__________ in OLAP allows you to define


OLAP a subcube of the original space.
Dicing

Slicing

What term in OLAP defines changing the


dimensional orientation of the report from
OLAP the cube data?

Dicing

Slicing

The _________ in OLAP enable you to


drill-up or drill-down to view different
OLAP levels of detail about your data.

dimensions

measures

select multiple cube


OLAP When you nest in OLAP, you _________ . aggregations
A process that transforms
information using a common set
ETL Which of the following describes ETL?
of enterprise definitions

DM

What is data mining

A particular attribute of
information

select multiple cube measures


A process that loads
information into a data
warehouse

The common term for the


representation of
multidimensional information

DM
DM

A collection of related data fields is called a


____.
byte
interface between the database
and application programs

record

DM

A DBMS is a(n) ____.


A(n) ____ is a generalized class of people,
places, or things for which data is collected,
stored, and maintained
record
Which attribute would make the best
primary key?
Social security number

DM

The ____ data model follows a treelike


structure.

DM

The most popular database model currently


in use is the ____.
relational model

hierarchical model

DM

A primary key is a field or set of fields that


uniquely identifies a record.
TRUE

FALSE

DM

One of the goals of a DBMS is to increase


data redundancy thereby making it less
vulnerable to hackers.

TRUE

FALSE

DM

A Data Warehouse would most likely be


part of a(n)

ERP System

Small MIS System

DM

Data Mining would most likely be used

to streamline a Transaction
Processing System

to model data in a DBMS

DM

Which of the following is a valid key field

A Book Title

House number + Street Name

DM

A Table

Can only store data of one type Consists of Alphanumeric data

DM

A RDBMS cannot store data without


knowing the data type. Which of the
following statements are true?

A Logical data type can store


three values, TRUE,
UNKNOWN and FALSE

DM

A FLAT FILE database management


system is

A database design that only has A DBMS that can only have
one table in it
simple data tables in it

DM

Assume you are extending the design of


The College Student Records System to
include details on each classroom. The
college is never likely to have more than ten
classrooms and definitely not ever going to
have more than 25 classrooms. What data
type would you select
Numeric - Byte

DM

OLAP A report must

distributed

data repository

entity
Last name
hierarchical

Numerical data can be stored


in different formats

Numeric - Single

be exported to a word processor be based on an underlying data


for printing
source (a table or a query)

The layout of a report is independant of the


OLAP number of records held in a table or query True

OLAP A report is used to

DM

The rule that prohibits transitive


dependencies is

FALSE
produce output that is ready
produce output that is ready for for publication on the Web
e-mailing
(HTML)

third normal form

first normal form

DM

The rule that requires that each non-key


field (attribute) should be fully functionally
dependent on the primary key is
Third Normal Form

First Normal Form

DM

The rule that specifies that there should be


no repeating fields and that fields should be
atomic is
third normal form

second normal form

DM

The process of combining two tables in a


relational database is known as

a Join

a Combine

DM

The ER model is meant to

enable low level descriptions of


data
replace relational design

DM

The Entity Relation Model models

Entities

Relationships

DM

Which of the following statements best


decribes the function of an entity relation
model?

An ER model provides a view


of the logic of the data and not
the physical implementation.

An ER model is concerned
primarily with a physical
implementation of the data and
secondly with the logical view

DM

SQL stands for

Sequential Question Language

Structured Query Language

DM

Which of the following are elements of


SQL?

Data Query Language

Data Definition Language

DM

Consider the table (STUDREC). Which of


the following statements will list columns
INIT, SNAME, GENDER and KIDS (in
that order) for all students who have more
then 1 child.

SELECT init, sname, gender,


kids FROM studrec WHERE
kids <1;

SELECT init, sname, gender,


kids FROM studrec WHERE
kids >1;

OLAP A typical data warehouse consists of

Staging area

What are the three layers of Data


OLAP warehouse architecture?

Data Marts
Data Modelling layer, Data
Data staging layer, Data Extract Accesses layer, Data Storage
layer, Data transactional layer
layer

OLAP Staging Area comes under which layer?

Data Storage layer

Data Access layer

What are Limitations of Traditional


OLAP techniques ?

Extensive programming

Redundant reporting

OLAP Different categories of Data Access are?


OLAP OLAP stands for

Web Access
Online Access Processing

Data Mining
Online Analytic Processing

A process that uses a variety of statistical


and artificial intelligence frameworks to
OLAP discover patterns and relationships in data Data Access Process
A category of data access solutions in
which information is viewed through a web
OLAP browser
Data Access Process

OLAP What is importance of Data Access?

Businesses today face


challenges like

Data Mining Process

Data Mining Process


Data Access is the last mile
that enables decision makers to

Enterprise Data Warehouse


OLAP What are different types of reporting?
Transaction Systems Reporting Reporting
In Transaction Systems Reporting,
Reporting Tool has a native connectivity to
OLAP ?
Views
Tables
To provide consolidated and
An enterprise data warehouse (EDW) is
To combine data from multiple cleansed data to an array of
OLAP designed to
OLTP systems
data marts
OLAP Examples of Managed Query Tool
Business Objects
MS Query
Multidimensional viewing
Time Intelligence - Time
OLAP Which are the OLAP features ?
Capabilities
Series analysis
Relatively standardized and
simple queries returning
OLAP OLAP system is
Decision support
relatively few records
OLAP What is measure?
Is not a number
represents factual data
Data is stored in
Support for large databases
OLAP What is ROLAP?
multidimensional cubes
with good performance
SQL Which one is DDL command?
Insert
Update
How many types of Normalization rules are
SQL there?
4
5
SQL Which are pseudocolumns
CURRVAL
NEXTVAL
Can you use select in FROM clause of SQL
SQL select ?
YES
NO
It allows you to associates a
It allows you to associate a
Describe the use of %ROWTYPE in
variable with a single column
variable with an entire table
SQL PL/SQL ?
type
row
SQL How many types of triggers are there?
9
10

SQL

What is the default ordering of an ORDER


BY clause in a SELECT statement?
Descending

SQL

Union All returns

Ascending

All rows selected by either


query

All rows selected by either


query and including duplicates
ETL is the set of processes by
which data is extracted from
various sources and loaded
into target systems
High speed loading of target
data warehouses

ETL

What is ETL process?

ETL

What is Importance of ETL?

ETL is the set of processes by


which data is extracted from
various sources, transformed
and loaded into target systems
Closely integrated with
RDBMSs

ETL
ETL
ETL

Which are ETL Activities ?


Data Extraction Methods are
Which are the examples of ETL tools?

Data Extraction, Data


transformation, Data loading
Incremental Extraction
Informatica PowerCenter

ETL

What is Bulk Load?

Format of Archived data


different from operational data

Data Extraction, Data


Extraction Cleanup, Data
loading
Real Time Extraction
Ab Initio
It limits your ability to recover
because no database logging
occurs

ETL

Which one is not GUI based Scheduler ?

Tool Specific

Autosys

perform a variety of
transformations unique to the
source, depending on business
requirements

performs the access and


extraction of data from the
source system and builds a
temporal view of the data at
the time of extraction

Target-based commit

Source-based commit

ETL

What do you mean by Source alteration


stage in ETL ?
What are the different types of Commit
intervals?

ETL

Which is the first step of the ETL process ? Data Extraction Cleanup

ETL

ETL

Quick and relatively easy to


write scripts for doing exports
and imports

ETL

Which is not pros of Batch Extraction ?


Which tool does not support Change-DataCapture Feature ?
Ascential Data Stage XE

DW

What is Data Warehouse ?

DW

What is the Need of Data Warehousing ?


Which one is not Characteristic of Data
Mart ?
Which is the information need for recent
data ?

DW
DW

DW
DW
DW
DW

DM

Does not usually require


additional hardware

Informatica PowerCenter
A data warehouse is a subjectoriented, integrated,
Data Warehouse is integarted of nonvolatile, time-variant
data in support of management's collection of data in support of
decisions
management's decisions
Better business intelligence for
To store Operational Data
end-users
Restrictive, non extensible

Short life/tactical

ODS

OLTP

What type of Data Structure Characteristic


does Data Warehousing has ?
Detailed
What are Components of a Data Warehouse
Architecture ?
Data Cleansing tool
Clean up source data in-place
What is use of Data Cleaning Tools ?
on the host
What is the use of Data Mining Tools ?
Slice and Dice
A known fact that can be
recorded and that have implicit
What is Database ?
meaning
A collection of concepts that
can be used to describe the
structure of a database

DM

Data Extraction

Summarized
ETL tool
Generate and maintain
centralized metadata
What If analysis
The data is perceived by the
user as tables
Representation of a set of
business requirements in a
standard structured framework
understood by the users

DM

What is Data Model ?


Which Data Modelling approach suit for
corporate data Warehouse ?
Dimensional Approach
What are the different types of relationship
notations ?
IEX

DM

What is Physical Data Model ?

Conceptual

DM

What are different types of Data Model ?

Physical model, Logical model, Hybrid model

DM

Entity Relational Approach


IDFIX
Geared for performance and
may consists of redundant data

DM

Can we have multiple foreign keys in a


table ?

TRUE

FALSE

Answer
s

Option (C)

Option (D)

Organized around important


subject areas.

Contains only current data.

One-to-many

All of the above

Completely normalized

Partially normalized

Helper

All of the above

To analyze data for expected


relationships

To create a new data warehouse

A
A

Long and Fat

Short and thin

Clean and Dirty Data

None of above

Inmon believes that DW is built Ralph Kimball believes that DW


and should be used for
is built and should be used for
B and D
reporting.
reporting.

Type Three

None of above

Does not matter if we use Sorted


or Unsorted data for
Aggregation
None of the Above

First Aggregates then load data


into dimension tables, then fact
tables

Does not matter if we load either


of fact, dimensions, or
aggregates

None of Above
Outer Join

None of the Above

C
C

None of Above
All of these

None of Above

Third
When Step
they can be compared
mathematically

Fourth Step

None of these

partitioned

reverse Key

A data warehouse is valuable to


thiose organisations that need to A data warehouse is necessary
keep an audit trail of their
to all those organisations that
activities
are using relational OLTP's

the act of using a relational


database to produce reports
giving data summaries on a
regular basis (e.g. monthly)

the act of sumarising data on a


regular basis (e.g. month end
summaries)

A fact table describes the


granularity of data held in a
DWH

A fact table describes the


transactions stored in a DWH

A hierachical and/or network


structure

A star schema

Adding data for the sake of it


may well degrade the
A data warehouse is a relatively effectiveness of data
straighttforward thing to set up. warehouseing analysis

takes regular copies of


transaction data and stores it in a has to work on live transactional
way that is optimised for query data to provide up to date and
and reporting
vaild results

The level of detail of the data


stored in a data warehouse.

The number of dimensions in a


data warehouse

The level of detail that is held in The number of dimension tables


the Data Warehouse
that exist in a star schema

Storing large volumes of data

Decision support

On Line Terminal Processing

On Line Transaction Protocol


must be in normalised form to at
can be normalised but often isn't least 2NF
Capable of integrating data from
a wide variety of sources
non-volatile
the act of processing, recording
the act of analysing transactions and storing individual
on a regular basis (e.g. monthly) transactions in a database

A
C
A,B,C,D

On Line Abstraction Protocol

On Line Analytical Processing

Data model

Entity-relationship diagram

Data model

Entity-relationship diagram

All of the above

None of the above

Many-to-many relationship

Many-to-one relationship

None of the above

Updating existing information to


reflect to the gathered and
processed information
None of the above

Database management systems

All of the above

Field formats

All of the above

Structure question language

All of the above

Data warehouse

A
B
A
B
B

B
B
A

A
B

A
A
A

Dimensions

Measures

rotating

nesting

Dimension

Nest

collection

multidimensional

nested data

slow data retrieval

ASFMI

MASHF

Rotating

Nesting

Rotating

Nesting

nesting

aggregation

select multiple cube dimensions select multiple cube slices


A process that extracts
information from internal and
external databases
All
above
Usesofathe
variety
of techniques to
find patterns and relationships in
large volumes of information
The process of analyzing data to and infer rules from them that
extract information not offered predict future behavior and
by the raw data alone
guide decision making

character

bit

knowledge base

unique group of records

attribute

file

First name

Age

network

relational

network model

object model

A
A

B
DBMS

to help transform data into


useful information that can be
used by a DSS

Expert system
to help transform data from
different sources so that they
can be stored in a single Data
Warehouse.

Car Registration number

Initials + Family Name + Date


of Birth

Consists of Rows and Columns

Cannot be empty

Some DBMS's can use DATE


data types

A Character (Text) data type can


contain 0,1,2,3,4,5,6,7,8 and 9
A DBMS that contains records
A DBMS that can only have one that have a large number of
table in it
fields in them

Numeric - Integer

Numeric - Long integer

Be redefined each time it is used be password protected

B,C,D

A
produce output that is formatted produce output formatted for
for display on a computer screen print

second normal form

None of the Above

Second Normal Form

None of the Above

first normal form

None of the Above

a Relate

a Construc

be close to a users perception of enable detailed descriptions of


the data
data query processing

Entities, Relationships and


Processes

Entities and Relationships

An ER model is entirely
concerned with modelling the
physical implemetation

An ER model is concerned
primarily with a logical view of
the data and secondly with the
physical implementation

Structured Question Language

Sequential Query Language

Data Modification Language

Data Manipulation Language

A,B,D

SELECT init, sname, gender,


kids FROM studrec WHERE
kids >'1';

SELECT init sname, gender,


kids FROM studrec WHERE
kids >1;

Analytical environment
Data Extraction layer, Data
Accesses layer, Data Storage
layer

All of the above

None

Data Extract layer


None
SQL does not have a natural way
of providing flexible view
reorganizations that will
transpose the data
All of the above

Both A and B
Both A and B

None
None

C
B

Web Access Process

None

Web Access Process

Reporting

Prompt, reliable data access

All of the above

Both A and B

None

OLAP

OLTP

Both A and B
Microsoft Access

None
All of the above

C
A

Only A

Both A and B

Both A and B
description of subject
Good to access pre-aggregated
data
Drop

None
Both B and C
Compilation intensive
architecture
Select

A
B

6
ROWID

7
All of the above

B
D

B
C

A
It allows you to associate a
variable with an entire table
column
11

Both A and C
12

B
D

B
All distinct rows selected by
both queries

All rows selected by the first


query but not the seconds

Both A and B

None

Both A and B

Only A

Data Extraction, Data loading


Full Extraction
Business Objects

Data Extraction, Data


transformation, Data Extraction
Cleanup, Data loading
All of the above
Both A and B

D
D
D

Lengthy and Complex process

All of the above

CRON jobs

All of the above

performs final formatting of data


to produce load-ready files for
the target table; identifies and
segregates rows to be inserted
vs. updated (if applicable);
applies remaining technical meta
data tagging; and processes data
into the RDBMS

final stage, uses the load- ready


files from Stage 4 to build
aggregation tables needed to
improve query performance
against the warehouse

Only A

Both A and B

Data transformation

Data loading

Not event driven--does not


facilitate notification or change Almost all applications provide
in another application at the time utilities for exporting and
of a change in first application importing

Ab Initio

All of the above

Both A and B

None

Used by Operational users

Both B and C

Project Orientation

Flexible, extensible

OLAP

All of the above

Detailed and Summarized

Detailed and lightly summarized

Data Modelling tool


Automatic generation of data
extract programs
Dill Down
It is designed, built, and
populated with data for a
specific purpose

All of the above

All of the above


Static Reports

A
B

All of the above

Both A and B

None

Both A and B

None

IE

Both B and C

Both A and B

None

Conceptual model

Both A and C