Sie sind auf Seite 1von 24

Topic Questions Option (A) Option (B)

A data warehouse is which of the Contains numerous naming

DM following? Can be updated by end users conventions and formats.

A star schema has what type of relationship

DM between a dimension and fact table? Many-to-many One-to-one
DM Fact tables are which of the following? Completely demoralized Partially demoralized
A snowflake schema is which of the
DM following types of tables? Fact Dimension
A goal of data mining includes which of the To explain some observed event
DM following? or condition To confirm that data exists
OLAP databases are called decision support
DM system ? TRUE FALSE
DM In Star Schema Dimension tables are Short and Fat Long and Thin

DM The data in Data Warehouse is generally Clean Data Dirty Data

Ralph Kimball believes that
portions of data can be Inmon believes that portions of
combined based on relevance of data can be combined based on
data and can be used for relevance of data and can be
DM Choose two reporting used for reporting

In which type of SCD(Slowly changing

DM dimensions) do we preserve history of data: Type One Type Two

ETL During ETL load we generally have Unsorted data for Aggregator Sorted data for Aggregator

First load data into fact tables First load data into dimension
Sequence of jobs to load data in to then dimension tables, then tables, then fact tables, then
DM warehouse Aggregates if any Aggregates if any
DM Snowflaking means Normalizing the data Denormalizing the data
Drill Across generally use the following
OLAP join to generate report Self Join Inner Join
DM In general data in Data Warehousing is Normalized Denormalized
DM Consolidated data mart is First level data mart Second level data mart
In 4 step dimensional process, declaring
DM gain of business process is First Step Second
They areStep
either same or one is
DM Dimensions are Confirmed when They are different subset of another

You need to create an index on the SALES

table, which is 10 GB in size. You want
your index to be spread across many
tablespaces, decreasing contention for
index lookup, and increasing scalability and
manageability.Which type of index would
DM be best for this table? bitmap unique
A data warehouse is valuable
A data warehouse is useful to all only if the organisation has an
organisations that currently use interest in analysing historical
DM Which of the following statements is true? OLTP's data.
the act of using software to
analyse highly consolidated
data, often to view the changes the act of exporting data into a
DM Analytical processing is over time. spreadsheet for analysis

The fact table of a data The fact table of a data

warehouse is the main store of warehouse is the main store of
descriptions of the transactions all of the recorded transactions
DM Which of the following statements is true? stored in a DWH over time.
Which of the following is associated with a
DM data warehouse A relation A flat file
A data warehouse
automatically makes a copy of
The more data a data warehouse every transaction recorded in
DM Which of the following statements is true? has, the better it is an OLTP system

must import data from

transactional systems whenever
significant changes occur in the takes regular copies of
DM A data warehouse transactional data. transaction data

The level of detail of the data

The number of fact tables in a descriptions held in a data
DM Granularity refers to data warehouse warehouse

The level of detail of data that is The data that describes the
DM Dimensionality refers to held in the fact table transactions in the fact table.
The main organisational justification for
implementing a data warehouse is to lagre scale transaction Cheaper ways of handling
DM provide processing transactions

OLAP OLTP stands for On Line Transaction Processing On Line Terminal Protocol
must be in normalised form to
DM Data in a data warehouse in a flat file format at least 3NF

DM A data warehouse needs to be time varient Subject orientated

the act of analysing each
the act of processing individual transaction to verify that it is
DM Transaction processing is transactions valid
On Line Abstraction
OLAP OLAP stands for On Line Analytical Protocol Processing
What is a formal way to express data
relationships to a database management
DM system? Attributes Entity identifier
What is a technique for documenting the
relationships between entities in a database
DM environment? Attributes Entity identifier
What indicates having the potential to
contain more than one value for an attribute
DM at any given time? Constraint Single-valued

Which relationship is between two entities

in which an instance of entity A can be
related to zero, one, or more instances of
entity B and entity B can be related to zero,
DM one, or more instances of entity A? One-to-many relationship One-to-one relationship
Which of the following uses a series of
logically related two-dimensional tables or
files to store information in the form of a
DM database? Database Database management system

All of the following terms describe OLAP, The gathering of input

OLAP except information Processing input information
Which tool is used to help an organization
DM build and use business intelligence? Data warehouse Data mining tools

DM What does the data dictionary identify? Field names Field types
Which of the following is a data
DM manipulation tool? File generators Query by example tool
When gathering business information
requirements, you should focus only on the
requirements provided by the business

One difference between the design of

online transaction processing (OLTP) and
online analytical processing (OLAP)
systems is that the OLTP system design is
OLAP optimized for getting data into the database. TRUE FALSE
Designing a data warehouse in first normal
DM form (1NF) is not recommended. TRUE FALSE
Cardinality is defined as the number of
DM relationships existing between entities. TRUE FALSE
It is not important to include metadata when
DM designing a data warehouse. TRUE FALSE
There is no need to include a time
DM dimension in the data warehouse. TRUE FALSE
The level of granularity you choose for the
time dimension has no significant impact on
DM the size of your database. TRUE FALSE
Surrogate keys are generated on tables in
the data warehouse after the table is
DM populated. TRUE FALSE
To improve performance, all tables in the
DM data warehouse should be indexed. TRUE FALSE
Fact tables are often referred to as the
DM measures of business performance. TRUE FALSE
Dimension tables are used to provide
descriptions of the business subjects and
descriptive information about each row in
DM the fact table. TRUE FALSE
A high level of granularity means more
detail; a low level of granularity mean less

One method of managing the history in

dimension tables is to drop the dimension
DM and rebuild the table from scratch. TRUE FALSE
You do not need to be concerned with
maintaining the history of changing data in
DM the dimension tables. TRUE FALSE
Effective use of summaries is the best
technique for improving performance in
DM data warehouses. TRUE FALSE
Summary data cannot be combined with
DM detailed fact data. TRUE FALSE
When choosing a level of summarization,
there are two approaches: summarizing the
entire dimension, or summarizing part of
the dimension and partially improving
DM performance TRUE FALSE
Table partitioning splits the storage of a
DM table into smaller individual units. TRUE FALSE
Denormalization is the factor that increases
DM the sparseness in a database. TRUE FALSE
What are the actual data values that occupy
the cells as defined by the dimensions
OLAP selected? Nesting Aggregation

The term that defines filtering data in an

OLAP OLAP cube is ___________ . dicing slicing

What is an item that matches a specific

OLAP description or classification? Category Measure

The cube structure in OLAP achieves the

OLAP __________ functionality. shared information

Aggregation provides OLAP with

OLAP __________ multidimensional data pre-calculated data

OLAP What is the acronym that defines OLAP? FHTMI FASMI

__________ in OLAP allows you to define
OLAP a subcube of the original space. Dicing Slicing
What term in OLAP defines changing the
dimensional orientation of the report from
OLAP the cube data? Dicing Slicing

The _________ in OLAP enable you to

drill-up or drill-down to view different
OLAP levels of detail about your data. dimensions measures
select multiple cube
OLAP When you nest in OLAP, you _________ . aggregations select multiple cube measures
A process that transforms A process that loads
information using a common set information into a data
ETL Which of the following describes ETL? of enterprise definitions warehouse

The common term for the

A particular attribute of representation of
DM What is data mining information multidimensional information
A collection of related data fields is called a
DM ____. byte record
interface between the database
DM A DBMS is a(n) ____. and application programs data repository
A(n) ____ is a generalized class of people,
places, or things for which data is collected,
DM stored, and maintained record entity
Which attribute would make the best
DM primary key? Social security number Last name
The ____ data model follows a treelike
DM structure. distributed hierarchical
The most popular database model currently
DM in use is the ____. relational model hierarchical model

A primary key is a field or set of fields that

DM uniquely identifies a record. TRUE FALSE
One of the goals of a DBMS is to increase
data redundancy thereby making it less
DM vulnerable to hackers. TRUE FALSE
A Data Warehouse would most likely be
DM part of a(n) ERP System Small MIS System

to streamline a Transaction
DM Data Mining would most likely be used Processing System to model data in a DBMS

DM Which of the following is a valid key field A Book Title House number + Street Name

DM A Table Can only store data of one type Consists of Alphanumeric data
A RDBMS cannot store data without A Logical data type can store
knowing the data type. Which of the three values, TRUE, Numerical data can be stored
DM following statements are true? UNKNOWN and FALSE in different formats

A FLAT FILE database management A database design that only has A DBMS that can only have
DM system is one table in it simple data tables in it
Assume you are extending the design of
The College Student Records System to
include details on each classroom. The
college is never likely to have more than ten
classrooms and definitely not ever going to
have more than 25 classrooms. What data
DM type would you select Numeric - Byte Numeric - Single

be exported to a word processor be based on an underlying data

OLAP A report must for printing source (a table or a query)

The layout of a report is independant of the

OLAP number of records held in a table or query True FALSE
produce output that is ready
produce output that is ready for for publication on the Web
OLAP A report is used to e-mailing (HTML)

The rule that prohibits transitive

DM dependencies is third normal form first normal form
The rule that requires that each non-key
field (attribute) should be fully functionally
DM dependent on the primary key is Third Normal Form First Normal Form

The rule that specifies that there should be

no repeating fields and that fields should be
DM atomic is third normal form second normal form
The process of combining two tables in a
DM relational database is known as a Join a Combine

enable low level descriptions of

DM The ER model is meant to data replace relational design

DM The Entity Relation Model models Entities Relationships

An ER model is concerned
Which of the following statements best An ER model provides a view primarily with a physical
decribes the function of an entity relation of the logic of the data and not implementation of the data and
DM model? the physical implementation. secondly with the logical view

DM SQL stands for Sequential Question Language Structured Query Language

Which of the following are elements of

DM SQL? Data Query Language Data Definition Language

Consider the table (STUDREC). Which of

the following statements will list columns
INIT, SNAME, GENDER and KIDS (in SELECT init, sname, gender, SELECT init, sname, gender,
that order) for all students who have more kids FROM studrec WHERE kids FROM studrec WHERE
DM then 1 child. kids <1; kids >1;

OLAP A typical data warehouse consists of Staging area Data Marts

Data Modelling layer, Data
What are the three layers of Data Data staging layer, Data Extract Accesses layer, Data Storage
OLAP warehouse architecture? layer, Data transactional layer layer

OLAP Staging Area comes under which layer? Data Storage layer Data Access layer

What are Limitations of Traditional

OLAP techniques ? Extensive programming Redundant reporting

OLAP Different categories of Data Access are? Web Access Data Mining
OLAP OLAP stands for Online Access Processing Online Analytic Processing

A process that uses a variety of statistical

and artificial intelligence frameworks to
OLAP discover patterns and relationships in data Data Access Process Data Mining Process
A category of data access solutions in
which information is viewed through a web
OLAP browser Data Access Process Data Mining Process

Businesses today face Data Access is the last mile

OLAP What is importance of Data Access? challenges like that enables decision makers to
Enterprise Data Warehouse
OLAP What are different types of reporting? Transaction Systems Reporting Reporting
In Transaction Systems Reporting,
Reporting Tool has a native connectivity to
OLAP ? Views Tables
To provide consolidated and
An enterprise data warehouse (EDW) is To combine data from multiple cleansed data to an array of
OLAP designed to OLTP systems data marts
OLAP Examples of Managed Query Tool Business Objects MS Query
Multidimensional viewing Time Intelligence - Time
OLAP Which are the OLAP features ? Capabilities Series analysis
Relatively standardized and
simple queries returning
OLAP OLAP system is Decision support relatively few records
OLAP What is measure? Is not a number represents factual data
Data is stored in Support for large databases
OLAP What is ROLAP? multidimensional cubes with good performance
SQL Which one is DDL command? Insert Update
How many types of Normalization rules are
SQL there? 4 5
SQL Which are pseudocolumns CURRVAL NEXTVAL
Can you use select in FROM clause of SQL
SQL select ? YES NO
It allows you to associates a It allows you to associate a
Describe the use of %ROWTYPE in variable with a single column variable with an entire table
SQL PL/SQL ? type row
SQL How many types of triggers are there? 9 10

What is the default ordering of an ORDER

SQL BY clause in a SELECT statement? Descending Ascending

All rows selected by either All rows selected by either

SQL Union All returns query query and including duplicates

ETL is the set of processes by ETL is the set of processes by

which data is extracted from which data is extracted from
various sources, transformed various sources and loaded
ETL What is ETL process? and loaded into target systems into target systems
Closely integrated with High speed loading of target
ETL What is Importance of ETL? RDBMSs data warehouses

Data Extraction, Data

Data Extraction, Data Extraction Cleanup, Data
ETL Which are ETL Activities ? transformation, Data loading loading
ETL Data Extraction Methods are Incremental Extraction Real Time Extraction
ETL Which are the examples of ETL tools? Informatica PowerCenter Ab Initio
It limits your ability to recover
Format of Archived data because no database logging
ETL What is Bulk Load? different from operational data occurs

ETL Which one is not GUI based Scheduler ? Tool Specific Autosys
performs the access and
perform a variety of extraction of data from the
transformations unique to the source system and builds a
What do you mean by Source alteration source, depending on business temporal view of the data at
ETL stage in ETL ? requirements the time of extraction
What are the different types of Commit
ETL intervals? Target-based commit Source-based commit

ETL Which is the first step of the ETL process ? Data Extraction Cleanup Data Extraction

Quick and relatively easy to

write scripts for doing exports Does not usually require
ETL Which is not pros of Batch Extraction ? and imports additional hardware
Which tool does not support Change-Data-
ETL Capture Feature ? Ascential Data Stage XE Informatica PowerCenter
A data warehouse is a subject-
oriented, integrated,
Data Warehouse is integarted of nonvolatile, time-variant
data in support of management's collection of data in support of
DW What is Data Warehouse ? decisions management's decisions
Better business intelligence for
DW What is the Need of Data Warehousing ? To store Operational Data end-users
Which one is not Characteristic of Data
DW Mart ? Restrictive, non extensible Short life/tactical
Which is the information need for recent
DW data ? ODS OLTP

What type of Data Structure Characteristic

DW does Data Warehousing has ? Detailed Summarized
What are Components of a Data Warehouse
DW Architecture ? Data Cleansing tool ETL tool
Clean up source data in-place Generate and maintain
DW What is use of Data Cleaning Tools ? on the host centralized metadata
DW What is the use of Data Mining Tools ? Slice and Dice What If analysis
A known fact that can be
recorded and that have implicit The data is perceived by the
DM What is Database ? meaning user as tables

A collection of concepts that Representation of a set of

can be used to describe the business requirements in a
structure of a database standard structured framework
DM What is Data Model ? understood by the users
Which Data Modelling approach suit for
DM corporate data Warehouse ? Dimensional Approach Entity Relational Approach
What are the different types of relationship
DM notations ? IEX IDFIX

Geared for performance and

DM What is Physical Data Model ? Conceptual may consists of redundant data

DM What are different types of Data Model ? Physical model, Logical model, Hybrid model
Can we have multiple foreign keys in a
