You are on page 1of 2


Define data warehouse

b. Compare OLTP with OlAP
c. Discuss the 3 main types of data warehouse usage: information processing,
analytical processing and data mining.
d. Draw a star schema of a data warehouse for sales containing sales fact table and
dimension tables for time, item,branch and location.

2 a. Discuss the various methods for handling missing values.

b. Describe the various techniques for data transformation.
c. In a data analysis problem we have an attribute age. The age values for the data
tuples are(in increasing order)
13,15,16,16,19,20,20,21,22,22,25,25,25,25,30,33,33,35,35,35,35,36,40,45,46,53 and
i. Use min-max normalization to transform the value 35 for age onto the range
ii. Use z-score normalization to transform the value 35 for age where the
standard deviation of age is 12.94 yrs.

3 a Discuss the various data mining primitives by means of examples.

b. What are 1) No coupling
2) loose coupling
3) semi tight coupling
4) tight coupling
in data mining environment?

4 a. Discuss the apriori algo by means of an example.

b. we are given the following set of 4 transactions in market based model.

Transaction –id Items bought

104 Milk, bread, cookies, juice
792 Milk juice
1130 milk eggs
1735 bread cookies

Build a frequency pattern (FP) tree.

5 a. Differentiate between classification and prediction.

b. Give an example of Bayesian Belief Networks.
c. Discuss the working of back propagation algorithm in terms of various steps.
D Give the table for X(yrs of exp) and Y(corresponding salary of engineers in rs)
predict the salary of the engineer with 12 yrs experience using linear regression
1 20
3 36
6 43
3 57

6 a . What is clustering?
b. Discuss density based methods of clustering.
c. Given 2 object represented by tuples (22,1,42,10) and (20,0,36,8) following
1. Euclidean distance between the 2 objects
2. Manhattan distance between the 2 objects
3. Minkowski distance between the 2 objects using the power q=3.

7 a . What are the requirements of clustering in data mining?

b. Briefly describe any of the following approaches to clustering methods:
Partitioning methods or hierarchical methods giving examples wherever

8a. Discuss data mining applications either for

1. financial data analysis OR
2. telecommunication industry
b. Describe the trends In data mining.