Sie sind auf Seite 1von 3

APOLLO ENGINEERING COLLEGE

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING


CIA-II (Set - 1)

SUBJECT:DATA ANALYTICS TIME: 1.30 HRS


YEAR/SEM:II/ III MARKS:50

PART-A (Answer all questions) 5*2=10


1. Define RTAP.
Real Time Analytics Platform. It makes use of all available data and recourses.
Based on dynamic analysis and reporting, it denotes the ability to process data as it
arrives, rather than storing and retrieving.
2. Define bloom filter.
A bloom filter is an space efficient probabilistic data structure. It is used to test wether
an element is a member of set.
3. What is sampling?
Sampling is a common practice for selecting a subset of data to be analyzed. Instead
of dealing an entire data stream, we select an instances at periodic intervals.
4. What is data streaming?
Data stream is a real-time, continuous and ordered sequence of items. It is not
possible to control the order in which the items arrive.
5. What is decaying window?
A decaying window introduces a decay factor f during the data mining process.
Specifically the weight of each transaction is multiplied by a factor of f<1.

PART-B (2*16)+(1*8)=40
1. Explain about estimating the moment. (16)
1. 1st moment
2.8th moment
3. 2nd moment
Example:
5,5,5,5,5 surprise number = 5*52=125

2. Explain Apriori algorithm. (16)


2 Step process:
1. Generate the candidate key
2. set the frequent itemset
First pass apriori
second pass apriori
3. Discuss about market based model. (8)
Market basket is an abstraction that models any many-many relationship between two
concepts: items and baskets.
frequent-itemset mining
Example:
Item={milk(m),Coke ( c) , Pepsi (p), bear (b), Juice (j)}
Minimum support s = 3.
Transactions:
1. T1={m,c,b}
2. T2={m,p,j}
3. T3={m,b}
4. T4={c,j}
5. T5={m,p,b},
6. T6={m,c,b,j}
7. T7={c,b,j}
8. T8={b,c}
Frequent itemsets:{m},{c},{b},{j},{m,b},{c,b},{j,c}

----------------------------------------------------------------------------------------------------------------
APOLLO ENGINEERING COLLEGE
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING
CIA-I (Set - 2)

SUBJECT:DATA ANALYTICS TIME: 1.30 HRS


YEAR/SEM: II/ III MARKS: 50

PART-A (Answer all questions) 5*2=10


1. Define sampling.
2. What is web data?
3. What are the characteristics of big data

PART-B (2*16)+(1*8)=40

1. Define member function and explain the types of member function.(16)


2. i) Discuss about the concept of pointer with and example program. (8)
ii) Describe with an example program about access specifiers.
3. Write a C++ program with the class name image, and the function name as area, to
find the area of a rectangle using constructor concept. (8)

-------------------------------------------------------------------------------------------------------------

Das könnte Ihnen auch gefallen