Beruflich Dokumente
Kultur Dokumente
UNIT-1
PART A(2 MARKS)
1. Define Big Data? Why it is need?
2. List the Analytics tools?
3. Define Analytics?
4. Define Data Acquisition?
5. Give some commonly used protocols for Data Acquisition.
6. What are the pain points of EDW?
7. List the features of Big Data?
8. What are the five aspects of data that data professionals consider for analytics?
9. What are the software tools used for Big Data?
10. Write any 4 Uses of informatica
UNIT II
UNIT III
Part A
1. What is meant by stream computing?
2. What are the types of stream?
3. List the queries of data streams
4. Define Classification
5. Define cluster
6. What is meant by Kafka?
7. Define Population Sampling?
8. Why we need TRAP?
9. What is meant by Web Analytics?
10.What are three core Components of Infosphere Streams
Part B
1. Write short notes on Kafka?
2. Discuss About sampling in a stream
3. Write Short notes on counting one’s in a Window
4. Discuss in detail about filtering in data Streams with the function of Bloom
Filters?
5. Draw the architecture of RTAP
Part C
1. Write a detail about stream data architecture?
2. Discuss in detail about real time analytics platform applications
3. Explain in detail about IBM Infosphere
4. Discuss in detail about counting distinct elements in a stream
5. Explain about decaying window s.
UNIT IV
PART A
1. Define Predictive Analytics
2. Give some examples of predictive analytics software
3. Define supervised and unsupervised machine learning
4. Give some application of neural networks
5. Define support and confidence
6. Write advantages and disadvantages of APRIORI algorithm
7. Define false negative and false positive
8. What is meant by Market based model?
9. Define partial clustering and hierarchical clustering
10.Define data visualization
Part B
1. Discuss briefly about machine learning algorithms
2. Explain in detail about kohonen’s model
3. Discuss in detail association rules
4. How to handle the large data in a main memory?
5. Explain in detail about clustering high dimensional data.
PART C
1. Explain in detail about predictive Analytics
2. Discuss briefly about Neural networks
3. Explain APRIORI algorithm with a suitable example
4. Explain in detail about clustering algorithms
5. Discuss in detail about Limited Pass Algorithms.
UNIT V
Part A
1. Define Map Reduce. Name the three stages of Map Reduce
2. Define HDFS
3. What are the advantages of HADOOP?
4. What are features of HIVE?
5. Define Sharding
6. What are the 3 types of databases?
7. Differentiate RDBMS and NOSQL
8. What are the features of HBase?
9. Define Impala
10. What is IMPALA Metastore?
Part B
1. Explain in detail about IBM Framework
2. Discuss in detail about HDFS
3. Discuss Sharding Technology in detail for Big Data
4. Discuss about S3 and its benefits briefly
5. Discuss briefly about big data for Blogs and E-Commerce
Part C
1. Discuss in detail about map reduce algorithm and framework
2. Explain HIVE framework with neat architecture diagram
3. Discuss NOSQL database in detail
4. Describe IMPALA architecture with neat diagram
5. Explain clearly about analyzing big data with twitter