Sie sind auf Seite 1von 2

DATA ANALYTICS QUESTION BANK

UNITS
1

2 MARKS
1. What is Big Data Analytics?
2. What are the characteristics of Big Data?
3. Describe the structure of Big Data.
4. What is Web Data?
5. Define Analytic sandbox and its types.
6. Differentiate Analysis vs Reporting.
7. List some modern data analytic tools.
8. Define Sampling Distribution.
9. State Central Limit Theorem.
10. Define Resampling and its methods.
11. Define Statistical Inference.
12. Define Hypothesis testing.

16 MARKS
1. Explain in detail about the stastical concepts of sampling
distributions and resampling distributions?
2. Explain in detail about the evolution of analytic scalability,
analytic processes and tools?
3. Explain the challenges of conventional systems in big data
analytics

1.
2.
3.
4.
5.
6.
7.
8.
9.

1. Explain in detail about the Regression Modelling and its


techniques.
2. Explain in detail about Bayesian modeling, inference and
Bayesian networks.
3. Explain the concept in analysis of Time Series.
4. Explain in detail about the Support vector and kernel methods.

1. Define Data Stream and Data Stream Mining.


2. Give some examples of Data Stream Source.
3. What is sampling in a Data Stream and list out its
methods.
4. Describe Filtering in a stream.
5. State and describe Bloom Filter
6. Discuss counting distinct elements in a stream and list
out the algorithms used.

What is Regression analysis and define its types?


Define Logit or Logistical regression.
Define Multivariate analysis and list some methods.
State Bayes Theorem.
State Nave Bayes algorithm.
Define SVM and its types of classifications.
What is Time Series Analysis?
Define Rule Induction.
What is Sequential Cover algorithm?

1. Explain in detail about Stream data model and architecture in


Stream Computing and Sampling data in a stream.
2. Explain in detail about RTAP.
3. Explain in detail about Counting Distinct Elements in a data
stream.

DATA ANALYTICS QUESTION BANK

7. Define Estimating moments in a stream.


8. Describe counting ones in a window
9. What is Decaying window?
10. Define and discuss RTAP.
1. Define Market Basket Model or Association Rule
Mining or Apriori Algorithm?
2. How to handle Larger datasets in main memory, What
are the algorithms used?
3. Describe Limited Pass Algorithms.
4. Describe counting frequent items in a stream
5. State Multistage and Multihash algorithm.
1.
2.
3.
4.
5.
6.
7.
8.
9.

Define Map Reduce and its stages


What is Hadoop and why Hadoop.
What is HIVE and list its operations?
What is MapR?
What is Sharding?
What is NoSQL and why NoSQL?
Describe about Amazon S3 (or) Simply S3.
What is HDFS?
List some visual data analytic techniques.

1. Explain in detail about Market based model relating with Apriori


Algorithm.
2. Explain in detail about Handling large data sets in Main memory
3. Explain in detail about Limited Pass algorithm.

1. Explain in detail about Map Reduce and its operations


2. Explain in detail about HDFS (Hadoop Distributed File System)
and its operations.
3. Explain in detail about Visual Data Analytic techniques and its
application.