Beruflich Dokumente
Kultur Dokumente
UNIT –I
1. Classification of Digital Data. Explain?
2. Write about challenges with Big Data?
3.Explain about 5v's?
4. What is Big Data Analytics?
5.What Big Data Analytics isn't?
6. Explain about classification of Analytics?
7.Write about top challenges facing big data?
8. Discuss why is big data analytics important?
9. Write about Data Scence?
10. Explain the difference between parallel and distributed system?
11. Explain CAP Theorem?
UNIT 2
1. Compare Reporting and Analysis with its process.
2. Explain the following
a. Advanced analytics
b. Operationalized analytics
c. Monetized analytics
3. How to develop an analytical team and what is the skill required for an analyst?
4. Distinguish statistical significance and business importance.
5. What are the roles of analytical team and IT team with a detailed note on text analysis?
6. Explain in detail the commonly used analytical approaches?
7. Discuss in detail the history of analytical tools.
8. How analytical tools have evolved from graphical user interfaces to point solutions to data
visualization tools?
9. Give a detailed note on features and limitations of R programming and IBM SPSS.
Big Data (16CS520) Page 1
QUESTION BANK 2018
UNIT 3
1. List the main feature of MapReduce.
2. Describe the working of Map reduce with an relevant example.
3. Discuss the techniques which is used to optimize the map reduce jobs.
4. Discuss the points to be considered while designing a file system in mapreduce.
5. What is HBASE? Give detailed note on features of HBASE.
6. Write a short note on the Hadoop ecosystem and HDFS archiecture.
7. How does HDFS ensure data integrity in a Hadoop cluster?
8. Discuss the following terms
a.Streaming information access.
b.Low latency information access.
c.Rest and thrift
d.org.apcahe.hadoop.io.package
9. What is Meta data? What information does it provide and explain the role of Namenode in a HDFS
clusters?
10. Define Command line interface using HDFS files and give a brief note on Hadoop-specific file
system types and HDFS commands.
UNIT 4
1. What is NoSQL? What are the advantages of NoSQL? And Explain types of NoSQL
Databases?
2. Differentiate between SQL vs NoSQL?
3. What is NewSQL? Differentiate between NewSQL and NoSQL?
4. With Neat sketch explain in detail Hadoop architecture and its components?
5. a) List hadoop distributions
b) Compare Hadoop vs SQL
6. With neat sketch explain HDFS?
7. With neat sketch explain processing data with Hadoop?
8. Explain in detail interacting with Hadoop Ecosystem?
Big Data (16CS520) Page 2
QUESTION BANK 2018
UNIT 5
1. List some key elements of social media.
2. Describe the steps to perform text mining.
3. Discuss some commonly used text mining software.
4. List some common online tools used to perform sentiment analysis.
5. What do you understand by sentiment analysis?
6. Discuss some application areas of mobile analytics.
7. Briefly explain some popular mobile analytics tools available in the market.
8. What is the importance of location –based tracking tools?
9. Discuss the necessity of keeping data secure while conducting analytics.
10. Discuss some fields where mobile analytics can be used.
UNIT – I
1. Data is present in a _____________ source [ ]
3. XML is an Example of [ ]
10. What category you place the consumer complaints and feedback [ ]
11.Yotaabytes is equal to [ ]
12.The human and technical infrastructure needed to support storage, processing and _______
[ ]
13.RDMS is an example of [ ]
15.In SMP,there is a single Common main memory that is shared by _____ processor [ ]
19. ________ deals with wide range of data types and source of data. [ ]
A) 2 B) 1 C) 3 D) 0
26. _______implies that the system will continue to function when network partition occurs.
[ ]
34.Which of the four characteristics of the Big data indicates that many data formats can be store and
analyze? [ ]
36. A system that has achieved eventual consistency is said to have converged or achieved [ ]
37. Big data analytics is about a tight handshake between three communities IT, Business user and
____ [ ]
39. A coordinated processing of a program by multiple processors, each working on different parts of
the program and using its own operating system and memory is called [ ]
40. A collection of independent computers that appear to its user as a single coherent system is
[ ]
Unit-II
1. Which among the following is not a characteristic of reporting? [ ]
A] Provides data B] Provide answers C] Is fairly inflexible D] Provides what is asked
2. Exploratory data analysis using graphs in nothing but: [ ]
A] Data cleaning B] Basic reporting C] Predictive modelling D] Model implementation
3. Reporting does not involve: [ ]
A] Predictive models B] Graphs C] Charts D] Tables
4. In a data analysis report, you will find: [ ]
A] Descriptive statistics B] Optimization C] Formatted text D] White papers or journals
5. Data collection is one of the steps in statistical data analysis. This step is performed after which of
the following steps? [ ]
A] Model building B] Model implementation C] Business objective D] Evaluation
6. When we don’t have access to a population, we tend to consider: [ ]
A] A simulated population B] A survey C] A random sample D] Judgemental insights
7. Missing value treatment of the data is: [ ]
A] Necessary to get the correct results B] Often leads to wrong results
C] Should never be practiced D] Simply dropping all the missing records
8. Which of the following is not a task of an analytics team? [ ]
UNIT III
1. Which of the following options most aptly explains the reason behind the creation of Mapreduce?
A)Need to increase the processing of new h/w B)Need to perform complex analysis of structured data.
C)Need to increase the number of web users D)Need to spread distributed computing. [ ]
2. In designing the mapreduce framework,which of the following needs did the engineers consider?
C)columnar data base D)new key value pair to answer the query. [ ]
11. Which of the following term is used to denote the small subsets of a large file created by HDFS
12. What message is generated by a datanode to indicate its connectivity with name nod
A)data about data B)data from web logs C)data from govt sources D) data from market [ ]
17. Which of the following commands of HDFS can issue directives to blocks
18. Which of the file system provides read-only access to hdfs over HTTPs.
19. ------ is a tool used to transfer data between hadoop and relational database
20. ------ used to transfer large amount of data from distributed resources to a single repository.
21. ________ systems are scale-out file-based (HDD) systems moving to more uses of memory in the
nodes.
A) NoSQL B) NewSQL C) SQL D) All of the mentioned [ ]
23. Which of the following command sets the value of a particular configuration variable (key)?
A) set –v B) set <key>=<value> C) set D) reset [ ]
25. The Pig Latin scripting language is not only a higher-level data flow language but also has
operators similar to :
A) SQL B) JSON C) XML D) All of the mentioned [ ]
26. The Pig Latin scripting language is not only a higher-level data flow language but also has
operators similar to :
A) SQL B) JSON C) XML D) All of the mentioned [ ]
27. Which of the following is used for the MapReduce job Tracker node?
A) mradmin B) tasktracker C) jobtracker D) none of the mentioned [ ]
31. ________ is the slave/worker node and holds the user data in the form of Data Blocks.
32. HDFS provides a command line interface called __________ used to interact with HDFS.
33. The __________ is responsible for allocating resources to the various running applications subject
to familiar constraints of capacities, queues etc.
A) Manager B) Master C) Scheduler D) manager [ ]
35. The Hadoop list includes the HBase database, the Apache Mahout ________ system, and matrix
operations.
A) Machine learning B) Pattern recognition C) Statistical classification D) Artificial intelligence [ ]
36. InputFormat class calls the ________ function and computes splits for each file and then sends
them to the jobtracker.
A) puts B) gets C) getSplits D) all of the mentioned [ ]
37. _________ is the primary interface for a user to describe a MapReduce job to the Hadoop
framework for execution.
A) Map Parameters B) JobConf C) MemoryConf D) None of the mentioned [ ]
39. ___________ is an open source SQL query engine for Apache HBase
A) Pig B) Phoenix C) Pivot D) None [ ]
Unit-IV
1.The expansion for CAP is Consistency, Availability, and _______ [ ]
5. ________is a robust database that supports ACID properties of transactions and has the scalability of
NoSQL. [ ]
21._________ is used to transfer bulk data between Hadoop and structured data stores such as
relational databases [ ]
23.Pig is a [ ]
30.________ traditional IT company is the largest Big Data vendor in the world [ ]
31._______ is Splunk’s new product to search, access and report on Hadoop data sets [ ]
Unit-V
1. Which of the following collectively represent a social network bound via specific sets of social
relationships? [ ]
(A) Websites (B) Big Data (C) People (D) Analytics Tools
(A) Participation (B) Online shopping (C) Content Sharing (D) Conversation
3. Which of the following text mining tools is used to extract who, what, where, when and why
facts? [ ]
(A) Active-point (B) Attensity (C) Cross minder (D) Compare suite
(A) Face book (B) LinkedIn (C) Twitter (D) Word press
5. Which of the following terms represents passive observation of social media activities?[ ]
6. Social media denotes a group of internal –based applications build over the foundation of
______ of that support the creation and exchange of user generated content. [ ]
(A)Web 2.0 (B) Web 3.0 (C) Web 2.1 (D) web 4.0
7. Which blogs allow people to share and showcase small posts and are suitable for quick sharing
of content in a few lines of a text or an individual photo or video. [ ]
(A) Blogs (B) Micro blogs (C) Wiki (D) Face Book.
10. _______ used for statistical data analysis, text processing and sentiment analysis. [ ]
(A)Cross minder (B) Compare suite (C) SAS Text miner (D) Attensity.
(A)Sentiment Analysis (B) one pass Clustering (C) Buckshot Clustering (D) Monarch
14. Which software used for analysis and transformation of reports into live data. [ ]
(A)Monarch (B) Text alyzer (C) SAS Text Miner (D) Compare suite.
17. Which tools helps in tracking data and scheduling and organizing pin in advance? [ ]
(A)Text alyzer (B) Monarch (C) Active point (D) Compare suite.
(A) Digital quality (B) Multimedia application (C) Mobile voice (D) Both a and b
21. ____ can be used by service provider to help them monitor and improve their service. [ ]
(A) Session (B) Bounce rate (C) track sales (D) Customers engaged.
22. The use of mobile phone or other device like tables to view online content via light we browser
refers as [ ]
23. Which is a big Marketing and Analytics platform for mobile and Web application? [ ]
(A)Data Winner (B) Statviz (C) Test flight (D) Both A and C
26. Which of the following Technologies supports LIT and Wi-Max techniques? [ ]
27. Which of the following mobile Analytics tools will you use to work on all platforms for the
measurement of user acquisition, engagement, and outcomes in native mobile apps? [ ]
28. Which of the following location –based mobile analytics tracking tools will you use to
incorporate advanced geolocation functionality to mobile devices running on iOS, Windows, as
well as Android? [ ]
29. Which of the following mobile analytics tools is used to test an app? [ ]
(A)Test Flight (B) Mobile App Tracking (C) Apsalar (D) Mixpanel.
30. Which of the following mobile analytics data collection tools provides data collection services
and reduce decision –making time by interpreting data efficiently? [ ]
(A) Open Data Kit (B) Data Winners (C) Command Mobile (D) Enterprise Server.
31. Which of the following types of mobile app analytics reports will you use to understand the
demographics of the people using a particular mobile application? [ ]
32. Which of the following reports will you use to display the details about the actual sign-ups
and sale of mobile applications? [ ]
(A)Mobile device (B) Mobile application (C) Mobile platform (D) Mobile Analytics tool
35. Some of the popular mobile analytics tools available in the market are: [ ]
37. An Application that can perform mobile data collection and workforce management service.[ ]
(A)COMMAND mobile (B) Data winners (C) Stat Viz (D) Play store
38. _____ used to describe the process in which the system automatically opens another page.[ ]
(A) Zahi Boussiba (B) Yoni Douek (C) Both A and B (D) IBM
OBJECTIVE - ANSWERS
4 4 B 4 D 4 A 4 D
5 5 C 5 D 5 C 5 C
6 6 C 6 A 6 A 6 A
7 7 A 7 B 7 B 7 B
8 8 C 8 A 8 A 8 A
9 9 C 9 B 9 A 9 C
10 10 B 10 C 10 B 10 A
11 11 B 11 C 11 A 11 A
12 12 D 12 B 12 B 12 A
13 13 B 13 A 13 C 13 A
14 14 14 D 14 B 14 A
15 15 C 15 D 15 A 15 A
16 16 A 16 A 16 A 16 A
17 17 A 17 C 17 A 17 A
18 18 B 18 D 18 A 18 A
19 19 D 19 A 19 D 19 C
20 20 A 20 B 20 B 20 C
21 21 C 21 A 21 B 21 B
22 22 B 22 A 22 D 22 A
23 23 B 23 B 23 A 23 D
24 24 B 24 B 24 B 24 B
25 25 A 25 A 25 C 25 D
26 26 D 26 B 26 A 26 D
27 27 A 27 C 27 A 27 C
28 28 B 28 B 28 D 28 A
29 29 A 29 C 29 A 29 A
30 30 D 30 C 30 B 30 B
31 31 B 31 A 31 B 31 A
32 32 A 32 B 32 C 32 D
33 33 A 33 C 33 C 33 D
34 34 C 34 C 34 B 34 A,D
35 35 B 35 A 35 B 35 D
36 36 A 36 C 36 B 36 A
37 37 A 37 B 37 A 37 A
38 38 B 38 B 38 D 38 B
39 39 C 39 B 39 A 39 C
40 40 B 40 B 40 C 40 B