Beruflich Dokumente
Kultur Dokumente
Marks-100
Internal-40 External-60
Course Objectives
To have good understanding of BigData concepts and design of HDFS for handling
Big Data.
To learn how to use PIG, HIVE, SCOOP, HBASE, OOZIE and FLUME.
II Be familiar with the Job Tracker, Task Tracker with Name Node and Data Nodes.
UNIT-I [15h]
Introduction to BigData and Hadoop: BigData and its Characterstics, Problems with BigData,
Handling BigData, Difference between Structured, Semi- Structured and Unstructured Data.
Introduction to Hadoop, Scope of Hadoop, Components of Hadoop,
Hadoop Distributed File System: Introduction of HDFS, HDFS Design, HDFS role in Hadoop, Features
of HDFS, Daemons of Hadoop and its functionality- Name Node, Data Node, Secondary Name Node,
Job Tracker, Task Tracker.
UNIT-II [15h]
HDFS Architecture: Concept of Nodes, Racks and Data Center. Basic Configuration for HDFS. Data
Organization- Blocks and Replication. Anatomy of File Write, Anatomy of File Read. Rack Awareness,
Heartbeat Signal. Storing and Reading Data into HDFS.
UNIT-III [15h]
Introduction to PIG, SQOOP and HIVE: Introduction to PIG Data Flow Engine, Uses of PIG, Modes of
Execution in PIG- Local Mode and MapReduce Mode. Introduction to SQOOP, Use of SQOOP,
Introduction to HIVE, HIVE Architecture.
Introduction to HBASE, OOZIE and FLUME: Introduction to HBASE, Basic Fundamentals of HBase.
Introduction to OOZIE, Use of OOZIE, Introduction to Flume, Uses of Flume, Flume Architecture- Flume
Master, Flume Collectors, Flume Agents.
Text Books:
1. Data Analytics by Radha Shankarmani, M. Vijayalakshmi by Technical Publications.
2. Big Data Analytics with R and Hadoop by Vignesh Prajapati.
Reference Books: