Sie sind auf Seite 1von 2

SYLLABUS

CHANDIGARH UNIVERSITY, GHARUAN

Big Data Analytics L T P C


Subject Code Total Contact Hours : 45Hours
(CST-301) Common to all Specializations of CSE 3rd 3 0 0 3
Year

Marks-100

Internal-40 External-60

Course Objectives

 To have good understanding of BigData concepts and design of HDFS for handling
Big Data.

 To learn how to use PIG, HIVE, SCOOP, HBASE, OOZIE and FLUME.

 To understand Map Reduce Architecture.

 To understand daemons of Hadoop and their functionality.


Unit Course Outcomes

I  Be familiar in Handling BigData for storage and retrieval using Hadoop

II  Be familiar with the Job Tracker, Task Tracker with Name Node and Data Nodes.

III  Familiar with uses of FLUME, OOZIE,HBASE,HIVE and SCOOP


Contents of the Syllabus

UNIT-I [15h]

Introduction to BigData and Hadoop: BigData and its Characterstics, Problems with BigData,
Handling BigData, Difference between Structured, Semi- Structured and Unstructured Data.
Introduction to Hadoop, Scope of Hadoop, Components of Hadoop,

Hadoop Distributed File System: Introduction of HDFS, HDFS Design, HDFS role in Hadoop, Features
of HDFS, Daemons of Hadoop and its functionality- Name Node, Data Node, Secondary Name Node,
Job Tracker, Task Tracker.

UNIT-II [15h]

HDFS Architecture: Concept of Nodes, Racks and Data Center. Basic Configuration for HDFS. Data
Organization- Blocks and Replication. Anatomy of File Write, Anatomy of File Read. Rack Awareness,
Heartbeat Signal. Storing and Reading Data into HDFS.

MapReduce: Introduction of MapReduce, MapReduce Architecture, Data flow in MapReduce- Splits,


Mapper , Partitioner, Sort and shuffle, Combiner, Reducer. Difference between Block and InputSplit.
Role of RecordReader, Basic Configuration of MapReduce, MapReduce life cycle- Driver Code,
Mapper and Reducer. Working of MapReduce.

UNIT-III [15h]

Introduction to PIG, SQOOP and HIVE: Introduction to PIG Data Flow Engine, Uses of PIG, Modes of
Execution in PIG- Local Mode and MapReduce Mode. Introduction to SQOOP, Use of SQOOP,
Introduction to HIVE, HIVE Architecture.

Introduction to HBASE, OOZIE and FLUME: Introduction to HBASE, Basic Fundamentals of HBase.
Introduction to OOZIE, Use of OOZIE, Introduction to Flume, Uses of Flume, Flume Architecture- Flume
Master, Flume Collectors, Flume Agents.

Text Books:
1. Data Analytics by Radha Shankarmani, M. Vijayalakshmi by Technical Publications.
2. Big Data Analytics with R and Hadoop by Vignesh Prajapati.

Reference Books:

1. Hadoop - The Definitive Guide by Tome White, O’REILLY Publications.

2. Hadoop In Practice by Alex Holmes, DreamTech Publications.

Das könnte Ihnen auch gefallen