Beruflich Dokumente
Kultur Dokumente
HDFS
Richa
Assistant Professor
CSE Dept
Chandigarh University
Introduction To HDFS
Features of HDFS/ Hadoop
Hadoop Daemons
HDFS Architecture/Design
2. HDFS Viewpoint
-Locate processing logic near data instead of moving data to the
application.
3. It restricts one writer at one time for data writing. Bytes are
always appended at the end of the stream in order.
4. Distributed Data
-HDFS takes care of splitting and distributing the data across all the
nodes within a cluster
-It also replicates the data over the entire cluster
8. Scalability
- It is the ability of adding or removing the nodes or the hardware
components to or from the cluster.
- We can add or remove a node from a cluster without effecting the
cluster operation.
- Each individual hardware components such as RAM or hard disks
can be added or removed from the cluster.
Daemons???????????
-Daemons in computing term is a process that runs in the
background.
1. NameNode
2. DataNode
3. SecondaryNameNode
4. JobTracker
5. TaskTracker
Master-Slave Model
A cluster is having thousands of nodes interconnected with
each other.
There is one Master node within the cluster that is also
referred to as Head of the cluster.
All other nodes are the Slave nodes or also referred to as the
Worker nodes.
DN1 DN4
DN2 DN5
DN3 DN6
Rack1 Rack2
No. of Blocks 3
Block ID [1, 2, 3]
User Hduser
Group Hadoop
Permission rw
Replication 3
64 MB 64 MB 22 MB
22MB
DN3 DN6
Rack1 Rack2
Map-
Reduce
TaskTracker TaskTracker
Layer