Beruflich Dokumente
Kultur Dokumente
Apache Hadoop
Learn to deploy, configure, and manage Cloudera's Apache Hadoop
implementation and HDFS.
In this interactive, hands-on Apache Hadoop course, you will gain a comprehensive
understanding of all the steps necessary to operate and maintain a Hadoop cluster. Covering
topics from installation and configuration through load balancing and tuning, this course is the
best preparation for the real-world challenges faced by Hadoop administrators.
This course covers concepts addressed on the Cloudera Certified Administrator for Apache
Hadoop (CCAH) exam and includes a CCAH exam voucher you'll receive at the end of class.
The internals of MapReduce and HDFS and how to build Hadoop architecture
Proper cluster configuration and deployment to integrate with systems and hardware in
the data center
How to load data into the cluster from dynamically generated files using Flume and
from RDBMS using Sqoop
Prerequisites
This course is designed for system administrators and IT managers who have basic Linux
systems administration experience. Prior knowledge of Hadoop is not required.
Follow-On Courses
Course Outline
1. The Case for Apache Hadoop
Why Hadoop?
Fundamental Concepts
2. HDFS
HDFS Features
NameNode Considerations
HDFS Security
REST Interfaces
4. MapReduce
Features of MapReduce
Basic Concepts
Architectural Overview
MapReduce Version 2
Failure Recovery
Network Considerations
Configuring Nodes
Deployment Types
Installing Hadoop
Hive
Impala
Pig
8. Hadoop Clients
9. Cloudera Manager
Cluster Upgrading
15. Conclusion
Labs
Throughout the course, you'll participate in hands-on labs to help build your knowledge and
apply the concepts discussed.