Beruflich Dokumente
Kultur Dokumente
Hive Pig
Giraph
Spark
Storm
Flink
MapReduce
HBase
Cassandra
MongoDB
Zookeeper
YARN
HDFS
Hadoop evolved over time!
Giraph
Spark
Storm
Flink
MapReduce
HBase
MapReduce
Cassandra
MongoDB
Zookeeper
YARN
HDFS HDFS
Hadoop 1.0
Only
MapReduce Hive Pig Others Other
jobs applications not
MapReduce supported
HDFS
Poor
Resource
utilization
One dataset many applications
HADOOP 1.0 HADOOP 2.0
MAP
SPARK OTHERS
REDUCE
HDFS HDFS
Central Resource Manager Each machine
== gets a Node
ultimate decision maker
Manager
Resource Manager Node Manager
Data Computation
Framework
Application Master =
personal negotiator
Negotiates
Resource
Manager
Node Manager
Container
2X ↑ Jobs 2.5X ↑
per day Number of
2X ↑ CPU tasks from all
utilization jobs
* Source: Apache Hadoop YARN: Yet Another Resource Negotiator.” In Proceedings of the 4th Annual Symposium on Cloud
Computing, 5:1–5:16. SOCC ’13.
YARN More Applications
Apache Hama
and growing …
Data Value Many choices in Hadoop 2.0