You are on page 1of 4

Wednesday, August 22, 12

INFRASTRUCTURE

Big Data
Wednesday, August 22, 12

Big Data
• • • • • • •
2.5B - content items shared 2.7B - ‘Likes’ 300M - photos uploaded 100+PB - disk space in a single HDFS cluster 105TB - data scanned via Hive (30min) 70,000 - queries executed 500+TB - new data ingested

Wednesday, August 22, 12

Life of data at Facebook
Data Tools
Workflow Hive MapReduce Real-time HDFS Import Copier/Loader
Realtime Analytics (PUMA) Scribe/ScribeH www.facebook.com UDB

Wednesday, August 22, 12