Sie sind auf Seite 1von 2

Want to supercharge your creative skills?

Pluralsight has acquired Digital-Tutors, giving you access to its 1,500+ creative skills courses.

Go to







SUP P OR T Search the library

Sign in

Full Library | Categories | Authors | Popular | New Releases

SQL on Hadoop - Analyzing Big Data with Hive

This course will teach you the Hive query language and how to apply it to solve common Big Data problems. This includes an introduction to distributed computing, Hadoop, and MapReduce fundamentals and the latest features released with Hive 0.11
12 Tw eet 14 Like Share 43 Share 14

Authored by: Ahmad Alkilani Duration: 4h 16m Level: Intermediate Released: 10/8/2013 Course Rating:

Table of Contents


Exercise Files


You are currently not signed in. Please sign in to access subscriber-only content. expand all | collapse all Progress Duration
00:24:05 01:48 01:12 03:01 01:30 04:08 01:27 09:49 01:10 00:46:17 01:17 01:36 02:24 01:03 02:29 04:09 07:51 12:09 02:27 09:48 01:04 01:28:26 00:49 07:39 01:28

Introduction to Hadoop
Introduction Motivation for Hadoop Distributed Computing Challenges Hadoop File System (HDFS) MapReduce Word Count Example Demo: Basic Hadoop Commands and Environment Setup Summary

Introduction to Hive
Introduction Hive Motivation Hive Architecture Hive Principles - Schema on Read Hive Principles - The Hive Warehouse Hive Query Language Basics - SELECT and Sub Queries Creating Databases and Tables with HiveQL Demo: Working with Hive Tables and Loading Data into Warehouse Loading Data - Hive Managed and External Tables Demo: External Tables and Create Table Alternatives Summary

Hive Query Language

Introduction Data Types Type Conversions

Managed Partitioned Tables External Partitioned Tables Demo: Table Partitioning Multi Inserts and Dynamic Partition Inserts Demo: Loading Data Use Case Data Retrieval - Group By and Functions Sorting and Controlling Data Flow The CLI and Variable Substitution Summary

06:46 03:50 19:00 13:40 05:38 13:26 08:11 06:41 01:18 01:18:09 01:03 04:15 04:01 03:44 06:22 01:37 02:32 06:05 07:03 04:39 03:23 01:00 01:23

Advanced HiveQL
Introduction Bucketing Bucket and Block Sampling Joins Joins in Depth and Join Optimizations Map-side Joins for Bucketed Tables Distributed Cache UDTFs, Explode and Lateral View Demo: Extending Hive - Creating Your own UDF Demo: Extending Hive - Compiling and Testing Custom UDF Extending Hive - Custom UDF Recap Demo: Hive Initialization File Accessing The Distributed Cache

Hadoop Streaming and Transform() Windowing and Analytics Functions

05:10 03:14

Demo: Putting it All Together Using Transform Demo: Analytics Functions Demo: Ranking Functions Summary

12:41 03:56 05:01 01:00 00:19:09 07:26 02:29 04:14 00:43 01:59 01:33 00:45

Storage and The Eco-System

Create Table Statement - File Formats and SerDes HCatalog Sqoop DistCP Hadoop Eco-System Projects References and Resources Summary