Hive - Self Learning Notes

Hochgeladen von

Karvendhan Balasubramanian

0% fanden dieses Dokument nützlich (0 Abstimmungen)

192 Ansichten69 Seiten

Self Learning notes to refresh about the hive concepts

Copyright

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

Self Learning notes to refresh about the hive concepts

Copyright:

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

192 Ansichten69 Seiten

Hive - Self Learning Notes

Hochgeladen von

Karvendhan Balasubramanian

Self Learning notes to refresh about the hive concepts

Copyright:

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 69

Im Dokument suchen

HIVE

Self learning notes

What is Hive
Hive is most suited for data warehouse applications, where relatively static
data is analyzed, fast response times are not required, and when the data is
not changing rapidly

Hive is not a full database. The design constraints and limitations of hadoop
and HDFS impose limits on what Hive can do.

The biggest limitation is that Hive does not provide record-level update,
insert, nor delete. You can generate new tables from queries or output query
results to files.

Also, because Hadoop is a batch-oriented system, Hive queries have higher

latency, due to the start-up overhead for MapReduce jobs.

Queries that would finish in seconds for a traditional database take longer for
Hive, even for relatively small data sets. Finally, Hive does not provide
transactions.
Limitations of hive
So, Hive doesnt provide crucial features required for OLTP, Online
Transaction Processing.

Its closer to being an OLAP tool, Online Analytic Processing, but as

well see, Hive isnt ideal for satisfying the online part of OLAP, at
least today, since there can be significant latency between issuing a
query and receiving a reply, both due to the
overhead of Hadoop and due to the size of the data sets Hadoop
was designed to serve.

If you need OLTP features for large-scale data, you should consider
using a NoSQL database. Examples include HBase, a NoSQL
database integrated with Hadoop.

So, Hive is best suited for data warehouse applications, where a

large data set is maintained and mined for insights, reports, etc.
Hive is best suited for data warehouse applications,
where real-time responsiveness to queries and
Hive Modules record-level inserts, updates, and deletes are not
required

Bundled with the Hive

distribution is the CLI, a simple
web interface called Hive web
interface (HWI), and
programmatic access through
JDBC, ODBC, and a Thrift
server

All commands and queries go to

the Driver, which compiles the
input, optimizes the
computation required, and
executes the required steps,
usually with Map Reduce jobs.

The Metastore is a separate relational database (usually a MySQL instance)

where Hive persists table schemas and other system metadata
Getting Started CLI
HIVE Data Definition Language

Das könnte Ihnen auch gefallen

Learn Hbase in 24 Hours
Von Everand
Learn Hbase in 24 Hours
Alex Nordeen
Noch keine Bewertungen
Learn Hive in 24 Hours
Von Everand
Learn Hive in 24 Hours
Alex Nordeen
Noch keine Bewertungen
Learn Hadoop in 24 Hours
Von Everand
Learn Hadoop in 24 Hours
Alex Nordeen
Noch keine Bewertungen
Professional Hadoop Solutions
Von Everand
Professional Hadoop Solutions
Boris Lublinsky
Bewertung: 4 von 5 Sternen
4/5 (2)
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Von Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
Noch keine Bewertungen
Apache HIVE
Dokument105 Seiten
Apache HIVE
hemanth kumar p
100% (1)
Hive
Dokument41 Seiten
Hive
Sumanth Pantangi
Noch keine Bewertungen
BMC Control-M 7: A Journey from Traditional Batch Scheduling to Workload Automation
Von Everand
BMC Control-M 7: A Journey from Traditional Batch Scheduling to Workload Automation
Qiang Ding
Noch keine Bewertungen
Airflow Introduction
Dokument9 Seiten
Airflow Introduction
Paresh Bhatia
Noch keine Bewertungen
NiFi A Complete Guide - 2021 Edition
Von Everand
NiFi A Complete Guide - 2021 Edition
Gerardus Blokdyk
Noch keine Bewertungen
Mastering Azure Synapse Analytics: Learn how to develop end-to-end analytics solutions with Azure Synapse Analytics (English Edition)
Von Everand
Mastering Azure Synapse Analytics: Learn how to develop end-to-end analytics solutions with Azure Synapse Analytics (English Edition)
Debananda Ghosh
Noch keine Bewertungen
Apache Airflow
Dokument8 Seiten
Apache Airflow
Bhanu Prakash
50% (2)
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
Von Everand
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
Noch keine Bewertungen
Couchbase Essentials
Von Everand
Couchbase Essentials
John Zablocki
Noch keine Bewertungen
Define The Difference Between Hive and Hbase?
Dokument11 Seiten
Define The Difference Between Hive and Hbase?
ashok c
Noch keine Bewertungen
2016 05 10 Apache Nifi Deep Dive 160511170654
Dokument34 Seiten
2016 05 10 Apache Nifi Deep Dive 160511170654
Shyam Babu
Noch keine Bewertungen
Apache Cassandra Essentials
Von Everand
Apache Cassandra Essentials
Padalia Nitin
Bewertung: 4 von 5 Sternen
4/5 (1)
Instant Pentaho Data Integration Kitchen
Von Everand
Instant Pentaho Data Integration Kitchen
Sergio Ramazzina
Noch keine Bewertungen
Hadoop: Fasilkom/Pusilkom UI (Credit: Samuel Louvan)
Dokument44 Seiten
Hadoop: Fasilkom/Pusilkom UI (Credit: Samuel Louvan)
Johan Rizky Aditya
Noch keine Bewertungen
Running Airflow Reliably With Kubernetes
Dokument47 Seiten
Running Airflow Reliably With Kubernetes
atulw
100% (1)
Creating Development Environments with Vagrant - Second Edition
Von Everand
Creating Development Environments with Vagrant - Second Edition
Michael Peacock
Noch keine Bewertungen
Monitoring Hadoop
Von Everand
Monitoring Hadoop
Gurmukh Singh
Noch keine Bewertungen
Learning Apache Cassandra
Von Everand
Learning Apache Cassandra
Mat Brown
Noch keine Bewertungen
Azure Databricks A Complete Guide - 2020 Edition
Von Everand
Azure Databricks A Complete Guide - 2020 Edition
Gerardus Blokdyk
Noch keine Bewertungen
Database server The Ultimate Step-By-Step Guide
Von Everand
Database server The Ultimate Step-By-Step Guide
Gerardus Blokdyk
Noch keine Bewertungen
Oracle SQL Developer 2.1
Von Everand
Oracle SQL Developer 2.1
Sue Harper
Noch keine Bewertungen
Advanced SQL:1999: Understanding Object-Relational and Other Advanced Features
Von Everand
Advanced SQL:1999: Understanding Object-Relational and Other Advanced Features
Jim Melton
Bewertung: 4.5 von 5 Sternen
4.5/5 (2)
Sssis Interview Questins
Dokument7 Seiten
Sssis Interview Questins
Kali Raj
Noch keine Bewertungen
Introduction To Hadoop Administration - SpringPeople
Dokument13 Seiten
Introduction To Hadoop Administration - SpringPeople
SpringPeople
Noch keine Bewertungen
Learning Couchbase
Von Everand
Learning Couchbase
Potsangbam Henry
Noch keine Bewertungen
Parjdev
Dokument1.130 Seiten
Parjdev
Vineet Pal Singh
Noch keine Bewertungen
Learn Data Analysis with Python: Lessons in Coding
Von Everand
Learn Data Analysis with Python: Lessons in Coding
A.J. Henley
Noch keine Bewertungen
Data Lake Architecture Strategy A Complete Guide - 2021 Edition
Von Everand
Data Lake Architecture Strategy A Complete Guide - 2021 Edition
Gerardus Blokdyk
Noch keine Bewertungen
When Where and Why To Use NoSQL
Dokument13 Seiten
When Where and Why To Use NoSQL
Venkat Krishnan
Noch keine Bewertungen
Getting Started With Apache Nifi
Dokument10 Seiten
Getting Started With Apache Nifi
Mario Soares
Noch keine Bewertungen
Cloudera A Complete Guide - 2019 Edition
Von Everand
Cloudera A Complete Guide - 2019 Edition
Gerardus Blokdyk
Noch keine Bewertungen
Kubernetes A Complete Guide - 2019 Edition
Von Everand
Kubernetes A Complete Guide - 2019 Edition
Gerardus Blokdyk
Noch keine Bewertungen
Introducing Maven: A Build Tool for Today's Java Developers
Von Everand
Introducing Maven: A Build Tool for Today's Java Developers
Balaji Varanasi
Noch keine Bewertungen
V2R5 2 Tera-Cram SQLZ Final Final2 Docustar1
Dokument309 Seiten
V2R5 2 Tera-Cram SQLZ Final Final2 Docustar1
Rampraveen Lanka
Noch keine Bewertungen
Getting Started with Big Data Query using Apache Impala
Von Everand
Getting Started with Big Data Query using Apache Impala
Agus Kurniawan
Noch keine Bewertungen
A Review Paper On Big Data Database'S: Cassandra, Hbase, Hive
Dokument6 Seiten
A Review Paper On Big Data Database'S: Cassandra, Hbase, Hive
shani thakur
Noch keine Bewertungen
Apache Solr for Indexing Data
Von Everand
Apache Solr for Indexing Data
Handiekar Sachin
Noch keine Bewertungen
Hive Query Optimization Infinity
Dokument13 Seiten
Hive Query Optimization Infinity
shashwat2010
Noch keine Bewertungen
Building Data Pipelines - 4
Dokument38 Seiten
Building Data Pipelines - 4
Gusti Adli Anshari
Noch keine Bewertungen
Data Warehouse Trends Report 2017
Dokument12 Seiten
Data Warehouse Trends Report 2017
suryansh
Noch keine Bewertungen
Mongo DB Using Python
Dokument7 Seiten
Mongo DB Using Python
PRAMEELA NILLA
Noch keine Bewertungen
Top 10 ETL Design Tips
Dokument37 Seiten
Top 10 ETL Design Tips
will3323
Noch keine Bewertungen
Hadoop Developer Training - Hive Lab Book
Dokument51 Seiten
Hadoop Developer Training - Hive Lab Book
Karthick selvam
Noch keine Bewertungen
HADOOP
Dokument35 Seiten
HADOOP
Ekapop Verasakulvong
100% (1)
SQL Refresher Complete Notes PDF
Dokument352 Seiten
SQL Refresher Complete Notes PDF
Deepasri Prasad
Noch keine Bewertungen
SQL Join: Anjan Banerjee
Dokument11 Seiten
SQL Join: Anjan Banerjee
Sudipta Mukhopadhyay
Noch keine Bewertungen
Advanced SAS Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
Von Everand
Advanced SAS Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
Vibrant Publishers
Noch keine Bewertungen
HBase Administration Cookbook
Von Everand
HBase Administration Cookbook
Yifeng Jiang
Noch keine Bewertungen
Hive Using Hiveql PDF
Dokument40 Seiten
Hive Using Hiveql PDF
Subhash Subha
Noch keine Bewertungen
Docker A Complete Guide - 2020 Edition
Von Everand
Docker A Complete Guide - 2020 Edition
Gerardus Blokdyk
Noch keine Bewertungen
HBase Essentials
Von Everand
HBase Essentials
Nishant Garg
Noch keine Bewertungen
Understanding Azure Monitoring: Includes IaaS and PaaS Scenarios
Von Everand
Understanding Azure Monitoring: Includes IaaS and PaaS Scenarios
Bapi Chakraborty
Noch keine Bewertungen
Practical Oracle Cloud Infrastructure: Infrastructure as a Service, Autonomous Database, Managed Kubernetes, and Serverless
Von Everand
Practical Oracle Cloud Infrastructure: Infrastructure as a Service, Autonomous Database, Managed Kubernetes, and Serverless
Michał Tomasz Jakóbczyk
Noch keine Bewertungen
UNIX Programming: UNIX Processes, Memory Management, Process Communication, Networking, and Shell Scripting
Von Everand
UNIX Programming: UNIX Processes, Memory Management, Process Communication, Networking, and Shell Scripting
Dr. Vineeta Khemchandani
Noch keine Bewertungen
02 - Lecture Note - TensorFlow Ops
Dokument21 Seiten
02 - Lecture Note - TensorFlow Ops
Roberto Pereira
Noch keine Bewertungen
MAD Chapter - 1
Dokument32 Seiten
MAD Chapter - 1
M N.B.B.Chari
Noch keine Bewertungen
Toad For Oracle Installation Guide 2018 R2
Dokument18 Seiten
Toad For Oracle Installation Guide 2018 R2
Gabino
Noch keine Bewertungen
OceanStor 2000 V3 Series Storage Systems V300R006C20 Upgrade Guide 06
Dokument295 Seiten
OceanStor 2000 V3 Series Storage Systems V300R006C20 Upgrade Guide 06
Pelon Sinpelo
Noch keine Bewertungen
TOPOLOGY
Dokument9 Seiten
TOPOLOGY
Leonardo Noran Jr
Noch keine Bewertungen
TurbineTracker Troubleshooting
Dokument3 Seiten
TurbineTracker Troubleshooting
paul
Noch keine Bewertungen
Palo Alto Interview Questions
Dokument5 Seiten
Palo Alto Interview Questions
Ramesh S
0% (1)
M03 - Install and Manage Network Protocols
Dokument50 Seiten
M03 - Install and Manage Network Protocols
Alazar walle
100% (1)
PCU20 Series Startup
Dokument6 Seiten
PCU20 Series Startup
Sam eagle good
Noch keine Bewertungen
Mpi MCQ
Dokument83 Seiten
Mpi MCQ
AKSHAY Dagare
Noch keine Bewertungen
Test Bank For Fundamentals of Python First Programs 1st Edition
Dokument7 Seiten
Test Bank For Fundamentals of Python First Programs 1st Edition
frederickmarcusrxbsr
Noch keine Bewertungen
HP Compaq Presario CQ40 - CQ45 1 Compal - La-4101p - Rev0.3 PDF
Dokument46 Seiten
HP Compaq Presario CQ40 - CQ45 1 Compal - La-4101p - Rev0.3 PDF
Electrolaptops Pereira
Noch keine Bewertungen
Galinger v. Hinds Preservation of Evidence Letter
Dokument3 Seiten
Galinger v. Hinds Preservation of Evidence Letter
NewsChannel 9 Staff
Noch keine Bewertungen
UMTS - Creating Vista DUN Connection
Dokument9 Seiten
UMTS - Creating Vista DUN Connection
Rendra Imron
Noch keine Bewertungen
AZ 104.exmcl - Premium.exam.318q
Dokument321 Seiten
AZ 104.exmcl - Premium.exam.318q
vs storage
Noch keine Bewertungen
Palo Alto Networks - Edu-210: Document Version
Dokument33 Seiten
Palo Alto Networks - Edu-210: Document Version
clara
Noch keine Bewertungen
Basic Troubleshooting
Dokument10 Seiten
Basic Troubleshooting
Derick Morante
Noch keine Bewertungen
What Is SAN Zoning?: Hard Zoning Soft Zoning
Dokument10 Seiten
What Is SAN Zoning?: Hard Zoning Soft Zoning
Pabbathi Prabhakar
0% (1)
CS401 Final Term Solved Paperz MAGA Files All Paperz in ONe File
Dokument47 Seiten
CS401 Final Term Solved Paperz MAGA Files All Paperz in ONe File
Umair Masood
100% (3)
Morpho RD Service Installation
Dokument20 Seiten
Morpho RD Service Installation
zxy_cbe
Noch keine Bewertungen
On "IOT Smart Bulb": Mini Project Report
Dokument7 Seiten
On "IOT Smart Bulb": Mini Project Report
Ranveer Rotwal
Noch keine Bewertungen
98-366 Test Bank Lesson - 01
Dokument6 Seiten
98-366 Test Bank Lesson - 01
klarascom
100% (1)
Wifi Based Led Text Scrolling Display-1
Dokument11 Seiten
Wifi Based Led Text Scrolling Display-1
Rakesh Reddy Baddipadige
Noch keine Bewertungen
Fast. Dependable. Compact.: Black & White Printing and Copying With Full Colour Scanning
Dokument4 Seiten
Fast. Dependable. Compact.: Black & White Printing and Copying With Full Colour Scanning
Gabi
Noch keine Bewertungen
CCMS Agent Installation NW 7.1 Onwards
Dokument23 Seiten
CCMS Agent Installation NW 7.1 Onwards
Ippili Paparao
Noch keine Bewertungen
Solutions Enabler Implementation
Dokument78 Seiten
Solutions Enabler Implementation
Hakan Tuysuzoglu
Noch keine Bewertungen
Netscreen - VPN Troubleshooting.
Dokument31 Seiten
Netscreen - VPN Troubleshooting.
Amandeep Singh
Noch keine Bewertungen
DPDK Fulltext01
Dokument65 Seiten
DPDK Fulltext01
Gabriel Francischini
Noch keine Bewertungen
Cheat Sheet SSH v4
Dokument1 Seite
Cheat Sheet SSH v4
Michael
Noch keine Bewertungen
Cara Setting APN Di Iphone
Dokument13 Seiten
Cara Setting APN Di Iphone
Wifia Fitri
Noch keine Bewertungen
Yhamk1924 02 e
Dokument70 Seiten
Yhamk1924 02 e
drgn997
Noch keine Bewertungen