Ebook778 pages4 hours

HBase Administration Cookbook

Name: HBase Administration Cookbook
Author: Yifeng Jiang
ISBN: 9781849517157

By Yifeng Jiang

Rating: 0 out of 5 stars

()

Read preview

About this ebook

As part of Packt’s cookbook series, each recipe offers a practical, step-by-step solution to common problems found in HBase administration. This book is for HBase administrators, developers, and will even help Hadoop administrators. You are not required to have HBase experience, but are expected to have a basic understanding of Hadoop and MapReduce.

Skip carousel

Computers

LanguageEnglish

PublisherPackt Publishing

Release dateAug 16, 2012

ISBN9781849517157

Author

Yifeng Jiang

Related authors

Skip carousel

Related to HBase Administration Cookbook

Related ebooks

Skip carousel

Hadoop 2.x Administration Cookbook
Ebook
Hadoop 2.x Administration Cookbook
byGurmukh Singh
Rating: 0 out of 5 stars
0 ratings
PostgreSQL 9 High Availability Cookbook
Ebook
PostgreSQL 9 High Availability Cookbook
byShaun M. Thomas
Rating: 5 out of 5 stars
5/5
Apache Hive Cookbook
Ebook
Apache Hive Cookbook
byShrey Mehrotra
Rating: 0 out of 5 stars
0 ratings
Oracle WebLogic Server 12c Advanced Administration Cookbook
Ebook
Oracle WebLogic Server 12c Advanced Administration Cookbook
byDalton Iwazaki
Rating: 0 out of 5 stars
0 ratings
Hadoop MapReduce v2 Cookbook - Second Edition
Ebook
Hadoop MapReduce v2 Cookbook - Second Edition
byThilina Gunarathne
Rating: 0 out of 5 stars
0 ratings
MariaDB Cookbook
Ebook
MariaDB Cookbook
byDaniel Bartholomew
Rating: 0 out of 5 stars
0 ratings
Ubuntu Server Cookbook
Ebook
Ubuntu Server Cookbook
byUday R. Sawant
Rating: 0 out of 5 stars
0 ratings
Nagios Core Administration Cookbook
Ebook
Nagios Core Administration Cookbook
byRyder Tom
Rating: 5 out of 5 stars
5/5
CentOS 7 Server Deployment Cookbook
Ebook
CentOS 7 Server Deployment Cookbook
byTimothy Boronczyk
Rating: 0 out of 5 stars
0 ratings
Oracle SOA Suite Performance Tuning Cookbook
Ebook
Oracle SOA Suite Performance Tuning Cookbook
byMatt Brasier
Rating: 0 out of 5 stars
0 ratings
Salt Cookbook
Ebook
Salt Cookbook
byAnirban Saha
Rating: 0 out of 5 stars
0 ratings
CentOS 6 Linux Server Cookbook
Ebook
CentOS 6 Linux Server Cookbook
byHobson Jonathan
Rating: 0 out of 5 stars
0 ratings
Oracle Goldengate 11g Complete Cookbook
Ebook
Oracle Goldengate 11g Complete Cookbook
byAnkur Gupta
Rating: 5 out of 5 stars
5/5
Chef Infrastructure Automation Cookbook - Second Edition
Ebook
Chef Infrastructure Automation Cookbook - Second Edition
byMatthias Marschall
Rating: 0 out of 5 stars
0 ratings
Windows Server 2012 Automation with PowerShell Cookbook
Ebook
Windows Server 2012 Automation with PowerShell Cookbook
byEd Goad
Rating: 0 out of 5 stars
0 ratings
SQL Server 2014 with PowerShell v5 Cookbook
Ebook
SQL Server 2014 with PowerShell v5 Cookbook
bySantos Donabel
Rating: 0 out of 5 stars
0 ratings
NHibernate 4.x Cookbook - Second Edition
Ebook
NHibernate 4.x Cookbook - Second Edition
byJason Dentler
Rating: 0 out of 5 stars
0 ratings
PowerCLI Cookbook
Ebook
PowerCLI Cookbook
byPhilip Sellers
Rating: 0 out of 5 stars
0 ratings
SQL Server 2012 with PowerShell V3 Cookbook
Ebook
SQL Server 2012 with PowerShell V3 Cookbook
bySantos Donabel
Rating: 0 out of 5 stars
0 ratings
Apache Maven Cookbook
Ebook
Apache Maven Cookbook
byRaghuram Bharathan
Rating: 0 out of 5 stars
0 ratings
IBM DB2 9.7 Advanced Administration Cookbook
Ebook
IBM DB2 9.7 Advanced Administration Cookbook
byAdrian Neagu
Rating: 0 out of 5 stars
0 ratings
Puppet 2.7 Cookbook
Ebook
Puppet 2.7 Cookbook
byJohn Arundel
Rating: 3 out of 5 stars
3/5
Apache OfBiz Cookbook
Ebook
Apache OfBiz Cookbook
byRuth Hoffman
Rating: 0 out of 5 stars
0 ratings
Oracle JDeveloper 11gR2 Cookbook
Ebook
Oracle JDeveloper 11gR2 Cookbook
byNick Haralabidis
Rating: 0 out of 5 stars
0 ratings
pfSense 2 Cookbook
Ebook
pfSense 2 Cookbook
byMatt Williamson
Rating: 3 out of 5 stars
3/5
Oracle Essbase 11 Development Cookbook
Ebook
Oracle Essbase 11 Development Cookbook
byJose R. Ruiz
Rating: 0 out of 5 stars
0 ratings
OpenShift Cookbook
Ebook
OpenShift Cookbook
byShekhar Gulati
Rating: 0 out of 5 stars
0 ratings
Laravel Application Development Cookbook
Ebook
Laravel Application Development Cookbook
byTerry Matula
Rating: 0 out of 5 stars
0 ratings
Microsoft Azure Development Cookbook Second Edition
Ebook
Microsoft Azure Development Cookbook Second Edition
byRoberto Freato
Rating: 5 out of 5 stars
5/5
Windows Server 2012 Hyper-V Cookbook
Ebook
Windows Server 2012 Hyper-V Cookbook
byLeandro Carvalho
Rating: 0 out of 5 stars
0 ratings

Computers For You

Skip carousel

Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
Ebook
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
byTriumph Books
Rating: 4 out of 5 stars
4/5
The Mega Box: The Ultimate Guide to the Best Free Resources on the Internet
Ebook
The Mega Box: The Ultimate Guide to the Best Free Resources on the Internet
byChris Mason
Rating: 4 out of 5 stars
4/5
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
Ebook
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
byAaron Smith
Rating: 0 out of 5 stars
0 ratings
Learning the Chess Openings
Ebook
Learning the Chess Openings
byJef Kaan
Rating: 5 out of 5 stars
5/5
Deep Search: How to Explore the Internet More Effectively
Ebook
Deep Search: How to Explore the Internet More Effectively
byAlan Pearce
Rating: 5 out of 5 stars
5/5
Quantum Computing For Dummies
Ebook
Quantum Computing For Dummies
byWilliam Hurley
Rating: 0 out of 5 stars
0 ratings
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
Ebook
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
byNigel Tillery
Rating: 0 out of 5 stars
0 ratings
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
CompTIA Security+ Get Certified Get Ahead: SY0-701 Study Guide
Ebook
CompTIA Security+ Get Certified Get Ahead: SY0-701 Study Guide
byJoe Shelley
Rating: 5 out of 5 stars
5/5
Storytelling with Data: Let's Practice!
Ebook
Storytelling with Data: Let's Practice!
byCole Nussbaumer Knaflic
Rating: 4 out of 5 stars
4/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
Ebook
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
byTJ Books
Rating: 0 out of 5 stars
0 ratings
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
Ebook
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
bySteven Cooper
Rating: 4 out of 5 stars
4/5
AWS Certified Cloud Practitioner All-in-One Exam Guide (Exam CLF-C01)
Ebook
AWS Certified Cloud Practitioner All-in-One Exam Guide (Exam CLF-C01)
byDaniel Carter
Rating: 5 out of 5 stars
5/5
Artificial Intelligence: The Complete Beginner’s Guide to the Future of A.I.
Ebook
Artificial Intelligence: The Complete Beginner’s Guide to the Future of A.I.
byJohn Adamssen
Rating: 4 out of 5 stars
4/5
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
Ultimate Guide to Mastering Command Blocks!: Minecraft Keys to Unlocking Secret Commands
Ebook
Ultimate Guide to Mastering Command Blocks!: Minecraft Keys to Unlocking Secret Commands
byTriumph Books
Rating: 5 out of 5 stars
5/5
Elon Musk
Ebook
Elon Musk
byWalter Isaacson
Rating: 4 out of 5 stars
4/5
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
Ebook
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
byQuentin Docter
Rating: 0 out of 5 stars
0 ratings
Tor and the Dark Art of Anonymity
Ebook
Tor and the Dark Art of Anonymity
byLance Henderson
Rating: 5 out of 5 stars
5/5
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
Practical Lock Picking: A Physical Penetration Tester's Training Guide
Ebook
Practical Lock Picking: A Physical Penetration Tester's Training Guide
byDeviant Ollam
Rating: 5 out of 5 stars
5/5
CompTIA Security+ Practice Questions
Ebook
CompTIA Security+ Practice Questions
byIP Specialist
Rating: 2 out of 5 stars
2/5
The Professional Voiceover Handbook: Voiceover training, #1
Ebook
The Professional Voiceover Handbook: Voiceover training, #1
byPeter Baker
Rating: 5 out of 5 stars
5/5
AP® Computer Science Principles Crash Course
Ebook
AP® Computer Science Principles Crash Course
byJacqueline Corricelli
Rating: 0 out of 5 stars
0 ratings
The Simulated Multiverse: An MIT Computer Scientist Explores Parallel Universes, The Simulation Hypothesis, Quantum Computing and the Mandela Effect: Simulation Hypothesis
Ebook
The Simulated Multiverse: An MIT Computer Scientist Explores Parallel Universes, The Simulation Hypothesis, Quantum Computing and the Mandela Effect: Simulation Hypothesis
byRizwan Virk
Rating: 3 out of 5 stars
3/5
The Simulation Hypothesis: An MIT Computer Scientist Shows Why AI, Quantum Physics and Eastern Mystics All Agree We Are In a Video Game
Ebook
The Simulation Hypothesis: An MIT Computer Scientist Shows Why AI, Quantum Physics and Eastern Mystics All Agree We Are In a Video Game
byRizwan Virk
Rating: 5 out of 5 stars
5/5
How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)
Ebook
How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)
byDavid Kadavy
Rating: 5 out of 5 stars
5/5
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
Ebook
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
bySeth Stephens-Davidowitz
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Listener Questions 3 - How to Get Rid of Your Oracle Addiction: Join Pete and Jesse as they talk about the Herculean effort that is lifting and shifting a system to AWS; why your work is just getting started after you do a lift-and-shift; how technical debt piles up when you don’t modernize applications to take advant
Podcast episode
Listener Questions 3 - How to Get Rid of Your Oracle Addiction: Join Pete and Jesse as they talk about the Herculean effort that is lifting and shifting a system to AWS; why your work is just getting started after you do a lift-and-shift; how technical debt piles up when you don’t modernize applications to take advant
byAWS Morning Brief
0 ratings
0% found this document useful
BAM 81: Building Automation Servers 101: What is a building automation server? What does it do? Should I cloud host my server? Should I use a desktop or server class platform? If you've ever wondered these questions then this episode is for you! In this episode, I unpack the...
Podcast episode
BAM 81: Building Automation Servers 101: What is a building automation server? What does it do? Should I cloud host my server? Should I use a desktop or server class platform? If you've ever wondered these questions then this episode is for you! In this episode, I unpack the...
byThe Smart Buildings Academy Podcast | Teaching You Building Automation, Systems Integration, and Information Technology
0 ratings
0% found this document useful
Whiteboard Confessional: Bespoke Password Management: Join me as I continue the Whiteboard Confessional series by talking about how I log into all of the various AWS accounts I use for work, why using IAM passwords and username pairs is patently ridiculous, how AWS Single Sign-On is supposed to be great but
Podcast episode
Whiteboard Confessional: Bespoke Password Management: Join me as I continue the Whiteboard Confessional series by talking about how I log into all of the various AWS accounts I use for work, why using IAM passwords and username pairs is patently ridiculous, how AWS Single Sign-On is supposed to be great but
byAWS Morning Brief
0 ratings
0% found this document useful
DOP 136: Teaching Kubernetes to a New Team Member: #136: Imagine you've just learned Kubernetes yourself, but now you've been selected to help train a new team member on what Kubernetes is and how to operate it. Where would you start? In this episode, we attempt to answer Dor's question and take you...
Podcast episode
DOP 136: Teaching Kubernetes to a New Team Member: #136: Imagine you've just learned Kubernetes yourself, but now you've been selected to help train a new team member on what Kubernetes is and how to operate it. Where would you start? In this episode, we attempt to answer Dor's question and take you...
byDevOps Paradox
0 ratings
0% found this document useful
344: Grains of Salt: Shell text processing, data rebalancing on ZFS mirrors, Add Security Headers with OpenBSD relayd, ZFS filesystem hierarchy in ZFS pools, speeding up ZSH, How Unix pipes work, grow ZFS pools over time, the real reason ifconfig on Linux is deprecated, clear your terminal in style, and more.
Podcast episode
344: Grains of Salt: Shell text processing, data rebalancing on ZFS mirrors, Add Security Headers with OpenBSD relayd, ZFS filesystem hierarchy in ZFS pools, speeding up ZSH, How Unix pipes work, grow ZFS pools over time, the real reason ifconfig on Linux is deprecated, clear your terminal in style, and more.
byBSD Now
0 ratings
0% found this document useful
Why Are You Still Paying Retail Prices?!: Join Pete and Jesse as they continue the Unconventional Guide to AWS Cost Management with a conversation about why you shouldn’t be paying retail prices for AWS services. They touch upon why every AWS customer gets an account manager regardless of how muc
Podcast episode
Why Are You Still Paying Retail Prices?!: Join Pete and Jesse as they continue the Unconventional Guide to AWS Cost Management with a conversation about why you shouldn’t be paying retail prices for AWS services. They touch upon why every AWS customer gets an account manager regardless of how muc
byAWS Morning Brief
0 ratings
0% found this document useful
Potluck — $100k Dev Jobs × Sponsored Blog Posts × How To Keep Your Skills Up To Date × Libraries vs Custom × Dev Tools × More!: It’s another potluck! In this episode, Scott and Wes answer your questions about VS Code, JavaScript, $100k-per-year dev jobs, sponsored blog posts, how to use dev tools, how to keep your skills up to date, and more! Prismic - Sponsor Prismic is a...
Podcast episode
Potluck — $100k Dev Jobs × Sponsored Blog Posts × How To Keep Your Skills Up To Date × Libraries vs Custom × Dev Tools × More!: It’s another potluck! In this episode, Scott and Wes answer your questions about VS Code, JavaScript, $100k-per-year dev jobs, sponsored blog posts, how to use dev tools, how to keep your skills up to date, and more! Prismic - Sponsor Prismic is a...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
026 jsAir - webpack: JavaScript bundler with Juho Vepsäläinen, Johannes Ewald, Sean T. Larkin, and Tobias Koppers: webpack: JavaScript bundler with Juho Vepsäläinen, Johannes Ewald, Sean T. Larkin, and Tobias Koppers Description: webpack is an amazing bundler for frontend assets. For many people it has completely changed the game for their build pipeline, entirely...
Podcast episode
026 jsAir - webpack: JavaScript bundler with Juho Vepsäläinen, Johannes Ewald, Sean T. Larkin, and Tobias Koppers: webpack: JavaScript bundler with Juho Vepsäläinen, Johannes Ewald, Sean T. Larkin, and Tobias Koppers Description: webpack is an amazing bundler for frontend assets. For many people it has completely changed the game for their build pipeline, entirely...
byJavaScript Air
0 ratings
0% found this document useful
319: Wins & Losses: Steph started a new project and shares details about the new tools she's using, including working on a remote dev environment. Chris shares a journey with Lograge and Rails flash messages as he strives to capture user-facing errors. They also discuss "silencing" flaky tests, using Graphviz to visualize data dependencies, and porting Devise views to use Inertia and Svelte. It's also interesting how different their paths have been this year!
Podcast episode
319: Wins & Losses: Steph started a new project and shares details about the new tools she's using, including working on a remote dev environment. Chris shares a journey with Lograge and Rails flash messages as he strives to capture user-facing errors. They also discuss "silencing" flaky tests, using Graphviz to visualize data dependencies, and porting Devise views to use Inertia and Svelte. It's also interesting how different their paths have been this year!
byThe Bike Shed
0 ratings
0% found this document useful
Keeping the Cloudwatch with Ewere Diagboya: This week Corey is joined by Ewere Diagboya, Head of Cloud at Mycloudseries, and multifaceted blogger and author, and the first AWS Hero from Africa. Ewere’s book on CloudWatch is the first of its kind, and certainly a valuable asset to the community. Ewe
Podcast episode
Keeping the Cloudwatch with Ewere Diagboya: This week Corey is joined by Ewere Diagboya, Head of Cloud at Mycloudseries, and multifaceted blogger and author, and the first AWS Hero from Africa. Ewere’s book on CloudWatch is the first of its kind, and certainly a valuable asset to the community. Ewe
byScreaming in the Cloud
0 ratings
0% found this document useful
The Rise of Serverless Databases // Alex DeBrie // MLOps Podcast #147
Podcast episode
The Rise of Serverless Databases // Alex DeBrie // MLOps Podcast #147
byMLOps.community
0 ratings
0% found this document useful
Kubernetes is the Most Expensive Way to Run a Service: Join Pete and Jesse for a lively discussion about a platform you might have heard of called Kuberentes. They touch upon why just because Google does something doesn’t mean your three-person startup should do the same, why Pete thinks Kubernetes is great i
Podcast episode
Kubernetes is the Most Expensive Way to Run a Service: Join Pete and Jesse for a lively discussion about a platform you might have heard of called Kuberentes. They touch upon why just because Google does something doesn’t mean your three-person startup should do the same, why Pete thinks Kubernetes is great i
byAWS Morning Brief
0 ratings
0% found this document useful
PB096: How to Create a Business Blog on a Shoestring: How to Create a Business Blog When You Don't have a Dedicated Team
Podcast episode
PB096: How to Create a Business Blog on a Shoestring: How to Create a Business Blog When You Don't have a Dedicated Team
byProBlogger Podcast: Blog Tips to Help You Make Money Blogging
0 ratings
0% found this document useful
303: Dear Mr. Grumpy Goose: Chris gives a DB sessions update and talks bifunctors & command objects. Steph shares the coolness of a gem she's been using called after_party, and excitedly gushes about her new laptop. (Chris is hoping to hold off on replacing his until the end of the year and then they can compare!) The two then answer a listener question on retrospectives and how they've seen productive ones run, while giving some of their own helpful opinions on dos and don'ts. They're talking to you, Grumpy Goose!
Podcast episode
303: Dear Mr. Grumpy Goose: Chris gives a DB sessions update and talks bifunctors & command objects. Steph shares the coolness of a gem she's been using called after_party, and excitedly gushes about her new laptop. (Chris is hoping to hold off on replacing his until the end of the year and then they can compare!) The two then answer a listener question on retrospectives and how they've seen productive ones run, while giving some of their own helpful opinions on dos and don'ts. They're talking to you, Grumpy Goose!
byThe Bike Shed
0 ratings
0% found this document useful
Listener Questions 4: Join Pete and Jesse as they take two questions from the field about practical approaches to applying some of their previous teachings to real-world scenarios. Listen in to learn why Pete believes Compute Optimizer is criminally underused, why teams should
Podcast episode
Listener Questions 4: Join Pete and Jesse as they take two questions from the field about practical approaches to applying some of their previous teachings to real-world scenarios. Listen in to learn why Pete believes Compute Optimizer is criminally underused, why teams should
byAWS Morning Brief
0 ratings
0% found this document useful
Solving Multicloud with Seamless Connectivity and AI - with Rob Croteau
Podcast episode
Solving Multicloud with Seamless Connectivity and AI - with Rob Croteau
byKubernetes Bytes
0 ratings
0% found this document useful
Mark Graban & Ron Pereira, SPC Webinar Q&A: Like podcast #143, this episode is a Q&A that follows up a webinar I did for my good friends at Gemba Academy recently, on the topic "Using Statistical Process Control (SPC) to Make Better Management Decisions." You can view a recording of the webina...
Podcast episode
Mark Graban & Ron Pereira, SPC Webinar Q&A: Like podcast #143, this episode is a Q&A that follows up a webinar I did for my good friends at Gemba Academy recently, on the topic "Using Statistical Process Control (SPC) to Make Better Management Decisions." You can view a recording of the webina...
byLean Blog Interviews - Healthcare, Manufacturing, Business, and Leadership
0 ratings
0% found this document useful
Potluck - Moist code × Memoization × Ready for full-time? × Deadlines × Design ethics × React components × Video hosting × Local fonts × More!: It’s another Potluck! In this episode, Scott and Wes answer your questions about memoization, how to know when you’re ready for a full-time dev job, what to do when you underestimate projects, design ethics, local fonts, and more! Linode - Sponsor...
Podcast episode
Potluck - Moist code × Memoization × Ready for full-time? × Deadlines × Design ethics × React components × Video hosting × Local fonts × More!: It’s another Potluck! In this episode, Scott and Wes answer your questions about memoization, how to know when you’re ready for a full-time dev job, what to do when you underestimate projects, design ethics, local fonts, and more! Linode - Sponsor...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
The DIY AppSec Lab - ASW #185: Lots of web hacking can be done directly from the browser. Throw in a proxy like Burp plus the browser's developer tools window and you've got a nearly complete toolkit. But nearly complete means there's still room for improvement. We'll talk about...
Podcast episode
The DIY AppSec Lab - ASW #185: Lots of web hacking can be done directly from the browser. Throw in a proxy like Burp plus the browser's developer tools window and you've got a nearly complete toolkit. But nearly complete means there's still room for improvement. We'll talk about...
byApplication Security Weekly (Video)
0 ratings
0% found this document useful
The DIY AppSec Lab - ASW #185: Lots of web hacking can be done directly from the browser. Throw in a proxy like Burp plus the browser's developer tools window and you've got a nearly complete toolkit. But nearly complete means there's still room for improvement. We'll talk about...
Podcast episode
The DIY AppSec Lab - ASW #185: Lots of web hacking can be done directly from the browser. Throw in a proxy like Burp plus the browser's developer tools window and you've got a nearly complete toolkit. But nearly complete means there's still room for improvement. We'll talk about...
bySecurity Weekly Podcast Network (Video)
0 ratings
0% found this document useful
Writing Good CSS: In this episode of Syntax, Scott and Wes talk about writing good CSS. LogRocket - Sponsor LogRocket lets you replay what users do on your site, helping you reproduce bugs and fix issues faster. It’s an exception tracker, a session re-player and a...
Podcast episode
Writing Good CSS: In this episode of Syntax, Scott and Wes talk about writing good CSS. LogRocket - Sponsor LogRocket lets you replay what users do on your site, helping you reproduce bugs and fix issues faster. It’s an exception tracker, a session re-player and a...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
Episode 13. Optimizing for Performance: We all love speed, code speed that is! In this podcast we delve into code optimizations, and what does it mean to optimize for speed. We go over what is the mind-state that you need as you optimize code and offer sneaky shortcuts that allows 10-fold...
Podcast episode
Episode 13. Optimizing for Performance: We all love speed, code speed that is! In this podcast we delve into code optimizations, and what does it mean to optimize for speed. We go over what is the mind-state that you need as you optimize code and offer sneaky shortcuts that allows 10-fold...
byJava Pub House
0 ratings
0% found this document useful
Predict Your Future (and Make Your CFO Happy): Join Pete and Jesse as they talk about the important role tagging plays in influencing DevOps, why tagging strategies need to change over time, why improving your organization's tagging strategy isn't an overnight fix, how tagging is all about cost attrib
Podcast episode
Predict Your Future (and Make Your CFO Happy): Join Pete and Jesse as they talk about the important role tagging plays in influencing DevOps, why tagging strategies need to change over time, why improving your organization's tagging strategy isn't an overnight fix, how tagging is all about cost attrib
byAWS Morning Brief
0 ratings
0% found this document useful
Whiteboard Confessional: Everything's a Database Except SQLite: Join me as I continue a new series called Whiteboard Confessional with a look at the awesomeness that is SQLite, including how it wasn’t designed to work in a client-server fashion, when you should use it and when you absolutely shouldn’t, how deciding to
Podcast episode
Whiteboard Confessional: Everything's a Database Except SQLite: Join me as I continue a new series called Whiteboard Confessional with a look at the awesomeness that is SQLite, including how it wasn’t designed to work in a client-server fashion, when you should use it and when you absolutely shouldn’t, how deciding to
byAWS Morning Brief
0 ratings
0% found this document useful
How He Generated An Extra $6,500 By Asking "What if This Were Easy?"
Podcast episode
How He Generated An Extra $6,500 By Asking "What if This Were Easy?"
byThe Art of Online Business
0 ratings
0% found this document useful
Simple Programmer Podcast 196: Should I Create Multiple Blogs?: A lot of developers feel overwhelmed when it comes to creating and start their blogging career. most have a lot of different questions inside their heads and it couldn't be any different: the amount of information out there is out of this world. When...
Podcast episode
Simple Programmer Podcast 196: Should I Create Multiple Blogs?: A lot of developers feel overwhelmed when it comes to creating and start their blogging career. most have a lot of different questions inside their heads and it couldn't be any different: the amount of information out there is out of this world. When...
bySimple Programmer Podcast
0 ratings
0% found this document useful
Whiteboard Confessional: Configuration MisManagement: Join me as I continue a new series called Whiteboard Confessional by examining the dark underbelly of configuration management: configuration mismanagement. In this episode, I discuss what it was like to be a very early developer on the SaltStack project,
Podcast episode
Whiteboard Confessional: Configuration MisManagement: Join me as I continue a new series called Whiteboard Confessional by examining the dark underbelly of configuration management: configuration mismanagement. In this episode, I discuss what it was like to be a very early developer on the SaltStack project,
byAWS Morning Brief
0 ratings
0% found this document useful
I can has() new CSS Selector?!: In this Hasty Treat, Scott and Wes talk about new CSS selectors :has, :where, and :is. MagicBell - Sponsor MagicBell is the the notification inbox for your product. Add a MagicBell to your product for announcements, billing, workflow, and other...
Podcast episode
I can has() new CSS Selector?!: In this Hasty Treat, Scott and Wes talk about new CSS selectors :has, :where, and :is. MagicBell - Sponsor MagicBell is the the notification inbox for your product. Add a MagicBell to your product for announcements, billing, workflow, and other...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
More on Severless - Databases × Files × Secrets × Auth × More!: In this episode of Syntax, Scott and Wes do a part 2 about Serverless — databases, files, secrets, auth, and more! Sanity - Sponsor is a real-time headless CMS with a fully customizable Content Studio built in React. Get a Sanity powered site...
Podcast episode
More on Severless - Databases × Files × Secrets × Auth × More!: In this episode of Syntax, Scott and Wes do a part 2 about Serverless — databases, files, secrets, auth, and more! Sanity - Sponsor is a real-time headless CMS with a fully customizable Content Studio built in React. Get a Sanity powered site...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
How to Build a Website — The Show For Beginners: In this episode of Syntax, Scott and Wes talk about the basics of building a website — how to get started for beginners! Freshbooks - Sponsor Get a 30 day free trial of Freshbooks at and put SYNTAX in the “How did you hear about us?”...
Podcast episode
How to Build a Website — The Show For Beginners: In this episode of Syntax, Scott and Wes talk about the basics of building a website — how to get started for beginners! Freshbooks - Sponsor Get a 30 day free trial of Freshbooks at and put SYNTAX in the “How did you hear about us?”...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful

Skip carousel

Set Up A Production- Ready Web Server
APC
Article
Set Up A Production- Ready Web Server
Nov 4, 2019
8 min read
Enterprise-grade Monitoring Made Easy
Linux Format
Article
Enterprise-grade Monitoring Made Easy
Mar 10, 2020
9 min read
Set Up A Production-ready Web Server
Linux Format
Article
Set Up A Production-ready Web Server
Sep 24, 2019
8 min read
About Kibana
Linux Format
Article
About Kibana
Mar 10, 2020
Kibana offers analytics and a search dashboard for Elasticsearch, as well as visualisation capabilities for data stored in Elasticsearch. Kibana is so handy that it would be a shame to use Elasticsearch without combining it with Kibana. Generally spe
1 min read
Answers
Linux Format
Article
Answers
Jul 2, 2019
Q How do I generate a very large file, say around 980GB, with the dd command? I’m trying to test a quota for an XFS file system under RHEL7. I use the following command as the user oracle, which has the user quota set: and it outputs Where is my
7 min read
How To Build The Linux Format Server
Linux Format
Article
How To Build The Linux Format Server
Oct 19, 2021
10 min read
Deep Into Storage Space
Maximum PC
Article
Deep Into Storage Space
Jun 25, 2019
8 min read
Twenty Years Of WordPress Websites!
Linux Format
Article
Twenty Years Of WordPress Websites!
Oct 17, 2023
11 min read
Deep Into Storage Space
APC
Article
Deep Into Storage Space
Oct 7, 2019
8 min read
Installing Apache for Linux… on Windows
TechLife
Article
Installing Apache for Linux… on Windows
Jul 27, 2020
5 min read
Turning An Old PC Into A NAS
APC
Article
Turning An Old PC Into A NAS
Aug 12, 2019
4 min read
Build A Pi-powered Network Storage Device
Linux Format
Article
Build A Pi-powered Network Storage Device
Dec 14, 2021
10 min read
MARIADB Optimise And Control Your Databases
Linux Format
Article
MARIADB Optimise And Control Your Databases
Jul 30, 2019
9 min read
Backblaze: No-hassle Online Backup With Unlimited Storage
PCWorld
Article
Backblaze: No-hassle Online Backup With Unlimited Storage
Apr 2, 2024
3 min read
Reader Support
Computeractive
Article
Reader Support
Feb 16, 2022
Why can’t I change Edge’s search engine? Q I followed the advice in Issue 623’s Browser Tips (page 43) to change the default search engine for Edge’s address bar. However, try as I might, I could not set it to Google UK (www.google.co.uk). I want to
3 min read
Automatically Provision Devices With Ansible
Linux Format
Article
Automatically Provision Devices With Ansible
Nov 15, 2022
Matt Holder has worked in IT support for over a decade, and always tries to utilise Linux alongside other installed systems. C loud computing is a term that means a number of things. Software as a Service (SaaS) is one such example of what can be hos
9 min read
LXF’s NEW $HOME
Linux Format
Article
LXF’s NEW $HOME
Jun 1, 2021
7 min read
Find And Clean Up Your Config Files
Linux Format
Article
Find And Clean Up Your Config Files
Feb 11, 2020
10 min read
Tweaking System Components
Linux Format
Article
Tweaking System Components
Nov 19, 2019
4 min read
What Can I Do To Make Zbrush Run Smoother And Crash Less?
3D World
Article
What Can I Do To Make Zbrush Run Smoother And Crash Less?
May 22, 2019
2 min read
Disk Management
Linux Format
Article
Disk Management
Mar 5, 2024
Recently, I have spent a not insignificant amount of time R remediating Linux systems with some poor design choices regards disk layout and management on Linux. I thought of it as something worthwhile sharing with the community at large. Ubuntu is th
3 min read
Tips For Managing Docker Containers
Linux Format
Article
Tips For Managing Docker Containers
Apr 2, 2024
4 min read
Answers
Linux Format
Article
Answers
Dec 15, 2020
8 min read
Observability Of The Kernel And Containers
Linux Format
Article
Observability Of The Kernel And Containers
Apr 4, 2023
Mihalis Tsoukalos is currently working on Time Series. You can reach him at: @mactsouk. For our final delve into eBPF, we’re tackling applications, the kernel and Docker containers. At the end of the day, all Linux machines execute code for applicat
10 min read
Build A Search And Analytic Engine
Linux Format
Article
Build A Search And Analytic Engine
Mar 10, 2020
7 min read
Secure Your Servers
Linux Format
Article
Secure Your Servers
Apr 7, 2020
No matter what you do with your Linux servers you will almost certainly have SSH access to them. Indeed this might be the only access you have, so it would be wise to secure it. Naturally, you will already be using a strong password and will have alr
4 min read
Get Installed
Linux Format
Article
Get Installed
May 31, 2022
1 min read
Get Installed
Linux Format
Article
Get Installed
May 31, 2022
1 min read
Easily Manage A Free VPS With Virtualmin
Linux Format
Article
Easily Manage A Free VPS With Virtualmin
Sep 19, 2023
9 min read
Help Desk
Macworld UK
Article
Help Desk
Apr 12, 2024
5 min read

Related categories

Skip carousel

Reviews for HBase Administration Cookbook

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

HBase Administration Cookbook - Yifeng Jiang

HBase Administration Cookbook

Credits

About the Author

Acknowledgement

About the Reviewers

www.PacktPub.com

Support files, eBooks, discount offers and more

Why Subscribe?

Free Access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Errata

Piracy

Questions

1. Setting Up HBase Cluster

Introduction

Quick start

Getting ready

How to do it...

How it works...

Getting ready on Amazon EC2

Getting ready

How to do it...

How it works...

Setting up Hadoop

Getting ready

How to do it...

How it works...

Setting up ZooKeeper

Getting ready

How to do it...

How it works...

There's more...

Changing the kernel settings

Getting ready

How to do it...

How it works...

See also

Setting up HBase

Getting ready

How to do it...

How it works...

Basic Hadoop/ZooKeeper/HBase configurations

How to do it...

How it works...

See also

Setting up multiple High Availability (HA) masters

Getting ready

How to do it...

Install and configure Heartbeat and Pacemaker

Create and install a NameNode resource agent

Configure highly available NameNode

Start DataNode, HBase cluster, and backup HBase master

How it works...

There's more...

2. Data Migration

Introduction

Importing data from MySQL via single client

Getting ready

How to do it...

How it works...

Importing data from TSV files using the bulk load tool

Getting ready

How to do it...

How it works...

There's more...

Writing your own MapReduce job to import data

Getting ready

How to do it...

How it works...

There's more...

Generating HFile files in MapReduce

Important configurations affecting data migration

See also

Precreating regions before moving data into HBase

Getting ready

How to do it...

How it works...

See also

3. Using Administration Tools

Introduction

HBase Master web UI

Getting ready

How to do it...

How it works...

Using HBase Shell to manage tables

Getting ready

How to do it...

How it works...

There's more...

Using HBase Shell to access data in HBase

Getting ready

How to do it...

How it works...

See also

Using HBase Shell to manage the cluster

Getting ready

How to do it...

How it works...

See also

Executing Java methods from HBase Shell

Getting ready

How to do it...

How it works...

There's more...

Row counter

Getting ready

How to do it...

How it works...

There's more...

WAL tool—manually splitting and dumping WALs

Getting ready

How to do it...

How it works...

See also

HFile tool—viewing textualized HFile content

Getting ready

How to do it...

How it works...

There's more...

HBase hbck—checking the consistency of an HBase cluster

Getting ready

How to do it...

How it works...

See also

Hive on HBase—querying HBase using a SQL-like language

Getting ready

How to do it...

How it works...

4. Backing Up and Restoring HBase Data

Introduction

Full shutdown backup using distcp

Getting ready

How to do it...

How it works...

Using CopyTable to copy data from one table to another

Getting ready

How to do it...

How it works...

Exporting an HBase table to dump files on HDFS

Getting ready

How to do it...

How it works...

There's more...

See also

Restoring HBase data by importing dump files from HDFS

Getting ready

How to do it...

How it works...

There's more...

See also

Backing up NameNode metadata

Getting ready

How to do it...

How it works...

There's more...

Backing up region starting keys

Getting ready

How to do it...

How it works...

See also

Cluster replication

Getting ready

How to do it...

How it works...

There's more...

5. Monitoring and Diagnosis

Introduction

Showing the disk utilization of HBase tables

Getting ready

How to do it...

How it works...

There's more...

Setting up Ganglia to monitor an HBase cluster

Getting ready

How to do it...

How it works...

There's more...

See also

OpenTSDB—using HBase to monitor an HBase cluster

Getting ready

How to do it...

How it works...

There's more...

Setting up Nagios to monitor HBase processes

Getting ready

How to do it...

How it works...

There's more...

Using Nagios to check Hadoop/HBase logs

Getting ready

How to do it...

How it works...

There's more...

See also

Simple scripts to report the status of the cluster

Getting ready

How to do it...

How it works...

There's more...

See also

Hot region—write diagnosis

Getting ready

How to do it...

How it works...

There's more...

See also

6. Maintenance and Security

Introduction

Enabling HBase RPC DEBUG-level logging

Getting ready

How to do it...

How it works...

There's more...

Graceful node decommissioning

Getting ready

How to do it...

How it works...

There's more...

See also

Adding nodes to the cluster

Getting ready

How to do it...

How it works...

There's more...

Rolling restart

Getting ready

How to do it...

How it works...

There's more...

Simple script for managing HBase processes

Getting ready

How to do it...

How it works...

Simple script for making deployment easier

Getting ready

How to do it...

How it works...

There's more...

Kerberos authentication for Hadoop and HBase

Getting ready

How to do it...

How it works...

There's more...

See also

Configuring HDFS security with Kerberos

Getting ready

How to do it...

How it works...

There's more...

HBase security configuration

Getting ready

How to do it...

How it works...

There's more...

7. Troubleshooting

Introduction

Troubleshooting tools

Getting ready

How to do it...

How it works...

See also

Handling the XceiverCount error

Getting ready

How to do it...

How it works...

Handling the too many open files error

Getting ready

How to do it...

How it works...

There's more...

See also

Handling the unable to create new native thread error

Getting ready

How to do it...

How it works...

There's more...

See also

Handling the HBase ignores HDFS client configuration issue

Getting ready

How to do it...

How it works...

Handling the ZooKeeper client connection error

Getting ready

How to do it...

How it works...

There's more...

Handling the ZooKeeper session expired error

Getting ready

How to do it...

How it works...

See also

Handling the HBase startup error on EC2

Getting ready

How to do it...

How it works...

There's more...

See also

8. Basic Performance Tuning

Introduction

Setting up Hadoop to spread disk I/O

Getting ready

How to do it...

How it works...

There's more...

Using network topology script to make Hadoop rack-aware

Getting ready

How to do it...

How it works...

Mounting disks with noatime and nodiratime

Getting ready

How to do it...

How it works...

There's more...

Setting vm.swappiness to 0 to avoid swap

Getting ready

How it works...

See also

Java GC and HBase heap settings

Getting ready

How to do it...

How it works...

There's more...

See also

Using compression

Getting ready

How to do it...

How it works...

There's more...

Managing compactions

Getting ready

How to do it...

How it works...

There's more...

Managing a region split

Getting ready

How to do it...

How it works...

There's more...

See also

9. Advanced Configurations and Tuning

Introduction

Benchmarking HBase cluster with YCSB

Getting ready

How to do it...

How it works...

There's more...

Increasing region server handler count

Getting ready

How to do it...

How it works...

See also

Precreating regions using your own algorithm

Getting ready

How to do it...

How it works...

There's more...

See also

Avoiding update blocking on write-heavy clusters

Getting ready

How to do it...

How it works...

See also

Tuning memory size for MemStores

Getting ready

How to do it...

How it works...

There's more...

See also

Client-side tuning for low latency systems

Getting ready

How to do it...

How it works...

There's more...

Configuring block cache for column families

Getting ready

How to do it...

How it works...

There's more...

See also

Increasing block cache size on read-heavy clusters

Getting ready

How to do it...

How it works...

See also

Client side scanner setting

Getting ready

How to do it...

How it works...

There's more...

See also

Tuning block size to improve seek performance

Getting ready

How to do it...

How it works...

There's more...

See also

Enabling Bloom Filter to improve the overall throughput

Getting ready

How to do it...

How it works...

There's more...

Index

HBase Administration Cookbook

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

First published: August 2012

Production Reference: 1080812

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham B3 2PB, UK..

ISBN 978-1-84951-714-0

www.packtpub.com

Cover Image by Asher Wishkerman ( <a.wishkerman@mpic.de> )

Credits

Author

Yifeng Jiang

Reviewers

Masatake Iwasaki

Tatsuya Kawano

Michael Morello

Shinichi Yamashita

Acquisition Editor

Sarah Cullington

Lead Technical Editor

Pramila Balan

Technical Editors

Merin Jose

Kavita Raghavan

Manmeet Singh Vasir

Copy Editors

Brandt D'Mello

Insiya Morbiwala

Project Coordinator

Yashodhan Dere

Proofreader

Aaron Nash

Indexer

Hemangini Bari

Graphics

Manu Joseph

Valentina D'silva

Production Coordinator

Arvindkumar Gupta

Cover Work

Arvindkumar Gupta

About the Author

Yifeng Jiang is a Hadoop and HBase Administrator and Developer at Rakuten—the largest e-commerce company in Japan. After graduating from the University of Science and Technology of China with a B.S. in Information Management Systems, he started his career as a professional software engineer, focusing on Java development.

In 2008, he started looking over the Hadoop project. In 2009, he led the development of his previous company's display advertisement data infrastructure using Hadoop and Hive.

In 2010, he joined his current employer, where he designed and implemented the Hadoop- and HBase-based, large-scale item ranking system. He is also one of the members of the Hadoop team in the company, which operates several Hadoop/HBase clusters.

Acknowledgement

Little did I know, when I was first asked by Packt Publishing whether I would be interested in writing a book about HBase administration on September 2011, how much work and stress (but also a lot of fun) it was going to be.

Now that the book is finally complete, I would like to thank those people without whom it would have been impossible to get done.

First, I would like to thank the HBase developers for giving us such a great piece of software. Thanks to all of the people on the mailing list providing good answers to my many questions, and all the people working on tickets and documents.

I would also like to thank the team at Packt Publishing for contacting me to get started with the writing of this book, and providing support, guidance, and feedback.

Many thanks to Rakuten, my employer, who provided me with the environment to work on HBase and the chance to write this book.

Thank you to Michael Stack for helping me with a quick review of the book.

Thank you to the book's reviewers—Michael Morello, Tatsuya Kawano, Kenichiro Hamano, Shinichi Yamashita, and Masatake Iwasaki.

To Yotaro Kagawa: Thank you for supporting me and my family from the very start and ever since.

To Xinping and Lingyin: Thank you for your support and all your patience—I love you!

About the Reviewers

Masatake Iwasaki is a Software Engineer at NTT DATA CORPORATION, providing technical consultation for open source softwares such as Hadoop, HBase, and PostgreSQL.

Tatsuya Kawano is an HBase contributor and evangelist in Japan. He has been helping the Japanese Hadoop and HBase community to grow since 2010.

He is currently working for Gemini Mobile Technologies as a Research & Development software engineer. He is also developing Cloudian, a fully S3 API-complaint cloud storage platform, and Hibari DB, an open source, distributed, key-value store.

He has co-authored a Japanese book named Basic Knowledge of NOSQL in 2012, which introduces 16 NoSQL products, such as HBase, Cassandra, Riak, MongoDB, and Neo4j to novice readers.

He has studied graphic design in New York, in the late 1990s. He loves playing with 3D computer graphics as much as he loves developing high-availability, scalable, storage systems.

Michael Morello holds a Masters degree in Distributed Computing and Artificial Intelligence. He is a Senior Java/JEE Developer with a strong Unix and Linux background. His areas of research are mostly related to large-scale systems and emerging technologies dedicated to solving scalability, performance, and high availability issues.

I would like to thank my wife and my little angel for their love and support.

Shinichi Yamashita is a Chief Engineer at the OSS Professional Service unit in NTT DATA Corporation, in Japan. He has more than 7 years of experience in software and middleware (Apache, Tomcat, PostgreSQL, Hadoop eco system) engineering.

Shinicha has written a few books on Hadoop in Japan.

I would like to thank my colleagues.

www.PacktPub.com

Support files, eBooks, discount offers and more

You might want to visit www.PacktPub.com for support files and downloads related to your book.

Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at for more details.

At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.

http://PacktLib.PacktPub.com

Do you need instant solutions to your IT questions? PacktLib is Packt's online digital book library. Here, you can access, read and search across Packt's entire library of books.

Why Subscribe?

Fully searchable across every book published by Packt

Copy and paste, print and bookmark content

On demand and accessible via web browser

Free Access for Packt account holders

If you have an account with Packt at www.PacktPub.com, you can use this to access PacktLib today and view nine entirely free books. Simply use your login credentials for immediate access.

Preface

As an open source, distributed, big data store, HBase scales to billions of rows, with millions of columns and sits on top of the clusters of commodity machines. If you are looking for a way to store and access a huge amount of data in real time, then look no further than HBase.

HBase Administration Cookbook provides practical examples and simple step-by-step instructions for you to administrate HBase with ease. The recipes cover a wide range of processes for managing a fully distributed, highly available HBase cluster on the cloud. Working with such a huge amount of data means that an organized and manageable process is key, and this book will help you to achieve that.

The recipes in this practical cookbook start with setting up a fully distributed HBase cluster and moving data into it. You will learn how to use all the tools for day-to-day administration tasks, as well as for efficiently managing and monitoring the cluster to achieve the best performance possible. Understanding the relationship between Hadoop and HBase will allow you to get the best out of HBase; so this book will show you how to set up Hadoop clusters, configure Hadoop to cooperate with HBase, and tune its performance.

What this book covers

Chapter 1, Setting Up HBase Cluster: This chapter explains how to set up an HBase cluster, from a basic standalone HBase instance to a fully distributed, highly available HBase cluster on Amazon EC2.

Chapter 2, Data Migration: In this chapter, we will start with the simple task of importing data from MySQL to HBase, using its Put API. We will then describe how to use the importtsv and bulk load tools to load TSV data files into HBase. We will also use a MapReduce sample to import data from other file formats. This includes putting data directly into an HBase table and writing to HFile format files on Hadoop Distributed File System (HDFS). The last recipe in this chapter explains how to precreate regions before loading data into HBase.

This chapter ships with several sample sources written in Java. It assumes that you have basic Java knowledge, so it does not explain how to compile and package the sample Java source in the recipes.

Chapter 3, Using Administration Tools: In this chapter, we describe the usage of various administration tools such as HBase web UI, HBase Shell, HBase hbck, and others. We explain what the tools are for, and how to use them to resolve a particular task.

Chapter 4, Backing Up and Restoring HBase Data: In this chapter, we will describe how to back up HBase data using various approaches, their pros and cons, and which approach to choose depending on your dataset size, resources, and requirements.

Chapter 5, Monitoring and Diagnosis: In this chapter, we will describe how to monitor and diagnose HBase cluster with Ganglia, OpenTSDB, Nagios, and other tools. We will start with a simple task to show the disk utilization of HBase tables. We will install and configure Ganglia to monitor an HBase metrics and show an example usage of Ganglia graphs. We will also set up OpenTSDB, which is similar to Ganglia, but more scalable as it is built on the top of HBase.

We will set up Nagios to check everything we want to check, including HBase-related daemon health, Hadoop/HBase logs, HBase inconsistencies, HDFS health, and space utilization.

In the last recipe, we will describe an approach to diagnose and fix the frequently asked hot spot region issue.

Chapter 6, Maintenance and Security: In the first six recipes of this chapter we will learn about the various HBase maintenance tasks, such as finding and correcting faults, changing cluster size, making configuration changes, and so on.

We will also look at security in this chapter. In the last three recipes, we will install Kerberos and then set up HDFS security with Kerberos, and finally set up secure HBase client access.

Chapter 7, Troubleshooting: In this chapter, we will look through several of the most confronted issues. We will describe the error messages of these issues, why they happen, and how to fix them with the troubleshooting tools.

Chapter 8, Basic Performance Tuning: In this chapter, we will describe how to tune HBase to gain better performance. We will also include recipes to tune other tuning points such as Hadoop configurations, the JVM garbage collection settings, and the OS kernel parameters.

Chapter 9, Advanced Configurations and Tuning: This is another chapter about performance tuning in the book. The previous chapter describes some recipes to tune Hadoop, OS setting, Java, and HBase itself, to improve the overall performance of the HBase cluster. These are general improvements for many use cases. In this chapter, we will describe more specific recipes, some of which are for write-heavy clusters, while some are aimed at improving the read performance of the cluster.

What you need for this book

Everything you need is listed in each recipe.

The basic list of software required for this book are as follows:

Debian 6.0.1 (squeeze)

Oracle JDK (Java Development Kit) SE 6

HBase 0.92.1

Hadoop 1.0.2

ZooKeeper 3.4.3

Who this book is for

This book is for HBase administrators, developers, and will even help Hadoop administrators. You are not required to have HBase experience, but are expected to have a basic understanding of Hadoop and MapReduce.

Conventions

In this book, you will find a number of styles of text that distinguish between different kinds of information. Here are some examples of these styles, and an explanation of their meaning.

Code words in text are shown as follows: HBase can be stopped using its stop-hbase.sh script.

A block of code is set as follows:

nameserver 10.160.49.250 #private IP of ns

search hbase-admin-cookbook.com #domain name

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

MAJOR_COMPACTION_KEY = \x00

MAX_SEQ_ID_KEY = 96573

TIMERANGE = 1323026325955....1323026325955

hfile.AVG_KEY_LEN = 31

hfile.AVG_VALUE_LEN = 4

hfile.COMPARATOR = org.apache.hadoop.hbase.KeyValue$KeyComparator

Any command-line input or output is written as follows:

$ bin/ycsb load hbase -P workloads/workloada -p columnfamily=f1 -p recordcount=1000000 -p threadcount=4 -s | tee -a workloada.dat

YCSB Client 0.1

Command line: -db com.yahoo.ycsb.db.HBaseClient -P workloads/workloada -p columnfamily=f1 -p recordcount=1000000 -p threadcount=4 -s -load

Loading workload...

New terms and important words are shown in bold. Words that you see on the screen, in menus or dialog boxes for example, appear in the text like this: Verify the startup from AWS Management Console.

Note

Warnings or important notes appear in a box like this.

Tip

Tips and tricks appear like this.

Reader feedback

Feedback from our readers is always welcome. Let us know what you think about this book—what you liked or may have disliked. Reader feedback is important for us to develop titles that you really get the most out of.

To send us general feedback, simply send an e-mail to <feedback@packtpub.com>, and mention the book title through the subject of your message.

If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, see our author guide on www.packtpub.com/authors.

Customer support

Now that you are the proud owner of a Packt book, we have a number of things to help you to get the most from your purchase.

Downloading the example code

You can download the example code files for all Packt books you have purchased from your account at http://www.packtpub.com. If you purchased this book elsewhere, you can visit http://www.packtpub.com/support and register to have the files e-mailed directly to you.

Errata

Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you find a mistake in one of our books—maybe a mistake in the text or the code—we would be grateful if you would report this to us. By doing so, you can save other readers from frustration and help us improve subsequent versions of this book. If you find any errata, please report them by visiting http://www.packtpub.com/support, selecting your book, clicking on the errata submission form link, and entering the details of your errata. Once your errata are verified, your submission will be accepted and the errata will be uploaded to our website, or added to any list of existing errata, under the Errata section of that title.

Piracy

Piracy of copyright material on the Internet is an ongoing problem across all media. At Packt, we take the protection of our copyright and licenses very seriously. If you come across any illegal copies of our works, in any

Enjoying the preview?

Page 1 of 1

HBase Administration Cookbook

About this ebook

Yifeng Jiang

Related authors

Related to HBase Administration Cookbook

Related ebooks

Computers For You

Related podcast episodes

Related articles

Related categories

Reviews for HBase Administration Cookbook

What did you think?

Book preview

HBase Administration Cookbook - Yifeng Jiang

Table of Contents

HBase Administration Cookbook

HBase Administration Cookbook

Credits

About the Author

Acknowledgement

About the Reviewers

Support files, eBooks, discount offers and more

Why Subscribe?

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Note

Tip

Reader feedback

Customer support

Downloading the example code

Errata

Piracy