HBase for Developers
Enable participants to design schemas, develop Java API code, and administer HBase clusters for scalable, low‑latency data access.
Get Course Info
Audience: Developers & Administrators
Duration: 3 days
Format: Lectures and hands‑on labs (50% lecture, 50% lab)
Overview
This course introduces HBase – a NoSQL store on top of Hadoop. The course is intended for developers who will be using HBase to develop applications and administrators who will manage HBase clusters.
Objective
Enable participants to design schemas, develop Java API code, and administer HBase clusters for scalable, low‑latency data access.
What You Will Learn
- HBase architecture and design
- HBase internals
- Using APIs to interact with HBase
- Data modeling on HBase
Course Details
Audience: Developers & Administrators
Duration: 3 days
Format: Lectures and hands‑on labs (50% lecture, 50% lab)
Comfortable with Java programming language • Comfortable in Linux environment (navigate command line, edit files with vi / nano) • A Java IDE like Eclipse or IntelliJ
Setup: A working HBase cluster will be provided • SSH client • Browser • Zero-Install (no HBase software needed on student machines)
Detailed Outline
- Big Data ecosystem
- NoSQL overview
- CAP theorem
- When is NoSQL appropriate
- Columnar storage
- HBase and NoSQL
- Concepts and Design
- Architecture (HMaster and Region Server)
- Data integrity
- HBase ecosystem
- Lab: Exploring HBase
- Namespaces, Tables and Regions
- Rows, columns, column families, versions
- HBase Shell and Admin commands
- Lab: HBase Shell
- Introduction to Java API
- Read / Write path
- Time Series data
- Scans
- Map Reduce
- Filters
- Counters
- Co-processors
- Labs: Using HBase Java API for time series, Map Reduce, Filters, and counters
- Real-world use-case presentations
- Group design solutions
- Discussion / critique
- Lab: implement a scenario in HBase
- Memfile / HFile / WAL
- HDFS storage
- Compactions
- Splits
- Bloom Filters
- Caches
- Diagnostics
- Hardware selection
- Install methods
- Common configurations
- Lab: installing HBase
- Developing applications using HBase
- Interacting with Hadoop stack (MapReduce, Pig, Hive)
- Frameworks around HBase
- Advanced concepts (co-processors)
- Labs: writing HBase applications
- Monitoring tools and practices
- Optimizing HBase
- HBase in the cloud
- Real-world HBase use cases
- Lab: checking HBase vitals
Ready to Get Started?
Contact us to learn more about this course and schedule your training.