🗄️ HBase on HDInsight¶
Master HBase on HDInsight. Learn NoSQL design, real-time reads/writes, and integration patterns.
🎯 Learning Objectives¶
- Understand HBase architecture
- Design schemas for NoSQL
- Perform CRUD operations
- Optimize for performance
- Integrate with Phoenix
📋 Prerequisites¶
- HDInsight cluster with HBase
- NoSQL concepts - Row keys, column families
- Java or Python
🏗️ HBase Architecture¶
- Region Servers - Store data
- Master Server - Coordinates regions
- ZooKeeper - Distributed coordination
- HDFS - Underlying storage
📊 Schema Design¶
# Create table
create 'users', 'profile', 'activity'
# Put data
put 'users', 'user001', 'profile:name', 'John Doe'
put 'users', 'user001', 'profile:email', 'john@example.com'
# Get data
get 'users', 'user001'
# Scan
scan 'users'
🔍 Phoenix SQL Layer¶
-- Create Phoenix table
CREATE TABLE users (
user_id VARCHAR PRIMARY KEY,
name VARCHAR,
email VARCHAR
);
-- Query with SQL
SELECT * FROM users WHERE name = 'John Doe';
📚 Resources¶
Last Updated: January 2025