0
0
Hadoopdata~30 mins

HBase data model (column families) in Hadoop - Mini Project: Build & Apply

Choose your learning style9 modes available
Understanding HBase Data Model with Column Families
📖 Scenario: You are working with a large dataset of customer information stored in HBase. To organize the data efficiently, you need to create a table with column families that group related data together.
🎯 Goal: Create an HBase table schema with two column families named personal and contact. Then, add sample data to these column families and finally retrieve the data to see how column families organize the information.
📋 What You'll Learn
Create an HBase table named customers with column families personal and contact
Insert sample data into the personal and contact column families for a customer with row key cust1
Retrieve and display the data stored in the customers table for cust1
💡 Why This Matters
🌍 Real World
HBase is used in big data applications to store large amounts of sparse data efficiently. Column families help group related data for faster access and better storage management.
💼 Career
Understanding HBase data model and column families is essential for roles in big data engineering, data architecture, and backend development working with Hadoop ecosystems.
Progress0 / 4 steps
1
Create HBase table with column families
Write the HBase shell command to create a table named customers with two column families: personal and contact.
Hadoop
Need a hint?

Use the create command in HBase shell followed by the table name and the column family names in quotes.

2
Insert sample data into column families
Write HBase shell commands to insert the following data for row key cust1: in the personal column family, add name as John Doe and age as 30; in the contact column family, add email as john@example.com and phone as 1234567890.
Hadoop
Need a hint?

Use the put command to add data to specific column families and qualifiers.

3
Retrieve data for the customer
Write the HBase shell command to get all data for the row key cust1 from the customers table.
Hadoop
Need a hint?

Use the get command with the table name and row key to retrieve data.

4
Display the retrieved data
Run the HBase shell command to display the data for cust1 from the customers table and observe the output showing data grouped by column families.
Hadoop
Need a hint?

The output should show the data grouped under personal and contact column families for cust1.