Challenge - 5 Problems
Bigtable Schema Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate2:00remaining
Understanding Row Key Design in Bigtable
Why is it important to design row keys in Bigtable to avoid sequential patterns?
Attempts:
2 left
💡 Hint
Think about how Bigtable distributes data across nodes.
✗ Incorrect
Sequential row keys cause all writes to go to the same node, creating a hotspot and reducing performance.
❓ query_result
intermediate2:00remaining
Predicting Data Retrieval Order
Given a Bigtable with row keys formatted as 'userID#timestamp', what will be the order of rows returned when scanning the table?
Attempts:
2 left
💡 Hint
Remember how Bigtable sorts rows lexicographically by row key.
✗ Incorrect
Bigtable sorts rows lexicographically by row key, so 'userID#timestamp' sorts first by userID, then timestamp.
📝 Syntax
advanced2:00remaining
Identifying Invalid Row Key Design
Which of the following row key designs is invalid because it can cause performance issues in Bigtable?
Attempts:
2 left
💡 Hint
Consider how row key prefixes affect data distribution.
✗ Incorrect
Using a timestamp prefix causes sequential writes to the same node, creating hotspots and degrading performance.
❓ optimization
advanced2:00remaining
Optimizing Column Family Usage
Which practice optimizes Bigtable schema design regarding column families?
Attempts:
2 left
💡 Hint
Think about how column families affect read and write efficiency.
✗ Incorrect
Grouping columns with similar access patterns reduces unnecessary data reads and improves performance.
🔧 Debug
expert3:00remaining
Diagnosing Hotspotting in Bigtable
A Bigtable instance is experiencing slow write performance. The row keys are user IDs followed by timestamps in ascending order (e.g., 'user123#20240601'). What is the most likely cause?
Attempts:
2 left
💡 Hint
Consider how row key order affects write distribution.
✗ Incorrect
Ascending timestamps at the end of the row key do not distribute writes evenly if user IDs are not well distributed or if many writes target the same user.