You want to store sensor readings collected every second in Bigtable. Which row key design will best prevent hotspotting and allow efficient range scans for recent data?
Think about how Bigtable stores rows lexicographically and how to avoid all writes going to the same region.
Using the sensor ID followed by a reversed timestamp distributes writes across Bigtable nodes and allows efficient scans for recent data by scanning rows with decreasing timestamps.
You query Bigtable for sensor data over the last hour using a row key range scan. What behavior should you expect regarding read latency?
Consider how Bigtable stores data and how scanning more rows affects performance.
Bigtable stores data sorted by row key. Scanning a larger time range means reading more rows, so latency increases roughly linearly with the number of rows scanned.
You have multiple IoT devices sending time-series data to Bigtable. Which approach best secures write access while minimizing operational overhead?
Think about security best practices and operational simplicity when many devices are involved.
Using a proxy service centralizes authentication and authorization, reducing the risk of key leakage and simplifying permission management. Devices authenticate to the proxy, which writes to Bigtable with a controlled service account.
Which schema design choice helps Bigtable compress time-series data most effectively?
Consider how Bigtable compresses data within column families and the effect of sorted column qualifiers.
Bigtable compresses data within column families and benefits from sorted column qualifiers. Using timestamps as sorted column qualifiers in a single column family allows better compression and efficient scans.
You notice Bigtable write throughput drops significantly during peak sensor data ingestion. Which cause best explains this behavior?
Think about how Bigtable partitions data and how row key design affects write distribution.
Monotonically increasing row keys cause all writes to go to the same tablet server, creating a hotspot that limits write throughput. Proper row key design avoids this by distributing writes.