You need to design a search system that can handle millions of products with fast response times. Which architecture component is most critical to ensure scalability and low latency?
Think about how to handle large data and many users at the same time.
Distributed indexing with sharding and replication allows the system to split data across servers and handle many queries in parallel, ensuring scalability and low latency.
Your recommendation system receives a sudden spike in user requests. Which approach best helps maintain performance under high load?
Think about reducing repeated work and fast access.
Caching popular recommendations reduces computation and database hits, allowing the system to serve many users quickly during spikes.
Which tradeoff is true when choosing real-time recommendation updates over batch updates?
Consider freshness versus resource cost.
Real-time updates keep recommendations fresh but need continuous computation, increasing resource use compared to batch processing.
What is the cold start problem in recommendation systems?
Think about what happens when the system has no data about a user.
The cold start problem occurs when the system cannot generate personalized recommendations because it lacks user interaction data.
You have 100 million documents averaging 1 KB each. The search index typically requires 30% of the original data size. How much storage is needed for the index?
Calculate 30% of total data size.
100 million documents * 1 KB = 100 million KB = 100 GB. 30% of 100 GB is 30 GB.
