Which of the following options correctly lists the essential components of a typical distributed file system?
Think about components that handle file metadata, actual file storage, and client access.
A distributed file system typically includes client nodes that request files, metadata servers that manage file information, storage nodes that hold the actual data, and a network layer to connect them all.
You want to increase the storage capacity of a distributed file system without downtime. Which approach is most suitable?
Consider how to add capacity while keeping the system available.
Adding storage nodes and updating metadata allows the system to grow without downtime. Replacing disks or compressing files may cause downtime or performance issues.
In a distributed file system, choosing strong consistency over availability means:
Think about what happens when parts of the system cannot communicate.
Strong consistency requires all nodes to agree on data before responding, so during network issues, some requests may be rejected to avoid stale data, sacrificing availability.
Which replication strategy ensures high availability and fault tolerance by storing multiple copies of data across different nodes?
Consider how data copies are kept consistent and available.
Synchronous replication stores multiple copies on different nodes and updates them simultaneously, ensuring availability and fault tolerance.
A distributed file system stores 1 billion files. Each file's metadata requires 1 KB of memory. If the metadata server has 256 GB of RAM, approximately how many metadata servers are needed to hold all metadata in memory?
Calculate total metadata size and divide by server RAM.
Total metadata size = 1 billion files * 1 KB = 1 TB. Each server has 256 GB RAM, so 1 TB / 256 GB = 4 servers needed.