0
0
HLDsystem_design~5 mins

Distributed file systems in HLD - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is a distributed file system?
A distributed file system is a way to store and access files across multiple computers as if they were on a single machine. It helps share data easily and reliably over a network.
Click to reveal answer
beginner
Why do distributed file systems use replication?
Replication means keeping copies of files on different machines. It helps protect data if one machine fails and improves speed by letting users access the closest copy.
Click to reveal answer
intermediate
What is the role of a metadata server in a distributed file system?
The metadata server keeps track of where files are stored, their names, and permissions. It helps find files quickly without searching all machines.
Click to reveal answer
advanced
How does a distributed file system handle concurrent writes to the same file?
It uses locking or version control to make sure changes don’t conflict. This keeps data consistent and prevents errors when many users write at once.
Click to reveal answer
intermediate
Name two popular distributed file systems and a key feature of each.
HDFS (Hadoop Distributed File System) is designed for big data and stores large files across many machines. Ceph is highly scalable and supports block, object, and file storage.
Click to reveal answer
What is the main purpose of a distributed file system?
ATo compress files to save space
BTo backup files on a single machine
CTo encrypt files for security
DTo store files across multiple machines for easy access
Which component tracks file locations in a distributed file system?
AData node
BMetadata server
CClient
DLoad balancer
Why is replication important in distributed file systems?
ATo improve fault tolerance and speed
BTo reduce file size
CTo encrypt data
DTo limit user access
How do distributed file systems maintain consistency during concurrent writes?
ABy locking files or using version control
BBy ignoring conflicts
CBy deleting old files
DBy copying files to clients
Which distributed file system is designed for big data storage?
ACeph
BNFS
CHDFS
DFTP
Explain how a distributed file system ensures data availability and fault tolerance.
Think about how copies help when one machine fails.
You got /4 concepts.
    Describe the role of metadata servers and data nodes in a distributed file system.
    Consider who knows where files are and who holds the files.
    You got /4 concepts.