8. What Distributed FileWhat Distributed File
System ProvidesSystem Provides
• Provide accesses to data stored at servers using file
system interfaces
• What are the file system interfaces?
o Open a file, check status on a file, close a file
o Read data from a file
o Write data to a file
o Lock a file or part of a file
o List files in a directory, delete a directory
o Delete a file, rename a file, add a symbolic link to a file
etc.
10. NFS architectureNFS architecture
Client computer Server computer
UNIX
file
system
NFS
client
NFS
server
UNIX
file
system
Application
program
Application
program
Virtual file systemVirtual file system
PCDOS
UNIX kernel
system calls
RPC for
(remote operations)
UNIX
Operations
on local files
Operations
on
remote files
UNIX kernel
Net
work
11. Google File SystemGoogle File System
Metadata:
namespace, access
control, mapping of
files to chunks, and
current location of
chunks
1
2
3
4
12. HDFS DesignHDFS Design
•Files stored as blocks
o Default 64MB
•Reliability through replication
o replicated across 3+ DataNodes
•Single NameNode coordinates access, metadata
o Centralized management
•No data caching
o Little benefit due to large data sets, streaming reads
14. HDFS ArchitectureHDFS Architecture
HDFS-Aware Application
POSIX API HDFS API
Regular VFS with local and
NFS-supported files
Specific drivers
Separate HDFS view
Network stack
HDFS
NameNode
HDFS
NameNode
HDFS DataNodeHDFS DataNode
HDFS DataNodeHDFS DataNode