Abstract
This talks describes the architecture of the Apache Hadoop Distributed File System (HDFS). It analyzes the evolution of HDFS by discussing why certain design decisions are made, what features are deemed more important than others and the type of applications that use HDFS. It contends that HDFS has been a creative but disruptive force in the world of general purpose file-systems.