What is Hadoop Distributed File System?
The Hadoop Distributed File System (HDFS) is the primary data storage system used by Hadoop applications. Designed for high fault tolerance to run on commodity hardware, it implements a distributed file system that provides high-throughput data access across highly scalable Hadoop clusters.