This document provides an overview of the Hadoop framework. It describes the key components of Hadoop including the NameNode, DataNodes, JobTracker, TaskTracker, and SecondaryNameNode. The NameNode manages file metadata and location information stored across DataNodes. The JobTracker schedules and tracks jobs on TaskTrackers running on slave nodes. The SecondaryNameNode helps recover metadata if the NameNode fails. Hadoop uses a master/slave architecture with the NameNode and JobTracker on the master and DataNodes and TaskTrackers on slave nodes.