The document proposes and analyzes the concept of "page farms", which are sets of pages that contribute to the PageRank score of a target page. It finds that computing page farms is NP-hard, so it develops a greedy algorithm to extract approximate page farms. It then analyzes the page farms of over 3 million randomly sampled web pages, finding that the distributions of page farm sizes follow a power law distribution and reflect the importance of pages.