Mais conteúdo relacionado
Semelhante a Hadoop 101 v2 (20)
Hadoop 101 v2
- 9. What if you need compute power for complex algorithms?
- 12. Run jobs on PART of the data on each computer then
AGGRETAGE the intermediary results from each computer.
- 13. Let’s add a computer to manage the process of
job delegation, merging the results...
and keeping track of the results...
- 14. We also need something to keep track of what files are
where, so we know where the data is that needs to be
computed...
- 15. When you have a lot of computers, and even more hard
drives,
one thing I can guarantee...
- 23. If a computer fails and you only have one copy of your
data...
- 25. So lets store multiple copies of the data. Hard drives are
CHEAP!
- 26. So lets store multiple copies of the data. Hard drives are
CHEAP!
- 27. So lets store multiple copies of the data. Hard drives are
CHEAP!
- 28. So lets store multiple copies of the data. Hard drives are
CHEAP!
- 31. Even if a whole rack fails... we are still OK
- 32. Once we find a failure let’s have the system recopy the
copies.
- 47. Job tracker manages task trackers, ships code to compute
nodes
Data Node
Task Tracker
Job Tracker
- 48. Name node manages distribution and replication on the
data nodes
Data Node
Task Tracker
Job Tracker
Name Node