SlideShare a Scribd company logo
1 of 22
HADOOP
                           Framework and
                           Applications




Prepared by: TEAM HADOOP                   slide1/22
CONTENTS
   WHY   HADOOP?




   INTRODUCTION      TO MapReduce




Prepared by: TEAM HADOOP             slide 2/22
WHAT?
  “... to create building blocks for programmers
  who just happen to have lots of data to
  store, lots of data to analyze, or lots of machines
  to coordinate, and who don‟t have the
  time, the skill, or the inclination to become
  distributed systems experts to build the
  infrastructure to handle it.”
                                           -Tom White

  Source: Hadoop: The Definitive Guide



Prepared by: TEAM HADOOP                        slide 3/22
WHAT?
     Hadoop contains many subprojects:
     Hadoop Common
     Chukwa
     HBase
     ZooKeeper
     Pig
     Zombie
     Hive
     MapReduce

  We will focus on MapReduce



Prepared by: TEAM HADOOP                  slide 4/22
WHO & WHEN?
   Pre-2004 : Cutting and Cafarella develop
    open source projects for web-scale
    indexing, crawling and search.




Prepared by: TEAM HADOOP                slide 5/22
WHO & WHEN?
   2004: Jeffrey Dean and Sanjay
    Ghemawat introduce map reduce model
    used internally at Google.




Prepared by: TEAM HADOOP           slide 6/22
WHO & WHEN?
   2006:Hadoop becomes official Apache
    project, Cutting joins Yahoo!Yahoo
    adopts Hadoop.




Prepared by: TEAM HADOOP            slide 7/22
TRENDS




Prepared by: TEAM HADOOP   slide 8/22
WHO USES IT?




Prepared by: TEAM HADOOP   slide 9/22
Roughly how long to read 1TB
  from a commodity hard disk?




Prepared by: TEAM HADOOP   slide 10/22
Roughly how long to read 1TB
  from a commodity hard disk?


                     Around 4 hours
WITH HADOOP..



                      62 seconds…



Prepared by: TEAM HADOOP              slide 11/22
INTRODUCTION TO MapReduce




   "Break large problem into smaller parts, solve in
   parallel, combine results."



 Prepared by: TEAM HADOOP                              slide 12/22
Typical scenario
   How  many times is the word „IT‟ present?
    You‟ll probably count but in a 30k paged
    document, can you??




Prepared by: TEAM HADOOP                 slide 13/22
Map Reduce Typical Illustration




 Prepared by: TEAM HADOOP    slide 14/22
Map Reduce paradigm

                                 Input




               Output                                   Map




                        Reduce           Shuffle/Sort




Prepared by: TEAM HADOOP                                      slide 15/22
Map Reduce paradigm
   Map:  transforms input record to
    intermediate (key, value) pair




Prepared by: TEAM HADOOP               slide 16/22
Map Reduce paradigm
   Reduce:   transforms all records for given
    key to final output.




Prepared by: TEAM HADOOP                    slide 17/22
Map reduce principles

                                           Move code to data (local
                                                computation)




                  Abstract away fault                                    Allow programs to scale
            tolerance, synchronization, etc.                          transparently w.r.t size of input




Prepared by: TEAM HADOOP                                                                                  slide 18/22
Implementation: Hardware




Prepared by: TEAM HADOOP sroy choudhury7@gmail.com   slide 19/22
Map Reduce: strengths
   Batch,   offline jobs

   Write-once,   read-many across full data
    set

   Usually,
          though not always, simple
    computations

   I/O   bound by disk/network bandwidth


Prepared by: TEAM HADOOP                  slide 20/22
What it‟s not!

  What it‟s not:

   High-performance parallel
    computing, e.g. MPI

   Low-latency    random access relational
    database

   Always   the right solution


Prepared by: TEAM HADOOP                  slide 21/22
THANK YOU!
                           QUESTIONS?




Prepared by: TEAM HADOOP                slide 22/22

More Related Content

Viewers also liked

Zaidan ismail rashid original
Zaidan ismail rashid originalZaidan ismail rashid original
Zaidan ismail rashid originalMuhiss Rahman
 
Bo p, disequlibrium,
Bo p, disequlibrium,Bo p, disequlibrium,
Bo p, disequlibrium,wwgreatmutha
 
Estrategias de ensenanza_cap6 Anijovich Mora 2009_
Estrategias de ensenanza_cap6 Anijovich Mora 2009_Estrategias de ensenanza_cap6 Anijovich Mora 2009_
Estrategias de ensenanza_cap6 Anijovich Mora 2009_María Julia Bravo
 
Међумолекулске интеракције и водонична веза
Међумолекулске интеракције и водонична везаМеђумолекулске интеракције и водонична веза
Међумолекулске интеракције и водонична везаTanja Milanović
 
Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...
Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...
Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...Lorenc Gordani
 
Chris Hamilton news:rewired presentation
Chris Hamilton news:rewired presentationChris Hamilton news:rewired presentation
Chris Hamilton news:rewired presentationrachelmcathy
 
22號 周玟伽
22號 周玟伽22號 周玟伽
22號 周玟伽輝 哲
 
Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...
Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...
Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...South Asia Fast Track
 
EDS selection & implementation @ CCC
EDS selection & implementation @ CCCEDS selection & implementation @ CCC
EDS selection & implementation @ CCCMolly Beestrum
 
Salon Maison Passive - Enterprise Europe Brussels - Technology Watch services
Salon Maison Passive - Enterprise Europe Brussels - Technology Watch servicesSalon Maison Passive - Enterprise Europe Brussels - Technology Watch services
Salon Maison Passive - Enterprise Europe Brussels - Technology Watch servicesecobuild.brussels
 
закон о пдн_последние_изменения_разбегаев_в_авг_2014
закон о пдн_последние_изменения_разбегаев_в_авг_2014закон о пдн_последние_изменения_разбегаев_в_авг_2014
закон о пдн_последние_изменения_разбегаев_в_авг_2014Vladimir Razbegaev
 
動画の作り方から稼ぎ方まで20130720
動画の作り方から稼ぎ方まで20130720動画の作り方から稼ぎ方まで20130720
動画の作り方から稼ぎ方まで20130720Keiko Morita
 

Viewers also liked (19)

Zaidan ismail rashid original
Zaidan ismail rashid originalZaidan ismail rashid original
Zaidan ismail rashid original
 
Bo p, disequlibrium,
Bo p, disequlibrium,Bo p, disequlibrium,
Bo p, disequlibrium,
 
加拉太書
加拉太書加拉太書
加拉太書
 
Expresiòn oral - Cassany
 Expresiòn oral - Cassany Expresiòn oral - Cassany
Expresiòn oral - Cassany
 
Supersticiones
SupersticionesSupersticiones
Supersticiones
 
La amistad
La amistadLa amistad
La amistad
 
Estrategias de ensenanza_cap6 Anijovich Mora 2009_
Estrategias de ensenanza_cap6 Anijovich Mora 2009_Estrategias de ensenanza_cap6 Anijovich Mora 2009_
Estrategias de ensenanza_cap6 Anijovich Mora 2009_
 
Pat7.3 253
Pat7.3 253Pat7.3 253
Pat7.3 253
 
Међумолекулске интеракције и водонична веза
Међумолекулске интеракције и водонична везаМеђумолекулске интеракције и водонична веза
Међумолекулске интеракције и водонична веза
 
Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...
Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...
Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...
 
Chris Hamilton news:rewired presentation
Chris Hamilton news:rewired presentationChris Hamilton news:rewired presentation
Chris Hamilton news:rewired presentation
 
22號 周玟伽
22號 周玟伽22號 周玟伽
22號 周玟伽
 
Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...
Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...
Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...
 
EDS selection & implementation @ CCC
EDS selection & implementation @ CCCEDS selection & implementation @ CCC
EDS selection & implementation @ CCC
 
disleksia kanak2
 disleksia kanak2 disleksia kanak2
disleksia kanak2
 
Tugas 4
Tugas 4Tugas 4
Tugas 4
 
Salon Maison Passive - Enterprise Europe Brussels - Technology Watch services
Salon Maison Passive - Enterprise Europe Brussels - Technology Watch servicesSalon Maison Passive - Enterprise Europe Brussels - Technology Watch services
Salon Maison Passive - Enterprise Europe Brussels - Technology Watch services
 
закон о пдн_последние_изменения_разбегаев_в_авг_2014
закон о пдн_последние_изменения_разбегаев_в_авг_2014закон о пдн_последние_изменения_разбегаев_в_авг_2014
закон о пдн_последние_изменения_разбегаев_в_авг_2014
 
動画の作り方から稼ぎ方まで20130720
動画の作り方から稼ぎ方まで20130720動画の作り方から稼ぎ方まで20130720
動画の作り方から稼ぎ方まで20130720
 

Similar to Hadoop and MapReduce

Hadoop live online training
Hadoop live online trainingHadoop live online training
Hadoop live online trainingHarika583
 
Intro to BigData , Hadoop and Mapreduce
Intro to BigData , Hadoop and MapreduceIntro to BigData , Hadoop and Mapreduce
Intro to BigData , Hadoop and MapreduceKrishna Sangeeth KS
 
Report Hadoop Map Reduce
Report Hadoop Map ReduceReport Hadoop Map Reduce
Report Hadoop Map ReduceUrvashi Kataria
 
Learn what is Hadoop-and-BigData
Learn  what is Hadoop-and-BigDataLearn  what is Hadoop-and-BigData
Learn what is Hadoop-and-BigDataThanusha154
 
Foss4g2016 Geopaparazzi Workshop
Foss4g2016 Geopaparazzi WorkshopFoss4g2016 Geopaparazzi Workshop
Foss4g2016 Geopaparazzi WorkshopAndrea Antonello
 
Introduccion a Hadoop / Introduction to Hadoop
Introduccion a Hadoop / Introduction to HadoopIntroduccion a Hadoop / Introduction to Hadoop
Introduccion a Hadoop / Introduction to HadoopGERARDO BARBERENA
 
Understanding hadoop
Understanding hadoopUnderstanding hadoop
Understanding hadoopRexRamos9
 
Reduce Side Joins
Reduce Side Joins Reduce Side Joins
Reduce Side Joins Edureka!
 
Seminar_Report_hadoop
Seminar_Report_hadoopSeminar_Report_hadoop
Seminar_Report_hadoopVarun Narang
 
Learning How to Learn Hadoop
Learning How to Learn HadoopLearning How to Learn Hadoop
Learning How to Learn HadoopSilicon Halton
 
Sparse matrix computations in MapReduce
Sparse matrix computations in MapReduceSparse matrix computations in MapReduce
Sparse matrix computations in MapReduceDavid Gleich
 
2. Develop a MapReduce program to calculate the frequency of a given word in ...
2. Develop a MapReduce program to calculate the frequency of a given word in ...2. Develop a MapReduce program to calculate the frequency of a given word in ...
2. Develop a MapReduce program to calculate the frequency of a given word in ...Prof. Maulik Trivedi
 
Intermachine Parallelism
Intermachine ParallelismIntermachine Parallelism
Intermachine ParallelismSri Prasanna
 
Hadoop, MapReduce and R = RHadoop
Hadoop, MapReduce and R = RHadoopHadoop, MapReduce and R = RHadoop
Hadoop, MapReduce and R = RHadoopVictoria López
 

Similar to Hadoop and MapReduce (20)

Hadoop Mapreduce
Hadoop MapreduceHadoop Mapreduce
Hadoop Mapreduce
 
Hadoop ppt2
Hadoop ppt2Hadoop ppt2
Hadoop ppt2
 
Hadoop live online training
Hadoop live online trainingHadoop live online training
Hadoop live online training
 
Intro to BigData , Hadoop and Mapreduce
Intro to BigData , Hadoop and MapreduceIntro to BigData , Hadoop and Mapreduce
Intro to BigData , Hadoop and Mapreduce
 
Report Hadoop Map Reduce
Report Hadoop Map ReduceReport Hadoop Map Reduce
Report Hadoop Map Reduce
 
Learn what is Hadoop-and-BigData
Learn  what is Hadoop-and-BigDataLearn  what is Hadoop-and-BigData
Learn what is Hadoop-and-BigData
 
mapReduce.pptx
mapReduce.pptxmapReduce.pptx
mapReduce.pptx
 
Foss4g2016 Geopaparazzi Workshop
Foss4g2016 Geopaparazzi WorkshopFoss4g2016 Geopaparazzi Workshop
Foss4g2016 Geopaparazzi Workshop
 
Hadoop MapReduce
Hadoop MapReduceHadoop MapReduce
Hadoop MapReduce
 
Introduccion a Hadoop / Introduction to Hadoop
Introduccion a Hadoop / Introduction to HadoopIntroduccion a Hadoop / Introduction to Hadoop
Introduccion a Hadoop / Introduction to Hadoop
 
Understanding hadoop
Understanding hadoopUnderstanding hadoop
Understanding hadoop
 
Hadoop Seminar Report
Hadoop Seminar ReportHadoop Seminar Report
Hadoop Seminar Report
 
Reduce Side Joins
Reduce Side Joins Reduce Side Joins
Reduce Side Joins
 
Seminar_Report_hadoop
Seminar_Report_hadoopSeminar_Report_hadoop
Seminar_Report_hadoop
 
Learning How to Learn Hadoop
Learning How to Learn HadoopLearning How to Learn Hadoop
Learning How to Learn Hadoop
 
Sparse matrix computations in MapReduce
Sparse matrix computations in MapReduceSparse matrix computations in MapReduce
Sparse matrix computations in MapReduce
 
Hadoop Internals
Hadoop InternalsHadoop Internals
Hadoop Internals
 
2. Develop a MapReduce program to calculate the frequency of a given word in ...
2. Develop a MapReduce program to calculate the frequency of a given word in ...2. Develop a MapReduce program to calculate the frequency of a given word in ...
2. Develop a MapReduce program to calculate the frequency of a given word in ...
 
Intermachine Parallelism
Intermachine ParallelismIntermachine Parallelism
Intermachine Parallelism
 
Hadoop, MapReduce and R = RHadoop
Hadoop, MapReduce and R = RHadoopHadoop, MapReduce and R = RHadoop
Hadoop, MapReduce and R = RHadoop
 

More from Abhishek Dey

Automatic problem generation
Automatic problem generationAutomatic problem generation
Automatic problem generationAbhishek Dey
 
Handling High Energy Physics Data using Cloud Computing
Handling High Energy Physics Data using Cloud ComputingHandling High Energy Physics Data using Cloud Computing
Handling High Energy Physics Data using Cloud ComputingAbhishek Dey
 
Big Data Analysis on a Cloud Ecosystem-PATW 2013
Big Data Analysis on a Cloud Ecosystem-PATW 2013Big Data Analysis on a Cloud Ecosystem-PATW 2013
Big Data Analysis on a Cloud Ecosystem-PATW 2013Abhishek Dey
 
Cloud computing using Eucalyptus
Cloud computing using EucalyptusCloud computing using Eucalyptus
Cloud computing using EucalyptusAbhishek Dey
 
Introduction to cloud computing
Introduction to cloud computingIntroduction to cloud computing
Introduction to cloud computingAbhishek Dey
 

More from Abhishek Dey (6)

Automatic problem generation
Automatic problem generationAutomatic problem generation
Automatic problem generation
 
Cafaholic ppt
Cafaholic pptCafaholic ppt
Cafaholic ppt
 
Handling High Energy Physics Data using Cloud Computing
Handling High Energy Physics Data using Cloud ComputingHandling High Energy Physics Data using Cloud Computing
Handling High Energy Physics Data using Cloud Computing
 
Big Data Analysis on a Cloud Ecosystem-PATW 2013
Big Data Analysis on a Cloud Ecosystem-PATW 2013Big Data Analysis on a Cloud Ecosystem-PATW 2013
Big Data Analysis on a Cloud Ecosystem-PATW 2013
 
Cloud computing using Eucalyptus
Cloud computing using EucalyptusCloud computing using Eucalyptus
Cloud computing using Eucalyptus
 
Introduction to cloud computing
Introduction to cloud computingIntroduction to cloud computing
Introduction to cloud computing
 

Recently uploaded

General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseAnaAcapella
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxcallscotland1987
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 

Recently uploaded (20)

General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 

Hadoop and MapReduce

  • 1. HADOOP Framework and Applications Prepared by: TEAM HADOOP slide1/22
  • 2. CONTENTS  WHY HADOOP?  INTRODUCTION TO MapReduce Prepared by: TEAM HADOOP slide 2/22
  • 3. WHAT? “... to create building blocks for programmers who just happen to have lots of data to store, lots of data to analyze, or lots of machines to coordinate, and who don‟t have the time, the skill, or the inclination to become distributed systems experts to build the infrastructure to handle it.” -Tom White Source: Hadoop: The Definitive Guide Prepared by: TEAM HADOOP slide 3/22
  • 4. WHAT?  Hadoop contains many subprojects:  Hadoop Common  Chukwa  HBase  ZooKeeper  Pig  Zombie  Hive  MapReduce We will focus on MapReduce Prepared by: TEAM HADOOP slide 4/22
  • 5. WHO & WHEN?  Pre-2004 : Cutting and Cafarella develop open source projects for web-scale indexing, crawling and search. Prepared by: TEAM HADOOP slide 5/22
  • 6. WHO & WHEN?  2004: Jeffrey Dean and Sanjay Ghemawat introduce map reduce model used internally at Google. Prepared by: TEAM HADOOP slide 6/22
  • 7. WHO & WHEN?  2006:Hadoop becomes official Apache project, Cutting joins Yahoo!Yahoo adopts Hadoop. Prepared by: TEAM HADOOP slide 7/22
  • 8. TRENDS Prepared by: TEAM HADOOP slide 8/22
  • 9. WHO USES IT? Prepared by: TEAM HADOOP slide 9/22
  • 10. Roughly how long to read 1TB from a commodity hard disk? Prepared by: TEAM HADOOP slide 10/22
  • 11. Roughly how long to read 1TB from a commodity hard disk? Around 4 hours WITH HADOOP.. 62 seconds… Prepared by: TEAM HADOOP slide 11/22
  • 12. INTRODUCTION TO MapReduce "Break large problem into smaller parts, solve in parallel, combine results." Prepared by: TEAM HADOOP slide 12/22
  • 13. Typical scenario  How many times is the word „IT‟ present? You‟ll probably count but in a 30k paged document, can you?? Prepared by: TEAM HADOOP slide 13/22
  • 14. Map Reduce Typical Illustration Prepared by: TEAM HADOOP slide 14/22
  • 15. Map Reduce paradigm Input Output Map Reduce Shuffle/Sort Prepared by: TEAM HADOOP slide 15/22
  • 16. Map Reduce paradigm  Map: transforms input record to intermediate (key, value) pair Prepared by: TEAM HADOOP slide 16/22
  • 17. Map Reduce paradigm  Reduce: transforms all records for given key to final output. Prepared by: TEAM HADOOP slide 17/22
  • 18. Map reduce principles Move code to data (local computation) Abstract away fault Allow programs to scale tolerance, synchronization, etc. transparently w.r.t size of input Prepared by: TEAM HADOOP slide 18/22
  • 19. Implementation: Hardware Prepared by: TEAM HADOOP sroy choudhury7@gmail.com slide 19/22
  • 20. Map Reduce: strengths  Batch, offline jobs  Write-once, read-many across full data set  Usually, though not always, simple computations  I/O bound by disk/network bandwidth Prepared by: TEAM HADOOP slide 20/22
  • 21. What it‟s not! What it‟s not:  High-performance parallel computing, e.g. MPI  Low-latency random access relational database  Always the right solution Prepared by: TEAM HADOOP slide 21/22
  • 22. THANK YOU! QUESTIONS? Prepared by: TEAM HADOOP slide 22/22