SlideShare uma empresa Scribd logo
1 de 2
Big Data (HADOOP AND MAPREDUCE?)
What is Hadoop? Simple answer, Hadoop lets you store files bigger than what
can be stored on one particular node or server. So you can store very, very
large files and many files on multiple servers/computers in a distributed fashion.
Advantages of Hadoop include affordability (it runs on industry standard hardware and
agility (store any data, run any analysis).
Hadoop is an Apache open source project that providesa parallel storage and
processing framework. Itsprimary purpose is to run MapReduce batch programs in
parallel on tens to thousands of server nodes.
Hadoop scales out to large clusters of serversand storage using the Hadoop Distributed
File System (HDFS) to manage huge data sets and spread them across the servers.
Hadoop comes with libraries and utilities needed by other Hadoop modules. Hadoop
consists of the Hadoop Common package, which providesfilesystemand OS level
abstractions, a MapReduce engine. The Hadoop Common package contains the
necessary JAVA files and scripts needed to start Hadoop. The package also provides
source code, documentation, and a contribution section that includes projects from
the Hadoop Community
Hadoop Distributed file-systemthat stores data on commodity machines, providing very
high aggregate bandwidth across the cluster. Hadoop scales out to large clustersof
serversand storage using the Hadoop Distributed File System (HDFS) to manage huge
data sets and spread them across the servers.
HDFS was designed to be a scalable, fault-tolerant, distributed storage systemthat
workscloselywith MapReduce. HDFS will “just work” under a variety of physical and
systemic circumstances. By distributing storage and computation across many servers,
the combined storage resource can grow with demand while remaining economical at
every size.
What is Map Reduce?
Map reduce is a framework for processing the data. The data is not moved in a
conventional fashion using the network becauseit is slow for huge amount of data and
media. MapReduce uses a better approach to fit well with big data sets. So rather than
move the data to the software, MapReducemoves the processing software to the
data.
MAP
REDUCE
KEY TO BE OR NOT
VALUE 2 2 1 1
Map Reduce – a programming model for large scale data processing. MapReduce
refers to the application modules written by a programmer that run in two phases: first
mapping the data (extract) then reducing it (transform).
Hadoop’s greatest benefits is the ability of programmers to write application modules in
almost any language and run them in parallel on the same cluster that stores the data.
With Hadoop, any programmer can harness the power and capacity of thousands of
CPUs and hard drivessimultaneously.
KEY TO BE OR NOT TO BE
VALUE 1 1 1 1 1 1

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

HADOOP TECHNOLOGY ppt
HADOOP  TECHNOLOGY pptHADOOP  TECHNOLOGY ppt
HADOOP TECHNOLOGY ppt
 
Hadoop Technology
Hadoop TechnologyHadoop Technology
Hadoop Technology
 
Hadoop ecosystem J.AYEESHA PARVEEN II-M.SC.,COMPUTER SCIENCE, BON SECOURS CO...
Hadoop ecosystem  J.AYEESHA PARVEEN II-M.SC.,COMPUTER SCIENCE, BON SECOURS CO...Hadoop ecosystem  J.AYEESHA PARVEEN II-M.SC.,COMPUTER SCIENCE, BON SECOURS CO...
Hadoop ecosystem J.AYEESHA PARVEEN II-M.SC.,COMPUTER SCIENCE, BON SECOURS CO...
 
Hadoop ecosystem; J.Ayeesha parveen 2 nd M.sc., computer science Bon Secours...
Hadoop ecosystem; J.Ayeesha parveen 2 nd M.sc., computer science  Bon Secours...Hadoop ecosystem; J.Ayeesha parveen 2 nd M.sc., computer science  Bon Secours...
Hadoop ecosystem; J.Ayeesha parveen 2 nd M.sc., computer science Bon Secours...
 
Hadoop
HadoopHadoop
Hadoop
 
Introduction to Apache hadoop
Introduction to Apache hadoopIntroduction to Apache hadoop
Introduction to Apache hadoop
 
Hadoop vs Spark | Which One to Choose? | Hadoop Training | Spark Training | E...
Hadoop vs Spark | Which One to Choose? | Hadoop Training | Spark Training | E...Hadoop vs Spark | Which One to Choose? | Hadoop Training | Spark Training | E...
Hadoop vs Spark | Which One to Choose? | Hadoop Training | Spark Training | E...
 
Hadoop introduction
Hadoop introductionHadoop introduction
Hadoop introduction
 
Analytics 3
Analytics 3Analytics 3
Analytics 3
 
Hadoop vs Apache Spark
Hadoop vs Apache SparkHadoop vs Apache Spark
Hadoop vs Apache Spark
 
An Introduction to Apache Spark
An Introduction to Apache SparkAn Introduction to Apache Spark
An Introduction to Apache Spark
 
Hadoop
HadoopHadoop
Hadoop
 
Hire Hadoop Developer
Hire Hadoop DeveloperHire Hadoop Developer
Hire Hadoop Developer
 
Hadoop distributions - ecosystem
Hadoop distributions - ecosystemHadoop distributions - ecosystem
Hadoop distributions - ecosystem
 
Big Data and Hadoop - An Introduction
Big Data and Hadoop - An IntroductionBig Data and Hadoop - An Introduction
Big Data and Hadoop - An Introduction
 
Big data
Big dataBig data
Big data
 
1.demystifying big data & hadoop
1.demystifying big data & hadoop1.demystifying big data & hadoop
1.demystifying big data & hadoop
 
Design of Hadoop Distributed File System
Design of Hadoop Distributed File SystemDesign of Hadoop Distributed File System
Design of Hadoop Distributed File System
 
The solution for big data
The solution for big dataThe solution for big data
The solution for big data
 
Big data analysis using hadoop cluster
Big data analysis using hadoop clusterBig data analysis using hadoop cluster
Big data analysis using hadoop cluster
 

Semelhante a Hadoop map reduce

Survey on Performance of Hadoop Map reduce Optimization Methods
Survey on Performance of Hadoop Map reduce Optimization MethodsSurvey on Performance of Hadoop Map reduce Optimization Methods
Survey on Performance of Hadoop Map reduce Optimization Methods
paperpublications3
 

Semelhante a Hadoop map reduce (20)

2.1-HADOOP.pdf
2.1-HADOOP.pdf2.1-HADOOP.pdf
2.1-HADOOP.pdf
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component
 
project report on hadoop
project report on hadoopproject report on hadoop
project report on hadoop
 
Hadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log Processing
 
Cppt Hadoop
Cppt HadoopCppt Hadoop
Cppt Hadoop
 
Cppt
CpptCppt
Cppt
 
Cppt
CpptCppt
Cppt
 
Survey on Performance of Hadoop Map reduce Optimization Methods
Survey on Performance of Hadoop Map reduce Optimization MethodsSurvey on Performance of Hadoop Map reduce Optimization Methods
Survey on Performance of Hadoop Map reduce Optimization Methods
 
Hadoop ppt2
Hadoop ppt2Hadoop ppt2
Hadoop ppt2
 
Managing Big data with Hadoop
Managing Big data with HadoopManaging Big data with Hadoop
Managing Big data with Hadoop
 
Distributed Systems Hadoop.pptx
Distributed Systems Hadoop.pptxDistributed Systems Hadoop.pptx
Distributed Systems Hadoop.pptx
 
Big Data and Hadoop Basics
Big Data and Hadoop BasicsBig Data and Hadoop Basics
Big Data and Hadoop Basics
 
Unit-3_BDA.ppt
Unit-3_BDA.pptUnit-3_BDA.ppt
Unit-3_BDA.ppt
 
Big Data and Hadoop Guide
Big Data and Hadoop GuideBig Data and Hadoop Guide
Big Data and Hadoop Guide
 
Introduction To Hadoop Administration - SpringPeople
Introduction To Hadoop Administration - SpringPeopleIntroduction To Hadoop Administration - SpringPeople
Introduction To Hadoop Administration - SpringPeople
 
Hadoop overview.pdf
Hadoop overview.pdfHadoop overview.pdf
Hadoop overview.pdf
 
Hadoop in action
Hadoop in actionHadoop in action
Hadoop in action
 
Hadoop Tutorial for Beginners
Hadoop Tutorial for BeginnersHadoop Tutorial for Beginners
Hadoop Tutorial for Beginners
 
Brief Introduction about Hadoop and Core Services.
Brief Introduction about Hadoop and Core Services.Brief Introduction about Hadoop and Core Services.
Brief Introduction about Hadoop and Core Services.
 
Hadoop and its role in Facebook: An Overview
Hadoop and its role in Facebook: An OverviewHadoop and its role in Facebook: An Overview
Hadoop and its role in Facebook: An Overview
 

Mais de VijayMohan Vasu

DATA SCIENCE CERTIFICATES
DATA SCIENCE CERTIFICATESDATA SCIENCE CERTIFICATES
DATA SCIENCE CERTIFICATES
VijayMohan Vasu
 
Midway Experience PowerBI
Midway Experience PowerBIMidway Experience PowerBI
Midway Experience PowerBI
VijayMohan Vasu
 
DATA WAREHOUSE AND BUSINESS INTELLIGENCE
DATA WAREHOUSE AND BUSINESS INTELLIGENCEDATA WAREHOUSE AND BUSINESS INTELLIGENCE
DATA WAREHOUSE AND BUSINESS INTELLIGENCE
VijayMohan Vasu
 

Mais de VijayMohan Vasu (17)

DATA SCIENCE CERTIFICATES
DATA SCIENCE CERTIFICATESDATA SCIENCE CERTIFICATES
DATA SCIENCE CERTIFICATES
 
Midway Experience PowerBI
Midway Experience PowerBIMidway Experience PowerBI
Midway Experience PowerBI
 
Experience Power BI
Experience Power BIExperience Power BI
Experience Power BI
 
DWBI-WORK MIDWAY
DWBI-WORK MIDWAYDWBI-WORK MIDWAY
DWBI-WORK MIDWAY
 
DATA WAREHOUSE AND BUSINESS INTELLIGENCE
DATA WAREHOUSE AND BUSINESS INTELLIGENCEDATA WAREHOUSE AND BUSINESS INTELLIGENCE
DATA WAREHOUSE AND BUSINESS INTELLIGENCE
 
Balanced Diet
Balanced DietBalanced Diet
Balanced Diet
 
Predictive analytics usage and challenges
Predictive analytics usage and challengesPredictive analytics usage and challenges
Predictive analytics usage and challenges
 
Predictive analytics in the world of big data
Predictive analytics in the world of big dataPredictive analytics in the world of big data
Predictive analytics in the world of big data
 
R for data analytics
R for data analyticsR for data analytics
R for data analytics
 
Predictive analytics for modern business
Predictive analytics for modern businessPredictive analytics for modern business
Predictive analytics for modern business
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
 
Social Media Marketing
Social Media MarketingSocial Media Marketing
Social Media Marketing
 
Introduction to data warehousing and business intelligence
Introduction to data warehousing and business intelligenceIntroduction to data warehousing and business intelligence
Introduction to data warehousing and business intelligence
 
Inmon & kimball method
Inmon & kimball methodInmon & kimball method
Inmon & kimball method
 
Introduction to data warehousing and business intelligence
Introduction to data warehousing and business intelligenceIntroduction to data warehousing and business intelligence
Introduction to data warehousing and business intelligence
 
Smartbi Presentation
Smartbi PresentationSmartbi Presentation
Smartbi Presentation
 
Smartbi Presentation
Smartbi PresentationSmartbi Presentation
Smartbi Presentation
 

Último

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
shivangimorya083
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
JohnnyPlasten
 

Último (20)

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 

Hadoop map reduce

  • 1. Big Data (HADOOP AND MAPREDUCE?) What is Hadoop? Simple answer, Hadoop lets you store files bigger than what can be stored on one particular node or server. So you can store very, very large files and many files on multiple servers/computers in a distributed fashion. Advantages of Hadoop include affordability (it runs on industry standard hardware and agility (store any data, run any analysis). Hadoop is an Apache open source project that providesa parallel storage and processing framework. Itsprimary purpose is to run MapReduce batch programs in parallel on tens to thousands of server nodes. Hadoop scales out to large clusters of serversand storage using the Hadoop Distributed File System (HDFS) to manage huge data sets and spread them across the servers. Hadoop comes with libraries and utilities needed by other Hadoop modules. Hadoop consists of the Hadoop Common package, which providesfilesystemand OS level abstractions, a MapReduce engine. The Hadoop Common package contains the necessary JAVA files and scripts needed to start Hadoop. The package also provides source code, documentation, and a contribution section that includes projects from the Hadoop Community Hadoop Distributed file-systemthat stores data on commodity machines, providing very high aggregate bandwidth across the cluster. Hadoop scales out to large clustersof serversand storage using the Hadoop Distributed File System (HDFS) to manage huge data sets and spread them across the servers. HDFS was designed to be a scalable, fault-tolerant, distributed storage systemthat workscloselywith MapReduce. HDFS will “just work” under a variety of physical and systemic circumstances. By distributing storage and computation across many servers,
  • 2. the combined storage resource can grow with demand while remaining economical at every size. What is Map Reduce? Map reduce is a framework for processing the data. The data is not moved in a conventional fashion using the network becauseit is slow for huge amount of data and media. MapReduce uses a better approach to fit well with big data sets. So rather than move the data to the software, MapReducemoves the processing software to the data. MAP REDUCE KEY TO BE OR NOT VALUE 2 2 1 1 Map Reduce – a programming model for large scale data processing. MapReduce refers to the application modules written by a programmer that run in two phases: first mapping the data (extract) then reducing it (transform). Hadoop’s greatest benefits is the ability of programmers to write application modules in almost any language and run them in parallel on the same cluster that stores the data. With Hadoop, any programmer can harness the power and capacity of thousands of CPUs and hard drivessimultaneously. KEY TO BE OR NOT TO BE VALUE 1 1 1 1 1 1