Submit Search
Upload
Intro to MapReduce
•
Download as PPTX, PDF
•
0 likes
•
540 views
Delhi/NCR HUG
Follow
Delhi Hadoop User Group MeetUp - 10th Sept. 2011 - Slides
Read less
Read more
Technology
Business
Report
Share
Report
Share
1 of 9
Download now
Recommended
Denodo DataFest 2017: Modern Data Architectures Need Real-time Data Delivery
Denodo DataFest 2017: Modern Data Architectures Need Real-time Data Delivery
Denodo
Denodo DataFest 2017: Lowering IT Costs with Big Data and Cloud Modernization
Denodo DataFest 2017: Lowering IT Costs with Big Data and Cloud Modernization
Denodo
What’s New in Syncsort’s Trillium Software System (TSS) 15.7
What’s New in Syncsort’s Trillium Software System (TSS) 15.7
Precisely
MapInfo Pro v2021 - Next Generation Location Analytics Made Easy
MapInfo Pro v2021 - Next Generation Location Analytics Made Easy
Precisely
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo
Performance Considerations in Logical Data Warehouse
Performance Considerations in Logical Data Warehouse
Denodo
4870 ibm-storage-solutions-final_nov26_18_34019934_usen
4870 ibm-storage-solutions-final_nov26_18_34019934_usen
duc_spt
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Igor De Souza
Recommended
Denodo DataFest 2017: Modern Data Architectures Need Real-time Data Delivery
Denodo DataFest 2017: Modern Data Architectures Need Real-time Data Delivery
Denodo
Denodo DataFest 2017: Lowering IT Costs with Big Data and Cloud Modernization
Denodo DataFest 2017: Lowering IT Costs with Big Data and Cloud Modernization
Denodo
What’s New in Syncsort’s Trillium Software System (TSS) 15.7
What’s New in Syncsort’s Trillium Software System (TSS) 15.7
Precisely
MapInfo Pro v2021 - Next Generation Location Analytics Made Easy
MapInfo Pro v2021 - Next Generation Location Analytics Made Easy
Precisely
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo
Performance Considerations in Logical Data Warehouse
Performance Considerations in Logical Data Warehouse
Denodo
4870 ibm-storage-solutions-final_nov26_18_34019934_usen
4870 ibm-storage-solutions-final_nov26_18_34019934_usen
duc_spt
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Igor De Souza
Denodo DataFest 2017: Enabling Single View of Entities with Microservices
Denodo DataFest 2017: Enabling Single View of Entities with Microservices
Denodo
Data as a service
Data as a service
Khushbu Joshi
Giga Spaces Getting Ready For The Cloud
Giga Spaces Getting Ready For The Cloud
chzesin
DataKraft - Powerful No-Coding Platform for Business Applications
DataKraft - Powerful No-Coding Platform for Business Applications
Tibbs Pereira
Solution architecture Amazon web services
Solution architecture Amazon web services
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
Databricks
Data platform architecture
Data platform architecture
Sudheer Kondla
Architecture for Real-Time and Batch Big Data Analytics
Architecture for Real-Time and Batch Big Data Analytics
Nir Rubinstein
Data As Service (Team: 5, Project: 17)
Data As Service (Team: 5, Project: 17)
Pankaj Shipte
Accelerate and modernize your data pipelines
Accelerate and modernize your data pipelines
Paul Van Siclen
Making the most of your Snowflake Investment
Making the most of your Snowflake Investment
Paul Van Siclen
The importance of efficient data management for Digital Transformation
The importance of efficient data management for Digital Transformation
MongoDB
Enterprise GIS Implementation for Public Infrastructure and Integration with ...
Enterprise GIS Implementation for Public Infrastructure and Integration with ...
Michael Baker Jr., Inc.
Cloud Crowd - Mobile Sync Cloud
Cloud Crowd - Mobile Sync Cloud
jimliddle
Solution Architecture - AWS
Solution Architecture - AWS
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
Sn wf12 amd fabric server (satheesh nanniyur) oct 12
Sn wf12 amd fabric server (satheesh nanniyur) oct 12
Satheesh Nanniyur
Event-driven Business: How Leading Companies Are Adopting Streaming Strategies
Event-driven Business: How Leading Companies Are Adopting Streaming Strategies
confluent
How In Memory Computing Changes Everything
How In Memory Computing Changes Everything
Debajit Banerjee
SoftServe BI/BigData Workshop in Utah
SoftServe BI/BigData Workshop in Utah
Serhiy (Serge) Haziyev
Intro to In-memory Computing and Gigaspaces
Intro to In-memory Computing and Gigaspaces
inside-BigData.com
Big Data application - OSS / BSS
Big Data application - OSS / BSS
Keyur Thakore
Analysing of big data using map reduce
Analysing of big data using map reduce
Paladion Networks
More Related Content
What's hot
Denodo DataFest 2017: Enabling Single View of Entities with Microservices
Denodo DataFest 2017: Enabling Single View of Entities with Microservices
Denodo
Data as a service
Data as a service
Khushbu Joshi
Giga Spaces Getting Ready For The Cloud
Giga Spaces Getting Ready For The Cloud
chzesin
DataKraft - Powerful No-Coding Platform for Business Applications
DataKraft - Powerful No-Coding Platform for Business Applications
Tibbs Pereira
Solution architecture Amazon web services
Solution architecture Amazon web services
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
Databricks
Data platform architecture
Data platform architecture
Sudheer Kondla
Architecture for Real-Time and Batch Big Data Analytics
Architecture for Real-Time and Batch Big Data Analytics
Nir Rubinstein
Data As Service (Team: 5, Project: 17)
Data As Service (Team: 5, Project: 17)
Pankaj Shipte
Accelerate and modernize your data pipelines
Accelerate and modernize your data pipelines
Paul Van Siclen
Making the most of your Snowflake Investment
Making the most of your Snowflake Investment
Paul Van Siclen
The importance of efficient data management for Digital Transformation
The importance of efficient data management for Digital Transformation
MongoDB
Enterprise GIS Implementation for Public Infrastructure and Integration with ...
Enterprise GIS Implementation for Public Infrastructure and Integration with ...
Michael Baker Jr., Inc.
Cloud Crowd - Mobile Sync Cloud
Cloud Crowd - Mobile Sync Cloud
jimliddle
Solution Architecture - AWS
Solution Architecture - AWS
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
Sn wf12 amd fabric server (satheesh nanniyur) oct 12
Sn wf12 amd fabric server (satheesh nanniyur) oct 12
Satheesh Nanniyur
Event-driven Business: How Leading Companies Are Adopting Streaming Strategies
Event-driven Business: How Leading Companies Are Adopting Streaming Strategies
confluent
How In Memory Computing Changes Everything
How In Memory Computing Changes Everything
Debajit Banerjee
SoftServe BI/BigData Workshop in Utah
SoftServe BI/BigData Workshop in Utah
Serhiy (Serge) Haziyev
Intro to In-memory Computing and Gigaspaces
Intro to In-memory Computing and Gigaspaces
inside-BigData.com
What's hot
(20)
Denodo DataFest 2017: Enabling Single View of Entities with Microservices
Denodo DataFest 2017: Enabling Single View of Entities with Microservices
Data as a service
Data as a service
Giga Spaces Getting Ready For The Cloud
Giga Spaces Getting Ready For The Cloud
DataKraft - Powerful No-Coding Platform for Business Applications
DataKraft - Powerful No-Coding Platform for Business Applications
Solution architecture Amazon web services
Solution architecture Amazon web services
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
Data platform architecture
Data platform architecture
Architecture for Real-Time and Batch Big Data Analytics
Architecture for Real-Time and Batch Big Data Analytics
Data As Service (Team: 5, Project: 17)
Data As Service (Team: 5, Project: 17)
Accelerate and modernize your data pipelines
Accelerate and modernize your data pipelines
Making the most of your Snowflake Investment
Making the most of your Snowflake Investment
The importance of efficient data management for Digital Transformation
The importance of efficient data management for Digital Transformation
Enterprise GIS Implementation for Public Infrastructure and Integration with ...
Enterprise GIS Implementation for Public Infrastructure and Integration with ...
Cloud Crowd - Mobile Sync Cloud
Cloud Crowd - Mobile Sync Cloud
Solution Architecture - AWS
Solution Architecture - AWS
Sn wf12 amd fabric server (satheesh nanniyur) oct 12
Sn wf12 amd fabric server (satheesh nanniyur) oct 12
Event-driven Business: How Leading Companies Are Adopting Streaming Strategies
Event-driven Business: How Leading Companies Are Adopting Streaming Strategies
How In Memory Computing Changes Everything
How In Memory Computing Changes Everything
SoftServe BI/BigData Workshop in Utah
SoftServe BI/BigData Workshop in Utah
Intro to In-memory Computing and Gigaspaces
Intro to In-memory Computing and Gigaspaces
Viewers also liked
Big Data application - OSS / BSS
Big Data application - OSS / BSS
Keyur Thakore
Analysing of big data using map reduce
Analysing of big data using map reduce
Paladion Networks
Hadoop MapReduce Fundamentals
Hadoop MapReduce Fundamentals
Lynn Langit
Apache HBase - Lab Assignment
Apache HBase - Lab Assignment
Farzad Nozarian
Apache Hadoop MapReduce Tutorial
Apache Hadoop MapReduce Tutorial
Farzad Nozarian
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
DataWorks Summit
An Introduction to MapReduce
An Introduction to MapReduce
Frane Bandov
MapReduce in Simple Terms
MapReduce in Simple Terms
Saliya Ekanayake
Viewers also liked
(8)
Big Data application - OSS / BSS
Big Data application - OSS / BSS
Analysing of big data using map reduce
Analysing of big data using map reduce
Hadoop MapReduce Fundamentals
Hadoop MapReduce Fundamentals
Apache HBase - Lab Assignment
Apache HBase - Lab Assignment
Apache Hadoop MapReduce Tutorial
Apache Hadoop MapReduce Tutorial
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
An Introduction to MapReduce
An Introduction to MapReduce
MapReduce in Simple Terms
MapReduce in Simple Terms
Similar to Intro to MapReduce
Scalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and How
Cambridge Semantics
Ameya Kanitkar: Using Hadoop and HBase to Personalize Web, Mobile and Email E...
Ameya Kanitkar: Using Hadoop and HBase to Personalize Web, Mobile and Email E...
WebExpo
ITReady DW Day2
ITReady DW Day2
Siwawong Wuttipongprasert
Sap Bw 3.5 Overview
Sap Bw 3.5 Overview
Trevor Prescod
Hws Design Presentation01
Hws Design Presentation01
tomcwilliamson
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Denodo
Apache Hadoop India Summit 2011 talk "Making Hadoop Enterprise Ready with Am...
Apache Hadoop India Summit 2011 talk "Making Hadoop Enterprise Ready with Am...
Yahoo Developer Network
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Hortonworks
UTAD - Jornadas de Informática - Potential of Big Data
UTAD - Jornadas de Informática - Potential of Big Data
Marco Silva
Big Data .. Are you ready for the next wave?
Big Data .. Are you ready for the next wave?
Mahmoud Sabri
Azure HDInsight
Azure HDInsight
Koray Kocabas
High Performance BI with Cognos and ParAccel Analytic Database
High Performance BI with Cognos and ParAccel Analytic Database
Karol Chlasta
Data Warehouse
Data Warehouse
ganblues
Neo4j GraphTour New York_EY Presentation_Michael Moore
Neo4j GraphTour New York_EY Presentation_Michael Moore
Neo4j
Building a Big Data Solution
Building a Big Data Solution
James Serra
Splunk Business Analytics
Splunk Business Analytics
CleverDATA
Data Transformation Patterns in AWS - AWS Online Tech Talks
Data Transformation Patterns in AWS - AWS Online Tech Talks
Amazon Web Services
Developing Enterprise Consciousness: Building Modern Open Data Platforms
Developing Enterprise Consciousness: Building Modern Open Data Platforms
ScyllaDB
Introduction to HANA in-memory from SAP
Introduction to HANA in-memory from SAP
ugur candan
Your Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph Strategy
Neo4j
Similar to Intro to MapReduce
(20)
Scalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and How
Ameya Kanitkar: Using Hadoop and HBase to Personalize Web, Mobile and Email E...
Ameya Kanitkar: Using Hadoop and HBase to Personalize Web, Mobile and Email E...
ITReady DW Day2
ITReady DW Day2
Sap Bw 3.5 Overview
Sap Bw 3.5 Overview
Hws Design Presentation01
Hws Design Presentation01
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Apache Hadoop India Summit 2011 talk "Making Hadoop Enterprise Ready with Am...
Apache Hadoop India Summit 2011 talk "Making Hadoop Enterprise Ready with Am...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
UTAD - Jornadas de Informática - Potential of Big Data
UTAD - Jornadas de Informática - Potential of Big Data
Big Data .. Are you ready for the next wave?
Big Data .. Are you ready for the next wave?
Azure HDInsight
Azure HDInsight
High Performance BI with Cognos and ParAccel Analytic Database
High Performance BI with Cognos and ParAccel Analytic Database
Data Warehouse
Data Warehouse
Neo4j GraphTour New York_EY Presentation_Michael Moore
Neo4j GraphTour New York_EY Presentation_Michael Moore
Building a Big Data Solution
Building a Big Data Solution
Splunk Business Analytics
Splunk Business Analytics
Data Transformation Patterns in AWS - AWS Online Tech Talks
Data Transformation Patterns in AWS - AWS Online Tech Talks
Developing Enterprise Consciousness: Building Modern Open Data Platforms
Developing Enterprise Consciousness: Building Modern Open Data Platforms
Introduction to HANA in-memory from SAP
Introduction to HANA in-memory from SAP
Your Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph Strategy
Recently uploaded
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
The Digital Insurer
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
wesley chun
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Martijn de Jong
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Anna Loughnan Colquhoun
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Delhi Call girls
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
ThousandEyes
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
Igalia
🐬 The future of MySQL is Postgres 🐘
🐬 The future of MySQL is Postgres 🐘
RTylerCroy
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
Delhi Call girls
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
debabhi2
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
The Digital Insurer
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
sudhanshuwaghmare1
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Neo4j
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
Delhi Call girls
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
HampshireHUG
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
Malak Abu Hammad
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
Enterprise Knowledge
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Miguel Araújo
Recently uploaded
(20)
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
🐬 The future of MySQL is Postgres 🐘
🐬 The future of MySQL is Postgres 🐘
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Intro to MapReduce
1.
Intro to Map/Reduce
- Somil Asthana
2.
3.
Map / Reduce
Pipe.
4.
5.
Works with a
model where computation moves to Data rather than Data moving to Computing Machine.
6.
Takes care of
issue arises due to distributed computing.
7.
8.
Map Reduce Pipe
Raw Data Mapper (Key, Value Format) Shuffle & Sort (based on Key) Reducer (For Each Key list of Values) Output (Key, Value Format)
9.
10.
The Data is
in 9-tuple format: <OrderID, EmailID, MobileNum, ProductID, PayableAmount, DeliveryCharges, ModeofPayment, OrderStatus, OrderSite>
11.
ModeofPayment = (COD,Credit,Check)
12.
OrderStatus = (Ordered,
Clicked, Verified, Rejected, Dispatched, Returned from Client, Delivered)
13.
14.
Cube Generation for
ABC Company Mapper Reducer
15.
Download now