SlideShare a Scribd company logo
1 of 4
Apache Hadoop Big Data Training Course

Sample Session:
Training Course Details Session:


https://www.youtube.com/watch?v=ij3v4mYKxHw


Haddop Business Case:


https://www.youtube.com/watch?v=vh0GpSi8StU


https://www.youtube.com/watch?v=jYAv34PHEdc


Hadoop HDFS Lab:


https://www.youtube.com/watch?v=Pp8SV50S9HM




Introduction to HADOOP

               Distributed computing , cloud computing
               Big data Basics and Need for Parallel Processing
               How Hadoop works ?
               Introduction to HDFS and Map Reduce


 Hadoop Architecture Details

               Name Node
               Data Node
               Secondary Name Node
               Job Tracker
               Task Tracker


HDFS ( Hadoop - Distributed File System)

               Hadoop Distributed file system , Background, GFS
               Data Replication
               Data Storage
               Data Retrieval
               Additional HDFS commands


 MapReduce Programming

               MapReduce, Background
               Writing MapReduce Programs
               Writable and WritableComparable
               Input Format, Output Format
Input Split and Block size
       Combiner
       Partitioner
       Number of Mappers and Reducers
       Counters


Map Reduce Algorithms and Exercises

       Line Count and Word Count
       Distributed Search
       Sorting Data – Key Value Data Type
       Mathematical Transformation example
       Working with Counters exercise
       Distributed Cache exercise
       Zero Reducer based exercises




Hadoop Streaming

       Introduction to Hadoop Streaming
       Streaming API details and use cases
       Python Based Example for Streaming API
       Exercise for Hadoop Streaming ( XML Files ) Based.
       Exercises on Ruby
       Exercise on C# using MS-Azure.

Apache Pig

       Installation
       Execution Types
       Grunt Shell
       Pig Latin
       Data Processing
       Loading and Storing
       Data Filtering
       Grouping & Joining Operations
       Hands on Exercises


Apache HBase Installation and Details
HBase and NOSQL Introduction
       HBase Installation and Configuration.
       HBase and Java Based integration
       HBase Hadoop Integration Details.
       HBase basic exercises


Apache Hive Installation and Details

         Hive Installation on Single cluster Hadoop Node.
         Hive Services
         Hive Shell Description
         Hive Server·
         Meta store Details
         Hive QL Basics
         Working with Tables, Databases etc.
         Hive JDBC programming
         Hands on Exercises and Assignments



Introduction to Amazon Map Reduce (AWS-EMR)

       Hadoop using Amaozon Web Service
       AWS MapReduce and EC2
       AWS - S3 Service Model.
       AWS-MR Architecture.
       Streaming Exercise using EMR JobFlow.

Hadoop Infrastructure Planning

       Basic Hadoop hardware and software req
       Small , Medium and Large cluster
       Networking challenges in Hadoop Deployment
       Disaster Recovery ( DR ) in Hadoop .
       Performance Tuning a large cluster

Hadoop Industry Solutions

       EMC GreenPlum Introduction
       IBM BigInsight Details
       Oracle , Microsoft etc Hadoop Offerings
       Cloudera and HortonWorks Hadoop Package
Hadoop and Cloud Computing

       Using Cloud technologies for distributed processing
       Hadoop on Amazon Web Service.
       Hadoop in Oracle Cloud / RackSpace

============================================================


Medium: GotoMeeting

Duration: 35 Hours

Materials: 50+ Exercises-Fully Solved. Decks, VM Machine.

More Related Content

What's hot

Qubole Overview at the Fifth Elephant Conference
Qubole Overview at the Fifth Elephant ConferenceQubole Overview at the Fifth Elephant Conference
Qubole Overview at the Fifth Elephant Conference
Joydeep Sen Sarma
 
Hadoop applicationarchitectures
Hadoop applicationarchitecturesHadoop applicationarchitectures
Hadoop applicationarchitectures
Doug Chang
 
Big Data in the Microsoft Platform
Big Data in the Microsoft PlatformBig Data in the Microsoft Platform
Big Data in the Microsoft Platform
Jesus Rodriguez
 

What's hot (18)

Qubole Overview at the Fifth Elephant Conference
Qubole Overview at the Fifth Elephant ConferenceQubole Overview at the Fifth Elephant Conference
Qubole Overview at the Fifth Elephant Conference
 
Apache Pig
Apache PigApache Pig
Apache Pig
 
Practical Hadoop Big Data Training Course by Certified Architect
Practical Hadoop Big Data Training Course by Certified ArchitectPractical Hadoop Big Data Training Course by Certified Architect
Practical Hadoop Big Data Training Course by Certified Architect
 
F07-Cloud-Hadoop-BAM
F07-Cloud-Hadoop-BAMF07-Cloud-Hadoop-BAM
F07-Cloud-Hadoop-BAM
 
Introduction to Apache Hivemall v0.5.2 and v0.6
Introduction to Apache Hivemall v0.5.2 and v0.6Introduction to Apache Hivemall v0.5.2 and v0.6
Introduction to Apache Hivemall v0.5.2 and v0.6
 
Cloud Optimized Big Data
Cloud Optimized Big DataCloud Optimized Big Data
Cloud Optimized Big Data
 
A complete hadoop stack
A complete hadoop stackA complete hadoop stack
A complete hadoop stack
 
Introduction to Apache Pig
Introduction to Apache PigIntroduction to Apache Pig
Introduction to Apache Pig
 
Introduction to pig & pig latin
Introduction to pig & pig latinIntroduction to pig & pig latin
Introduction to pig & pig latin
 
Qubole @ AWS Meetup Bangalore - July 2015
Qubole @ AWS Meetup Bangalore - July 2015Qubole @ AWS Meetup Bangalore - July 2015
Qubole @ AWS Meetup Bangalore - July 2015
 
Hadoop big data
Hadoop   big dataHadoop   big data
Hadoop big data
 
Hopsworks in the cloud Berlin Buzzwords 2019
Hopsworks in the cloud Berlin Buzzwords 2019 Hopsworks in the cloud Berlin Buzzwords 2019
Hopsworks in the cloud Berlin Buzzwords 2019
 
The Meta of Hadoop - COMAD 2012
The Meta of Hadoop - COMAD 2012The Meta of Hadoop - COMAD 2012
The Meta of Hadoop - COMAD 2012
 
What Can HPC on AWS Do?
What Can HPC on AWS Do?What Can HPC on AWS Do?
What Can HPC on AWS Do?
 
Hadoop applicationarchitectures
Hadoop applicationarchitecturesHadoop applicationarchitectures
Hadoop applicationarchitectures
 
Hadoop pig
Hadoop pigHadoop pig
Hadoop pig
 
Big Data in the Microsoft Platform
Big Data in the Microsoft PlatformBig Data in the Microsoft Platform
Big Data in the Microsoft Platform
 
Facebook Retrospective - Big data-world-europe-2012
Facebook Retrospective - Big data-world-europe-2012Facebook Retrospective - Big data-world-europe-2012
Facebook Retrospective - Big data-world-europe-2012
 

Viewers also liked

PAMF 2014 Developer Challenge Webinar
PAMF 2014 Developer Challenge WebinarPAMF 2014 Developer Challenge Webinar
PAMF 2014 Developer Challenge Webinar
health2dev
 
World isflatsummary
World isflatsummaryWorld isflatsummary
World isflatsummary
BASANTY
 
Algoritmos parte 2
Algoritmos parte 2Algoritmos parte 2
Algoritmos parte 2
AlfredoDguez
 
Phrasal verbs list
Phrasal verbs listPhrasal verbs list
Phrasal verbs list
mariabandah
 
Ing y arquit.ppt2014
Ing y arquit.ppt2014Ing y arquit.ppt2014
Ing y arquit.ppt2014
palopilu
 
Grupo tv_cable_diapositivas
Grupo  tv_cable_diapositivasGrupo  tv_cable_diapositivas
Grupo tv_cable_diapositivas
Jorge Ramos
 
Colores primarios
Colores primariosColores primarios
Colores primarios
estacion3
 
Anexo ii portaria-n°-0511
Anexo ii portaria-n°-0511Anexo ii portaria-n°-0511
Anexo ii portaria-n°-0511
enfoquecultural
 

Viewers also liked (18)

Vfi mobiel fondsenwerven marcom 12 6-2013
Vfi mobiel fondsenwerven marcom 12 6-2013Vfi mobiel fondsenwerven marcom 12 6-2013
Vfi mobiel fondsenwerven marcom 12 6-2013
 
6 Big Ideas from SXSW Interactive: A Visual Recap
6 Big Ideas from SXSW Interactive: A Visual Recap6 Big Ideas from SXSW Interactive: A Visual Recap
6 Big Ideas from SXSW Interactive: A Visual Recap
 
ONC Market R&D Pilot challenge Webinar final
ONC Market R&D Pilot challenge Webinar finalONC Market R&D Pilot challenge Webinar final
ONC Market R&D Pilot challenge Webinar final
 
PAMF 2014 Developer Challenge Webinar
PAMF 2014 Developer Challenge WebinarPAMF 2014 Developer Challenge Webinar
PAMF 2014 Developer Challenge Webinar
 
World isflatsummary
World isflatsummaryWorld isflatsummary
World isflatsummary
 
Tourism!
Tourism!Tourism!
Tourism!
 
Ursa Major - RendezVIEW
Ursa Major - RendezVIEWUrsa Major - RendezVIEW
Ursa Major - RendezVIEW
 
Algoritmos parte 2
Algoritmos parte 2Algoritmos parte 2
Algoritmos parte 2
 
marcel duchamp
marcel duchampmarcel duchamp
marcel duchamp
 
Education
EducationEducation
Education
 
Phrasal verbs list
Phrasal verbs listPhrasal verbs list
Phrasal verbs list
 
Bigdata Hadoop project payment gateway domain
Bigdata Hadoop project payment gateway domainBigdata Hadoop project payment gateway domain
Bigdata Hadoop project payment gateway domain
 
Ing y arquit.ppt2014
Ing y arquit.ppt2014Ing y arquit.ppt2014
Ing y arquit.ppt2014
 
Tabela copinha ouro
Tabela copinha ouroTabela copinha ouro
Tabela copinha ouro
 
Grupo tv_cable_diapositivas
Grupo  tv_cable_diapositivasGrupo  tv_cable_diapositivas
Grupo tv_cable_diapositivas
 
Colores primarios
Colores primariosColores primarios
Colores primarios
 
Madre
MadreMadre
Madre
 
Anexo ii portaria-n°-0511
Anexo ii portaria-n°-0511Anexo ii portaria-n°-0511
Anexo ii portaria-n°-0511
 

Similar to Hadoop online training course

Hadoop and aws map reducecourse
Hadoop and aws map reducecourseHadoop and aws map reducecourse
Hadoop and aws map reducecourse
Samatha Kamuni
 
Haoop ppt
Haoop pptHaoop ppt
Haoop ppt
orsenit
 
Haoop ppt
Haoop pptHaoop ppt
Haoop ppt
orsenit
 
HADOOP ONLINE TRAINING
HADOOP ONLINE TRAININGHADOOP ONLINE TRAINING
HADOOP ONLINE TRAINING
training3
 
Playing with Hadoop (NPW2013)
Playing with Hadoop (NPW2013)Playing with Hadoop (NPW2013)
Playing with Hadoop (NPW2013)
Søren Lund
 
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Cloudera, Inc.
 

Similar to Hadoop online training course (20)

Hadoop and aws map reducecourse
Hadoop and aws map reducecourseHadoop and aws map reducecourse
Hadoop and aws map reducecourse
 
Hadoop and aws short
Hadoop and aws  shortHadoop and aws  short
Hadoop and aws short
 
Hadoop Training in Hyderabad | Online Training
Hadoop Training in Hyderabad | Online TrainingHadoop Training in Hyderabad | Online Training
Hadoop Training in Hyderabad | Online Training
 
Big-Data Hadoop Training Institutes in Pune | CloudEra Certification courses ...
Big-Data Hadoop Training Institutes in Pune | CloudEra Certification courses ...Big-Data Hadoop Training Institutes in Pune | CloudEra Certification courses ...
Big-Data Hadoop Training Institutes in Pune | CloudEra Certification courses ...
 
Hadoop MapReduce Fundamentals
Hadoop MapReduce FundamentalsHadoop MapReduce Fundamentals
Hadoop MapReduce Fundamentals
 
Hadoop content
Hadoop contentHadoop content
Hadoop content
 
Hadoop and Big Data: Revealed
Hadoop and Big Data: RevealedHadoop and Big Data: Revealed
Hadoop and Big Data: Revealed
 
Windows Azure HDInsight Service
Windows Azure HDInsight ServiceWindows Azure HDInsight Service
Windows Azure HDInsight Service
 
Hadoop Frameworks Panel__HadoopSummit2010
Hadoop Frameworks Panel__HadoopSummit2010Hadoop Frameworks Panel__HadoopSummit2010
Hadoop Frameworks Panel__HadoopSummit2010
 
Hadoop_arunam_ppt
Hadoop_arunam_pptHadoop_arunam_ppt
Hadoop_arunam_ppt
 
Hadoop online training
Hadoop online trainingHadoop online training
Hadoop online training
 
Haoop ppt
Haoop pptHaoop ppt
Haoop ppt
 
Haoop ppt
Haoop pptHaoop ppt
Haoop ppt
 
Best hadoop-online-training
Best hadoop-online-trainingBest hadoop-online-training
Best hadoop-online-training
 
HADOOP ONLINE TRAINING
HADOOP ONLINE TRAININGHADOOP ONLINE TRAINING
HADOOP ONLINE TRAINING
 
Playing with Hadoop (NPW2013)
Playing with Hadoop (NPW2013)Playing with Hadoop (NPW2013)
Playing with Hadoop (NPW2013)
 
Hadoop online training
Hadoop online training Hadoop online training
Hadoop online training
 
Datascience Training with Hadoop, Python Machine Learning & Scala, Spark
Datascience Training with Hadoop, Python Machine Learning & Scala, SparkDatascience Training with Hadoop, Python Machine Learning & Scala, Spark
Datascience Training with Hadoop, Python Machine Learning & Scala, Spark
 
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
 
Bisp developing-solutions-using-hadoop
Bisp developing-solutions-using-hadoopBisp developing-solutions-using-hadoop
Bisp developing-solutions-using-hadoop
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 

Hadoop online training course

  • 1. Apache Hadoop Big Data Training Course Sample Session: Training Course Details Session: https://www.youtube.com/watch?v=ij3v4mYKxHw Haddop Business Case: https://www.youtube.com/watch?v=vh0GpSi8StU https://www.youtube.com/watch?v=jYAv34PHEdc Hadoop HDFS Lab: https://www.youtube.com/watch?v=Pp8SV50S9HM Introduction to HADOOP Distributed computing , cloud computing Big data Basics and Need for Parallel Processing How Hadoop works ? Introduction to HDFS and Map Reduce Hadoop Architecture Details Name Node Data Node Secondary Name Node Job Tracker Task Tracker HDFS ( Hadoop - Distributed File System) Hadoop Distributed file system , Background, GFS Data Replication Data Storage Data Retrieval Additional HDFS commands MapReduce Programming MapReduce, Background Writing MapReduce Programs Writable and WritableComparable Input Format, Output Format
  • 2. Input Split and Block size Combiner Partitioner Number of Mappers and Reducers Counters Map Reduce Algorithms and Exercises Line Count and Word Count Distributed Search Sorting Data – Key Value Data Type Mathematical Transformation example Working with Counters exercise Distributed Cache exercise Zero Reducer based exercises Hadoop Streaming Introduction to Hadoop Streaming Streaming API details and use cases Python Based Example for Streaming API Exercise for Hadoop Streaming ( XML Files ) Based. Exercises on Ruby Exercise on C# using MS-Azure. Apache Pig Installation Execution Types Grunt Shell Pig Latin Data Processing Loading and Storing Data Filtering Grouping & Joining Operations Hands on Exercises Apache HBase Installation and Details
  • 3. HBase and NOSQL Introduction HBase Installation and Configuration. HBase and Java Based integration HBase Hadoop Integration Details. HBase basic exercises Apache Hive Installation and Details Hive Installation on Single cluster Hadoop Node. Hive Services Hive Shell Description Hive Server· Meta store Details Hive QL Basics Working with Tables, Databases etc. Hive JDBC programming Hands on Exercises and Assignments Introduction to Amazon Map Reduce (AWS-EMR) Hadoop using Amaozon Web Service AWS MapReduce and EC2 AWS - S3 Service Model. AWS-MR Architecture. Streaming Exercise using EMR JobFlow. Hadoop Infrastructure Planning Basic Hadoop hardware and software req Small , Medium and Large cluster Networking challenges in Hadoop Deployment Disaster Recovery ( DR ) in Hadoop . Performance Tuning a large cluster Hadoop Industry Solutions EMC GreenPlum Introduction IBM BigInsight Details Oracle , Microsoft etc Hadoop Offerings Cloudera and HortonWorks Hadoop Package
  • 4. Hadoop and Cloud Computing Using Cloud technologies for distributed processing Hadoop on Amazon Web Service. Hadoop in Oracle Cloud / RackSpace ============================================================ Medium: GotoMeeting Duration: 35 Hours Materials: 50+ Exercises-Fully Solved. Decks, VM Machine.