SlideShare uma empresa Scribd logo
1 de 13
Baixar para ler offline
@RMSSoftwareTech training@rmssoftwaretech.com
http://www.rmssoftwaretech.com
Apache Hadoop
(Big Data)
Big Data Training
2
© 2014 RMS Software Tech (rmssoftwaretech.com)
Logo’s & Trademarks
• Note: Any logos used in this presentation are owned by their
respective companies and are only used in this slide deck for
educational purposes. No other companies are responsible for or
provide attribution for any of the material in these slides.
• This slide deck is released under a Creative Commons License
and can be reused in your own presentations, however please
research the specific meanings of these symbols:
• You may be able to use the slide deck for purposes beyond the
CC license if you email me with the special request.
•  All third party trademark rights acknowledged
3
© 2014 RMS Software Tech (rmssoftwaretech.com)
Profile : RMS Software Technologies
•  About us :
◦  Leading provider of Software solutions, System integration Services and
Professional Training
◦  Based in San Jose, CA & Mumbai, India started in 2012.
•  Professional Courses :
◦  Expert Training team of 10 people who provide training in various leading
technologies like iPhone iOS, Android, Java, Big Data Hadoop, QA & Agile Scrum
Methodologies.
◦  Team of developers using AngularJS at the client projects.
◦  Focused on Interns and engineers looking to learn new exciting technologies.
•  Track Record:
◦  We work with emerging technologies to create mobile applications, rich-client desktop
software, and large-scale systems (CRM, ERP).
◦  We create high quality solutions for hard problems, to help our customers thrive
◦  We can share this expertise with you developers, in the form of workshop style, hands-on
training classes.
4
© 2014 RMS Software Tech (rmssoftwaretech.com)
Course : Prerequisites and Equipment
•  Prerequisites :
◦  Students should have experience with Database (DBMS) like Oracle,
Informix, Sybase. No prior experience of Big Data or NOSQL and Hadoop
is required for the course.
•  Equipment :
◦  Please use - Laptop (Windows, Linux, or Mac).
5
© 2014 RMS Software Tech (rmssoftwaretech.com)
Training Agenda
Training Schedule & Agenda
Week 1 : Hadoop Overview
Week 2 : HDFS Deep Dive
Week 3 : MapReduce and Pig
Week 4 : Hive and HBase
Week 5 : Zookeeper, Oozie, Flume, Talend
Week 6 : Practice Questions, Q & A with Final Project
We believe this curriculum covers the basics well, and positions students to use
Hadoop effectively and efficiently. It provides a good overview on Hadoop and Big
Data
6
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 1 : Hadoop Overview
•  Brief History of Hadoop
•  RDBMS/SQL vs. Hadoop
•  Structured vs. Unstructured data
•  Introduction to Hadoop Ecosystem (HDFS, MapReduce, Pig, Hive, HBase)
•  HDFS Overview (NameNode vs. DataNode)
•  MapReduce overview (JobTracker vs. TaskTracker)
•  Hadoop XML files for configuration
•  Hadoop Ecosystem (Hive, Pig, Hbase, Zookeeper, Mahout, Oozie, Talend,
Scoop, Flume)
•  Lab #1 Virtual Machine Setup
7
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 2 : HDFS Deep Dive
•  NameNode Architecture
•  DataNode Architecture
•  Write Pipeline
•  Read Pipeline
•  HDFS Disk space quotas and number of file quotas
•  Quick Intro to Java API interface
•  Lab #2.
8
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 3 : MapReduce and Pig
•  MapReduce Architecture
•  Combiner, Partitioner
•  JobTracker & TaskTracker
•  Job Scheduling
•  Distributed Cache
•  Counters
•  MapReduce configuration files
•  Simple MapReduce example : WordCount
•  Next Gen MapReduce : YARN.
•  Lab #3 : MapReduce
•  Lab #4 : Pig
9
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 4: Hive and HBase
•  Hive architecture.
•  Hive vs. RDBMS.
•  HiveQL and Hive. Shell
•  Managing Tables
•  Querying Data
•  Data Types and Schemas
•  Introduction to UDF (User Defined Functions)
•  HBase Architecture
•  HBase vs. Cassandra
10
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 4: HBase
•  Bloom Filters and Block indexes
•  Table Scans and Filters
•  Lab # Intro to HBase command line.
11
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 5: Zookeeper, Oozie, Flume, Sqoop, Talend
•  Flume overview
•  Flume usage
•  Sqoop overview
•  Sqoop usage
•  Hadoop workflow
•  Jobcontrol
•  Oozie
•  Talend
•  Sqoop
12
© 2014 RMS Software Tech (rmssoftwaretech.com)
Week 6: Project and Practice Questions
•  Sample Project
•  Practice Questions
•  Q & A
Foundation for tomorrow
@RMSSoftwareTech
training@rmssoftwaretech.com

Mais conteúdo relacionado

Destaque

Your Big Data Stack is Too Big!: Presented by Timothy Potter, Lucidworks
Your Big Data Stack is Too Big!: Presented by Timothy Potter, LucidworksYour Big Data Stack is Too Big!: Presented by Timothy Potter, Lucidworks
Your Big Data Stack is Too Big!: Presented by Timothy Potter, LucidworksLucidworks
 
07 Using Oracle-Supported Package in Application Development
07 Using Oracle-Supported Package in Application Development07 Using Oracle-Supported Package in Application Development
07 Using Oracle-Supported Package in Application Developmentrehaniltifat
 
06 Using More Package Concepts
06 Using More Package Concepts06 Using More Package Concepts
06 Using More Package Conceptsrehaniltifat
 
03 Writing Control Structures, Writing with Compatible Data Types Using Expli...
03 Writing Control Structures, Writing with Compatible Data Types Using Expli...03 Writing Control Structures, Writing with Compatible Data Types Using Expli...
03 Writing Control Structures, Writing with Compatible Data Types Using Expli...rehaniltifat
 
Big Data Technology Stack : Nutshell
Big Data Technology Stack : NutshellBig Data Technology Stack : Nutshell
Big Data Technology Stack : NutshellKhalid Imran
 

Destaque (7)

Your Big Data Stack is Too Big!: Presented by Timothy Potter, Lucidworks
Your Big Data Stack is Too Big!: Presented by Timothy Potter, LucidworksYour Big Data Stack is Too Big!: Presented by Timothy Potter, Lucidworks
Your Big Data Stack is Too Big!: Presented by Timothy Potter, Lucidworks
 
07 Using Oracle-Supported Package in Application Development
07 Using Oracle-Supported Package in Application Development07 Using Oracle-Supported Package in Application Development
07 Using Oracle-Supported Package in Application Development
 
06 Using More Package Concepts
06 Using More Package Concepts06 Using More Package Concepts
06 Using More Package Concepts
 
03 Writing Control Structures, Writing with Compatible Data Types Using Expli...
03 Writing Control Structures, Writing with Compatible Data Types Using Expli...03 Writing Control Structures, Writing with Compatible Data Types Using Expli...
03 Writing Control Structures, Writing with Compatible Data Types Using Expli...
 
Big Data Tech Stack
Big Data Tech StackBig Data Tech Stack
Big Data Tech Stack
 
Big Data Technology Stack : Nutshell
Big Data Technology Stack : NutshellBig Data Technology Stack : Nutshell
Big Data Technology Stack : Nutshell
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 

Semelhante a Big Data Hadoop Training Course

Hadoop 2.0-development
Hadoop 2.0-developmentHadoop 2.0-development
Hadoop 2.0-developmentKnowledgehut
 
Hadoop online training in india
Hadoop online training  in indiaHadoop online training  in india
Hadoop online training in indiaMadhu Trainer
 
Hadoop training kit from lcc infotech
Hadoop   training kit from lcc infotechHadoop   training kit from lcc infotech
Hadoop training kit from lcc infotechlccinfotech
 
Hadoop course content
Hadoop course contentHadoop course content
Hadoop course contentRS Trainings
 
Applications on Hadoop
Applications on HadoopApplications on Hadoop
Applications on Hadoopmarkgrover
 
Big data analytics_using_hadoop
Big data analytics_using_hadoopBig data analytics_using_hadoop
Big data analytics_using_hadoopKnowledgehut
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseDataWorks Summit
 
Sasmita bigdata resume
Sasmita bigdata resumeSasmita bigdata resume
Sasmita bigdata resumeSasmita Swain
 
What it takes to bring Hadoop to a production-ready state
What it takes to bring Hadoop to a production-ready stateWhat it takes to bring Hadoop to a production-ready state
What it takes to bring Hadoop to a production-ready stateClouderaUserGroups
 
Apache hadoop-administrator-training
Apache hadoop-administrator-trainingApache hadoop-administrator-training
Apache hadoop-administrator-trainingKnowledgehut
 

Semelhante a Big Data Hadoop Training Course (20)

Salesforce.com Training Course Agenda
Salesforce.com Training Course AgendaSalesforce.com Training Course Agenda
Salesforce.com Training Course Agenda
 
Spring Framework Training Course
Spring Framework Training Course Spring Framework Training Course
Spring Framework Training Course
 
DeepeshRehi
DeepeshRehiDeepeshRehi
DeepeshRehi
 
hadoop exp
hadoop exphadoop exp
hadoop exp
 
Hadoop 2.0-development
Hadoop 2.0-developmentHadoop 2.0-development
Hadoop 2.0-development
 
Hadoop online training in india
Hadoop online training  in indiaHadoop online training  in india
Hadoop online training in india
 
Sudhanshu kumar hadoop
Sudhanshu kumar hadoopSudhanshu kumar hadoop
Sudhanshu kumar hadoop
 
Hadoop training kit from lcc infotech
Hadoop   training kit from lcc infotechHadoop   training kit from lcc infotech
Hadoop training kit from lcc infotech
 
Hadoop course content
Hadoop course contentHadoop course content
Hadoop course content
 
Android Mobile Development Course
Android Mobile Development Course Android Mobile Development Course
Android Mobile Development Course
 
Resume
ResumeResume
Resume
 
Learn hadoop and big data technologies
Learn hadoop and big data technologiesLearn hadoop and big data technologies
Learn hadoop and big data technologies
 
Applications on Hadoop
Applications on HadoopApplications on Hadoop
Applications on Hadoop
 
Big data analytics_using_hadoop
Big data analytics_using_hadoopBig data analytics_using_hadoop
Big data analytics_using_hadoop
 
Angular JS Training Agenda
Angular JS Training AgendaAngular JS Training Agenda
Angular JS Training Agenda
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data Warehouse
 
Sasmita bigdata resume
Sasmita bigdata resumeSasmita bigdata resume
Sasmita bigdata resume
 
What it takes to bring Hadoop to a production-ready state
What it takes to bring Hadoop to a production-ready stateWhat it takes to bring Hadoop to a production-ready state
What it takes to bring Hadoop to a production-ready state
 
HimaBindu
HimaBinduHimaBindu
HimaBindu
 
Apache hadoop-administrator-training
Apache hadoop-administrator-trainingApache hadoop-administrator-training
Apache hadoop-administrator-training
 

Último

Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxAmita Gupta
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxdhanalakshmis0310
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 

Último (20)

Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 

Big Data Hadoop Training Course

  • 2. 2 © 2014 RMS Software Tech (rmssoftwaretech.com) Logo’s & Trademarks • Note: Any logos used in this presentation are owned by their respective companies and are only used in this slide deck for educational purposes. No other companies are responsible for or provide attribution for any of the material in these slides. • This slide deck is released under a Creative Commons License and can be reused in your own presentations, however please research the specific meanings of these symbols: • You may be able to use the slide deck for purposes beyond the CC license if you email me with the special request. •  All third party trademark rights acknowledged
  • 3. 3 © 2014 RMS Software Tech (rmssoftwaretech.com) Profile : RMS Software Technologies •  About us : ◦  Leading provider of Software solutions, System integration Services and Professional Training ◦  Based in San Jose, CA & Mumbai, India started in 2012. •  Professional Courses : ◦  Expert Training team of 10 people who provide training in various leading technologies like iPhone iOS, Android, Java, Big Data Hadoop, QA & Agile Scrum Methodologies. ◦  Team of developers using AngularJS at the client projects. ◦  Focused on Interns and engineers looking to learn new exciting technologies. •  Track Record: ◦  We work with emerging technologies to create mobile applications, rich-client desktop software, and large-scale systems (CRM, ERP). ◦  We create high quality solutions for hard problems, to help our customers thrive ◦  We can share this expertise with you developers, in the form of workshop style, hands-on training classes.
  • 4. 4 © 2014 RMS Software Tech (rmssoftwaretech.com) Course : Prerequisites and Equipment •  Prerequisites : ◦  Students should have experience with Database (DBMS) like Oracle, Informix, Sybase. No prior experience of Big Data or NOSQL and Hadoop is required for the course. •  Equipment : ◦  Please use - Laptop (Windows, Linux, or Mac).
  • 5. 5 © 2014 RMS Software Tech (rmssoftwaretech.com) Training Agenda Training Schedule & Agenda Week 1 : Hadoop Overview Week 2 : HDFS Deep Dive Week 3 : MapReduce and Pig Week 4 : Hive and HBase Week 5 : Zookeeper, Oozie, Flume, Talend Week 6 : Practice Questions, Q & A with Final Project We believe this curriculum covers the basics well, and positions students to use Hadoop effectively and efficiently. It provides a good overview on Hadoop and Big Data
  • 6. 6 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 1 : Hadoop Overview •  Brief History of Hadoop •  RDBMS/SQL vs. Hadoop •  Structured vs. Unstructured data •  Introduction to Hadoop Ecosystem (HDFS, MapReduce, Pig, Hive, HBase) •  HDFS Overview (NameNode vs. DataNode) •  MapReduce overview (JobTracker vs. TaskTracker) •  Hadoop XML files for configuration •  Hadoop Ecosystem (Hive, Pig, Hbase, Zookeeper, Mahout, Oozie, Talend, Scoop, Flume) •  Lab #1 Virtual Machine Setup
  • 7. 7 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 2 : HDFS Deep Dive •  NameNode Architecture •  DataNode Architecture •  Write Pipeline •  Read Pipeline •  HDFS Disk space quotas and number of file quotas •  Quick Intro to Java API interface •  Lab #2.
  • 8. 8 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 3 : MapReduce and Pig •  MapReduce Architecture •  Combiner, Partitioner •  JobTracker & TaskTracker •  Job Scheduling •  Distributed Cache •  Counters •  MapReduce configuration files •  Simple MapReduce example : WordCount •  Next Gen MapReduce : YARN. •  Lab #3 : MapReduce •  Lab #4 : Pig
  • 9. 9 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 4: Hive and HBase •  Hive architecture. •  Hive vs. RDBMS. •  HiveQL and Hive. Shell •  Managing Tables •  Querying Data •  Data Types and Schemas •  Introduction to UDF (User Defined Functions) •  HBase Architecture •  HBase vs. Cassandra
  • 10. 10 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 4: HBase •  Bloom Filters and Block indexes •  Table Scans and Filters •  Lab # Intro to HBase command line.
  • 11. 11 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 5: Zookeeper, Oozie, Flume, Sqoop, Talend •  Flume overview •  Flume usage •  Sqoop overview •  Sqoop usage •  Hadoop workflow •  Jobcontrol •  Oozie •  Talend •  Sqoop
  • 12. 12 © 2014 RMS Software Tech (rmssoftwaretech.com) Week 6: Project and Practice Questions •  Sample Project •  Practice Questions •  Q & A