Many enterprises, in the modern IT industry, are forced to process and distribute large amounts of data on a regular basis. Basic database management systems and tools become ineffective in dealing with processing and storing such large amounts of data. Knowledge and expertise in dealing with Big Data management applications has become a necessity within the IT industry. CloudAce offers two separate Big Data Training programs that focus around Apache’s Hadoop platform,
CloudAce Technologies provides Big Data Training in partnership with leading company Cloud Enabled.
About Our Trainers
By participating in our Big Data Training programs, you will be placed under the guidance of a certified cloud computing professional that has worked with us as a Technical Lead for over 9 years, dealing extensively with Big Data analytics, development, and implementation. Our trainer holds the Hadoop developer and Hadoop administrator certifications, also boasting a wealth of teaching experience. Our trainer also has intensive hands on experience in the implementation of algorithms like decision trees, support vector machines, random forest, naïve bayees, neural networks, genetic algorithm, conjoint analysis, principal component analysis, etc.
Hadoop Administrator Training
Hadoop Administrator Training program is designed to familiarize participants with the concept of Big Data Analytics and their applications within the business process. Through the help of our fully certified trainer, participants will benefit from an understanding of the manner in which Apache’s Hadoop platform is able to process and effectively analyze large data sets without the use of extensive hardware. Participants will be provided with basic control and management guidelines required to use the platform. From learning how to write MapReduce programs to basic cluster maintenance, participants will learn the tools required to become a Hadoop administrator. The program will also briefly cover some components of Hadoop Ecosystem.
The Hadoop Administrator Course will be conducted over a period of 3 consecutive days.
The Hadoop tutorial will be conducted in a class-room format.
The fee for the Hadoop Administrator course is 18,000 INR, exclusive of taxes.
Upon completion of this training, successful participants will receive a certification of Hadoop Administrator.
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Hadoop Administrator Training
1. CLOUDACE TECHNOLOGIES, Regus Solitaire Business Centre (Hyderabad) Pvt Ltd, 4th Floor, Gumidelli Commercial
Complex, 1-10-39 to 44, Old Airport Road, Begumpet, Hyderabad - 500016. Contact No. +91 9000798810, Email:
trainings@cloudace.in, www.cloudace.in
Hadoop Administrator
Course: Hadoop Administrator
Duration: 3 Days
Many enterprises, in the modern IT industry, are forced to process and distribute large amounts of
data on a regular basis. Basic database management systems and tools become ineffective in dealing
with processing and storing such large amounts of data. Knowledge and expertise in dealing with Big
Data management applications has become a necessity within the IT industry. CloudAce offers two
separate Big Data Training programs that focus around Apache’s Hadoop platform,
CloudAce Technologies provides Big Data Training in partnership with leading company Cloud
Enabled.
About Our Trainers
By participating in our Big Data Training programs, you will be placed under the guidance of a
certified cloud computing professional that has worked with us as a Technical Lead for over 9 years,
dealing extensively with Big Data analytics, development, and implementation. Our trainer holds the
Hadoop developer and Hadoop administrator certifications, also boasting a wealth of teaching
experience. Our trainer also has intensive hands on experience in the implementation of algorithms
like decision trees, support vector machines, random forest, naïve bayees, neural networks, genetic
algorithm, conjoint analysis, principal component analysis, etc.
Hadoop Administrator Training
Hadoop Administrator Training program is designed to familiarize participants with the concept of
Big Data Analytics and their applications within the business process. Through the help of our fully
certified trainer, participants will benefit from an understanding of the manner in which Apache’s
Hadoop platform is able to process and effectively analyze large data sets without the use of
extensive hardware. Participants will be provided with basic control and management guidelines
required to use the platform. From learning how to write MapReduce programs to basic cluster
maintenance, participants will learn the tools required to become a Hadoop administrator. The
program will also briefly cover some components of Hadoop Ecosystem.
The Hadoop Administrator Course will be conducted over a period of 3 consecutive days.
The Hadoop tutorial will be conducted in a class-room format.
The fee for the Hadoop Administrator course is 18,000 INR, exclusive of taxes.
2. CLOUDACE TECHNOLOGIES, Regus Solitaire Business Centre (Hyderabad) Pvt Ltd, 4th Floor, Gumidelli Commercial
Complex, 1-10-39 to 44, Old Airport Road, Begumpet, Hyderabad - 500016. Contact No. +91 9000798810, Email:
trainings@cloudace.in, www.cloudace.in
Upon completion of this training, successful participants will receive a certification of
Hadoop Administrator.
The agenda for the course is outlined below
• Module 1 : Big Data – An Overview
o What is Cloud Computing
o What is Grid Computing
o What is Virtualization
o How above three are inter-related to each other
o What is Big Data
o Introduction to Analytics and the need for big data analytics
o Hadoop Solutions - Big Picture
o Hadoop distributions
o Comparing Hadoop Vs. Traditional systems
o Volunteer Computing
o Data Retrieval - Radom Access Vs. Sequential Access
o NoSQL Databases
• Module 2 : The Motivation of Hadoop
o Problems with traditional large-scale systems
o Requirements for a new approach
• Module 3 : Hadoop Basic Concepts
o What is Hadoop?
o The Hadoop Distributed File System
o How MapReduce Works
o Anatomy of a Hadoop Cluster
• Module 4 : Hadoop Demons
o Namenode
o Datanode
o Secondary namenode
o Job tracker
3. CLOUDACE TECHNOLOGIES, Regus Solitaire Business Centre (Hyderabad) Pvt Ltd, 4th Floor, Gumidelli Commercial
Complex, 1-10-39 to 44, Old Airport Road, Begumpet, Hyderabad - 500016. Contact No. +91 9000798810, Email:
trainings@cloudace.in, www.cloudace.in
o Task tracker
• Module 5 : Hadoop File system in Detail
o Blocks and Splits
o Replication
o Data high availability
o Data Integrity
o Cluster architecture and block placement
• Module 6 : Programming Practices and Performance Tuning
o Developing MapReduce Programs in
Local Mode
Pseudo-distributed Mode
Fully distributed mode
• Module 7 : Writing a MapReduce Program
o Examining a Sample MapReduce Program
o Basic API Concepts
o The Driver Code
o The Mapper
o The Reducer
o Hadoop's Streaming API
• Module 8 : Setup Hadoop Cluster
o Install and configure Apache Hadoop
o Make a fully distributed Hadoop cluster on a single laptop/desktop
o Install and configure Cloudera Hadoop distribution in fully distributed mode
o Install and configure Horton Works Hadoop distribution in fully distributed mode
o Monitoring the cluster
o Getting used to management console of Cloudera and Horton Works
• Module 9 : Hadoop Security
o Why Hadoop Security is Important
o Hadoop’s Security System Components
o What Kerberos IS and How it works
4. CLOUDACE TECHNOLOGIES, Regus Solitaire Business Centre (Hyderabad) Pvt Ltd, 4th Floor, Gumidelli Commercial
Complex, 1-10-39 to 44, Old Airport Road, Begumpet, Hyderabad - 500016. Contact No. +91 9000798810, Email:
trainings@cloudace.in, www.cloudace.in
o Configuring Kerberos Security
o Integrating a secure Cluster with other Systems
• Module 10 : Managing and Scheduling Jobs
o Managing Running Jobs
o Hands-On Exercise
o The FIFO Scheduler
o The Fair Scheduler
o Configuring the FAIR Scheduler
o Hands-on Exercise
• Module 11 : Cluster Maintenance
o Checking HDFS Status
o Hands-On Exercise
o Copying Data Between Clusters
o Adding and Removing
o Cluster Nodes
o Rebalancing the Cluster
o Hands-On Exercise
o NameNode Metadata Backup
• Module 12 : Cluster Monitoring and Troubleshooting
o General System Monitoring
o Managing Hadoop’s Log Files
o Using the NameNode and
o JobTracker Web UIs
o Hands-On Exercise
o Cluster Monitoring with Ganglia
o Common Troubleshooting Issues
o Benchmarking Your Cluster
5. CLOUDACE TECHNOLOGIES, Regus Solitaire Business Centre (Hyderabad) Pvt Ltd, 4th Floor, Gumidelli Commercial
Complex, 1-10-39 to 44, Old Airport Road, Begumpet, Hyderabad - 500016. Contact No. +91 9000798810, Email:
trainings@cloudace.in, www.cloudace.in
Hadoop Ecosystem covered as part of Hadoop Administrator
• Eco system component: Ganglia
o Install and configure Ganglia on a cluster
o Configure and use Ganglia
o Use Ganglia for graphs.
• Eco system component: Nagios
o Nagios concepts
o Install and configure Nagios on cluster
o Use Nagios for sample alerts and monitoring
• Eco system component: Hive
o Hive concepts
o Install and configure hive on cluster
o Create database, access it console
o Develop and run sample applications in Java/Python to access hive
• Eco system component: Sqoop
o Install and configure sqoop on cluster
o Import data from Oracle/Mysql to hive
• Overview of other Eco system component:
o Oozie, Avro, Thrift, Rest, Mahout, Cassandra, YARN, MR2 etc
Training Duration - 3 Days classroom Training
Course Fee - 18,000 INR + Service Taxes per Participant ( excludes Exam Fees)
For further information please email us at trainings@cloudace.in or call Mr. Rohit @
9000798810