Ernestas Sysojevas. Hadoop Essentials and Ecosystem
1. Get Started on Your Journey to Hadoop
Hadoop Essentials and Ecosystem
Ernestas Sysojevas
DATA MINER
2. About me – Ernestas Sysojevas
Master degree in Computer Sciences (Information Systems)
>400 courses delivered in IT and IT Project Management area
>10 years of training experience, mainly in databases area (MS SQL, MYSQL)
2
Professional certifications held:
Cloudera Certified Administrator for Apache Hadoop (CCAH)
Cloudera Certified Developer for Apache Hadoop (CCDH)
Microsoft Certified Trainer (MCT)
Project Management Professional (PMP)
About 15 other IT certificates like MCDBA, MCITP SQL, MCSE, CEH, MCTS
Project and MCTS Sharepoint
Senior Trainer and Director at DATA MINER company
3. About our Company – DATA MINER
3
DATA MINER – 5 years old training and consultancy company in IT
business located in Vilnius, capital of Lithuania
That capital of foreign country is nearest to Minsk? Vilnius (188 km) – and only 2,5 hours by train
From Minsk to:
Riga – 484 km
Kiev – 527 km
Warsaw – 550 km
Moscow – 718 km
4. We as Cloudera Training Partner
Exclusive Cloudera Training Partner in Lithuania, Latvia, Estonia,
Russia, Ukraine and Kazakhstan ( Belarus in the nearest future)
4
Cloudera CDH distribution –
the most popular Hadoop
distribution in the world
With over 20,000 individuals trained, Cloudera is a leading
educator of Hadoop training and certification programs.
5. Global Hadoop market
The global Hadoop market was worth USD 1.5
billion in 2012 and is expected to reach USD 20.9
billion in 2018, growing at 55 % every year from
2012 to 2018.
Transparency Market Research "Hadoop Market - Global Industry Analysis, Size,
Share, Growth, Trends, and Forecast, 2012- 2018,„
7. The roles people play
Role Required Skills Responsibilities
System
Administrators
Strong Linux administration
skills
Install, configure, upgrade,
manage and monitor Hadoop
software and hardware
Developers Strong Java or scripting
capabilities, understanding of
MapReduce framework
Write, package and deploy
MapReduce, Hive, Pig, Impala
programs
Data Analysts SQL, understanding of data
analysis and data mining
Extracting intelligence from
the data, writing Hive/Pig,
Impala code
Data Scientists Knowledge of statistics, domain
knowledge
Answering questions you
didn‘t realize you wanted to
ask
Data Stewards Data modelling and ETL Managing data lifecycle