SlideShare uma empresa Scribd logo
1 de 22
Baixar para ler offline
Hadoop/Spark Webinar
Career Opportunities in Open Source Framework
Know your Trainer
www.edunextgen.com 2©2018 EduNextgen
Webinar Objective
What is Open Source Framework?
Open Source Framework Example
Difference from other types of Framework?
Importance of open source framework
What is Big Data?
Deep understanding of Hadoop and Spark framework
Job Analysis on Hadoop and Spark
www.edunextgen.com 3©2018 EduNextgen
What is Open Source Framework?
The term “Open Source" refers to something people can modify and share because its design is publicly accessible
Code Available publicly, so source code that anyone can inspect, modify, and enhance
www.edunextgen.com 4©2018 EduNextgen
Open Source Framework Example
www.edunextgen.com 5©2018 EduNextgen
Difference from other types of Framework
Open Source Framework:
Source code available publicly
Redistribute solutions
Can use in any way
Eliminates single point of failure
Democratic forum for action
No vendor lock-in
No guarantee for development will continue
Intellectual property (algorithms)
Support consistency
www.edunextgen.com 6
Proprietary Software:
Predictable releases
Entity to hold responsible for bugs, errors and updates
Consistent feature development
More stable framework
More consistent training options
Easier access to support
Single company releasing patches
Costs will be Higher for start-up
Vendor will have owns software
©2018 EduNextgen
Importance of open source framework
Control: People have more control. As per requirement can be change and modify as well. Experiment can be possible and what
ever suitable for the requirement can be implement.
Training: Open source code is publicly accessible, students can easily study it as they learn to make better software. It help to
develop the skill.
Security: More secure and stable, because anyone can view and modify, someone might spot and correct errors or omissions
that a program's original authors might have missed.
Stability: It can be possible to develop long-term projects. Because programmers publicly distribute the source code for open
source software, users relying on that software for critical tasks can be sure their tools won't disappear or fall into disrepair if their
original creators stop working on them.
www.edunextgen.com 7©2018 EduNextgen
What is Big Data?
Big Data is extremely large volume of data sets that may be trends or
associations, especially relating to human behavior and interactions
Big data is a term that describes the large volume (Terabytes or Petabytes) of
data – both structured and unstructured – that inundates a business on a day-
to-day basis
Big Data is a collection of huge amount of data set which is not possible to
handle in traditional way
Big Data include capturing data, data storage, data analysis, visualization,
querying etc.
www.edunextgen.com 8©2018 EduNextgen
Introduction to Hadoop
www.edunextgen.com 9
Hadoop is an Open-source Data Management Framework which
support, store and process big data
Hadoop is a part of apache project. It is being used by Google,
Yahoo, Facebook, Twitter, LinkedIn and many more
It allows the distributed processing of huge data across clusters
Hadoop was developed by Doug Cutting and Mike Cafarella in
year 2006
©2018 EduNextgen
Necessity of Hadoop
Make strategic, confident decisions based on solid data and advanced analytics
Gain valuable business insights that help you pinpoint weaknesses and discover
new opportunities
Earn higher profits by better understanding the business, processes and the
customers
Big Data flowing in at exponential rate
Increased number of Hadoop driven jobs
www.edunextgen.com 10©2018 EduNextgen
Hadoop Characteristics
www.edunextgen.com 11
Hadoop Is Easily Scalable
Hadoop Brings Flexibility In Data Processing
Hadoop Is Fault Tolerant
Hadoop Is Great At Faster Data Processing
Hadoop Ecosystem Is Robust:
Hadoop Is Very Cost Effective
©2018 EduNextgen
Introduction to Apache Spark
www.edunextgen.com 12
Apache Spark is an open-source cluster in-memory computing framework.
Apache Spark provides an interface for programming entire clusters with implicit data
parallelism and fault tolerance.
Apache Spark provides high-level APIs in Scala, Java, R and Python.
Spark is 100x faster then Hadoop MapReduce
Spark support streaming process for large dataset
Initial Release on May, 2014
Stable Version Release on July, 2017: v2.2.0 on
Spark is written in Scala, Java, Python
Operating System Support: Microsoft Windows, macOS, Linux
©2018 EduNextgen
Introduction to Apache Spark (Cont’d)
www.edunextgen.com 13©2018 EduNextgen
Why Spark?
Speed:
Spark is In-memory computations
It extends the MapReduce model and takes it to a whole other level
Spark is 100x times faster than Hadoop MapReduce
Generality:
Spark is able to handle wide range of workloads
Iterative algorithms
It provide interactive queries and streaming feature
Ease of use:
We have APIs as Scala, Python, Java in Spark
It contents libraries for ML, SQL, Streaming and Graph Processing
Spark runs on Hadoop clusters, Mesos Cassandra etc
www.edunextgen.com 14©2018 EduNextgen
Job Opportunities
www.edunextgen.com 15
After the U.S., India has the largest demand of analytics / big data / data science professionals. Amidst such
demand, people find themselves confused to select an appropriate job profile for the best future.
“A professional with working knowledge of data science and big data earns 8% more than with co-worker “
©2018 EduNextgen
Job Opportunities (Cont’d)
www.edunextgen.com 16
89% of hiring managers find it difficult to find talent
47% of employers are willing to pay for professional certifications, up from 33% in 2017
Positions they are looking for:
73% – Developers
60% – DevOps
53% – SysAdmins
Employers are seeking expertise in:
70% – Cloud
67% – Big Data
65% – Linux
©2018 EduNextgen
Job Opportunities (Cont’d)
www.edunextgen.com 17©2018 EduNextgen
Next Webinar: Execute your First Hive Project
What is Big Data?
Why do we need Big Data?
What is Hive?
Basic Hive Operations & Commands:
Create database, Show databases, Use, Create table, Show table, Describe, Data
Loading in Hive Table – From Local Filesystem, Inserting Data in Hive Table,
Select*
Retails domain project execution with the hive:
Use Case #1: Out of 20000 how many customers given product rating
Use Case #2: Find how many number of product available for below brand:
Puma
Regular
First Choice
Note: Show product details for "Puma“
www.edunextgen.com 18©2018 EduNextgen
Hadoop Kick-Starter Course
What is this course about?
Get insights into applications of Big Data and Hadoop along with learning about performing basic operation of HDFS,
MapReduce and Hive. The course is bundled with industry grade hands on assignments and project access provided
through VM environment to practice what you learn. A program to help you understand career path in Big Data and
available learning paths to advance your career options.
Duration: 6 Hrs.
Date: 20th & 21st January, 2018
Time: 07:30 PM to 10:30 PM
Price: ₹ 499
www.edunextgen.com 19©2018 EduNextgen
Participants will get access to
Course Content (LMS Access)
10+ Assignments
20+ Quizzes
Pre-Installed Hadoop environment (Plug and Play)
1 Project with 5 Use Cases
Hadoop Kick-Starter Course Curriculum
Day #1:
What is Big Data?
Why need More and More Data?
Big Data Characteristics:
Volume, Velocity, Variety, Veracity
Types of Data
Applications of Big Data
Industry who generate Big Data
Introduction to Hadoop
Why Hadoop?
Hadoop Ecosystem
YARN
www.edunextgen.com 20©2018 EduNextgen
Day #2:
Hive: Introduction
What is Hive and it’s Limitation?
Hive Architecture
Hive Components
Hive Data Types:
Primary Data Types
Complex Data Types
Various Hive Commands and Operations
Joins in Hive
Project Execution
Q&A
www.edunextgen.com 21©2018 EduNextgen
Hadoop and Spark Webinar: Career Opportunities in Big Data Frameworks

Mais conteúdo relacionado

Mais procurados

Manipulating data with Talend. Learn how?
Manipulating data with Talend. Learn how?Manipulating data with Talend. Learn how?
Manipulating data with Talend. Learn how?Edureka!
 
Predicting Consumer Behaviour via Hadoop
Predicting Consumer Behaviour via HadoopPredicting Consumer Behaviour via Hadoop
Predicting Consumer Behaviour via HadoopSkillspeed
 
DataOps, DevOps and the Developer: Treating Database Code Just Like App Code
DataOps, DevOps and the Developer: Treating Database Code Just Like App CodeDataOps, DevOps and the Developer: Treating Database Code Just Like App Code
DataOps, DevOps and the Developer: Treating Database Code Just Like App CodeDevOps.com
 
Talk to me Goose: Going beyond your regular Chatbot
Talk to me Goose: Going beyond your regular ChatbotTalk to me Goose: Going beyond your regular Chatbot
Talk to me Goose: Going beyond your regular ChatbotLuc Bors
 
Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!DataWorks Summit
 
Data Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
Data Science: Driving Smarter Finance and Workforce Decsions for the EnterpriseData Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
Data Science: Driving Smarter Finance and Workforce Decsions for the EnterpriseDataWorks Summit
 
CAG Corporate Dossier
CAG Corporate DossierCAG Corporate Dossier
CAG Corporate Dossiercagtorus
 
The Computer Architecture Group at the University of A Coruña, Spain
The Computer Architecture Group at the University of A Coruña, SpainThe Computer Architecture Group at the University of A Coruña, Spain
The Computer Architecture Group at the University of A Coruña, SpainKI sciencebiz
 
IoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected WorldIoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected WorldDataWorks Summit
 
Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Edgar Alejandro Villegas
 
Talend webinar
Talend webinarTalend webinar
Talend webinarEdureka!
 

Mais procurados (15)

Manipulating data with Talend. Learn how?
Manipulating data with Talend. Learn how?Manipulating data with Talend. Learn how?
Manipulating data with Talend. Learn how?
 
Predicting Consumer Behaviour via Hadoop
Predicting Consumer Behaviour via HadoopPredicting Consumer Behaviour via Hadoop
Predicting Consumer Behaviour via Hadoop
 
DataOps, DevOps and the Developer: Treating Database Code Just Like App Code
DataOps, DevOps and the Developer: Treating Database Code Just Like App CodeDataOps, DevOps and the Developer: Treating Database Code Just Like App Code
DataOps, DevOps and the Developer: Treating Database Code Just Like App Code
 
Talk to me Goose: Going beyond your regular Chatbot
Talk to me Goose: Going beyond your regular ChatbotTalk to me Goose: Going beyond your regular Chatbot
Talk to me Goose: Going beyond your regular Chatbot
 
Hareesh
HareeshHareesh
Hareesh
 
RESUME_N
RESUME_NRESUME_N
RESUME_N
 
Resume (1)
Resume (1)Resume (1)
Resume (1)
 
Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!
 
Data Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
Data Science: Driving Smarter Finance and Workforce Decsions for the EnterpriseData Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
Data Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
 
CAG Corporate Dossier
CAG Corporate DossierCAG Corporate Dossier
CAG Corporate Dossier
 
The Computer Architecture Group at the University of A Coruña, Spain
The Computer Architecture Group at the University of A Coruña, SpainThe Computer Architecture Group at the University of A Coruña, Spain
The Computer Architecture Group at the University of A Coruña, Spain
 
PGDay Brasilia 2017
PGDay Brasilia 2017PGDay Brasilia 2017
PGDay Brasilia 2017
 
IoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected WorldIoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected World
 
Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869
 
Talend webinar
Talend webinarTalend webinar
Talend webinar
 

Semelhante a Hadoop and Spark Webinar: Career Opportunities in Big Data Frameworks

How to Become a Big Data Professional.pdf
How to Become a Big Data Professional.pdfHow to Become a Big Data Professional.pdf
How to Become a Big Data Professional.pdfCareervira
 
Top 10 renowned big data companies
Top 10 renowned big data companiesTop 10 renowned big data companies
Top 10 renowned big data companiesRobert Smith
 
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...Imam Raza
 
Big data courses
Big data coursesBig data courses
Big data coursespremav6
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Pactera_US
 
All data accessible to all my organization - Presentation at OW2con'19, June...
 All data accessible to all my organization - Presentation at OW2con'19, June... All data accessible to all my organization - Presentation at OW2con'19, June...
All data accessible to all my organization - Presentation at OW2con'19, June...OW2
 
Execute your first hive project
Execute your first hive project Execute your first hive project
Execute your first hive project edunextgen
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksHortonworks
 
Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache SoftwareBob Marcus
 
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summitAnalysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summitSlim Baltagi
 
Coding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistanceCoding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistancephdAssistance1
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big DataDataWorks Summit
 

Semelhante a Hadoop and Spark Webinar: Career Opportunities in Big Data Frameworks (20)

sudipto_resume
sudipto_resumesudipto_resume
sudipto_resume
 
How to Become a Big Data Professional.pdf
How to Become a Big Data Professional.pdfHow to Become a Big Data Professional.pdf
How to Become a Big Data Professional.pdf
 
Top 10 renowned big data companies
Top 10 renowned big data companiesTop 10 renowned big data companies
Top 10 renowned big data companies
 
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
 
Big data courses
Big data coursesBig data courses
Big data courses
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks
 
All data accessible to all my organization - Presentation at OW2con'19, June...
 All data accessible to all my organization - Presentation at OW2con'19, June... All data accessible to all my organization - Presentation at OW2con'19, June...
All data accessible to all my organization - Presentation at OW2con'19, June...
 
HimaBindu
HimaBinduHimaBindu
HimaBindu
 
Execute your first hive project
Execute your first hive project Execute your first hive project
Execute your first hive project
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and Hortonworks
 
BigData_Krishna Kumar Sharma
BigData_Krishna Kumar SharmaBigData_Krishna Kumar Sharma
BigData_Krishna Kumar Sharma
 
CSB_community
CSB_communityCSB_community
CSB_community
 
Apresentação Hadoop
Apresentação HadoopApresentação Hadoop
Apresentação Hadoop
 
Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache Software
 
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summitAnalysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Coding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistanceCoding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - Phdassistance
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
 
Resume_Karthick
Resume_KarthickResume_Karthick
Resume_Karthick
 

Último

RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 

Último (20)

RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 

Hadoop and Spark Webinar: Career Opportunities in Big Data Frameworks

  • 1. Hadoop/Spark Webinar Career Opportunities in Open Source Framework
  • 3. Webinar Objective What is Open Source Framework? Open Source Framework Example Difference from other types of Framework? Importance of open source framework What is Big Data? Deep understanding of Hadoop and Spark framework Job Analysis on Hadoop and Spark www.edunextgen.com 3©2018 EduNextgen
  • 4. What is Open Source Framework? The term “Open Source" refers to something people can modify and share because its design is publicly accessible Code Available publicly, so source code that anyone can inspect, modify, and enhance www.edunextgen.com 4©2018 EduNextgen
  • 5. Open Source Framework Example www.edunextgen.com 5©2018 EduNextgen
  • 6. Difference from other types of Framework Open Source Framework: Source code available publicly Redistribute solutions Can use in any way Eliminates single point of failure Democratic forum for action No vendor lock-in No guarantee for development will continue Intellectual property (algorithms) Support consistency www.edunextgen.com 6 Proprietary Software: Predictable releases Entity to hold responsible for bugs, errors and updates Consistent feature development More stable framework More consistent training options Easier access to support Single company releasing patches Costs will be Higher for start-up Vendor will have owns software ©2018 EduNextgen
  • 7. Importance of open source framework Control: People have more control. As per requirement can be change and modify as well. Experiment can be possible and what ever suitable for the requirement can be implement. Training: Open source code is publicly accessible, students can easily study it as they learn to make better software. It help to develop the skill. Security: More secure and stable, because anyone can view and modify, someone might spot and correct errors or omissions that a program's original authors might have missed. Stability: It can be possible to develop long-term projects. Because programmers publicly distribute the source code for open source software, users relying on that software for critical tasks can be sure their tools won't disappear or fall into disrepair if their original creators stop working on them. www.edunextgen.com 7©2018 EduNextgen
  • 8. What is Big Data? Big Data is extremely large volume of data sets that may be trends or associations, especially relating to human behavior and interactions Big data is a term that describes the large volume (Terabytes or Petabytes) of data – both structured and unstructured – that inundates a business on a day- to-day basis Big Data is a collection of huge amount of data set which is not possible to handle in traditional way Big Data include capturing data, data storage, data analysis, visualization, querying etc. www.edunextgen.com 8©2018 EduNextgen
  • 9. Introduction to Hadoop www.edunextgen.com 9 Hadoop is an Open-source Data Management Framework which support, store and process big data Hadoop is a part of apache project. It is being used by Google, Yahoo, Facebook, Twitter, LinkedIn and many more It allows the distributed processing of huge data across clusters Hadoop was developed by Doug Cutting and Mike Cafarella in year 2006 ©2018 EduNextgen
  • 10. Necessity of Hadoop Make strategic, confident decisions based on solid data and advanced analytics Gain valuable business insights that help you pinpoint weaknesses and discover new opportunities Earn higher profits by better understanding the business, processes and the customers Big Data flowing in at exponential rate Increased number of Hadoop driven jobs www.edunextgen.com 10©2018 EduNextgen
  • 11. Hadoop Characteristics www.edunextgen.com 11 Hadoop Is Easily Scalable Hadoop Brings Flexibility In Data Processing Hadoop Is Fault Tolerant Hadoop Is Great At Faster Data Processing Hadoop Ecosystem Is Robust: Hadoop Is Very Cost Effective ©2018 EduNextgen
  • 12. Introduction to Apache Spark www.edunextgen.com 12 Apache Spark is an open-source cluster in-memory computing framework. Apache Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance. Apache Spark provides high-level APIs in Scala, Java, R and Python. Spark is 100x faster then Hadoop MapReduce Spark support streaming process for large dataset Initial Release on May, 2014 Stable Version Release on July, 2017: v2.2.0 on Spark is written in Scala, Java, Python Operating System Support: Microsoft Windows, macOS, Linux ©2018 EduNextgen
  • 13. Introduction to Apache Spark (Cont’d) www.edunextgen.com 13©2018 EduNextgen
  • 14. Why Spark? Speed: Spark is In-memory computations It extends the MapReduce model and takes it to a whole other level Spark is 100x times faster than Hadoop MapReduce Generality: Spark is able to handle wide range of workloads Iterative algorithms It provide interactive queries and streaming feature Ease of use: We have APIs as Scala, Python, Java in Spark It contents libraries for ML, SQL, Streaming and Graph Processing Spark runs on Hadoop clusters, Mesos Cassandra etc www.edunextgen.com 14©2018 EduNextgen
  • 15. Job Opportunities www.edunextgen.com 15 After the U.S., India has the largest demand of analytics / big data / data science professionals. Amidst such demand, people find themselves confused to select an appropriate job profile for the best future. “A professional with working knowledge of data science and big data earns 8% more than with co-worker “ ©2018 EduNextgen
  • 16. Job Opportunities (Cont’d) www.edunextgen.com 16 89% of hiring managers find it difficult to find talent 47% of employers are willing to pay for professional certifications, up from 33% in 2017 Positions they are looking for: 73% – Developers 60% – DevOps 53% – SysAdmins Employers are seeking expertise in: 70% – Cloud 67% – Big Data 65% – Linux ©2018 EduNextgen
  • 18. Next Webinar: Execute your First Hive Project What is Big Data? Why do we need Big Data? What is Hive? Basic Hive Operations & Commands: Create database, Show databases, Use, Create table, Show table, Describe, Data Loading in Hive Table – From Local Filesystem, Inserting Data in Hive Table, Select* Retails domain project execution with the hive: Use Case #1: Out of 20000 how many customers given product rating Use Case #2: Find how many number of product available for below brand: Puma Regular First Choice Note: Show product details for "Puma“ www.edunextgen.com 18©2018 EduNextgen
  • 19. Hadoop Kick-Starter Course What is this course about? Get insights into applications of Big Data and Hadoop along with learning about performing basic operation of HDFS, MapReduce and Hive. The course is bundled with industry grade hands on assignments and project access provided through VM environment to practice what you learn. A program to help you understand career path in Big Data and available learning paths to advance your career options. Duration: 6 Hrs. Date: 20th & 21st January, 2018 Time: 07:30 PM to 10:30 PM Price: ₹ 499 www.edunextgen.com 19©2018 EduNextgen Participants will get access to Course Content (LMS Access) 10+ Assignments 20+ Quizzes Pre-Installed Hadoop environment (Plug and Play) 1 Project with 5 Use Cases
  • 20. Hadoop Kick-Starter Course Curriculum Day #1: What is Big Data? Why need More and More Data? Big Data Characteristics: Volume, Velocity, Variety, Veracity Types of Data Applications of Big Data Industry who generate Big Data Introduction to Hadoop Why Hadoop? Hadoop Ecosystem YARN www.edunextgen.com 20©2018 EduNextgen Day #2: Hive: Introduction What is Hive and it’s Limitation? Hive Architecture Hive Components Hive Data Types: Primary Data Types Complex Data Types Various Hive Commands and Operations Joins in Hive Project Execution