SlideShare a Scribd company logo
1 of 13
Download to read offline
Sujay Chungath
Founder Director, Netscitus Corporation
Latest Trends in Big Data and
Career Opportunities
© Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
AGENDA
Time : 9 AM to 10 AM IST
Table of Contents
● Netscientium - Who are we ?
● What is Big Data and relevance
● Latest trends
● Career opportunities
● Q&A
● Our offerings
● Contact
© Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
NETSCIENTIUM, WHO ARE WE ?
● Netscientiun is the Knowledge
Initiative of Netscitus Corporation, a
company with base in India and USA
● Netscientium is specialized in giving
online and offline trainings in Big
Data Technologies
© Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
WHAT IS BIG DATA AND ITS RELEVANCE
Health information
exchange Gene sequencing,
Serialization,
Healthcare service
quality
improvements
Drug Safety
Banks and
Financial services
Modeling True Risk,
Threat Analysis, Fraud
Detection, Trade
Surveillance, Credit
Scoring And Analysis
Retail
Point of sales
Transaction
Analysis,
Customer Churn
Analysis,
Sentiment Analysis
© Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
LIMITATIONS OF EXISTING TECHNOLOGIES
A meagre 10%
of the 2PB
Data is
available for BI
BI Reports + Interactive Apps
RDBMS (Aggregated Data)
EPL Compute Grid
Storage
Storage only Grid (original Raw Data)
Processing
2. Moving data to compute doesn’t
scale.
1. Can’t
explore
original high
fidelity raw
data.
90% of the
2PB
archived
3.
Premature
data death
Mostly
Append
Collection
© Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
HADOOP ADVANTAGE
BI Reports + Interactive Apps
RDBMS (Aggregated Data)
Hadoop: Storage + Compute Grid
Both
Storage
And
Processing
No Data
Archiving
1. Data Exploration
& Advanced
analytics
3. Keep
Data Alive
forever
Mostly
Append
Collection
Entire 2PB
Data is
available for
processing
2. Scalable throughout for ETL &
aggregation
Instrumentation
© Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
CAREER OPPORTUNITIES - DATA SCIENTIST
Data Scientist
 The [big] data scientist needs to be able to program
 Python, R, Java, Ruby, Clojure, Matlab, Pig or SQL.
 They need to have an understanding of Hadoop, Hive and/or MapReduce.
 In addition the need to be familiar with disciplines such as:
 Natural Language Processing: the interactions between computers and humans;
 Machine learning: using computers to improve as well as develop algorithms;
 Conceptual modeling: to be able to share and articulate modelling;
 Statistical analysis: to understand and work around possible limitations in models;
 Predictive modeling: most of the big data problems are towards being able to predict future
outcomes;
© Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
CAREER OPPORTUNITIES-- BIG DATA ENGINEER
Role - Big Data Engineer / BigData Development / Bigdata Architect
• A software Engineer who is expert in Java / C / C++ => HADOOP (APIs, MR Coding, Ecosystem &
Admin ) => HIVE/PIG/IMPALA/ML => OOZIE Plus Monitoring.
• Architect, Design & Develop Bigdata based software from scratch / Upgrade / Mainitain.
• A software Engineer who is expert in ORACLE / PL/SQL/ MS SQL / TERRADATA / DATA WAREHOUSING
=> HADOOP (APIs, MR Coding, Ecosystem & Admin ) => HIVE/PIG/IMPALA/ML => OOZIE Plus
Monitoring tools.
• Architect, Design & Develop Bigdata based data ware house
© Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
CAREER OPPORTUNITIES - HADOOP DBA
• Role - Big Data DBA
 Design and Development of Data modelling.
 Hadoop ecosystem installation and configuration.
 DR / Cluster to Clysters - Database backup and recovery.
 Database connectivity and security.
 Performance monitoring and tuning ; Configuration based
 Disk space management.
 Software patches and upgrades for Unix as well as Hadoop
© Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
CAREER OPPORTUNITIES - HADOOP ADMINISTRATOR
● Role - Big Data Admin
• Good Linux and shell Scripting background
• Good knowledge of Hadoop Ecosystem and technologies.
• Understanding of Hadoop design principals and factors that affect distributed system
performance, including hardware and network considerations.
• Experience in providing Infrastructure Recommendations, Capacity Planning and develop
utilities to monitor cluster better
• Experience around managing large clusters with huge volumes of data
• Experience with cluster maintenance tasks such as creation and removal of nodes, cluster
monitoring and troubleshooting. Manage and review Hadoop log files.
• Experience installing and implementing security for Hadoop clusters.
• Installing Hadoop Updates, patches and version upgrades.
© Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
CAREER OPPORTUNITIES - HADOOP OPERATIONS
BigData – Production Support / Operations
• Good Linux and shell Scripting background
• Good knowledge of Hadoop Ecosystem and technologies.
• Cluster maintenance
• Job Management / Job failures / Investigation / Restart
• Autosys / Oozie integration
• Data analysis – Data recovery
• Cluster to Cluster data movement
• Escalations
• Operations management.
© Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
CONTACT US
❖ India
➢ Email
■ careermanager@netscientium.com
■ smitha@netscientium.com
■ Phone +91 9008587999
❖ USA
➢ Email
■ careermanager@netscientium.com
➢ Phone
 Website http://netscientium.com/
careermanager@netscientium.com
© Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
WRITE TO US FOR NEXT WEBINAR
Note : Please write a mail to us with your feedback and following details to get the updates on next
webinar
Use coupon code ‘WEBINAR-11’ and your mail id used today to avail Rs.2000/- off in our trainings in
August and September 2015.
© Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com

More Related Content

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Latest trends in big data and career opportunities

  • 1. Sujay Chungath Founder Director, Netscitus Corporation Latest Trends in Big Data and Career Opportunities © Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
  • 2. AGENDA Time : 9 AM to 10 AM IST Table of Contents ● Netscientium - Who are we ? ● What is Big Data and relevance ● Latest trends ● Career opportunities ● Q&A ● Our offerings ● Contact © Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
  • 3. NETSCIENTIUM, WHO ARE WE ? ● Netscientiun is the Knowledge Initiative of Netscitus Corporation, a company with base in India and USA ● Netscientium is specialized in giving online and offline trainings in Big Data Technologies © Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
  • 4. WHAT IS BIG DATA AND ITS RELEVANCE Health information exchange Gene sequencing, Serialization, Healthcare service quality improvements Drug Safety Banks and Financial services Modeling True Risk, Threat Analysis, Fraud Detection, Trade Surveillance, Credit Scoring And Analysis Retail Point of sales Transaction Analysis, Customer Churn Analysis, Sentiment Analysis © Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
  • 5. LIMITATIONS OF EXISTING TECHNOLOGIES A meagre 10% of the 2PB Data is available for BI BI Reports + Interactive Apps RDBMS (Aggregated Data) EPL Compute Grid Storage Storage only Grid (original Raw Data) Processing 2. Moving data to compute doesn’t scale. 1. Can’t explore original high fidelity raw data. 90% of the 2PB archived 3. Premature data death Mostly Append Collection © Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
  • 6. HADOOP ADVANTAGE BI Reports + Interactive Apps RDBMS (Aggregated Data) Hadoop: Storage + Compute Grid Both Storage And Processing No Data Archiving 1. Data Exploration & Advanced analytics 3. Keep Data Alive forever Mostly Append Collection Entire 2PB Data is available for processing 2. Scalable throughout for ETL & aggregation Instrumentation © Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
  • 7. CAREER OPPORTUNITIES - DATA SCIENTIST Data Scientist  The [big] data scientist needs to be able to program  Python, R, Java, Ruby, Clojure, Matlab, Pig or SQL.  They need to have an understanding of Hadoop, Hive and/or MapReduce.  In addition the need to be familiar with disciplines such as:  Natural Language Processing: the interactions between computers and humans;  Machine learning: using computers to improve as well as develop algorithms;  Conceptual modeling: to be able to share and articulate modelling;  Statistical analysis: to understand and work around possible limitations in models;  Predictive modeling: most of the big data problems are towards being able to predict future outcomes; © Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
  • 8. CAREER OPPORTUNITIES-- BIG DATA ENGINEER Role - Big Data Engineer / BigData Development / Bigdata Architect • A software Engineer who is expert in Java / C / C++ => HADOOP (APIs, MR Coding, Ecosystem & Admin ) => HIVE/PIG/IMPALA/ML => OOZIE Plus Monitoring. • Architect, Design & Develop Bigdata based software from scratch / Upgrade / Mainitain. • A software Engineer who is expert in ORACLE / PL/SQL/ MS SQL / TERRADATA / DATA WAREHOUSING => HADOOP (APIs, MR Coding, Ecosystem & Admin ) => HIVE/PIG/IMPALA/ML => OOZIE Plus Monitoring tools. • Architect, Design & Develop Bigdata based data ware house © Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
  • 9. CAREER OPPORTUNITIES - HADOOP DBA • Role - Big Data DBA  Design and Development of Data modelling.  Hadoop ecosystem installation and configuration.  DR / Cluster to Clysters - Database backup and recovery.  Database connectivity and security.  Performance monitoring and tuning ; Configuration based  Disk space management.  Software patches and upgrades for Unix as well as Hadoop © Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
  • 10. CAREER OPPORTUNITIES - HADOOP ADMINISTRATOR ● Role - Big Data Admin • Good Linux and shell Scripting background • Good knowledge of Hadoop Ecosystem and technologies. • Understanding of Hadoop design principals and factors that affect distributed system performance, including hardware and network considerations. • Experience in providing Infrastructure Recommendations, Capacity Planning and develop utilities to monitor cluster better • Experience around managing large clusters with huge volumes of data • Experience with cluster maintenance tasks such as creation and removal of nodes, cluster monitoring and troubleshooting. Manage and review Hadoop log files. • Experience installing and implementing security for Hadoop clusters. • Installing Hadoop Updates, patches and version upgrades. © Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
  • 11. CAREER OPPORTUNITIES - HADOOP OPERATIONS BigData – Production Support / Operations • Good Linux and shell Scripting background • Good knowledge of Hadoop Ecosystem and technologies. • Cluster maintenance • Job Management / Job failures / Investigation / Restart • Autosys / Oozie integration • Data analysis – Data recovery • Cluster to Cluster data movement • Escalations • Operations management. © Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
  • 12. CONTACT US ❖ India ➢ Email ■ careermanager@netscientium.com ■ smitha@netscientium.com ■ Phone +91 9008587999 ❖ USA ➢ Email ■ careermanager@netscientium.com ➢ Phone  Website http://netscientium.com/ careermanager@netscientium.com © Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com
  • 13. WRITE TO US FOR NEXT WEBINAR Note : Please write a mail to us with your feedback and following details to get the updates on next webinar Use coupon code ‘WEBINAR-11’ and your mail id used today to avail Rs.2000/- off in our trainings in August and September 2015. © Netscientium All Rights Reserved 2015 Email:careermanager@netscientium.com