1. INTRODUCTION TO DATA ANALYTICS
Utkarsh Sharma
Asst. Prof (CSE Dept.)
Jaypee University of Engineering & Technology
2. CONTENTS
What is Data Science, Big Data, Data Analytics?
Roles and Responsibilities of Data Scientist, Big Data Professional and Data Analyst
Required Skill set.
Understanding how data science, big data, and data analytics is used to drive the success of Netflix.
3. ROLE OF STATISTICAL LEARNING
Here are some examples of learning problems:
Predict whether a patient, hospitalized due to a heart attack, will have a second heart attack. The
prediction is to be based on demo-graphic, diet and clinical measurements for that patient.
Predict the price of a stock in 6 months from now, on the basis of company performance measures and
economic data.
Identify the numbers in a handwritten ZIP code, from a digitized image.
Estimate the amount of glucose in the blood of a diabetic person, from the infrared absorption spectrum
of that person’s blood.
Identify the risk factors for prostate cancer, based on clinical and demographic variables.
4. WHAT IS DATA SCIENCE?
Combination of mathematics, statistics and programming.
Context of problem being solved.
Ingenious way of capturing data which is not captured.
Ability to look at the things differently
5. WHAT IS BIG DATA?
Large amount of data from various source.
Traditional data processing system are incapable to deal.
In terms of Volume, Variety, Veracity, Velocity and value.
6. WHAT IS DATA ANALYTICS?
Discovering useful information from data.
Supports decision making.
Involves inspecting, cleansing, transforming and modelling data.
Uses qualitative and quantitative techniques.
7. WHAT DO DATA SCIENTISTS DO?
Predicts future based on past patterns using AI and machine learning.
Examines data from multiple sources.
Finding co-relations and hidden patterns from data.
8. WHAT DOES A BIG DATA PROFESSIONAL DO?
Architect distributed systems.
Build large scale data processing system.
Process the data using various big data tools.
9. WHAT DOES A DATA ANALYST DO?
Acquire, analyse and process the data.
Finding insights of captured data.
Create data report using various reporting tools.
14. AN EXAMPLE SCENARIO
Data Scientist
Understanding of
the impact of QoE
on User Behavior
Creating
personalized
streaming
experience
Optimizing content
caching
Improving content
quality