SlideShare a Scribd company logo
1 of 32
Download to read offline
Data Analytics
Tilani Gunawardena
PhD(UNIBAS), BSc.Eng(Pera), FHEA(UK), AMIE(SL)
2019/11/22
Why?
2
Data
Data- Data is a set of values of qualitative or quantitative
variables. It is information in raw or unorganized form. It may
be a fact, figure, characters, symbols etc.
Information- Meaningful or organized data is information,
comes from analyzing data
3
Analytics- Analytics is the discovery , interpretation, and communication
of meaningful patterns or summery in data.
Data Analytics (DA) is the process of examining data sets in order to draw
conclusion about the information it contains.
Analytics is not a tool or technology, rather it is the way of thinking and
acting on data.
Data Analytics
4
Data on its own is useless unless you can make
sense of it!
Primary Focus Areas for Analytics
5
Data Analytics - Example
 Business analytics
 Risk
 Fraud
 Health
 Web
5
6
Business Analytics
7
TheProcess of Statistical Analysis
Form
Hypotheses
IdentifyData
Source
Prove/Disprove
Hypothesis
When we have resource constraints, StatisticalAnalysis enables us to make quantitative
inferences based on an amount of information we can analyze (a sample).
8
Data Analytics Life Cycle
9
Descriptive Analytics
Predictive Analytics
Prescriptive Analytics
10
Types of Data Analytics
11
TypesofDataAnalysis
TypesofDataAnalysis
Predictive PrescriptiveDescriptive
• Aims to help
uncover valuable
insight from the
data beinganalyzed
• Answersthe
question
“Whathas
happened?”
• Helps forecast
behavior ofpeople
and markets
• Answers thequestion
“What couldhappen?”
• Suggests conclusions or
actions thatmay be
taken based on the
analysis
• Answers the
question
• “Whatshould be
done?”
• What Should we do
12
Types of Analytics
Prescriptive
Analytics
advice on possible outcomes
Predictive
Analytics
understanding the future
Descriptive
Analytics
insight into the past
Why do airline prices
change every hour?
How do grocery cashiers
know to hand you coupons
you might actually use?
How does Netflix
frequently recommend
just the right movie?
13
 Though the most simple type, it is used most
often.
 Two types of descriptiveanalysis:
1. Measures of central tendency (tells us about
the middle)
 Mean − theaverage
 Median− the midpoint of the
responses
 Mode− the response with the highest
frequency
2. Measures of dispersion
 Range −the min, the max and the
 distance between thetwo
 Variance − the average degree towhich each
of the points differ fromthe mean
 Standard Deviation − the most
common/standard way ofexpressing the
spread of data
Customer_ID ItemsPurchased AmountSpent
29304 1 1.09
28308 3 44.43
19962 21 218.58
30281 1 73.02
2
1
7
6
5
4
3
2
1
0
Mean Median Mode
Mean, Median and Mode Amounts
of ItemsPurchased
6.5
Descriptive DataAnalytics
14
Variability
Mean
Mean
Mean
No Variability in Cash Flow
Variability in Cash Flow Mean
15
16
 Some mistake predictive analysis to have exclusive relevance to predicting
future events.
− However, in cases such as sentiment analysis, existing data (e.g., the text
of a tweet) is used to predict non-existent data (whether the tweet is positive
or negative).
 Several of the models that can be used for predictive analysis are:
− Forecasting
− Simulation
− Regression
− Classification
− Clustering
Predictive DataAnalytics
17
 Decisions can be formulated from descriptive and predictive analysis
− If I need to cut a product and I know that product C is least preferred and least
profitable, I will cut productC.
 However, prescriptive analytics explicitly tell youthe decisions that should
be made.This can be done using a variety of techniques:
− Linear programming
− Integer programming
− Mixed integerprogramming
− Nonlinear programming
Prescriptive DataAnalytics
18
ComparingtheThreeTypesofDataAnalytics
 Descriptive analysis is mostcommon.
− Best practice to perform descriptive
analyses prior toprescriptive/predictive
 Understand that distribution, variance,
skew, etc.,may exclude certainmodels
 How to know which type of analysis to
pursue:
− How much time do you have?
− What resources are available toyou?
− How accurate is your data? How accurate
do you need the model/analysis to be?
− How popular/accepted is the model you are considering?
 Don’t subscribe to “that’show we’ve always done it,”but
remember to use a model that stakeholders will accept.
19
Why Big Data Analytics?
Why is Big Data Analytics important?
Big data analytics helps organizations harness
their data and use it to identify new
opportunities. That, in turn, leads to smarter
business moves, more efficient operations,
higher profits and happier customers.
20
Data Analytics Tools
Enter Data Scientists
Data Scientist:
The
SEXIEST
Job
In The 21ST
centuryHarvard Business Review, Oct 2012
A Business analyst is not able
to discover insights from huge
sets of data of different
domains.
Data scientists can work in co-
ordination with different
verticals of an organization
and find useful
patterns/insights for a
company to make tangible
business decisions.
INCREASE IN JOB POSTINGS FOR
DATA SCIENTISTS IN THE US
BETWEEN 2011-12
15,000%
23
USE CASES
Facebook
 2.5 billion monthly active users
 Deep learning: Facebook makes use of facial recognition and text
analysis
 Facial recognition, Facebook uses powerful neural networks to classify
faces in the photographs
 Uses its own text understanding engine called “DeepText” to understand
user sentences.
 For targeted advertising: Deep Learning
It uses the insights gained from the data to cluster users based on their
preferences and provides them with the advertisements that appeal to
them.
25
Amazon
Amazon heavily relies on predictive analytics to increase customer satisfaction.
Personalized recommendation system.
This also comes through the suggestions that are drawn from the other users who use similar products or provide
similar ratings.
This also comes through the suggestions that are drawn from the other users who use similar products or provide
similar ratings.
26
Amazon’s Recommendation Engine
Amazon
Fraud Detection
Amazon has its own novel ways and algorithms to
detect fraud sellers and fraudulent purchases.
27
Uber
Uber is a popular smartphone application that allows you to book
a cab.
Uber makes extensive use of Big Data.
Uber has to maintain a large database of drivers, customers, and
several other records.
Whenever you hail for a cab, Uber matches your profile with the
most suitable driver.
28
Bank of America - Using Data to Leverage Customer Experience
BoA are making use of Data Science and predictive analytics
Banking industries are able to detect frauds in payments and customer
information
Prevents frauds regarding insurances, credit cards, and accounting.
Banks employ data scientists to use their quantitative knowledge where
they apply algorithms like association, clustering, forecasting, and
classification.
Risk modeling - Using Machine Learning, banks are able to minimize risk
modeling.
29
Understanding their customers through an intelligent customer segmentation
approach: high-value and low-value segments.
Clustering, logistic regression, decision trees to help the banks to understand
the Customer Lifetime Value (CLV) and take group them in the appropriate
segments.
30
Bank of America - Using Data to Leverage Customer Experience
Airbnb
31
host accommodations as well as find them through its mobile app and website
Contain massive big data of customer and host information, homestays and
lodge records, as well as website traffic
Ex : In 2014, Airbnb found out that users from certain countries would click the
neighborhood link, browse the page and photos and not make any booking.
In order to mitigate this issue, Airbnb released a different version for the users
from those countries and replaced neighborhood links with the top travel
destinations. This saw a 10% improvement in the lift rate for those users.
Spotify
online music streaming giant
With over 100 million users, Spotify deals with a massive amount of big data
uses the 600 GBs of daily data generated by the users to build its algorithms
to boost user experience.
In the year 2017, Spotify used data science to gain insights about which
universities had the highest percentage of party playlists and which ones spent
the most time on it.
32

More Related Content

What's hot

Data mining slides
Data mining slidesData mining slides
Data mining slides
smj
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Simplilearn
 

What's hot (20)

Data science life cycle
Data science life cycleData science life cycle
Data science life cycle
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
 
Data analytics
Data analyticsData analytics
Data analytics
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analytics
 
Introduction to Data Science and Analytics
Introduction to Data Science and AnalyticsIntroduction to Data Science and Analytics
Introduction to Data Science and Analytics
 
Introduction To Analytics
Introduction To AnalyticsIntroduction To Analytics
Introduction To Analytics
 
Data mining slides
Data mining slidesData mining slides
Data mining slides
 
The Importance of Data Visualization
The Importance of Data VisualizationThe Importance of Data Visualization
The Importance of Data Visualization
 
Introduction to data science club
Introduction to data science clubIntroduction to data science club
Introduction to data science club
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An Overview
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
 
Introduction to Data Visualization
Introduction to Data VisualizationIntroduction to Data Visualization
Introduction to Data Visualization
 
Big data analytics in banking sector
Big data analytics in banking sectorBig data analytics in banking sector
Big data analytics in banking sector
 
Importance of data analytics for business
Importance of data analytics for businessImportance of data analytics for business
Importance of data analytics for business
 
Data visualization
Data visualizationData visualization
Data visualization
 
Exploratory data analysis with Python
Exploratory data analysis with PythonExploratory data analysis with Python
Exploratory data analysis with Python
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysis
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
 

Similar to Data analytics

Running title TRENDS IN COMPUTER INFORMATION SYSTEMS1TRENDS I.docx
Running title TRENDS IN COMPUTER INFORMATION SYSTEMS1TRENDS I.docxRunning title TRENDS IN COMPUTER INFORMATION SYSTEMS1TRENDS I.docx
Running title TRENDS IN COMPUTER INFORMATION SYSTEMS1TRENDS I.docx
anhlodge
 

Similar to Data analytics (20)

leewayhertz.com-Data analysis workflow using Scikit-learn.pdf
leewayhertz.com-Data analysis workflow using Scikit-learn.pdfleewayhertz.com-Data analysis workflow using Scikit-learn.pdf
leewayhertz.com-Data analysis workflow using Scikit-learn.pdf
 
what is ..how to process types and methods involved in data analysis
what is ..how to process types and methods involved in data analysiswhat is ..how to process types and methods involved in data analysis
what is ..how to process types and methods involved in data analysis
 
Unveiling the Power of Data Analytics Transforming Insights into Action.pdf
Unveiling the Power of Data Analytics Transforming Insights into Action.pdfUnveiling the Power of Data Analytics Transforming Insights into Action.pdf
Unveiling the Power of Data Analytics Transforming Insights into Action.pdf
 
Insurance value chain
Insurance value chainInsurance value chain
Insurance value chain
 
Data Mining
Data MiningData Mining
Data Mining
 
MB2208A- Business Analytics- unit-4.pptx
MB2208A- Business Analytics- unit-4.pptxMB2208A- Business Analytics- unit-4.pptx
MB2208A- Business Analytics- unit-4.pptx
 
Data Analysis and Analytics.pdf
Data Analysis and Analytics.pdfData Analysis and Analytics.pdf
Data Analysis and Analytics.pdf
 
Unveiling the Power of Data Analytics.pdf
Unveiling the Power of Data Analytics.pdfUnveiling the Power of Data Analytics.pdf
Unveiling the Power of Data Analytics.pdf
 
Applications of machine learning
Applications of machine learningApplications of machine learning
Applications of machine learning
 
Career Opportunities in Data Science
Career Opportunities in Data ScienceCareer Opportunities in Data Science
Career Opportunities in Data Science
 
Regression and correlation
Regression and correlationRegression and correlation
Regression and correlation
 
ForresterPredictiveWave
ForresterPredictiveWaveForresterPredictiveWave
ForresterPredictiveWave
 
Beginners_s_Guide_Data_Analytics_1661051664.pdf
Beginners_s_Guide_Data_Analytics_1661051664.pdfBeginners_s_Guide_Data_Analytics_1661051664.pdf
Beginners_s_Guide_Data_Analytics_1661051664.pdf
 
Datamining
DataminingDatamining
Datamining
 
Datamining
DataminingDatamining
Datamining
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Big data: What's the big deal?
Big data: What's the big deal?Big data: What's the big deal?
Big data: What's the big deal?
 
Running title TRENDS IN COMPUTER INFORMATION SYSTEMS1TRENDS I.docx
Running title TRENDS IN COMPUTER INFORMATION SYSTEMS1TRENDS I.docxRunning title TRENDS IN COMPUTER INFORMATION SYSTEMS1TRENDS I.docx
Running title TRENDS IN COMPUTER INFORMATION SYSTEMS1TRENDS I.docx
 
Data analytics course in chandigarh, mohali
Data analytics course in chandigarh, mohaliData analytics course in chandigarh, mohali
Data analytics course in chandigarh, mohali
 
BUSINESS_INTELLIGENT_AND_ANALYTICS.pptx
BUSINESS_INTELLIGENT_AND_ANALYTICS.pptxBUSINESS_INTELLIGENT_AND_ANALYTICS.pptx
BUSINESS_INTELLIGENT_AND_ANALYTICS.pptx
 

More from Tilani Gunawardena PhD(UNIBAS), BSc(Pera), FHEA(UK), CEng, MIESL

More from Tilani Gunawardena PhD(UNIBAS), BSc(Pera), FHEA(UK), CEng, MIESL (20)

BlockChain.pptx
BlockChain.pptxBlockChain.pptx
BlockChain.pptx
 
Introduction to data mining and machine learning
Introduction to data mining and machine learningIntroduction to data mining and machine learning
Introduction to data mining and machine learning
 
Introduction to cloud computing
Introduction to cloud computingIntroduction to cloud computing
Introduction to cloud computing
 
Hadoop Eco system
Hadoop Eco systemHadoop Eco system
Hadoop Eco system
 
Parallel Computing on the GPU
Parallel Computing on the GPUParallel Computing on the GPU
Parallel Computing on the GPU
 
evaluation and credibility-Part 2
evaluation and credibility-Part 2evaluation and credibility-Part 2
evaluation and credibility-Part 2
 
evaluation and credibility-Part 1
evaluation and credibility-Part 1evaluation and credibility-Part 1
evaluation and credibility-Part 1
 
Machine Learning and Data Mining
Machine Learning and Data MiningMachine Learning and Data Mining
Machine Learning and Data Mining
 
K Nearest Neighbors
K Nearest NeighborsK Nearest Neighbors
K Nearest Neighbors
 
Decision tree
Decision treeDecision tree
Decision tree
 
kmean clustering
kmean clusteringkmean clustering
kmean clustering
 
Covering algorithm
Covering algorithmCovering algorithm
Covering algorithm
 
Hierachical clustering
Hierachical clusteringHierachical clustering
Hierachical clustering
 
Assosiate rule mining
Assosiate rule miningAssosiate rule mining
Assosiate rule mining
 
Big data in telecom
Big data in telecomBig data in telecom
Big data in telecom
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
MapReduce
MapReduceMapReduce
MapReduce
 
Cheetah:Data Warehouse on Top of MapReduce
Cheetah:Data Warehouse on Top of MapReduceCheetah:Data Warehouse on Top of MapReduce
Cheetah:Data Warehouse on Top of MapReduce
 
Pig Experience
Pig ExperiencePig Experience
Pig Experience
 
Interpreting the Data:Parallel Analysis with Sawzall
Interpreting the Data:Parallel Analysis with SawzallInterpreting the Data:Parallel Analysis with Sawzall
Interpreting the Data:Parallel Analysis with Sawzall
 

Recently uploaded

Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
gajnagarg
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
HyderabadDolls
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
Health
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
SayantanBiswas37
 

Recently uploaded (20)

Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
 

Data analytics

  • 1. Data Analytics Tilani Gunawardena PhD(UNIBAS), BSc.Eng(Pera), FHEA(UK), AMIE(SL) 2019/11/22
  • 3. Data Data- Data is a set of values of qualitative or quantitative variables. It is information in raw or unorganized form. It may be a fact, figure, characters, symbols etc. Information- Meaningful or organized data is information, comes from analyzing data 3
  • 4. Analytics- Analytics is the discovery , interpretation, and communication of meaningful patterns or summery in data. Data Analytics (DA) is the process of examining data sets in order to draw conclusion about the information it contains. Analytics is not a tool or technology, rather it is the way of thinking and acting on data. Data Analytics 4 Data on its own is useless unless you can make sense of it!
  • 5. Primary Focus Areas for Analytics 5
  • 6. Data Analytics - Example  Business analytics  Risk  Fraud  Health  Web 5 6
  • 8. TheProcess of Statistical Analysis Form Hypotheses IdentifyData Source Prove/Disprove Hypothesis When we have resource constraints, StatisticalAnalysis enables us to make quantitative inferences based on an amount of information we can analyze (a sample). 8
  • 12. TypesofDataAnalysis Predictive PrescriptiveDescriptive • Aims to help uncover valuable insight from the data beinganalyzed • Answersthe question “Whathas happened?” • Helps forecast behavior ofpeople and markets • Answers thequestion “What couldhappen?” • Suggests conclusions or actions thatmay be taken based on the analysis • Answers the question • “Whatshould be done?” • What Should we do 12
  • 13. Types of Analytics Prescriptive Analytics advice on possible outcomes Predictive Analytics understanding the future Descriptive Analytics insight into the past Why do airline prices change every hour? How do grocery cashiers know to hand you coupons you might actually use? How does Netflix frequently recommend just the right movie? 13
  • 14.  Though the most simple type, it is used most often.  Two types of descriptiveanalysis: 1. Measures of central tendency (tells us about the middle)  Mean − theaverage  Median− the midpoint of the responses  Mode− the response with the highest frequency 2. Measures of dispersion  Range −the min, the max and the  distance between thetwo  Variance − the average degree towhich each of the points differ fromthe mean  Standard Deviation − the most common/standard way ofexpressing the spread of data Customer_ID ItemsPurchased AmountSpent 29304 1 1.09 28308 3 44.43 19962 21 218.58 30281 1 73.02 2 1 7 6 5 4 3 2 1 0 Mean Median Mode Mean, Median and Mode Amounts of ItemsPurchased 6.5 Descriptive DataAnalytics 14
  • 15. Variability Mean Mean Mean No Variability in Cash Flow Variability in Cash Flow Mean 15
  • 16. 16
  • 17.  Some mistake predictive analysis to have exclusive relevance to predicting future events. − However, in cases such as sentiment analysis, existing data (e.g., the text of a tweet) is used to predict non-existent data (whether the tweet is positive or negative).  Several of the models that can be used for predictive analysis are: − Forecasting − Simulation − Regression − Classification − Clustering Predictive DataAnalytics 17
  • 18.  Decisions can be formulated from descriptive and predictive analysis − If I need to cut a product and I know that product C is least preferred and least profitable, I will cut productC.  However, prescriptive analytics explicitly tell youthe decisions that should be made.This can be done using a variety of techniques: − Linear programming − Integer programming − Mixed integerprogramming − Nonlinear programming Prescriptive DataAnalytics 18
  • 19. ComparingtheThreeTypesofDataAnalytics  Descriptive analysis is mostcommon. − Best practice to perform descriptive analyses prior toprescriptive/predictive  Understand that distribution, variance, skew, etc.,may exclude certainmodels  How to know which type of analysis to pursue: − How much time do you have? − What resources are available toyou? − How accurate is your data? How accurate do you need the model/analysis to be? − How popular/accepted is the model you are considering?  Don’t subscribe to “that’show we’ve always done it,”but remember to use a model that stakeholders will accept. 19
  • 20. Why Big Data Analytics? Why is Big Data Analytics important? Big data analytics helps organizations harness their data and use it to identify new opportunities. That, in turn, leads to smarter business moves, more efficient operations, higher profits and happier customers. 20
  • 22.
  • 23. Enter Data Scientists Data Scientist: The SEXIEST Job In The 21ST centuryHarvard Business Review, Oct 2012 A Business analyst is not able to discover insights from huge sets of data of different domains. Data scientists can work in co- ordination with different verticals of an organization and find useful patterns/insights for a company to make tangible business decisions. INCREASE IN JOB POSTINGS FOR DATA SCIENTISTS IN THE US BETWEEN 2011-12 15,000% 23
  • 25. Facebook  2.5 billion monthly active users  Deep learning: Facebook makes use of facial recognition and text analysis  Facial recognition, Facebook uses powerful neural networks to classify faces in the photographs  Uses its own text understanding engine called “DeepText” to understand user sentences.  For targeted advertising: Deep Learning It uses the insights gained from the data to cluster users based on their preferences and provides them with the advertisements that appeal to them. 25
  • 26. Amazon Amazon heavily relies on predictive analytics to increase customer satisfaction. Personalized recommendation system. This also comes through the suggestions that are drawn from the other users who use similar products or provide similar ratings. This also comes through the suggestions that are drawn from the other users who use similar products or provide similar ratings. 26 Amazon’s Recommendation Engine
  • 27. Amazon Fraud Detection Amazon has its own novel ways and algorithms to detect fraud sellers and fraudulent purchases. 27
  • 28. Uber Uber is a popular smartphone application that allows you to book a cab. Uber makes extensive use of Big Data. Uber has to maintain a large database of drivers, customers, and several other records. Whenever you hail for a cab, Uber matches your profile with the most suitable driver. 28
  • 29. Bank of America - Using Data to Leverage Customer Experience BoA are making use of Data Science and predictive analytics Banking industries are able to detect frauds in payments and customer information Prevents frauds regarding insurances, credit cards, and accounting. Banks employ data scientists to use their quantitative knowledge where they apply algorithms like association, clustering, forecasting, and classification. Risk modeling - Using Machine Learning, banks are able to minimize risk modeling. 29
  • 30. Understanding their customers through an intelligent customer segmentation approach: high-value and low-value segments. Clustering, logistic regression, decision trees to help the banks to understand the Customer Lifetime Value (CLV) and take group them in the appropriate segments. 30 Bank of America - Using Data to Leverage Customer Experience
  • 31. Airbnb 31 host accommodations as well as find them through its mobile app and website Contain massive big data of customer and host information, homestays and lodge records, as well as website traffic Ex : In 2014, Airbnb found out that users from certain countries would click the neighborhood link, browse the page and photos and not make any booking. In order to mitigate this issue, Airbnb released a different version for the users from those countries and replaced neighborhood links with the top travel destinations. This saw a 10% improvement in the lift rate for those users.
  • 32. Spotify online music streaming giant With over 100 million users, Spotify deals with a massive amount of big data uses the 600 GBs of daily data generated by the users to build its algorithms to boost user experience. In the year 2017, Spotify used data science to gain insights about which universities had the highest percentage of party playlists and which ones spent the most time on it. 32