SlideShare uma empresa Scribd logo
1 de 38
Baixar para ler offline
www.edureka.co/r-for-analytics
View Business Analytics with R course details at www.edureka.co/r-for-analytics
Business Analytics with R
www.edureka.co/r-for-analyticsSlide 2 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Objectives
What is data mining
What is Business Analytics
Stages of Analytics / data mining
What is R
overview of Machine Learning
 What is Clustering
What is K-means Clustering
Use-case
At the end of this session, you will be able to
Slide 3Slide 3 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Data mining ??
Generally, data mining is the process of studying data from maximum possible dimensions and summarizing it into
useful information
Technically, data mining is the process of finding correlations or patterns among dozens of fields in large data
generated from business
Or you can say, data mining is the process finding useful information from the data and then devising knowledge
out of it for improving future of our business
» Data ??
Data are any facts, numbers, or text is getting produced by existing system
» Information ??
The patterns, associations, or relationships among all this data can provide information
» Knowledge ??
Information can be converted into knowledge about historical patterns and future trends. For example summary of
sales in off season may help to start some offers in that period to increase sales
Slide 4Slide 4 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Business Analytics(BA)
Refers to the skills, technologies, practices for iterative study and investigation of historical business data to
gain insight and drive business planning
Study of data through statistical and operations analysis
Makes use of past data and statistical methods to understand business performance and hence makes us
take necessary steps to improve it
Injects intelligence into the business planning
Intersection of business and technology
Slide 5Slide 5 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Business Analytics
Why Business Analytics is getting popular these days ?
Cost of storing data Cost of processing data
Slide 6Slide 6 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Cross Industry standard Process for data mining ( CRISP – DM )
Stages of Analytics / Data Mining
Slide 7Slide 7 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Knowledge discovery and data mining ( KDD)
Stages of Analytics / Data Mining
Slide 8Slide 8 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
What is R : Programming Language
 You do data analysis in R by writing scripts and functions
in the R programming language.
 R has also quickly found the following because
statisticians, engineers and scientists without computer
programming skills find it easy to use.
R is Programming Language
Slide 9Slide 9 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
What is R : Data Analysis Software
 Data Scientists, Statisticians, Analysts, Quants, and
others who need to make sense of data use R for
statistical analysis, data visualization, and
predictive modelling.
 Rexer Analytics’s Annual Data Miner Survey is the
largest survey of data mining, data science, and
analytics professionals in the industry.
 It has concluded that R's popularity has increased
substantially in recent years.
R is Data Analysis Software
Slide 10Slide 10 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
What is R : Environment for Statistical Analysis
 R language consists of functions for almost every
data manipulation, statistical model, or chart that a
data analyst could ever need.
 For statisticians, however, R is particularly useful
because it contains a number of built-in mechanisms
for organizing data, running calculations on the
information and creating graphical representations of
data sets.
R is Environment for Statistical Analysis
Slide 11Slide 11 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
R : Characteristics
Effective and fast data handling and storage facility
A bunch of operators for calculations on arrays, lists, vectors etc
A large integrated collection of tools for data analysis, and visualization
Facilities for data analysis using graphs and display either directly at the computer or paper
A well implemented and effective programming language called ‘S’ on top of which R is built
A complete range of packages to extend and enrich the functionality of R
Slide 12Slide 12 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Data Visualization in R
This plot represents the
locations of all the traffic
signals in the city.
It is recognizable as
Toronto without any other
geographic data being
plotted - the structure of
the city comes out in the
data alone.
Slide 13Slide 13 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Who Uses R : Domains
 Telecom
 Pharmaceuticals
 Financial Services
 Life Sciences
 Education, etc
Slide 14Slide 14 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Machine Learning
We have so many algorithms for data mining which can be used to build systems that can read past data and can
generate a system that can accommodate any future data and derive useful insight from it
Such set of algorithms comes under machine learning
Machine learning focuses on the development of computer programs that can teach themselves to grow and change
when exposed to new data
Train data
ML
model
Algorithms
Slide 15Slide 15 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Types of Learning
Supervised Learning Unsupervised Learning
1. Uses a known dataset to make
predictions.
2. The training dataset includes
input data and response values.
3. From it, the supervised learning
algorithm builds a model to make
predictions of the response
values for a new dataset.
1. Draw inferences from datasets
consisting of input data without
labeled responses.
2. Used for exploratory data analysis
to find hidden patterns or grouping
in data
3. The most common unsupervised
learning method is cluster analysis.
Machine Learning
www.edureka.co/r-for-analyticsSlide 16 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Common Machine Learning Algorithms
Types of Learning
Supervised Learning
Unsupervised Learning
Algorithms
 Naïve Bayes
 Support Vector Machines
 Random Forests
 Decision Trees
Algorithms
 K-means
 Fuzzy Clustering
 Hierarchical Clustering
Gaussian mixture models
Self-organizing maps
www.edureka.co/r-for-analyticsSlide 17 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
What is Clustering?
Organizing data into clusters such that there is:
 High intra-cluster similarity
 Low inter-cluster similarity
 Informally, finding natural groupings among objects
http://en.wikipedia.org/wiki/Cluster_analysis
www.edureka.co/r-for-analyticsSlide 18 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
K-means Clustering
www.edureka.in/hadoopSlide 19
K-Means Clustering
The process by which objects are classified into
a number of groups so that they are as much
dissimilar as possible from one group to another
group, but as much similar as possible within
each group.
The objects in group 1 should be as similar as
possible.
But there should be much difference between an
object in group 1 and group 2.
The attributes of the objects are allowed to
determine which objects should be grouped
together.
Total population
Group 1
Group 2 Group 3
Group 4
www.edureka.co/r-for-analyticsSlide 20 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
How it works
1. Given n object set, randomly initialize k cluster centers from the existing set
2. Assign the objects from the set to these randomly selected cluster center based on closets Euclidean distance
from the center.
3. Set the position of each cluster to the mean of all data points belonging to that cluster
4. Repeat steps 2-3 until cluster center changes no more and cluster size remains constant
www.edureka.co/r-for-analyticsSlide 21 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
We have marks of 17 students in a class. Their ratings are :
{1,2,2,4,5,6,6,7,8,10,10,11,11,12,13,13,13}
Group the students in three categories i.e. good, average and bad.
K-means example with one dimensional data
www.edureka.co/r-for-analyticsSlide 22 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Randomly initialize 3 cluster centers:
Iteration 1
Good
(centroid=3)
Average
(centroid=2)
Bad
(centroid=1)
4,5,6,6,7,8,
10,10,11,11,
12,13,13,13
2,2 1
www.edureka.co/r-for-analyticsSlide 23 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Iteration 1 summary
Cluster 1 (Good):
No 0f items = 14
Sum of items = 129
mean = 129/14 = 9
Cluster 1 (Average):
No 0f items = 2
Sum of items = 4
mean = 4/2 = 2
Cluster 1 (Bad):
No 0f items = 1
Sum of items = 1
mean = 1/1 = 1
Change
detected
Good Average Bad
(centroid=9) (centroid=2) (centroid=1)
New cluster center after iteration 1
www.edureka.co/r-for-analyticsSlide 24 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Good
(centroid=9)
Average
(centroid=2)
Bad
(centroid=1)
6,6,7,8,
10,10,11,11,
12,13,13,13
2,2,4,5 1
Iteration 2
www.edureka.co/r-for-analyticsSlide 25 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Cluster 1 (Good):
No 0f items = 12
Sum of items = 120
mean = 120/12 = 10
Cluster 1 (Average):
No 0f items = 4
Sum of items = 13
mean = 13/4= 3
Cluster 1 (Bad):
No 0f items = 1
Sum of items = 1
mean = 1/1 = 1
Change
detected
Good Average Bad
(centroid=10) (centroid=3) (centroid=1)
New cluster center after iteration 2
Change
detected
Iteration 2 summary
www.edureka.co/r-for-analyticsSlide 26 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Good
(centroid=10)
Average
(centroid=3)
Bad
(centroid=1)
7,8,
10,10,11,11,
12,13,13,13
6,6,2,2,4,5 1
Iteration 3
www.edureka.co/r-for-analyticsSlide 27 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Cluster 1 (Good):
No 0f items = 10
Sum of items = 108
mean = 108/11 = 11
Cluster 1 (Average):
No 0f items = 6
Sum of items = 25
mean = 13/4= 4
Cluster 1 (Bad):
No 0f items = 1
Sum of items = 1
mean = 1/1 = 1
Change
detected
Good Average Bad
(centroid=11) (centroid=4) (centroid=1)
New cluster center after iteration 3
Change
detected
Iteration 3 summary
www.edureka.co/r-for-analyticsSlide 28 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Good
(centroid=11)
Average
(centroid=4)
Bad
(centroid=1)
8,
10,10,11,11,
12,13,13,13
7,6,6,4,5 1,2,2
Iteration 4 summary
www.edureka.co/r-for-analyticsSlide 29 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Cluster 1 (Good):
No 0f items = 9
Sum of items = 101
mean = 108/11 = 11
Cluster 1 (Average):
No 0f items = 5
Sum of items = 28
mean = 28/5= 6
Cluster 1 (Bad):
No 0f items = 3
Sum of items = 5
mean = 5/3 = 2
No Change
detected
Good Average Bad
(centroid=11) (centroid=6) (centroid=2)
New cluster center after iteration 4
Change
detected
Change
detected
Iteration 4 summary
www.edureka.co/r-for-analyticsSlide 30 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Good
(centroid=11)
Average
(centroid=6)
Bad
(centroid=2)
10,10,
11,11,
12,13,13,13
8,7,6,6,4,5 1,2,2
Iteration 5
www.edureka.co/r-for-analyticsSlide 31 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Cluster 1 (Good):
No 0f items = 8
Sum of items = 93
mean = 93/8 = 12
Cluster 1 (Average):
No 0f items = 6
Sum of items = 36
mean = 36/6= 6
Cluster 1 (Bad):
No 0f items = 3
Sum of items = 5
mean = 5/3 = 2
Change
detected
Good Average Bad
(centroid=12) (centroid=6) (centroid=2)
New cluster center after iteration 5
No Change
detected
No Change
detected
Iteration 5 summary
www.edureka.co/r-for-analyticsSlide 32 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Good
(centroid=12)
Average
(centroid=6)
Bad
(centroid=2)
10,10,
11,11,
12,13,13,13
8,7,6,6,4,5 1,2,2
Iteration 6
www.edureka.co/r-for-analyticsSlide 33 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Cluster 1 (Good):
No 0f items = 8
Sum of items = 93
mean = 93/8 = 12
Cluster 1 (Average):
No 0f items = 6
Sum of items = 36
mean = 36/6= 6
Cluster 1 (Bad):
No 0f items = 3
Sum of items = 5
mean = 5/3 = 2
No Change
detected
Good Average Bad
(centroid=12) (centroid=6) (centroid=2)
New cluster center after iteration 6
No Change
detected
No Change
detected
Iteration 6 summary
www.edureka.co/r-for-analyticsSlide 34 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
G
O
O
d
A
v
g
B
a
d
10 10
11
11
12
13
13
13
4
5
6
6
7
8
1
2
2
Slide 35Slide 35 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Use Cases
Slide 36 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions
Demo
More Information on R setup and applications at:
http://www.edureka.in/blog/category/business-analytics-with-r/
Slide 37 www.edureka.co/r-for-analytics
 Module 1
» Introduction to Business Analytics
 Module 2
» Introduction to R Programming
 Module 3
» Data Manipulation in R
 Module 4
» Data Import Techniques in R
 Module 5
» Exploratory Data Analysis
 Module 6
» Data Visualization in R
Course Topics
 Module 7
» Data mining: Clustering Techniques
 Module 8
» Data Mining: Association rule mining and
Sentiment analysis
 Module 9
» Linear and Logistic Regression
 Module 10
» Annova and Predictive Analysis
 Module 11
» Data Mining: Decision Trees and Random forest
 Module 12
» Final Project Business Analytics with R class –
Census Data
Slide 38 www.edureka.co/r-for-analytics

Mais conteúdo relacionado

Mais procurados

Introduction to Apache Mahout
Introduction to Apache MahoutIntroduction to Apache Mahout
Introduction to Apache Mahout
Edureka!
 
Not Just Dots on a Map: How Geography Influences Real World Decisions Part 4
Not Just Dots on a Map: How Geography Influences Real World Decisions Part 4Not Just Dots on a Map: How Geography Influences Real World Decisions Part 4
Not Just Dots on a Map: How Geography Influences Real World Decisions Part 4
American Public Gardens Association
 
Carma internet research module detecting bad data
Carma internet research module   detecting bad dataCarma internet research module   detecting bad data
Carma internet research module detecting bad data
Syracuse University
 

Mais procurados (20)

Webinar: Data Visualization-How to Make Sense of Data
Webinar: Data Visualization-How to Make Sense of DataWebinar: Data Visualization-How to Make Sense of Data
Webinar: Data Visualization-How to Make Sense of Data
 
Application of Clustering in Data Science using Real-life Examples
Application of Clustering in Data Science using Real-life Examples Application of Clustering in Data Science using Real-life Examples
Application of Clustering in Data Science using Real-life Examples
 
Python webinar 4th june
Python webinar 4th junePython webinar 4th june
Python webinar 4th june
 
Webinar: Mastering Python - An Excellent tool for Web Scraping and Data Anal...
Webinar:  Mastering Python - An Excellent tool for Web Scraping and Data Anal...Webinar:  Mastering Python - An Excellent tool for Web Scraping and Data Anal...
Webinar: Mastering Python - An Excellent tool for Web Scraping and Data Anal...
 
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
 
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
 
Introduction to Apache Mahout
Introduction to Apache MahoutIntroduction to Apache Mahout
Introduction to Apache Mahout
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Predictive analytics for E-commerce
Predictive analytics for E-commerce Predictive analytics for E-commerce
Predictive analytics for E-commerce
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
 
Python for Data Science
Python for Data SciencePython for Data Science
Python for Data Science
 
5 Benefits of Predictive Analytics for E-Commerce
5 Benefits of Predictive Analytics for E-Commerce5 Benefits of Predictive Analytics for E-Commerce
5 Benefits of Predictive Analytics for E-Commerce
 
Linked Data Entity Summarization (PhD defense)
Linked Data Entity Summarization (PhD defense)Linked Data Entity Summarization (PhD defense)
Linked Data Entity Summarization (PhD defense)
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
 
Data science with Perl & Raku
Data science with Perl & RakuData science with Perl & Raku
Data science with Perl & Raku
 
Introduction To Data Science With Python
Introduction To Data Science With PythonIntroduction To Data Science With Python
Introduction To Data Science With Python
 
Not Just Dots on a Map: How Geography Influences Real World Decisions Part 4
Not Just Dots on a Map: How Geography Influences Real World Decisions Part 4Not Just Dots on a Map: How Geography Influences Real World Decisions Part 4
Not Just Dots on a Map: How Geography Influences Real World Decisions Part 4
 
Carma internet research module detecting bad data
Carma internet research module   detecting bad dataCarma internet research module   detecting bad data
Carma internet research module detecting bad data
 
Programming for data science in python
Programming for data science in pythonProgramming for data science in python
Programming for data science in python
 
Sharing about my data science journey and what I do at Lazada
Sharing about my data science journey and what I do at LazadaSharing about my data science journey and what I do at Lazada
Sharing about my data science journey and what I do at Lazada
 

Destaque

Presentation_Netinfo_6.2015
Presentation_Netinfo_6.2015Presentation_Netinfo_6.2015
Presentation_Netinfo_6.2015
Hristo Hristov
 
Hisham Muhamed Tilbete - Mrekullia e Kur'anit në çështje të astronomisë
Hisham Muhamed Tilbete - Mrekullia e Kur'anit në çështje të astronomisëHisham Muhamed Tilbete - Mrekullia e Kur'anit në çështje të astronomisë
Hisham Muhamed Tilbete - Mrekullia e Kur'anit në çështje të astronomisë
Libra Islame
 

Destaque (20)

Philomena Detailed CV
Philomena Detailed CVPhilomena Detailed CV
Philomena Detailed CV
 
Structurarea unei cotatii in mai multe sectiuni
Structurarea unei cotatii in mai multe sectiuniStructurarea unei cotatii in mai multe sectiuni
Structurarea unei cotatii in mai multe sectiuni
 
صداقت حضرت مسیح موعود علیہ السلام از احادیث
صداقت حضرت مسیح موعود علیہ السلام از احادیثصداقت حضرت مسیح موعود علیہ السلام از احادیث
صداقت حضرت مسیح موعود علیہ السلام از احادیث
 
CANENERO Advertising - Gilberto Chiacchiera
CANENERO Advertising - Gilberto ChiacchieraCANENERO Advertising - Gilberto Chiacchiera
CANENERO Advertising - Gilberto Chiacchiera
 
Presentation_Netinfo_6.2015
Presentation_Netinfo_6.2015Presentation_Netinfo_6.2015
Presentation_Netinfo_6.2015
 
Jeremy thake introducing alm to share point development implementations (ap...
Jeremy thake   introducing alm to share point development implementations (ap...Jeremy thake   introducing alm to share point development implementations (ap...
Jeremy thake introducing alm to share point development implementations (ap...
 
Wie Startups unsere Welt verändern (2015)
Wie Startups unsere Welt verändern (2015) Wie Startups unsere Welt verändern (2015)
Wie Startups unsere Welt verändern (2015)
 
Il Web E Le Reti Di Vendita
Il Web E Le Reti Di Vendita Il Web E Le Reti Di Vendita
Il Web E Le Reti Di Vendita
 
Roma solo fotos
Roma solo fotosRoma solo fotos
Roma solo fotos
 
Pedagogical Strategies for Worthwhile Learning in Online Environments
Pedagogical Strategies for Worthwhile Learning in Online EnvironmentsPedagogical Strategies for Worthwhile Learning in Online Environments
Pedagogical Strategies for Worthwhile Learning in Online Environments
 
Kauno miesto savivaldybės 100 dienų ataskaita
Kauno miesto savivaldybės 100 dienų ataskaitaKauno miesto savivaldybės 100 dienų ataskaita
Kauno miesto savivaldybės 100 dienų ataskaita
 
Povezovanje kemijske panoge in delo z mladimi, KOCKE, Ziga Lampe, Drzava za g...
Povezovanje kemijske panoge in delo z mladimi, KOCKE, Ziga Lampe, Drzava za g...Povezovanje kemijske panoge in delo z mladimi, KOCKE, Ziga Lampe, Drzava za g...
Povezovanje kemijske panoge in delo z mladimi, KOCKE, Ziga Lampe, Drzava za g...
 
Η αγαπημένη μου πόλη
Η αγαπημένη μου πόληΗ αγαπημένη μου πόλη
Η αγαπημένη μου πόλη
 
最高の自分に進化する方法【コンサル起業実践講座】
最高の自分に進化する方法【コンサル起業実践講座】最高の自分に進化する方法【コンサル起業実践講座】
最高の自分に進化する方法【コンサル起業実践講座】
 
MvSM: 7) Co tam dávat - tvorba obsahu pro sociální média
MvSM: 7) Co tam dávat - tvorba obsahu pro sociální médiaMvSM: 7) Co tam dávat - tvorba obsahu pro sociální média
MvSM: 7) Co tam dávat - tvorba obsahu pro sociální média
 
YoonSeo Link
YoonSeo LinkYoonSeo Link
YoonSeo Link
 
Set
SetSet
Set
 
Digital Marketing
Digital MarketingDigital Marketing
Digital Marketing
 
Hisham Muhamed Tilbete - Mrekullia e Kur'anit në çështje të astronomisë
Hisham Muhamed Tilbete - Mrekullia e Kur'anit në çështje të astronomisëHisham Muhamed Tilbete - Mrekullia e Kur'anit në çështje të astronomisë
Hisham Muhamed Tilbete - Mrekullia e Kur'anit në çështje të astronomisë
 
Buyer Persona - Key to B2B online marketing success
Buyer Persona - Key to B2B online marketing successBuyer Persona - Key to B2B online marketing success
Buyer Persona - Key to B2B online marketing success
 

Semelhante a Business Analytics with R

"Introduction to R Programming and Machine Learning"
"Introduction to R Programming and Machine Learning""Introduction to R Programming and Machine Learning"
"Introduction to R Programming and Machine Learning"
Edureka!
 
Business Analytics with R - Webinar
Business Analytics with R - WebinarBusiness Analytics with R - Webinar
Business Analytics with R - Webinar
Edureka!
 
A Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxA Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptx
RajSingh512965
 

Semelhante a Business Analytics with R (20)

"Introduction to R Programming and Machine Learning"
"Introduction to R Programming and Machine Learning""Introduction to R Programming and Machine Learning"
"Introduction to R Programming and Machine Learning"
 
Linear Regression with R programming.pptx
Linear Regression with R programming.pptxLinear Regression with R programming.pptx
Linear Regression with R programming.pptx
 
Business Analytics with R - Webinar
Business Analytics with R - WebinarBusiness Analytics with R - Webinar
Business Analytics with R - Webinar
 
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
 
Self Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxSelf Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docx
 
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
 
A Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxA Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptx
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
 
Data Analytics Course Curriculum_ What to Expect and How to Prepare in 2023.pdf
Data Analytics Course Curriculum_ What to Expect and How to Prepare in 2023.pdfData Analytics Course Curriculum_ What to Expect and How to Prepare in 2023.pdf
Data Analytics Course Curriculum_ What to Expect and How to Prepare in 2023.pdf
 
Data Science Training Course in Gurgaon.pptx
Data Science Training Course in Gurgaon.pptxData Science Training Course in Gurgaon.pptx
Data Science Training Course in Gurgaon.pptx
 
Using R for Classification of Large Social Network Data
Using R for Classification of Large Social Network DataUsing R for Classification of Large Social Network Data
Using R for Classification of Large Social Network Data
 
MODULE 1_Introduction to Data analytics and life cycle..pptx
MODULE 1_Introduction to Data analytics and life cycle..pptxMODULE 1_Introduction to Data analytics and life cycle..pptx
MODULE 1_Introduction to Data analytics and life cycle..pptx
 
Data Science.pptx
Data Science.pptxData Science.pptx
Data Science.pptx
 
resume_MH
resume_MHresume_MH
resume_MH
 
FDS_dept_ppt.pptx
FDS_dept_ppt.pptxFDS_dept_ppt.pptx
FDS_dept_ppt.pptx
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysis
 
fINAL Lesson_1_Course_Introduction_v1.pptx
fINAL Lesson_1_Course_Introduction_v1.pptxfINAL Lesson_1_Course_Introduction_v1.pptx
fINAL Lesson_1_Course_Introduction_v1.pptx
 
Data Science
Data ScienceData Science
Data Science
 
Twitter Sentiment Analysis: An Unsupervised Approach
Twitter Sentiment Analysis: An Unsupervised ApproachTwitter Sentiment Analysis: An Unsupervised Approach
Twitter Sentiment Analysis: An Unsupervised Approach
 

Mais de Edureka!

Mais de Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
 

Último

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Último (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 

Business Analytics with R

  • 1. www.edureka.co/r-for-analytics View Business Analytics with R course details at www.edureka.co/r-for-analytics Business Analytics with R
  • 2. www.edureka.co/r-for-analyticsSlide 2 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Objectives What is data mining What is Business Analytics Stages of Analytics / data mining What is R overview of Machine Learning  What is Clustering What is K-means Clustering Use-case At the end of this session, you will be able to
  • 3. Slide 3Slide 3 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Data mining ?? Generally, data mining is the process of studying data from maximum possible dimensions and summarizing it into useful information Technically, data mining is the process of finding correlations or patterns among dozens of fields in large data generated from business Or you can say, data mining is the process finding useful information from the data and then devising knowledge out of it for improving future of our business » Data ?? Data are any facts, numbers, or text is getting produced by existing system » Information ?? The patterns, associations, or relationships among all this data can provide information » Knowledge ?? Information can be converted into knowledge about historical patterns and future trends. For example summary of sales in off season may help to start some offers in that period to increase sales
  • 4. Slide 4Slide 4 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Business Analytics(BA) Refers to the skills, technologies, practices for iterative study and investigation of historical business data to gain insight and drive business planning Study of data through statistical and operations analysis Makes use of past data and statistical methods to understand business performance and hence makes us take necessary steps to improve it Injects intelligence into the business planning Intersection of business and technology
  • 5. Slide 5Slide 5 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Business Analytics Why Business Analytics is getting popular these days ? Cost of storing data Cost of processing data
  • 6. Slide 6Slide 6 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Cross Industry standard Process for data mining ( CRISP – DM ) Stages of Analytics / Data Mining
  • 7. Slide 7Slide 7 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Knowledge discovery and data mining ( KDD) Stages of Analytics / Data Mining
  • 8. Slide 8Slide 8 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions What is R : Programming Language  You do data analysis in R by writing scripts and functions in the R programming language.  R has also quickly found the following because statisticians, engineers and scientists without computer programming skills find it easy to use. R is Programming Language
  • 9. Slide 9Slide 9 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions What is R : Data Analysis Software  Data Scientists, Statisticians, Analysts, Quants, and others who need to make sense of data use R for statistical analysis, data visualization, and predictive modelling.  Rexer Analytics’s Annual Data Miner Survey is the largest survey of data mining, data science, and analytics professionals in the industry.  It has concluded that R's popularity has increased substantially in recent years. R is Data Analysis Software
  • 10. Slide 10Slide 10 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions What is R : Environment for Statistical Analysis  R language consists of functions for almost every data manipulation, statistical model, or chart that a data analyst could ever need.  For statisticians, however, R is particularly useful because it contains a number of built-in mechanisms for organizing data, running calculations on the information and creating graphical representations of data sets. R is Environment for Statistical Analysis
  • 11. Slide 11Slide 11 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions R : Characteristics Effective and fast data handling and storage facility A bunch of operators for calculations on arrays, lists, vectors etc A large integrated collection of tools for data analysis, and visualization Facilities for data analysis using graphs and display either directly at the computer or paper A well implemented and effective programming language called ‘S’ on top of which R is built A complete range of packages to extend and enrich the functionality of R
  • 12. Slide 12Slide 12 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Data Visualization in R This plot represents the locations of all the traffic signals in the city. It is recognizable as Toronto without any other geographic data being plotted - the structure of the city comes out in the data alone.
  • 13. Slide 13Slide 13 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Who Uses R : Domains  Telecom  Pharmaceuticals  Financial Services  Life Sciences  Education, etc
  • 14. Slide 14Slide 14 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Machine Learning We have so many algorithms for data mining which can be used to build systems that can read past data and can generate a system that can accommodate any future data and derive useful insight from it Such set of algorithms comes under machine learning Machine learning focuses on the development of computer programs that can teach themselves to grow and change when exposed to new data Train data ML model Algorithms
  • 15. Slide 15Slide 15 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Types of Learning Supervised Learning Unsupervised Learning 1. Uses a known dataset to make predictions. 2. The training dataset includes input data and response values. 3. From it, the supervised learning algorithm builds a model to make predictions of the response values for a new dataset. 1. Draw inferences from datasets consisting of input data without labeled responses. 2. Used for exploratory data analysis to find hidden patterns or grouping in data 3. The most common unsupervised learning method is cluster analysis. Machine Learning
  • 16. www.edureka.co/r-for-analyticsSlide 16 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Common Machine Learning Algorithms Types of Learning Supervised Learning Unsupervised Learning Algorithms  Naïve Bayes  Support Vector Machines  Random Forests  Decision Trees Algorithms  K-means  Fuzzy Clustering  Hierarchical Clustering Gaussian mixture models Self-organizing maps
  • 17. www.edureka.co/r-for-analyticsSlide 17 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions What is Clustering? Organizing data into clusters such that there is:  High intra-cluster similarity  Low inter-cluster similarity  Informally, finding natural groupings among objects http://en.wikipedia.org/wiki/Cluster_analysis
  • 18. www.edureka.co/r-for-analyticsSlide 18 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions K-means Clustering
  • 19. www.edureka.in/hadoopSlide 19 K-Means Clustering The process by which objects are classified into a number of groups so that they are as much dissimilar as possible from one group to another group, but as much similar as possible within each group. The objects in group 1 should be as similar as possible. But there should be much difference between an object in group 1 and group 2. The attributes of the objects are allowed to determine which objects should be grouped together. Total population Group 1 Group 2 Group 3 Group 4
  • 20. www.edureka.co/r-for-analyticsSlide 20 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions How it works 1. Given n object set, randomly initialize k cluster centers from the existing set 2. Assign the objects from the set to these randomly selected cluster center based on closets Euclidean distance from the center. 3. Set the position of each cluster to the mean of all data points belonging to that cluster 4. Repeat steps 2-3 until cluster center changes no more and cluster size remains constant
  • 21. www.edureka.co/r-for-analyticsSlide 21 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions We have marks of 17 students in a class. Their ratings are : {1,2,2,4,5,6,6,7,8,10,10,11,11,12,13,13,13} Group the students in three categories i.e. good, average and bad. K-means example with one dimensional data
  • 22. www.edureka.co/r-for-analyticsSlide 22 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Randomly initialize 3 cluster centers: Iteration 1 Good (centroid=3) Average (centroid=2) Bad (centroid=1) 4,5,6,6,7,8, 10,10,11,11, 12,13,13,13 2,2 1
  • 23. www.edureka.co/r-for-analyticsSlide 23 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Iteration 1 summary Cluster 1 (Good): No 0f items = 14 Sum of items = 129 mean = 129/14 = 9 Cluster 1 (Average): No 0f items = 2 Sum of items = 4 mean = 4/2 = 2 Cluster 1 (Bad): No 0f items = 1 Sum of items = 1 mean = 1/1 = 1 Change detected Good Average Bad (centroid=9) (centroid=2) (centroid=1) New cluster center after iteration 1
  • 24. www.edureka.co/r-for-analyticsSlide 24 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Good (centroid=9) Average (centroid=2) Bad (centroid=1) 6,6,7,8, 10,10,11,11, 12,13,13,13 2,2,4,5 1 Iteration 2
  • 25. www.edureka.co/r-for-analyticsSlide 25 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Cluster 1 (Good): No 0f items = 12 Sum of items = 120 mean = 120/12 = 10 Cluster 1 (Average): No 0f items = 4 Sum of items = 13 mean = 13/4= 3 Cluster 1 (Bad): No 0f items = 1 Sum of items = 1 mean = 1/1 = 1 Change detected Good Average Bad (centroid=10) (centroid=3) (centroid=1) New cluster center after iteration 2 Change detected Iteration 2 summary
  • 26. www.edureka.co/r-for-analyticsSlide 26 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Good (centroid=10) Average (centroid=3) Bad (centroid=1) 7,8, 10,10,11,11, 12,13,13,13 6,6,2,2,4,5 1 Iteration 3
  • 27. www.edureka.co/r-for-analyticsSlide 27 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Cluster 1 (Good): No 0f items = 10 Sum of items = 108 mean = 108/11 = 11 Cluster 1 (Average): No 0f items = 6 Sum of items = 25 mean = 13/4= 4 Cluster 1 (Bad): No 0f items = 1 Sum of items = 1 mean = 1/1 = 1 Change detected Good Average Bad (centroid=11) (centroid=4) (centroid=1) New cluster center after iteration 3 Change detected Iteration 3 summary
  • 28. www.edureka.co/r-for-analyticsSlide 28 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Good (centroid=11) Average (centroid=4) Bad (centroid=1) 8, 10,10,11,11, 12,13,13,13 7,6,6,4,5 1,2,2 Iteration 4 summary
  • 29. www.edureka.co/r-for-analyticsSlide 29 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Cluster 1 (Good): No 0f items = 9 Sum of items = 101 mean = 108/11 = 11 Cluster 1 (Average): No 0f items = 5 Sum of items = 28 mean = 28/5= 6 Cluster 1 (Bad): No 0f items = 3 Sum of items = 5 mean = 5/3 = 2 No Change detected Good Average Bad (centroid=11) (centroid=6) (centroid=2) New cluster center after iteration 4 Change detected Change detected Iteration 4 summary
  • 30. www.edureka.co/r-for-analyticsSlide 30 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Good (centroid=11) Average (centroid=6) Bad (centroid=2) 10,10, 11,11, 12,13,13,13 8,7,6,6,4,5 1,2,2 Iteration 5
  • 31. www.edureka.co/r-for-analyticsSlide 31 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Cluster 1 (Good): No 0f items = 8 Sum of items = 93 mean = 93/8 = 12 Cluster 1 (Average): No 0f items = 6 Sum of items = 36 mean = 36/6= 6 Cluster 1 (Bad): No 0f items = 3 Sum of items = 5 mean = 5/3 = 2 Change detected Good Average Bad (centroid=12) (centroid=6) (centroid=2) New cluster center after iteration 5 No Change detected No Change detected Iteration 5 summary
  • 32. www.edureka.co/r-for-analyticsSlide 32 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Good (centroid=12) Average (centroid=6) Bad (centroid=2) 10,10, 11,11, 12,13,13,13 8,7,6,6,4,5 1,2,2 Iteration 6
  • 33. www.edureka.co/r-for-analyticsSlide 33 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Cluster 1 (Good): No 0f items = 8 Sum of items = 93 mean = 93/8 = 12 Cluster 1 (Average): No 0f items = 6 Sum of items = 36 mean = 36/6= 6 Cluster 1 (Bad): No 0f items = 3 Sum of items = 5 mean = 5/3 = 2 No Change detected Good Average Bad (centroid=12) (centroid=6) (centroid=2) New cluster center after iteration 6 No Change detected No Change detected Iteration 6 summary
  • 34. www.edureka.co/r-for-analyticsSlide 34 Twitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions G O O d A v g B a d 10 10 11 11 12 13 13 13 4 5 6 6 7 8 1 2 2
  • 35. Slide 35Slide 35 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Use Cases
  • 36. Slide 36 www.edureka.co/r-for-analyticsTwitter @edurekaIN, Facebook /edurekaIN, use #AskEdureka for Questions Demo More Information on R setup and applications at: http://www.edureka.in/blog/category/business-analytics-with-r/
  • 37. Slide 37 www.edureka.co/r-for-analytics  Module 1 » Introduction to Business Analytics  Module 2 » Introduction to R Programming  Module 3 » Data Manipulation in R  Module 4 » Data Import Techniques in R  Module 5 » Exploratory Data Analysis  Module 6 » Data Visualization in R Course Topics  Module 7 » Data mining: Clustering Techniques  Module 8 » Data Mining: Association rule mining and Sentiment analysis  Module 9 » Linear and Logistic Regression  Module 10 » Annova and Predictive Analysis  Module 11 » Data Mining: Decision Trees and Random forest  Module 12 » Final Project Business Analytics with R class – Census Data