SlideShare uma empresa Scribd logo
1 de 26
Analytical Tools
By : Aniket Joshi
• A GUI (Graphic User Interface) based analytics
software
• Versatile, powerful & user friendly
• Handles large amount of data easily
• Easy to learn
• Popularity of SAS E-Miner doesn’t match its
capability due to high price tag
• One of the most expensive softwares’ which a very
few companies can afford
SAS Enterprise Miner
SAS Enterprise Miner
FEATURES
• Data Preparation, Summarization & Exploration
i. Access & integrate structured and unstructured
data sources
ii. Outlier filtering
iii. Data Partitioning
iv. Integration with R
v. Merge & Append Tools
vi. Univariate & Bivariate statistics and plots
vii. Interactive Variable Binning
• Advanced Predictive & Descriptive Modeling
i. Clustering & Self Organizing Maps
ii. Market Basket Analysis
iii. Dimension Reduction Techniques
iv. Linear & Logistic Regression
v. Decision Trees
vi. Neural Networks
vii. Time Series Data Mining
viii.Survival Analysis
SAS Enterprise Miner
BASE SAS +
• An analytics software pre loaded with functions to
perform statistical analysis
• Not user friendly, involves coding
• Cheaper software as compared to SAS Enterprise
Miner but still expensive
• SAS+ comprises of BASE SAS, SAS STAT & SAS Access
to ODBC
WPS
• An analytics software which reads, understands
and executes the language of SAS
• Comes loaded with built in functions & procedures
to perform statistical analysis
• Inspired by BASE SAS
• Interface similar to BASE SAS
• Cheaper than BASE SAS +
• Version 3 (WPS) offers WPS Workbench User
Interface to connect & run programs in server,
cluster and cloud environment
IBM’s SPSS
• IBM’s SPSS is an equivalent of BASE SAS
• Popular Software in Market Research
• Can handle small to mid size data sets
• IBM’s SPSS Modeler is an equivalent of SAS
Enterprise Miner
• SPSS Modeler offers features such as :
i. Accessing Data
ii. Data Exploration
iii. Summarization & Preparation
iv. Predictive Modeling Techniques such as
regression, clustering, decision trees, neural
networks, self organizing maps etc
IBM’s SPSS Modeler/Clementine
R
• World’s most popular open source analytics tool
• Evolved from a language called S, then converted to
a product called S+ (GUI based)
• R offers more than 3000 packages
• R package is a collection of functions which enable:
i. Make computations in descriptive statistics
ii. Data Manipulation
iii. Regression Analysis
iv. Advanced Visualization
• R 3.4.3 (Latest Version)
• Developed by practitioners themselves
• BASE SAS involves coding in R Language, interface is similar to
BASE SAS
• R language is concise and elegant, uses pre developed
packages
• Not easy to learn, steep learning curve
• Performs super complex statistical analytics quickly
R
• Excellent statistical & visualization capability
• Faces problems in handling large data sets
• Integration of R with HADOOP can handle large data
sets
• Adoption of R has increased due to use by Facebook,
Google, Bing, Mozilla etc
• Graphs can be created with several layers, scales,
coordinate systems, smoothing curves
R
Apache HADOOP
• Open source data management software
• Helps companies in analyzing massive data volumes
(Structured & Unstructured)
• Used by Ebay, Yahoo, Facebook
• One of the most desired technical skills in the
industry
MICROSTRATEGY
• Business Intelligence product with limited analytics
capability
• Easy to learn tool
• Excellent Visualization
• Advanced Integration Capability with R & HADOOP
STATISTICA
• Statistics & Analytics software package developed
by STATSOFT
• Features are data analysis, data management,
statistics, data mining, data visualization etc
• GUI based product similar to SAS Enterprise Miner
• Procedure involves :
i. Loading table of data
ii. Applying statistical functions from drop down
menus
• User friendly, Easy to learn
• Advanced analytics capabilities (with large data)
KXEN
• KXEN = Knowledge Extraction Engines
• Automated Analytics
• Reduces work of analysts
• Products are based on algorithms developed by
Russian Mathematician Vladimir Vapnik
• Easy to use, easy to learn, fast & can handle large
data sets
• Can produce a large number of models quickly
• Works like a Black Box
• KXEN Software Packages offer:
i. Data Manipulation
ii. Classification
iii. Regression
iv. Clustering
v. Variable Importance
vi. Segmentation
vii. Time Series
viii.Association Rules
ix. Data Fusion
KXEN
TABLEAU
• GUI based data visualization product similar to
MICROSTRATEGY, is focused on Business Intelligence
• Drag & Drop feature offered to analyze data
• Visualizes & creates interactive dash boards
• Easy to learn
• Gives a good understanding of data
• Not capable of Predictive Analytics
Comparison of Analytics Tools
• Measures used to compare the popularity of
Analytics tools are:
i. Level of Activity on E-mails or Discussion
Lists devoted to these tools
ii. Number of Users (Data Analytics
Competitions)
iii. Languages used in Data Mining or Analysis
Based on Level of Activity on E-
Mails/Discussion Lists
• R & SAS Tools are most popular
• R has dominated in the last few years
• R shows decline in 2011 due to :
i. Migration to other forums
ii. Emergence of easy to use User Interface in R such
as R Commander, Deducer (a GUI for R) & Rattle (a
GUI for Data Mining using R)
Based on Level of Activity on E-
Mails/Discussion Lists
Based on Number of Users (Data Analytics
Competitions
• “Kaggle.com” sponsors data analysis contests
• Companies post Data Analytics Problems with certain
prize money
• R is the most preferred language in these competitions
and even externally
• 50% of the contest winners were found to be using R
• Other tools often have prohibitions, due to licenses etc,
so R is naturally preferred
Based on Number of Users (Data Analytics
Competitions
Based on Languages used in Data Mining or
Analytics
Based on Softwares used in Data Mining or
Analytics
• R is the leader in programming languages used
followed by Python (2015)
• R is the leader in the Softwares’ used followed by
Rapid Minder (2015)
• The usage of these languages and softwares’ has a
direct relationship with the number of jobs which
have a programming language as their requirement
Based on Languages & Softwares’ used in
Data Mining or Analytics
Thank You

Mais conteúdo relacionado

Mais procurados (20)

Data mining slides
Data mining slidesData mining slides
Data mining slides
 
Data visualization
Data visualizationData visualization
Data visualization
 
Data preprocessing ng
Data preprocessing   ngData preprocessing   ng
Data preprocessing ng
 
Introduction to basic data analytics tools
Introduction to basic data analytics toolsIntroduction to basic data analytics tools
Introduction to basic data analytics tools
 
Predictive Analytics - An Introduction
Predictive Analytics - An IntroductionPredictive Analytics - An Introduction
Predictive Analytics - An Introduction
 
Data analysis
Data analysisData analysis
Data analysis
 
The Data Science Process
The Data Science ProcessThe Data Science Process
The Data Science Process
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
 
Exploratory Data Analysis
Exploratory Data AnalysisExploratory Data Analysis
Exploratory Data Analysis
 
1. Data Analytics-introduction
1. Data Analytics-introduction1. Data Analytics-introduction
1. Data Analytics-introduction
 
Data preprocessing in Data Mining
Data preprocessing in Data MiningData preprocessing in Data Mining
Data preprocessing in Data Mining
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Data analytics
Data analyticsData analytics
Data analytics
 
Data Analytics Life Cycle
Data Analytics Life CycleData Analytics Life Cycle
Data Analytics Life Cycle
 
Data analytics vs. Data analysis
Data analytics vs. Data analysisData analytics vs. Data analysis
Data analytics vs. Data analysis
 
Decision tree
Decision treeDecision tree
Decision tree
 
Data analytics
Data analyticsData analytics
Data analytics
 
Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data mining
 
Data Mining
Data MiningData Mining
Data Mining
 

Semelhante a Analytical tools

Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Group
 
Business analytics and data visualisation
Business analytics and data visualisationBusiness analytics and data visualisation
Business analytics and data visualisationShwetabh Jaiswal
 
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...Microsoft TechNet - Belgium and Luxembourg
 
Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?Jen Stirrup
 
Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?Jen Stirrup
 
In-Database Analytics Deep Dive with Teradata and Revolution
In-Database Analytics Deep Dive with Teradata and RevolutionIn-Database Analytics Deep Dive with Teradata and Revolution
In-Database Analytics Deep Dive with Teradata and RevolutionRevolution Analytics
 
Software Programs for Data Analysis
Software Programs for Data AnalysisSoftware Programs for Data Analysis
Software Programs for Data Analysisunmgrc
 
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...Mark Rittman
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureDatabricks
 
Data mining tools (R , WEKA, RAPID MINER, ORANGE)
Data mining tools (R , WEKA, RAPID MINER, ORANGE)Data mining tools (R , WEKA, RAPID MINER, ORANGE)
Data mining tools (R , WEKA, RAPID MINER, ORANGE)Krishna Petrochemicals
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMBig Data Joe™ Rossi
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMBig Data Joe™ Rossi
 
Choosing a Data Visualization Tool for Data Scientists_Final
Choosing a Data Visualization Tool for Data Scientists_FinalChoosing a Data Visualization Tool for Data Scientists_Final
Choosing a Data Visualization Tool for Data Scientists_FinalHeather Choi
 
Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...
Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...
Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...Rui Quintino
 

Semelhante a Analytical tools (20)

Market research of the analytics tools
Market research of the analytics toolsMarket research of the analytics tools
Market research of the analytics tools
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
 
Business analytics and data visualisation
Business analytics and data visualisationBusiness analytics and data visualisation
Business analytics and data visualisation
 
Microstrategy Overview
Microstrategy OverviewMicrostrategy Overview
Microstrategy Overview
 
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
 
Prez szabolcs
Prez szabolcsPrez szabolcs
Prez szabolcs
 
Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?
 
Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?Business Intelligence Barista: What DataViz Tool to Use, and When?
Business Intelligence Barista: What DataViz Tool to Use, and When?
 
In-Database Analytics Deep Dive with Teradata and Revolution
In-Database Analytics Deep Dive with Teradata and RevolutionIn-Database Analytics Deep Dive with Teradata and Revolution
In-Database Analytics Deep Dive with Teradata and Revolution
 
Software Programs for Data Analysis
Software Programs for Data AnalysisSoftware Programs for Data Analysis
Software Programs for Data Analysis
 
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Data mining tools (R , WEKA, RAPID MINER, ORANGE)
Data mining tools (R , WEKA, RAPID MINER, ORANGE)Data mining tools (R , WEKA, RAPID MINER, ORANGE)
Data mining tools (R , WEKA, RAPID MINER, ORANGE)
 
MECBOT
MECBOTMECBOT
MECBOT
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBM
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBM
 
Choosing a Data Visualization Tool for Data Scientists_Final
Choosing a Data Visualization Tool for Data Scientists_FinalChoosing a Data Visualization Tool for Data Scientists_Final
Choosing a Data Visualization Tool for Data Scientists_Final
 
Big data Question bank.pdf
Big data Question bank.pdfBig data Question bank.pdf
Big data Question bank.pdf
 
Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...
Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...
Microsoft Data Platform Airlift 2017 Rui Quintino Machine Learning with SQL S...
 

Mais de Aniket Joshi

Kaizen - Continual Improvement Program
Kaizen - Continual Improvement ProgramKaizen - Continual Improvement Program
Kaizen - Continual Improvement ProgramAniket Joshi
 
Statistical Quality Control Tools - Total Quality Management
Statistical Quality Control Tools - Total Quality ManagementStatistical Quality Control Tools - Total Quality Management
Statistical Quality Control Tools - Total Quality ManagementAniket Joshi
 
Improving Sales Force Effectiveness : Bayer’s Experiment with New Technology
Improving Sales Force Effectiveness : Bayer’s Experiment with New TechnologyImproving Sales Force Effectiveness : Bayer’s Experiment with New Technology
Improving Sales Force Effectiveness : Bayer’s Experiment with New TechnologyAniket Joshi
 
LI & FUNG – THE GLOBAL VALUE CHAIN CONFIGURATOR
LI & FUNG – THE GLOBAL VALUE CHAIN CONFIGURATORLI & FUNG – THE GLOBAL VALUE CHAIN CONFIGURATOR
LI & FUNG – THE GLOBAL VALUE CHAIN CONFIGURATORAniket Joshi
 
Lean Manufacturing in Pharmaceutical Industry
Lean Manufacturing in Pharmaceutical IndustryLean Manufacturing in Pharmaceutical Industry
Lean Manufacturing in Pharmaceutical IndustryAniket Joshi
 
Berkshire Hathaway - Lubrizol Controversy
Berkshire Hathaway - Lubrizol ControversyBerkshire Hathaway - Lubrizol Controversy
Berkshire Hathaway - Lubrizol ControversyAniket Joshi
 
Is Brand a Religion ?
Is Brand a Religion ?Is Brand a Religion ?
Is Brand a Religion ?Aniket Joshi
 
Managing the Manager Brand
Managing the Manager BrandManaging the Manager Brand
Managing the Manager BrandAniket Joshi
 
Era of Disposable Worker
Era of Disposable Worker Era of Disposable Worker
Era of Disposable Worker Aniket Joshi
 
Hyundai i10 - Macroeconomics Factors
Hyundai i10 - Macroeconomics FactorsHyundai i10 - Macroeconomics Factors
Hyundai i10 - Macroeconomics FactorsAniket Joshi
 
Gujarat Ambuja Cements Limited Cast Study
Gujarat Ambuja Cements Limited Cast StudyGujarat Ambuja Cements Limited Cast Study
Gujarat Ambuja Cements Limited Cast StudyAniket Joshi
 

Mais de Aniket Joshi (14)

Kaizen - Continual Improvement Program
Kaizen - Continual Improvement ProgramKaizen - Continual Improvement Program
Kaizen - Continual Improvement Program
 
Statistical Quality Control Tools - Total Quality Management
Statistical Quality Control Tools - Total Quality ManagementStatistical Quality Control Tools - Total Quality Management
Statistical Quality Control Tools - Total Quality Management
 
Improving Sales Force Effectiveness : Bayer’s Experiment with New Technology
Improving Sales Force Effectiveness : Bayer’s Experiment with New TechnologyImproving Sales Force Effectiveness : Bayer’s Experiment with New Technology
Improving Sales Force Effectiveness : Bayer’s Experiment with New Technology
 
LI & FUNG – THE GLOBAL VALUE CHAIN CONFIGURATOR
LI & FUNG – THE GLOBAL VALUE CHAIN CONFIGURATORLI & FUNG – THE GLOBAL VALUE CHAIN CONFIGURATOR
LI & FUNG – THE GLOBAL VALUE CHAIN CONFIGURATOR
 
Lean Manufacturing in Pharmaceutical Industry
Lean Manufacturing in Pharmaceutical IndustryLean Manufacturing in Pharmaceutical Industry
Lean Manufacturing in Pharmaceutical Industry
 
Berkshire Hathaway - Lubrizol Controversy
Berkshire Hathaway - Lubrizol ControversyBerkshire Hathaway - Lubrizol Controversy
Berkshire Hathaway - Lubrizol Controversy
 
Talent Analytics
Talent AnalyticsTalent Analytics
Talent Analytics
 
E Mail Etiquettes
E Mail EtiquettesE Mail Etiquettes
E Mail Etiquettes
 
Is Brand a Religion ?
Is Brand a Religion ?Is Brand a Religion ?
Is Brand a Religion ?
 
Managing the Manager Brand
Managing the Manager BrandManaging the Manager Brand
Managing the Manager Brand
 
Era of Disposable Worker
Era of Disposable Worker Era of Disposable Worker
Era of Disposable Worker
 
Secondary Markets
Secondary MarketsSecondary Markets
Secondary Markets
 
Hyundai i10 - Macroeconomics Factors
Hyundai i10 - Macroeconomics FactorsHyundai i10 - Macroeconomics Factors
Hyundai i10 - Macroeconomics Factors
 
Gujarat Ambuja Cements Limited Cast Study
Gujarat Ambuja Cements Limited Cast StudyGujarat Ambuja Cements Limited Cast Study
Gujarat Ambuja Cements Limited Cast Study
 

Último

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsJoseMangaJr1
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 

Último (20)

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 

Analytical tools

  • 1. Analytical Tools By : Aniket Joshi
  • 2. • A GUI (Graphic User Interface) based analytics software • Versatile, powerful & user friendly • Handles large amount of data easily • Easy to learn • Popularity of SAS E-Miner doesn’t match its capability due to high price tag • One of the most expensive softwares’ which a very few companies can afford SAS Enterprise Miner
  • 3. SAS Enterprise Miner FEATURES • Data Preparation, Summarization & Exploration i. Access & integrate structured and unstructured data sources ii. Outlier filtering iii. Data Partitioning iv. Integration with R v. Merge & Append Tools vi. Univariate & Bivariate statistics and plots vii. Interactive Variable Binning
  • 4. • Advanced Predictive & Descriptive Modeling i. Clustering & Self Organizing Maps ii. Market Basket Analysis iii. Dimension Reduction Techniques iv. Linear & Logistic Regression v. Decision Trees vi. Neural Networks vii. Time Series Data Mining viii.Survival Analysis SAS Enterprise Miner
  • 5. BASE SAS + • An analytics software pre loaded with functions to perform statistical analysis • Not user friendly, involves coding • Cheaper software as compared to SAS Enterprise Miner but still expensive • SAS+ comprises of BASE SAS, SAS STAT & SAS Access to ODBC
  • 6. WPS • An analytics software which reads, understands and executes the language of SAS • Comes loaded with built in functions & procedures to perform statistical analysis • Inspired by BASE SAS • Interface similar to BASE SAS • Cheaper than BASE SAS + • Version 3 (WPS) offers WPS Workbench User Interface to connect & run programs in server, cluster and cloud environment
  • 7. IBM’s SPSS • IBM’s SPSS is an equivalent of BASE SAS • Popular Software in Market Research • Can handle small to mid size data sets
  • 8. • IBM’s SPSS Modeler is an equivalent of SAS Enterprise Miner • SPSS Modeler offers features such as : i. Accessing Data ii. Data Exploration iii. Summarization & Preparation iv. Predictive Modeling Techniques such as regression, clustering, decision trees, neural networks, self organizing maps etc IBM’s SPSS Modeler/Clementine
  • 9. R • World’s most popular open source analytics tool • Evolved from a language called S, then converted to a product called S+ (GUI based) • R offers more than 3000 packages • R package is a collection of functions which enable: i. Make computations in descriptive statistics ii. Data Manipulation iii. Regression Analysis iv. Advanced Visualization
  • 10. • R 3.4.3 (Latest Version) • Developed by practitioners themselves • BASE SAS involves coding in R Language, interface is similar to BASE SAS • R language is concise and elegant, uses pre developed packages • Not easy to learn, steep learning curve • Performs super complex statistical analytics quickly R
  • 11. • Excellent statistical & visualization capability • Faces problems in handling large data sets • Integration of R with HADOOP can handle large data sets • Adoption of R has increased due to use by Facebook, Google, Bing, Mozilla etc • Graphs can be created with several layers, scales, coordinate systems, smoothing curves R
  • 12. Apache HADOOP • Open source data management software • Helps companies in analyzing massive data volumes (Structured & Unstructured) • Used by Ebay, Yahoo, Facebook • One of the most desired technical skills in the industry
  • 13. MICROSTRATEGY • Business Intelligence product with limited analytics capability • Easy to learn tool • Excellent Visualization • Advanced Integration Capability with R & HADOOP
  • 14. STATISTICA • Statistics & Analytics software package developed by STATSOFT • Features are data analysis, data management, statistics, data mining, data visualization etc • GUI based product similar to SAS Enterprise Miner • Procedure involves : i. Loading table of data ii. Applying statistical functions from drop down menus • User friendly, Easy to learn • Advanced analytics capabilities (with large data)
  • 15. KXEN • KXEN = Knowledge Extraction Engines • Automated Analytics • Reduces work of analysts • Products are based on algorithms developed by Russian Mathematician Vladimir Vapnik • Easy to use, easy to learn, fast & can handle large data sets • Can produce a large number of models quickly • Works like a Black Box
  • 16. • KXEN Software Packages offer: i. Data Manipulation ii. Classification iii. Regression iv. Clustering v. Variable Importance vi. Segmentation vii. Time Series viii.Association Rules ix. Data Fusion KXEN
  • 17. TABLEAU • GUI based data visualization product similar to MICROSTRATEGY, is focused on Business Intelligence • Drag & Drop feature offered to analyze data • Visualizes & creates interactive dash boards • Easy to learn • Gives a good understanding of data • Not capable of Predictive Analytics
  • 18. Comparison of Analytics Tools • Measures used to compare the popularity of Analytics tools are: i. Level of Activity on E-mails or Discussion Lists devoted to these tools ii. Number of Users (Data Analytics Competitions) iii. Languages used in Data Mining or Analysis
  • 19. Based on Level of Activity on E- Mails/Discussion Lists
  • 20. • R & SAS Tools are most popular • R has dominated in the last few years • R shows decline in 2011 due to : i. Migration to other forums ii. Emergence of easy to use User Interface in R such as R Commander, Deducer (a GUI for R) & Rattle (a GUI for Data Mining using R) Based on Level of Activity on E- Mails/Discussion Lists
  • 21. Based on Number of Users (Data Analytics Competitions
  • 22. • “Kaggle.com” sponsors data analysis contests • Companies post Data Analytics Problems with certain prize money • R is the most preferred language in these competitions and even externally • 50% of the contest winners were found to be using R • Other tools often have prohibitions, due to licenses etc, so R is naturally preferred Based on Number of Users (Data Analytics Competitions
  • 23. Based on Languages used in Data Mining or Analytics
  • 24. Based on Softwares used in Data Mining or Analytics
  • 25. • R is the leader in programming languages used followed by Python (2015) • R is the leader in the Softwares’ used followed by Rapid Minder (2015) • The usage of these languages and softwares’ has a direct relationship with the number of jobs which have a programming language as their requirement Based on Languages & Softwares’ used in Data Mining or Analytics

Notas do Editor

  1. Most analytics tools involve coding, and hence it is a trade off between user friendliness and scalability. GUI based analytics products work well with limited data but become unviable for large data. But SAS Enterprise Miner can handle large amounts of data.
  2. Results coming out of these models still need to be interpreted and insights derived by an analyst who understands the business.
  3. Results coming out of these models still need to be interpreted and insights derived by an analyst who understands the business.
  4. Works like a Black Box – if one needs to explain the algorithm or methodology to an analyst or end user, it is unexplainable and secret. It only gives results but doesn’t offer the explanation behind reaching those inferences or conclusions.