SlideShare a Scribd company logo
1 of 51
Download to read offline
Power of Ensembles______
Amit Kapoor
@amitkaps
Bargava Subramanian
@bargava
The Blind Men & the Elephant
___
“And so these men of Indostan
Disputed loud and long,
Each in his own opinion
Exceeding stiff and strong,
Though each was partly in the right,
And all were in the wrong.”
— John Godfrey Saxe
Model Abstraction
___
"All models are wrong, but some are
useful"
— George Box
Building Many Models
___
"All models are wrong, but some are
useful, and their combination may be
better"
Ensembles
___
(noun) /änˈsämbəl/
a group of items viewed as a whole
rather than individually.
Machine Learning Process
___
Frame: Problem definition
Acquire: Data ingestion
Refine: Data wrangling
Transform: Feature creation
Explore: Feature selection
Model: Model selection
Insight: Solution communication
Machine Learning Model Pipeline
___
Data --- Feature --- Model --- Model
Space Space Algorithm Parameters
(n) (f) (m) (p)
Machine Learning Challenge
___
"...the task of searching through a
hypotheses space to find a suitable
hypothesis that will make good
predictions for a particular problem"
Hypotheses Space
___
Data --- Feature --- Model --- Model
Space Space Algorithm Parameters
(n) (f) (m) (p)
|_________________________|
Hypotheses Space
Search Hypotheses Space
___
The key question is how do we efficiently
search this Hypotheses Space?
Illustrative Example
___
Binary Classification Problem
Classes: c = 2
Features: f = x1, x2, x3
Observations: n = 1,000
Dimensionality Reduction
___
"Easier to visualise the data space"
Principal Component Analysis
Dimensions = 2 --> (pc1, pc2)
Approach to Build Models
___
1. Train a model
2. Predict the probabilities for each
class
3. Score the model on AUC via 5-Fold
Cross-Validation
4. Show the Decision Space
Simple Model with Different Features
___
Data --- Feature --- Model --- Model
Space Space Algorithm Parameters
(n) (f) (m) (p)
100% x1,x2 Logistic C=1
100% x2,x3 Logistic C=1
100% x1,x3 Logistic C=1
100% pc1,pc2 Logistic C=1
Tune the Model Parameters
___
Data --- Feature --- Model --- Model
Space Space Algorithm Parameters
(n) (f) (m) (p)
100% pc1,pc2 Logistic C=1
100% pc1,pc2 Logistic C=1e-1
100% pc1,pc2 Logistic C=1e-3
100% pc1,pc2 Logistic C=1e-5
Make Many Simple Models
___
Data --- Feature --- Model --- Model
Space Space Algorithm Parameters
(n) (f) (m) (p)
100% pc1,pc2 Logistic C=1
100% pc1,pc2 SVM-Linear prob=True
100% pc1,pc2 Decision Tree d=full
100% pc1,pc2 KNN nn=3
Hypotheses Search Approach
___
Exhaustive search is impossible
Compute as a proxy for human IQ
Clever algorithmic way to search the
solution space
Ensemble Thought Process
___
"The goal of ensemble methods is to
combine the predictions of several base
estimators built with (one or multiple)
model algorithms in order to improve
generalisation and robustness over a
single estimator."
Ensemble Approach
___
Different training sets
Different feature sampling
Different algorithms
Different (hyper) parameters
Clever aggregation
Ensemble Methods
___
[1] Averaging
[2] Boosting
[3] Voting
[4] Stacking
[1] Averaging: Concept
___
"Build several estimators independently
and then average their predictions to
reduce model variance"
To ensemble several good models to
produce a less variance model.
[1] Averaging: Approaches
[1] Averaging: Simple Models
___
Data --- Feature --- Model --- Model
Space Space Algorithm Parameters
(n) (f) (m) (p)
R[50%] pc1,pc2 Decision Tree n_est=10
R[50%,r] pc1,pc2 Decision Tree n_est=10
100% R[50%] Decision Tree n_est=10
R[50%] R[50%] Decision Tree n_est=10
[1] Averaging: Extended Models
___
Use perturb-and-combine techniques
Bagging: Bootstrap Aggregation
Random Forest: Best split amongst random
features
Extremely Randomised: Random threshold
for split
[1] Averaging: Extended Models
___
Data --- Feature --- Model --- Model
Space Space Algorithm Parameters
(n) (f) (m) (p)
100% pc1,pc2 Decision Tree d=full
R[50%,r] pc1,pc2 Decision Tree n_est=10
R[50%,r] R[Split] Decision Tree n_est=10
R[50%,r] R[Split,thresh] Decision Tree n_est=10
[2] Boosting: Concept
___
"Build base estimators sequentially and
then try to reduce the bias of the
combined estimator."
Combine several weak models to produce a
powerful ensemble.
[2] Boosting: Concept
___
Source: Wang Tan
[2] Boosting: Models
___
Data --- Feature --- Model --- Model
Space Space Algorithm Parameters
(n) (f) (m) (p)
100% pc1,pc2 Decision Tree d=full
100% pc1,pc2 DTree Boost n_est=10
100% pc1,pc2 DTree Boost n_est=10, d=1
100% pc1,pc2 DTree Boost n_est=10, d=2
[3] Voting: Concept
___
"To use conceptually different model
algorithms and use majority vote or soft
vote (average probabilities) to
combine."
To ensemble a set of equally well
performing models in order to balance
out their individual weaknesses.
[3] Voting: Approach
[3] Voting: Approach
___
Hard Voting: the majority (mode) of the
class labels predicted
Soft Voting: the argmax of the sum of
predicted probabilities
Weighted Voting: the argmax of the
weighted sum of predicted probabilities*
[3] Voting: Models
___
Data --- Feature --- Model --- Model
Space Space Algorithm Parameters
(n) (f) (m) (p)
100% pc1,pc2 Gaussian -
100% pc1,pc2 SVM - RBF gamma=0.5, C=1
100% pc1,pc2 Gradient Boost n_est=10, d=2
----- ------- -------------- --------------
100% pc1,pc2 Voting Soft
[4] Stacking
___
"To use output of different model
algorithms as feature inputs for a meta
classifier."
To ensemble a set of well performing
models in order to uncover higher order
interaction effects
[4] Stacking: Approach
[4] Stacking: Models
___
Data --- Feature --- Model --- Model
Space Space Algorithm Parameters
(n) (f) (m) (p)
100% pc1,pc2 Gaussian -
100% pc1,pc2 SVM - RBF gamma=0.5, C=1
100% pc1,pc2 Gradient Boost n_est=10, d=2
----- ------- -------------- --------------
100% pc1,pc2 Stacking Logistic()
Ensemble Advantages
___
Improved accuracy
Robustness and better generalisation
Parallelization
Ensemble Disadvantages
___
Model human readability isn't great
Time/effort trade-off to improve accuracy
may not make sense
Advanced Ensemble Approach
___
— Grid Search for voting weights
— Bayesian Optimization for weights &
hyper-parameters
— Feature Engineering e.g. tree or
cluster embedding
Ensembles
___
"It is the harmony of the diverse parts,
their symmetry, their happy balance; in
a word it is all that introduces order,
all that gives unity, that permits us to
see clearly and to comprehend at once
both the ensemble and the details."
- Henri Poincare
Code and Slides
____
All the code and slides are available at
https://github.com/amitkaps/ensemble
Contact
______
Amit Kapoor
@amitkaps
amitkaps.com
Bargava Subramanian
@bargava

More Related Content

What's hot

Introduction To Generative Adversarial Networks GANs
Introduction To Generative Adversarial Networks GANsIntroduction To Generative Adversarial Networks GANs
Introduction To Generative Adversarial Networks GANsHichem Felouat
 
Genetic programming
Genetic programmingGenetic programming
Genetic programmingOmar Ghazi
 
Rabbit challenge 3 DNN Day2
Rabbit challenge 3 DNN Day2Rabbit challenge 3 DNN Day2
Rabbit challenge 3 DNN Day2TOMMYLINK1
 
Introduction of Xgboost
Introduction of XgboostIntroduction of Xgboost
Introduction of Xgboostmichiaki ito
 
NIPS読み会2013: One-shot learning by inverting a compositional causal process
NIPS読み会2013: One-shot learning by inverting  a compositional causal processNIPS読み会2013: One-shot learning by inverting  a compositional causal process
NIPS読み会2013: One-shot learning by inverting a compositional causal processnozyh
 
GBM package in r
GBM package in rGBM package in r
GBM package in rmark_landry
 
Predict future time series forecasting
Predict future time series forecastingPredict future time series forecasting
Predict future time series forecastingHichem Felouat
 
Side Notes on Practical Natural Language Processing: Bootstrap Test
Side Notes on Practical Natural Language Processing: Bootstrap TestSide Notes on Practical Natural Language Processing: Bootstrap Test
Side Notes on Practical Natural Language Processing: Bootstrap TestHarithaGangavarapu
 
Comparison Study of Decision Tree Ensembles for Regression
Comparison Study of Decision Tree Ensembles for RegressionComparison Study of Decision Tree Ensembles for Regression
Comparison Study of Decision Tree Ensembles for RegressionSeonho Park
 
Machine Learning with Go
Machine Learning with GoMachine Learning with Go
Machine Learning with GoJames Bowman
 
Vc dimension in Machine Learning
Vc dimension in Machine LearningVc dimension in Machine Learning
Vc dimension in Machine LearningVARUN KUMAR
 
XGBoost: the algorithm that wins every competition
XGBoost: the algorithm that wins every competitionXGBoost: the algorithm that wins every competition
XGBoost: the algorithm that wins every competitionJaroslaw Szymczak
 
Tree models with Scikit-Learn: Great models with little assumptions
Tree models with Scikit-Learn: Great models with little assumptionsTree models with Scikit-Learn: Great models with little assumptions
Tree models with Scikit-Learn: Great models with little assumptionsGilles Louppe
 
Machine Learning Algorithms
Machine Learning AlgorithmsMachine Learning Algorithms
Machine Learning AlgorithmsHichem Felouat
 
Machine learning
Machine learningMachine learning
Machine learningShreyas G S
 
Anomaly detection using deep one class classifier
Anomaly detection using deep one class classifierAnomaly detection using deep one class classifier
Anomaly detection using deep one class classifier홍배 김
 
Gradient boosting in practice: a deep dive into xgboost
Gradient boosting in practice: a deep dive into xgboostGradient boosting in practice: a deep dive into xgboost
Gradient boosting in practice: a deep dive into xgboostJaroslaw Szymczak
 

What's hot (20)

Introduction To Generative Adversarial Networks GANs
Introduction To Generative Adversarial Networks GANsIntroduction To Generative Adversarial Networks GANs
Introduction To Generative Adversarial Networks GANs
 
Genetic programming
Genetic programmingGenetic programming
Genetic programming
 
Rabbit challenge 3 DNN Day2
Rabbit challenge 3 DNN Day2Rabbit challenge 3 DNN Day2
Rabbit challenge 3 DNN Day2
 
Introduction of Xgboost
Introduction of XgboostIntroduction of Xgboost
Introduction of Xgboost
 
Xgboost
XgboostXgboost
Xgboost
 
NIPS読み会2013: One-shot learning by inverting a compositional causal process
NIPS読み会2013: One-shot learning by inverting  a compositional causal processNIPS読み会2013: One-shot learning by inverting  a compositional causal process
NIPS読み会2013: One-shot learning by inverting a compositional causal process
 
GBM package in r
GBM package in rGBM package in r
GBM package in r
 
Predict future time series forecasting
Predict future time series forecastingPredict future time series forecasting
Predict future time series forecasting
 
Side Notes on Practical Natural Language Processing: Bootstrap Test
Side Notes on Practical Natural Language Processing: Bootstrap TestSide Notes on Practical Natural Language Processing: Bootstrap Test
Side Notes on Practical Natural Language Processing: Bootstrap Test
 
Comparison Study of Decision Tree Ensembles for Regression
Comparison Study of Decision Tree Ensembles for RegressionComparison Study of Decision Tree Ensembles for Regression
Comparison Study of Decision Tree Ensembles for Regression
 
Machine Learning with Go
Machine Learning with GoMachine Learning with Go
Machine Learning with Go
 
Vc dimension in Machine Learning
Vc dimension in Machine LearningVc dimension in Machine Learning
Vc dimension in Machine Learning
 
XGBoost: the algorithm that wins every competition
XGBoost: the algorithm that wins every competitionXGBoost: the algorithm that wins every competition
XGBoost: the algorithm that wins every competition
 
Tree models with Scikit-Learn: Great models with little assumptions
Tree models with Scikit-Learn: Great models with little assumptionsTree models with Scikit-Learn: Great models with little assumptions
Tree models with Scikit-Learn: Great models with little assumptions
 
Machine Learning Algorithms
Machine Learning AlgorithmsMachine Learning Algorithms
Machine Learning Algorithms
 
Ml presentation
Ml presentationMl presentation
Ml presentation
 
Xgboost
XgboostXgboost
Xgboost
 
Machine learning
Machine learningMachine learning
Machine learning
 
Anomaly detection using deep one class classifier
Anomaly detection using deep one class classifierAnomaly detection using deep one class classifier
Anomaly detection using deep one class classifier
 
Gradient boosting in practice: a deep dive into xgboost
Gradient boosting in practice: a deep dive into xgboostGradient boosting in practice: a deep dive into xgboost
Gradient boosting in practice: a deep dive into xgboost
 

Viewers also liked

Python Visualisation for Data Science
Python Visualisation for Data SciencePython Visualisation for Data Science
Python Visualisation for Data ScienceAmit Kapoor
 
Kontrak kuliah pengetahuan lingkungan
Kontrak kuliah pengetahuan lingkunganKontrak kuliah pengetahuan lingkungan
Kontrak kuliah pengetahuan lingkunganHotnida D'kanda
 
Sanjeet kumar social media profile
Sanjeet kumar social media profileSanjeet kumar social media profile
Sanjeet kumar social media profileSanjeet Kumar
 
Co to jest psychologia?
Co to jest psychologia?Co to jest psychologia?
Co to jest psychologia?Paula Pilarska
 
Cuadro comparativo linux, windows y android
Cuadro comparativo linux, windows y androidCuadro comparativo linux, windows y android
Cuadro comparativo linux, windows y androidSthefy Zavala
 
презентація сучасний кабінет інструментарій педагога в підготовці кваліфіко...
презентація сучасний кабінет   інструментарій педагога в підготовці кваліфіко...презентація сучасний кабінет   інструментарій педагога в підготовці кваліфіко...
презентація сучасний кабінет інструментарій педагога в підготовці кваліфіко...Stepan Revin
 
SARK Trading & Marketing Intl Trade Show Brochure
SARK Trading & Marketing Intl Trade Show BrochureSARK Trading & Marketing Intl Trade Show Brochure
SARK Trading & Marketing Intl Trade Show BrochureSadik R Khan
 
Visualising Big Data
Visualising Big DataVisualising Big Data
Visualising Big DataAmit Kapoor
 
Telling Stories with Data - Using Story Spine
Telling Stories with Data - Using Story SpineTelling Stories with Data - Using Story Spine
Telling Stories with Data - Using Story SpineAmit Kapoor
 
Atmosfer dan pencemaran udara
Atmosfer dan pencemaran udaraAtmosfer dan pencemaran udara
Atmosfer dan pencemaran udaraHotnida D'kanda
 
Ensemble Learning: The Wisdom of Crowds (of Machines)
Ensemble Learning: The Wisdom of Crowds (of Machines)Ensemble Learning: The Wisdom of Crowds (of Machines)
Ensemble Learning: The Wisdom of Crowds (of Machines)Lior Rokach
 
Five Things I Wish I Knew the First Day I Used Tableau
Five Things I Wish I Knew the First Day I Used TableauFive Things I Wish I Knew the First Day I Used Tableau
Five Things I Wish I Knew the First Day I Used TableauRyan Sleeper
 
SORACOM Bootcamp Rec1 - SORACOM Air (1)
SORACOM Bootcamp Rec1 - SORACOM Air (1)SORACOM Bootcamp Rec1 - SORACOM Air (1)
SORACOM Bootcamp Rec1 - SORACOM Air (1)SORACOM,INC
 
Data driven storytelling tips from an iron viz champion ryan sleeper
Data driven storytelling tips from an iron viz champion   ryan sleeperData driven storytelling tips from an iron viz champion   ryan sleeper
Data driven storytelling tips from an iron viz champion ryan sleeperRyan Sleeper
 
SORACOM Bootcamp Rec5 - SORACOM Funnel
SORACOM Bootcamp Rec5 - SORACOM FunnelSORACOM Bootcamp Rec5 - SORACOM Funnel
SORACOM Bootcamp Rec5 - SORACOM FunnelSORACOM,INC
 
Crafting Visual Stories with Data
Crafting Visual Stories with DataCrafting Visual Stories with Data
Crafting Visual Stories with DataAmit Kapoor
 
Learning the Craft of Data Visualisation
Learning the Craft of Data VisualisationLearning the Craft of Data Visualisation
Learning the Craft of Data VisualisationAmit Kapoor
 
Visualising Multi Dimensional Data
Visualising Multi Dimensional DataVisualising Multi Dimensional Data
Visualising Multi Dimensional DataAmit Kapoor
 

Viewers also liked (20)

Python Visualisation for Data Science
Python Visualisation for Data SciencePython Visualisation for Data Science
Python Visualisation for Data Science
 
Kontrak kuliah pengetahuan lingkungan
Kontrak kuliah pengetahuan lingkunganKontrak kuliah pengetahuan lingkungan
Kontrak kuliah pengetahuan lingkungan
 
Sanjeet kumar social media profile
Sanjeet kumar social media profileSanjeet kumar social media profile
Sanjeet kumar social media profile
 
Co to jest psychologia?
Co to jest psychologia?Co to jest psychologia?
Co to jest psychologia?
 
Cuadro comparativo linux, windows y android
Cuadro comparativo linux, windows y androidCuadro comparativo linux, windows y android
Cuadro comparativo linux, windows y android
 
What is psychology
What is psychologyWhat is psychology
What is psychology
 
презентація сучасний кабінет інструментарій педагога в підготовці кваліфіко...
презентація сучасний кабінет   інструментарій педагога в підготовці кваліфіко...презентація сучасний кабінет   інструментарій педагога в підготовці кваліфіко...
презентація сучасний кабінет інструментарій педагога в підготовці кваліфіко...
 
SARK Trading & Marketing Intl Trade Show Brochure
SARK Trading & Marketing Intl Trade Show BrochureSARK Trading & Marketing Intl Trade Show Brochure
SARK Trading & Marketing Intl Trade Show Brochure
 
2016 04-07 презентация
2016 04-07 презентация2016 04-07 презентация
2016 04-07 презентация
 
Visualising Big Data
Visualising Big DataVisualising Big Data
Visualising Big Data
 
Telling Stories with Data - Using Story Spine
Telling Stories with Data - Using Story SpineTelling Stories with Data - Using Story Spine
Telling Stories with Data - Using Story Spine
 
Atmosfer dan pencemaran udara
Atmosfer dan pencemaran udaraAtmosfer dan pencemaran udara
Atmosfer dan pencemaran udara
 
Ensemble Learning: The Wisdom of Crowds (of Machines)
Ensemble Learning: The Wisdom of Crowds (of Machines)Ensemble Learning: The Wisdom of Crowds (of Machines)
Ensemble Learning: The Wisdom of Crowds (of Machines)
 
Five Things I Wish I Knew the First Day I Used Tableau
Five Things I Wish I Knew the First Day I Used TableauFive Things I Wish I Knew the First Day I Used Tableau
Five Things I Wish I Knew the First Day I Used Tableau
 
SORACOM Bootcamp Rec1 - SORACOM Air (1)
SORACOM Bootcamp Rec1 - SORACOM Air (1)SORACOM Bootcamp Rec1 - SORACOM Air (1)
SORACOM Bootcamp Rec1 - SORACOM Air (1)
 
Data driven storytelling tips from an iron viz champion ryan sleeper
Data driven storytelling tips from an iron viz champion   ryan sleeperData driven storytelling tips from an iron viz champion   ryan sleeper
Data driven storytelling tips from an iron viz champion ryan sleeper
 
SORACOM Bootcamp Rec5 - SORACOM Funnel
SORACOM Bootcamp Rec5 - SORACOM FunnelSORACOM Bootcamp Rec5 - SORACOM Funnel
SORACOM Bootcamp Rec5 - SORACOM Funnel
 
Crafting Visual Stories with Data
Crafting Visual Stories with DataCrafting Visual Stories with Data
Crafting Visual Stories with Data
 
Learning the Craft of Data Visualisation
Learning the Craft of Data VisualisationLearning the Craft of Data Visualisation
Learning the Craft of Data Visualisation
 
Visualising Multi Dimensional Data
Visualising Multi Dimensional DataVisualising Multi Dimensional Data
Visualising Multi Dimensional Data
 

Similar to The Power of Ensembles in Machine Learning

Two methods for optimising cognitive model parameters
Two methods for optimising cognitive model parametersTwo methods for optimising cognitive model parameters
Two methods for optimising cognitive model parametersUniversity of Huddersfield
 
Computational Intelligence Assisted Engineering Design Optimization (using MA...
Computational Intelligence Assisted Engineering Design Optimization (using MA...Computational Intelligence Assisted Engineering Design Optimization (using MA...
Computational Intelligence Assisted Engineering Design Optimization (using MA...AmirParnianifard1
 
(Machine Learning) Ensemble learning
(Machine Learning) Ensemble learning (Machine Learning) Ensemble learning
(Machine Learning) Ensemble learning Omkar Rane
 
Lecture7 cross validation
Lecture7 cross validationLecture7 cross validation
Lecture7 cross validationStéphane Canu
 
Machine learning (5)
Machine learning (5)Machine learning (5)
Machine learning (5)NYversity
 
Transfer Learning for Improving Model Predictions in Highly Configurable Soft...
Transfer Learning for Improving Model Predictions in Highly Configurable Soft...Transfer Learning for Improving Model Predictions in Highly Configurable Soft...
Transfer Learning for Improving Model Predictions in Highly Configurable Soft...Pooyan Jamshidi
 
Evaluation of a hybrid method for constructing multiple SVM kernels
Evaluation of a hybrid method for constructing multiple SVM kernelsEvaluation of a hybrid method for constructing multiple SVM kernels
Evaluation of a hybrid method for constructing multiple SVM kernelsinfopapers
 
Summary.ppt
Summary.pptSummary.ppt
Summary.pptbutest
 
[PR12] PR-036 Learning to Remember Rare Events
[PR12] PR-036 Learning to Remember Rare Events[PR12] PR-036 Learning to Remember Rare Events
[PR12] PR-036 Learning to Remember Rare EventsTaegyun Jeon
 
PPT - AutoML-Zero: Evolving Machine Learning Algorithms From Scratch
PPT - AutoML-Zero: Evolving Machine Learning Algorithms From ScratchPPT - AutoML-Zero: Evolving Machine Learning Algorithms From Scratch
PPT - AutoML-Zero: Evolving Machine Learning Algorithms From ScratchJisang Yoon
 
Deep Learning for Developers
Deep Learning for DevelopersDeep Learning for Developers
Deep Learning for DevelopersJulien SIMON
 
The Concurrent Constraint Programming Research Programmes -- Redux (part2)
The Concurrent Constraint Programming Research Programmes -- Redux (part2)The Concurrent Constraint Programming Research Programmes -- Redux (part2)
The Concurrent Constraint Programming Research Programmes -- Redux (part2)Pierre Schaus
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_financeStefan Duprey
 
Intro to Machine Learning for non-Data Scientists
Intro to Machine Learning for non-Data ScientistsIntro to Machine Learning for non-Data Scientists
Intro to Machine Learning for non-Data ScientistsParinaz Ameri
 
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at GoogleDataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at GoogleHakka Labs
 
This is a heavily data-oriented
This is a heavily data-orientedThis is a heavily data-oriented
This is a heavily data-orientedbutest
 
This is a heavily data-oriented
This is a heavily data-orientedThis is a heavily data-oriented
This is a heavily data-orientedbutest
 
An Adaptive Masker for the Differential Evolution Algorithm
An Adaptive Masker for the Differential Evolution AlgorithmAn Adaptive Masker for the Differential Evolution Algorithm
An Adaptive Masker for the Differential Evolution AlgorithmIOSR Journals
 

Similar to The Power of Ensembles in Machine Learning (20)

Two methods for optimising cognitive model parameters
Two methods for optimising cognitive model parametersTwo methods for optimising cognitive model parameters
Two methods for optimising cognitive model parameters
 
Computational Intelligence Assisted Engineering Design Optimization (using MA...
Computational Intelligence Assisted Engineering Design Optimization (using MA...Computational Intelligence Assisted Engineering Design Optimization (using MA...
Computational Intelligence Assisted Engineering Design Optimization (using MA...
 
(Machine Learning) Ensemble learning
(Machine Learning) Ensemble learning (Machine Learning) Ensemble learning
(Machine Learning) Ensemble learning
 
Lecture7 cross validation
Lecture7 cross validationLecture7 cross validation
Lecture7 cross validation
 
Machine learning (5)
Machine learning (5)Machine learning (5)
Machine learning (5)
 
Transfer Learning for Improving Model Predictions in Highly Configurable Soft...
Transfer Learning for Improving Model Predictions in Highly Configurable Soft...Transfer Learning for Improving Model Predictions in Highly Configurable Soft...
Transfer Learning for Improving Model Predictions in Highly Configurable Soft...
 
Evaluation of a hybrid method for constructing multiple SVM kernels
Evaluation of a hybrid method for constructing multiple SVM kernelsEvaluation of a hybrid method for constructing multiple SVM kernels
Evaluation of a hybrid method for constructing multiple SVM kernels
 
Summary.ppt
Summary.pptSummary.ppt
Summary.ppt
 
[PR12] PR-036 Learning to Remember Rare Events
[PR12] PR-036 Learning to Remember Rare Events[PR12] PR-036 Learning to Remember Rare Events
[PR12] PR-036 Learning to Remember Rare Events
 
MyStataLab Assignment Help
MyStataLab Assignment HelpMyStataLab Assignment Help
MyStataLab Assignment Help
 
PPT - AutoML-Zero: Evolving Machine Learning Algorithms From Scratch
PPT - AutoML-Zero: Evolving Machine Learning Algorithms From ScratchPPT - AutoML-Zero: Evolving Machine Learning Algorithms From Scratch
PPT - AutoML-Zero: Evolving Machine Learning Algorithms From Scratch
 
Machine learning
Machine learningMachine learning
Machine learning
 
Deep Learning for Developers
Deep Learning for DevelopersDeep Learning for Developers
Deep Learning for Developers
 
The Concurrent Constraint Programming Research Programmes -- Redux (part2)
The Concurrent Constraint Programming Research Programmes -- Redux (part2)The Concurrent Constraint Programming Research Programmes -- Redux (part2)
The Concurrent Constraint Programming Research Programmes -- Redux (part2)
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_finance
 
Intro to Machine Learning for non-Data Scientists
Intro to Machine Learning for non-Data ScientistsIntro to Machine Learning for non-Data Scientists
Intro to Machine Learning for non-Data Scientists
 
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at GoogleDataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
 
This is a heavily data-oriented
This is a heavily data-orientedThis is a heavily data-oriented
This is a heavily data-oriented
 
This is a heavily data-oriented
This is a heavily data-orientedThis is a heavily data-oriented
This is a heavily data-oriented
 
An Adaptive Masker for the Differential Evolution Algorithm
An Adaptive Masker for the Differential Evolution AlgorithmAn Adaptive Masker for the Differential Evolution Algorithm
An Adaptive Masker for the Differential Evolution Algorithm
 

More from Amit Kapoor

Deep Learning for NLP
Deep Learning for NLPDeep Learning for NLP
Deep Learning for NLPAmit Kapoor
 
Model Visualisation
Model VisualisationModel Visualisation
Model VisualisationAmit Kapoor
 
Storytelling with Data - Approach | Skills
Storytelling with Data - Approach | SkillsStorytelling with Data - Approach | Skills
Storytelling with Data - Approach | SkillsAmit Kapoor
 
Tools & Resources for Data Visualisation
Tools & Resources for Data VisualisationTools & Resources for Data Visualisation
Tools & Resources for Data VisualisationAmit Kapoor
 
Fifth Elephant 2014 talk - Crafting Visual Stories with Data
Fifth Elephant 2014 talk - Crafting Visual Stories with DataFifth Elephant 2014 talk - Crafting Visual Stories with Data
Fifth Elephant 2014 talk - Crafting Visual Stories with DataAmit Kapoor
 
Storytelling with Data - See | Show | Tell | Engage
Storytelling with Data - See | Show | Tell | EngageStorytelling with Data - See | Show | Tell | Engage
Storytelling with Data - See | Show | Tell | EngageAmit Kapoor
 
Business Process Improvement - A Strategic and Supply Chain Perspective
Business Process Improvement - A Strategic and Supply Chain Perspective Business Process Improvement - A Strategic and Supply Chain Perspective
Business Process Improvement - A Strategic and Supply Chain Perspective Amit Kapoor
 
What makes a data-story work?
What makes a data-story work?What makes a data-story work?
What makes a data-story work?Amit Kapoor
 
What is Strategy - Thinking like a Strategist
What is Strategy - Thinking like a StrategistWhat is Strategy - Thinking like a Strategist
What is Strategy - Thinking like a StrategistAmit Kapoor
 
Story Structure and Modern Storytelling
Story Structure and Modern StorytellingStory Structure and Modern Storytelling
Story Structure and Modern StorytellingAmit Kapoor
 
Targeting the Moment of Truth - Using Big Data in Retail
Targeting the Moment of Truth - Using Big Data in RetailTargeting the Moment of Truth - Using Big Data in Retail
Targeting the Moment of Truth - Using Big Data in RetailAmit Kapoor
 
Storytelling - Gutenberg
Storytelling - GutenbergStorytelling - Gutenberg
Storytelling - GutenbergAmit Kapoor
 
Analytics in Consulting
Analytics in ConsultingAnalytics in Consulting
Analytics in ConsultingAmit Kapoor
 
Retail Pricing Perspective
Retail Pricing PerspectiveRetail Pricing Perspective
Retail Pricing PerspectiveAmit Kapoor
 

More from Amit Kapoor (14)

Deep Learning for NLP
Deep Learning for NLPDeep Learning for NLP
Deep Learning for NLP
 
Model Visualisation
Model VisualisationModel Visualisation
Model Visualisation
 
Storytelling with Data - Approach | Skills
Storytelling with Data - Approach | SkillsStorytelling with Data - Approach | Skills
Storytelling with Data - Approach | Skills
 
Tools & Resources for Data Visualisation
Tools & Resources for Data VisualisationTools & Resources for Data Visualisation
Tools & Resources for Data Visualisation
 
Fifth Elephant 2014 talk - Crafting Visual Stories with Data
Fifth Elephant 2014 talk - Crafting Visual Stories with DataFifth Elephant 2014 talk - Crafting Visual Stories with Data
Fifth Elephant 2014 talk - Crafting Visual Stories with Data
 
Storytelling with Data - See | Show | Tell | Engage
Storytelling with Data - See | Show | Tell | EngageStorytelling with Data - See | Show | Tell | Engage
Storytelling with Data - See | Show | Tell | Engage
 
Business Process Improvement - A Strategic and Supply Chain Perspective
Business Process Improvement - A Strategic and Supply Chain Perspective Business Process Improvement - A Strategic and Supply Chain Perspective
Business Process Improvement - A Strategic and Supply Chain Perspective
 
What makes a data-story work?
What makes a data-story work?What makes a data-story work?
What makes a data-story work?
 
What is Strategy - Thinking like a Strategist
What is Strategy - Thinking like a StrategistWhat is Strategy - Thinking like a Strategist
What is Strategy - Thinking like a Strategist
 
Story Structure and Modern Storytelling
Story Structure and Modern StorytellingStory Structure and Modern Storytelling
Story Structure and Modern Storytelling
 
Targeting the Moment of Truth - Using Big Data in Retail
Targeting the Moment of Truth - Using Big Data in RetailTargeting the Moment of Truth - Using Big Data in Retail
Targeting the Moment of Truth - Using Big Data in Retail
 
Storytelling - Gutenberg
Storytelling - GutenbergStorytelling - Gutenberg
Storytelling - Gutenberg
 
Analytics in Consulting
Analytics in ConsultingAnalytics in Consulting
Analytics in Consulting
 
Retail Pricing Perspective
Retail Pricing PerspectiveRetail Pricing Perspective
Retail Pricing Perspective
 

Recently uploaded

PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.pptamreenkhanum0307
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 

Recently uploaded (20)

PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.ppt
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 

The Power of Ensembles in Machine Learning

  • 1. Power of Ensembles______ Amit Kapoor @amitkaps Bargava Subramanian @bargava
  • 2. The Blind Men & the Elephant ___ “And so these men of Indostan Disputed loud and long, Each in his own opinion Exceeding stiff and strong, Though each was partly in the right, And all were in the wrong.” — John Godfrey Saxe
  • 3. Model Abstraction ___ "All models are wrong, but some are useful" — George Box
  • 4. Building Many Models ___ "All models are wrong, but some are useful, and their combination may be better"
  • 5. Ensembles ___ (noun) /änˈsämbəl/ a group of items viewed as a whole rather than individually.
  • 6. Machine Learning Process ___ Frame: Problem definition Acquire: Data ingestion Refine: Data wrangling Transform: Feature creation Explore: Feature selection Model: Model selection Insight: Solution communication
  • 7. Machine Learning Model Pipeline ___ Data --- Feature --- Model --- Model Space Space Algorithm Parameters (n) (f) (m) (p)
  • 8. Machine Learning Challenge ___ "...the task of searching through a hypotheses space to find a suitable hypothesis that will make good predictions for a particular problem"
  • 9. Hypotheses Space ___ Data --- Feature --- Model --- Model Space Space Algorithm Parameters (n) (f) (m) (p) |_________________________| Hypotheses Space
  • 10. Search Hypotheses Space ___ The key question is how do we efficiently search this Hypotheses Space?
  • 11. Illustrative Example ___ Binary Classification Problem Classes: c = 2 Features: f = x1, x2, x3 Observations: n = 1,000
  • 12.
  • 13. Dimensionality Reduction ___ "Easier to visualise the data space" Principal Component Analysis Dimensions = 2 --> (pc1, pc2)
  • 14.
  • 15. Approach to Build Models ___ 1. Train a model 2. Predict the probabilities for each class 3. Score the model on AUC via 5-Fold Cross-Validation 4. Show the Decision Space
  • 16. Simple Model with Different Features ___ Data --- Feature --- Model --- Model Space Space Algorithm Parameters (n) (f) (m) (p) 100% x1,x2 Logistic C=1 100% x2,x3 Logistic C=1 100% x1,x3 Logistic C=1 100% pc1,pc2 Logistic C=1
  • 17.
  • 18. Tune the Model Parameters ___ Data --- Feature --- Model --- Model Space Space Algorithm Parameters (n) (f) (m) (p) 100% pc1,pc2 Logistic C=1 100% pc1,pc2 Logistic C=1e-1 100% pc1,pc2 Logistic C=1e-3 100% pc1,pc2 Logistic C=1e-5
  • 19.
  • 20. Make Many Simple Models ___ Data --- Feature --- Model --- Model Space Space Algorithm Parameters (n) (f) (m) (p) 100% pc1,pc2 Logistic C=1 100% pc1,pc2 SVM-Linear prob=True 100% pc1,pc2 Decision Tree d=full 100% pc1,pc2 KNN nn=3
  • 21.
  • 22. Hypotheses Search Approach ___ Exhaustive search is impossible Compute as a proxy for human IQ Clever algorithmic way to search the solution space
  • 23. Ensemble Thought Process ___ "The goal of ensemble methods is to combine the predictions of several base estimators built with (one or multiple) model algorithms in order to improve generalisation and robustness over a single estimator."
  • 24. Ensemble Approach ___ Different training sets Different feature sampling Different algorithms Different (hyper) parameters Clever aggregation
  • 25. Ensemble Methods ___ [1] Averaging [2] Boosting [3] Voting [4] Stacking
  • 26. [1] Averaging: Concept ___ "Build several estimators independently and then average their predictions to reduce model variance" To ensemble several good models to produce a less variance model.
  • 28. [1] Averaging: Simple Models ___ Data --- Feature --- Model --- Model Space Space Algorithm Parameters (n) (f) (m) (p) R[50%] pc1,pc2 Decision Tree n_est=10 R[50%,r] pc1,pc2 Decision Tree n_est=10 100% R[50%] Decision Tree n_est=10 R[50%] R[50%] Decision Tree n_est=10
  • 29.
  • 30. [1] Averaging: Extended Models ___ Use perturb-and-combine techniques Bagging: Bootstrap Aggregation Random Forest: Best split amongst random features Extremely Randomised: Random threshold for split
  • 31. [1] Averaging: Extended Models ___ Data --- Feature --- Model --- Model Space Space Algorithm Parameters (n) (f) (m) (p) 100% pc1,pc2 Decision Tree d=full R[50%,r] pc1,pc2 Decision Tree n_est=10 R[50%,r] R[Split] Decision Tree n_est=10 R[50%,r] R[Split,thresh] Decision Tree n_est=10
  • 32.
  • 33. [2] Boosting: Concept ___ "Build base estimators sequentially and then try to reduce the bias of the combined estimator." Combine several weak models to produce a powerful ensemble.
  • 35. [2] Boosting: Models ___ Data --- Feature --- Model --- Model Space Space Algorithm Parameters (n) (f) (m) (p) 100% pc1,pc2 Decision Tree d=full 100% pc1,pc2 DTree Boost n_est=10 100% pc1,pc2 DTree Boost n_est=10, d=1 100% pc1,pc2 DTree Boost n_est=10, d=2
  • 36.
  • 37. [3] Voting: Concept ___ "To use conceptually different model algorithms and use majority vote or soft vote (average probabilities) to combine." To ensemble a set of equally well performing models in order to balance out their individual weaknesses.
  • 39. [3] Voting: Approach ___ Hard Voting: the majority (mode) of the class labels predicted Soft Voting: the argmax of the sum of predicted probabilities Weighted Voting: the argmax of the weighted sum of predicted probabilities*
  • 40. [3] Voting: Models ___ Data --- Feature --- Model --- Model Space Space Algorithm Parameters (n) (f) (m) (p) 100% pc1,pc2 Gaussian - 100% pc1,pc2 SVM - RBF gamma=0.5, C=1 100% pc1,pc2 Gradient Boost n_est=10, d=2 ----- ------- -------------- -------------- 100% pc1,pc2 Voting Soft
  • 41.
  • 42. [4] Stacking ___ "To use output of different model algorithms as feature inputs for a meta classifier." To ensemble a set of well performing models in order to uncover higher order interaction effects
  • 44. [4] Stacking: Models ___ Data --- Feature --- Model --- Model Space Space Algorithm Parameters (n) (f) (m) (p) 100% pc1,pc2 Gaussian - 100% pc1,pc2 SVM - RBF gamma=0.5, C=1 100% pc1,pc2 Gradient Boost n_est=10, d=2 ----- ------- -------------- -------------- 100% pc1,pc2 Stacking Logistic()
  • 45.
  • 46. Ensemble Advantages ___ Improved accuracy Robustness and better generalisation Parallelization
  • 47. Ensemble Disadvantages ___ Model human readability isn't great Time/effort trade-off to improve accuracy may not make sense
  • 48. Advanced Ensemble Approach ___ — Grid Search for voting weights — Bayesian Optimization for weights & hyper-parameters — Feature Engineering e.g. tree or cluster embedding
  • 49. Ensembles ___ "It is the harmony of the diverse parts, their symmetry, their happy balance; in a word it is all that introduces order, all that gives unity, that permits us to see clearly and to comprehend at once both the ensemble and the details." - Henri Poincare
  • 50. Code and Slides ____ All the code and slides are available at https://github.com/amitkaps/ensemble