SlideShare uma empresa Scribd logo
1 de 20
Baixar para ler offline
Some Take-Home Messages (THM) about ML....
Data Science Meetup
Gianluca Bontempi
Interuniversity Institute of Bioinformatics in Brussels, (IB)2
Machine Learning Group,
Computer Science Department, ULB
mlg.ulb.ac.be, ibsquare.be
May 20, 2016
Introducing myself
1992: Computer science engineer (Politecnico di Milano, Italy),
1994: Researcher in robotics in IRST, Trento, Italy,
1995: Researcher in IRIDIA, ULB, Brussels,
1996-97: Researcher in IDSIA, Lugano, Switzerland,
1998-2000: Marie Curie fellowship in IRIDIA, ULB,
2000-2001: Scientist in Philips Research, Eindhoven, The
Netherlands,
2001-2002: Scientist in IMEC, Microelectronics Institute,
Leuven, Belgium,
since 2002: professor in Machine Learning, Modeling and
Simulation, Bioinformatics in ULB Computer Science Dept.,
since 2004: head of the ULB Machine Learning Group (MLG).
since 2013: director of the Interuniversity Institute of
Bioinformatics in Brussels (IB)2, ibsquare.be.
What is machine learning?
Machine learning is that domain of computational intelligence
which is concerned with the question of how to construct computer
programs that automatically improve with experience. (Mitchell,
97)
Reductionist attitude: ML is just a buzzword which equates to
statistics plus marketing
Positive attitude: ML paved the way to the treatment of real
problems related to data analysis, sometimes
overlooked by statisticians (nonlinearity, classification,
pattern recognition, missing variables, adaptivity,
optimization, massive datasets, data management,
causality, representation of knowledge, parallelisation)
Interdisciplinary attitude: ML should have its roots on statistics
and complements it by focusing on: algorithmic
issues, computational efficiency, data engineering.
Prediction is pervasive ...
Prediction is pervasive ...
Predict
whether you will like a book/movie (collaborative filtering)
credit applicants as low, medium, or high risk.
which home telephone lines are used for Internet access.
which customers are likely to stop being customers (churn).
the value of a piece of real estate
which telephone subscribers will order a 4G service
which CARREFOUR clients will be more interested to a
discount in Italian products.
the probability that a company is employing black workers
(anti-fraud detection)
the survival risk of a patient on the basis of a genetic signature
the probability of a crime in an urban area.
the key of a cryptographic algorithm on the basis of power
consumption
Supervised learning
First assumption: learning is essentially about prediction !
Second assumption: reality is stochastic, dependency and
uncertainty are well described by conditional probability.
PREDICTION
TARGET
TRAINING
DATASET
INPUT OUTPUT
ERROR
PREDICTION
MODEL
measurable features (inputs)
measurable target variables (outputs) and accuracy criteria
data (in God we trust, all the others must bring data)
THM1: formalizing a problem as a prediction problem is often the
most important contribution of a data scientist!
It is all about ...
1 Probabilistic modeling
it formalizes uncertainty and dependency (regression function)
notions of entropy and information
relevant and irrelevant features (e.g. Markov blanket notion)
Bayesian networks, causal reasoning
2 Estimation
bias/variance notions
generalization issues: underfitting vs overfitting
Bayesian, frequentist, decision theory
validation
combination/averaging of estimators (bagging, boosting)
3 Optimization
Maximum likelihood, least squares, backpropagation
Dual problems (SVM)
L1, L2 norm (lasso)
4 Computer science
implementation, algorithms
parallelism, scalability
data management
So ... how to teach machine learning?
Focus on ...
Formalism ?
Algorithms ?
Coding ?
Applications ?
Of course all is important but what is the essence, what is common
to the exploding number of algorithms, techniques, fancy
applications?
Estimation
STOCHASTIC PHENOMENON
DATA
LEARNER
DATA DATA
MODEL,
PREDICTION
LEARNER
MODEL,
PREDICTION
LEARNER
MODEL,
PREDICTION
THM2: a predictor is an estimator, i.e. an algorithm (black-box)
which takes data and returns a prediction.
THM3: reality is stochastic, so data is stochastic and prediction is
stochastic.
Assessing in an un uncertain world (Baggio, 1998)
non aver paura di sbagliare un calcio di rigore, non è mica da questi
particolari che si giudica un giocatore (De Gregori, 1982)).
Assessing a learner
The goal of learning is to find a model which is able to
generalize, i.e. able to return good predictions in contexts
with the same distribution but independent of the training set
How to estimate the quality of a model?
It is always possible to find models with such a complicate
structure that they have null training errors. Are these models
good?
Typically NOT. Since doing very well on the training set could
mean doing badly on new data.
This is the phenomenon of overfitting.
THM4: learning is challenging since data have to be used 1) for
creating prediction models and 2) for assessing them.
Bias and variance of a model
Estimation theory: mean-squared-error (a measure of the
generalization quality) can be written as
MSE = σ2
w + squared bias + variance
where
noise concerns the reality alone,
bias reflects the relation between reality and the learning
algorithm
variance concerns the learning algorithm alone.
This is purely theoretical since these quantities cannot be
measured ....
.. but useful to understand why and in which circumstances
learners work.
The bias/variance dilemma
Noise is all that cannot be learned from data
Bias measures the lack of representational power of the class
of hypotheses.
Too simple model ⇒ large bias ⇒ underfitting
Variance warns us against an excessive complexity of the
approximator.
Too complex model ⇒ large variance ⇒ overfitting
A neural network is less biased than a linear model but
inevitably more variant.
Averaging (e.g. bagging, boosting, random forests) is a good
cure for variance.
Bias/variance trade-off
complexity
generalization
error
Bias
Variance
Underfitting Overfitting
THM5: think in terms of bias/variance tradeoff. Think to your
preferred learning algorithm and discover how bias/variance is
managed.
The Ockam’s Razor (1825)
THM6: "Pluralitas non est ponenda sine neccesitate" i.e. one
should not increase, beyond what is necessary, the number of
entities required to explain anything.
This is the medieval rule of parsimony, or principle of
economy, known as Ockham’s razor.
In other terms the principle states that one should not make
more assumptions than the minimum needed.
It underlies all scientific modeling and theory building. It
admonishes us to choose from a set of otherwise equivalent
models the simplest one.
Be simple: "shave off" those concepts, variables or constructs
that are not really needed to explain the phenomenon.
Does the best exist?
Given a finite number of samples, are there any reasons to
prefer one learning algorithm over another?
If we make no assumption about the nature of the learning
task, can we expect any learning method to be superior or
inferior overall?
Can we even find an algorithm that is overall superior to (or
inferior to) random guessing?
The No Free Lunch Theorem answers NO to these questions.
No Free Lunch theorem
If the goal is to obtain good generalization performance, there
are no context-independent or usage-independent reasons
to favor one learning method over another.
If one algorithm seems to outperform another in a particular
situation, it is a consequence of its fit to the particular pattern
recognition problem, not the general superiority of the
algorithm.
The theorem also justifies the skeptiscism about studies that
demonstrate the overall superiority of a particular learning or
recognition algorithm.
If a learning method performs well over some set of problems,
then it must perform worse than average elsewhere. No
method can perform well throughout the full set of functions.
THM7: Every learning algorithm makes assumptions (most of the
times in implicit manner) and these make the difference.
Conclusion
Popper claimed that, if a theory is falsifiable (i.e. it can be
contradicted by an observation or the outcome of a physical
experiment), then it is scientific. Since prediction is the most
falsifiable aspect of science it is also the most scientific one.
Effective machine learning is an extension of statistics, in no
way an alternative.
Simplest (i.e. linear) model first.
Modelling is more an art than an automatic process... then
experience data analysts are more valuable than expensive
tools.
Expert knowledge matters..., data too
Understanding what is predictable is as important as trying to
predict it.
All models are wrong, some of them are useful.
All that we did not discuss...
Dimensionality reduction and feature selection
Causal inference
Unsupervised learning
Active learning
Spatio-temporal prediction
Nonstationary problems
Scalable machine learning
Control and robotics
Libraries and platforms (R, python, Weka)
Resources
A biased list ...:-)
Scoop-it
www.scoop.it/t/machine-learning-by-gianluca-bontempi
on machine learning
Scoop-it
www.scoop.it/t/probabilistic-reasoning-and-statistics
on Probabilistic reasoning, causal inference and statistics
MLG mlg.ulb.ac.be
MA course INFO-F-422 Statistical foundations of machine
learning
Handbook available on https://www.otexts.org

Mais conteúdo relacionado

Mais procurados

Machine learning presentation (razi)
Machine learning presentation (razi)Machine learning presentation (razi)
Machine learning presentation (razi)Rizwan Shaukat
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Marina Santini
 
Hedging Predictions in Machine Learning
Hedging Predictions in Machine LearningHedging Predictions in Machine Learning
Hedging Predictions in Machine Learningbutest
 
Classification of Machine Learning Algorithms
Classification of Machine Learning AlgorithmsClassification of Machine Learning Algorithms
Classification of Machine Learning AlgorithmsAM Publications
 
Soft computing
Soft computingSoft computing
Soft computingCSS
 
Classification and Regression
Classification and RegressionClassification and Regression
Classification and RegressionMegha Sharma
 
Simplified Fuzzy ARTMAP
Simplified Fuzzy ARTMAPSimplified Fuzzy ARTMAP
Simplified Fuzzy ARTMAPPradipBankar
 
Machine Learning
Machine LearningMachine Learning
Machine LearningRahul Kumar
 
Selecting the Right Type of Algorithm for Various Applications - Phdassistance
Selecting the Right Type of Algorithm for Various Applications - PhdassistanceSelecting the Right Type of Algorithm for Various Applications - Phdassistance
Selecting the Right Type of Algorithm for Various Applications - PhdassistancePhD Assistance
 
Automated Education Propositional Logic Tool (AEPLT): Used For Computation in...
Automated Education Propositional Logic Tool (AEPLT): Used For Computation in...Automated Education Propositional Logic Tool (AEPLT): Used For Computation in...
Automated Education Propositional Logic Tool (AEPLT): Used For Computation in...CSCJournals
 
Machine learning ppt
Machine learning ppt Machine learning ppt
Machine learning ppt Poojamanic
 
AI Use Cases: Special Attention on Semantic Segmentation
AI Use Cases: Special Attention on Semantic SegmentationAI Use Cases: Special Attention on Semantic Segmentation
AI Use Cases: Special Attention on Semantic SegmentationFrederick Apina
 
Applied Artificial Intelligence Unit 4 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 4 Semester 3 MSc IT Part 2 Mumbai Univer...Applied Artificial Intelligence Unit 4 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 4 Semester 3 MSc IT Part 2 Mumbai Univer...Madhav Mishra
 
Selecting the Right Type of Algorithm for Various Applications - Phdassistance
Selecting the Right Type of Algorithm for Various Applications - PhdassistanceSelecting the Right Type of Algorithm for Various Applications - Phdassistance
Selecting the Right Type of Algorithm for Various Applications - PhdassistancePhD Assistance
 
Introduction to AI - Second Lecture
Introduction to AI - Second LectureIntroduction to AI - Second Lecture
Introduction to AI - Second LectureWouter Beek
 

Mais procurados (20)

Machine learning presentation (razi)
Machine learning presentation (razi)Machine learning presentation (razi)
Machine learning presentation (razi)
 
Machine learning
Machine learningMachine learning
Machine learning
 
Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?Lecture 1: What is Machine Learning?
Lecture 1: What is Machine Learning?
 
Hedging Predictions in Machine Learning
Hedging Predictions in Machine LearningHedging Predictions in Machine Learning
Hedging Predictions in Machine Learning
 
Classification of Machine Learning Algorithms
Classification of Machine Learning AlgorithmsClassification of Machine Learning Algorithms
Classification of Machine Learning Algorithms
 
Soft computing
Soft computingSoft computing
Soft computing
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Classification and Regression
Classification and RegressionClassification and Regression
Classification and Regression
 
Simplified Fuzzy ARTMAP
Simplified Fuzzy ARTMAPSimplified Fuzzy ARTMAP
Simplified Fuzzy ARTMAP
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Selecting the Right Type of Algorithm for Various Applications - Phdassistance
Selecting the Right Type of Algorithm for Various Applications - PhdassistanceSelecting the Right Type of Algorithm for Various Applications - Phdassistance
Selecting the Right Type of Algorithm for Various Applications - Phdassistance
 
Automated Education Propositional Logic Tool (AEPLT): Used For Computation in...
Automated Education Propositional Logic Tool (AEPLT): Used For Computation in...Automated Education Propositional Logic Tool (AEPLT): Used For Computation in...
Automated Education Propositional Logic Tool (AEPLT): Used For Computation in...
 
Machine learning ppt
Machine learning ppt Machine learning ppt
Machine learning ppt
 
AI Use Cases: Special Attention on Semantic Segmentation
AI Use Cases: Special Attention on Semantic SegmentationAI Use Cases: Special Attention on Semantic Segmentation
AI Use Cases: Special Attention on Semantic Segmentation
 
International Journal of Engineering Inventions (IJEI)
International Journal of Engineering Inventions (IJEI)International Journal of Engineering Inventions (IJEI)
International Journal of Engineering Inventions (IJEI)
 
Applied Artificial Intelligence Unit 4 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 4 Semester 3 MSc IT Part 2 Mumbai Univer...Applied Artificial Intelligence Unit 4 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 4 Semester 3 MSc IT Part 2 Mumbai Univer...
 
Launching into machine learning
Launching into machine learningLaunching into machine learning
Launching into machine learning
 
Selecting the Right Type of Algorithm for Various Applications - Phdassistance
Selecting the Right Type of Algorithm for Various Applications - PhdassistanceSelecting the Right Type of Algorithm for Various Applications - Phdassistance
Selecting the Right Type of Algorithm for Various Applications - Phdassistance
 
Reasoning in AI
Reasoning in AIReasoning in AI
Reasoning in AI
 
Introduction to AI - Second Lecture
Introduction to AI - Second LectureIntroduction to AI - Second Lecture
Introduction to AI - Second Lecture
 

Destaque

Power of Code: What you don’t know about what you know
Power of Code: What you don’t know about what you knowPower of Code: What you don’t know about what you know
Power of Code: What you don’t know about what you knowcdathuraliya
 
07 history of cv vision paradigms - system - algorithms - applications - eva...
07  history of cv vision paradigms - system - algorithms - applications - eva...07  history of cv vision paradigms - system - algorithms - applications - eva...
07 history of cv vision paradigms - system - algorithms - applications - eva...zukun
 
Applying Reinforcement Learning for Network Routing
Applying Reinforcement Learning for Network RoutingApplying Reinforcement Learning for Network Routing
Applying Reinforcement Learning for Network Routingbutest
 
Graphical Models for chains, trees and grids
Graphical Models for chains, trees and gridsGraphical Models for chains, trees and grids
Graphical Models for chains, trees and gridspotaters
 
One Size Doesn't Fit All: The New Database Revolution
One Size Doesn't Fit All: The New Database RevolutionOne Size Doesn't Fit All: The New Database Revolution
One Size Doesn't Fit All: The New Database Revolutionmark madsen
 
Streamlining Technology to Reduce Complexity and Improve Productivity
Streamlining Technology to Reduce Complexity and Improve ProductivityStreamlining Technology to Reduce Complexity and Improve Productivity
Streamlining Technology to Reduce Complexity and Improve ProductivityKevin Fream
 
Machine Learning techniques
Machine Learning techniques Machine Learning techniques
Machine Learning techniques Jigar Patel
 
Pattern Recognition and Machine Learning : Graphical Models
Pattern Recognition and Machine Learning : Graphical ModelsPattern Recognition and Machine Learning : Graphical Models
Pattern Recognition and Machine Learning : Graphical Modelsbutest
 
Les outils de modélisation des Big Data
Les outils de modélisation des Big DataLes outils de modélisation des Big Data
Les outils de modélisation des Big DataKezhan SHI
 
graphical models for the Internet
graphical models for the Internetgraphical models for the Internet
graphical models for the Internetantiw
 
Nearest Neighbor Customer Insight
Nearest Neighbor Customer InsightNearest Neighbor Customer Insight
Nearest Neighbor Customer InsightMapR Technologies
 
Web Crawling and Reinforcement Learning
Web Crawling and Reinforcement LearningWeb Crawling and Reinforcement Learning
Web Crawling and Reinforcement LearningFrancesco Gadaleta
 
A real-time big data architecture for glasses detection using computer vision...
A real-time big data architecture for glasses detection using computer vision...A real-time big data architecture for glasses detection using computer vision...
A real-time big data architecture for glasses detection using computer vision...Alberto Fernandez Villan
 
A system to filter unwanted messages from osn user walls
A system to filter unwanted messages from osn user wallsA system to filter unwanted messages from osn user walls
A system to filter unwanted messages from osn user wallsIEEEFINALYEARPROJECTS
 
[PRML 3.1~3.2] Linear Regression / Bias-Variance Decomposition
[PRML 3.1~3.2] Linear Regression / Bias-Variance Decomposition [PRML 3.1~3.2] Linear Regression / Bias-Variance Decomposition
[PRML 3.1~3.2] Linear Regression / Bias-Variance Decomposition DongHyun Kwak
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spacesMounia Lalmas-Roelleke
 
Big Data Paradigm - Analysis, Application and Challenges
Big Data Paradigm - Analysis, Application and ChallengesBig Data Paradigm - Analysis, Application and Challenges
Big Data Paradigm - Analysis, Application and ChallengesUyoyo Edosio
 
Sourcing talent a key recruiting differentiator part 2 - the (Big) Data Lands...
Sourcing talent a key recruiting differentiator part 2 - the (Big) Data Lands...Sourcing talent a key recruiting differentiator part 2 - the (Big) Data Lands...
Sourcing talent a key recruiting differentiator part 2 - the (Big) Data Lands...Alexander Crépin
 
On cascading small decision trees
On cascading small decision treesOn cascading small decision trees
On cascading small decision treesJulià Minguillón
 

Destaque (20)

Supervised Approach to Extract Sentiments from Unstructured Text
Supervised Approach to Extract Sentiments from Unstructured TextSupervised Approach to Extract Sentiments from Unstructured Text
Supervised Approach to Extract Sentiments from Unstructured Text
 
Power of Code: What you don’t know about what you know
Power of Code: What you don’t know about what you knowPower of Code: What you don’t know about what you know
Power of Code: What you don’t know about what you know
 
07 history of cv vision paradigms - system - algorithms - applications - eva...
07  history of cv vision paradigms - system - algorithms - applications - eva...07  history of cv vision paradigms - system - algorithms - applications - eva...
07 history of cv vision paradigms - system - algorithms - applications - eva...
 
Applying Reinforcement Learning for Network Routing
Applying Reinforcement Learning for Network RoutingApplying Reinforcement Learning for Network Routing
Applying Reinforcement Learning for Network Routing
 
Graphical Models for chains, trees and grids
Graphical Models for chains, trees and gridsGraphical Models for chains, trees and grids
Graphical Models for chains, trees and grids
 
One Size Doesn't Fit All: The New Database Revolution
One Size Doesn't Fit All: The New Database RevolutionOne Size Doesn't Fit All: The New Database Revolution
One Size Doesn't Fit All: The New Database Revolution
 
Streamlining Technology to Reduce Complexity and Improve Productivity
Streamlining Technology to Reduce Complexity and Improve ProductivityStreamlining Technology to Reduce Complexity and Improve Productivity
Streamlining Technology to Reduce Complexity and Improve Productivity
 
Machine Learning techniques
Machine Learning techniques Machine Learning techniques
Machine Learning techniques
 
Pattern Recognition and Machine Learning : Graphical Models
Pattern Recognition and Machine Learning : Graphical ModelsPattern Recognition and Machine Learning : Graphical Models
Pattern Recognition and Machine Learning : Graphical Models
 
Les outils de modélisation des Big Data
Les outils de modélisation des Big DataLes outils de modélisation des Big Data
Les outils de modélisation des Big Data
 
graphical models for the Internet
graphical models for the Internetgraphical models for the Internet
graphical models for the Internet
 
Nearest Neighbor Customer Insight
Nearest Neighbor Customer InsightNearest Neighbor Customer Insight
Nearest Neighbor Customer Insight
 
Web Crawling and Reinforcement Learning
Web Crawling and Reinforcement LearningWeb Crawling and Reinforcement Learning
Web Crawling and Reinforcement Learning
 
A real-time big data architecture for glasses detection using computer vision...
A real-time big data architecture for glasses detection using computer vision...A real-time big data architecture for glasses detection using computer vision...
A real-time big data architecture for glasses detection using computer vision...
 
A system to filter unwanted messages from osn user walls
A system to filter unwanted messages from osn user wallsA system to filter unwanted messages from osn user walls
A system to filter unwanted messages from osn user walls
 
[PRML 3.1~3.2] Linear Regression / Bias-Variance Decomposition
[PRML 3.1~3.2] Linear Regression / Bias-Variance Decomposition [PRML 3.1~3.2] Linear Regression / Bias-Variance Decomposition
[PRML 3.1~3.2] Linear Regression / Bias-Variance Decomposition
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spaces
 
Big Data Paradigm - Analysis, Application and Challenges
Big Data Paradigm - Analysis, Application and ChallengesBig Data Paradigm - Analysis, Application and Challenges
Big Data Paradigm - Analysis, Application and Challenges
 
Sourcing talent a key recruiting differentiator part 2 - the (Big) Data Lands...
Sourcing talent a key recruiting differentiator part 2 - the (Big) Data Lands...Sourcing talent a key recruiting differentiator part 2 - the (Big) Data Lands...
Sourcing talent a key recruiting differentiator part 2 - the (Big) Data Lands...
 
On cascading small decision trees
On cascading small decision treesOn cascading small decision trees
On cascading small decision trees
 

Semelhante a Some Take-Home Message about Machine Learning

On Machine Learning and Data Mining
On Machine Learning and Data MiningOn Machine Learning and Data Mining
On Machine Learning and Data Miningbutest
 
Introduction AI ML& Mathematicals of ML.pdf
Introduction AI ML& Mathematicals of ML.pdfIntroduction AI ML& Mathematicals of ML.pdf
Introduction AI ML& Mathematicals of ML.pdfGandhiMathy6
 
Machine learning interview questions and answers
Machine learning interview questions and answersMachine learning interview questions and answers
Machine learning interview questions and answerskavinilavuG
 
Machine Learning basics
Machine Learning basicsMachine Learning basics
Machine Learning basicsNeeleEilers
 
A Machine Learning Primer,
A Machine Learning Primer,A Machine Learning Primer,
A Machine Learning Primer,Eirini Ntoutsi
 
notes as .ppt
notes as .pptnotes as .ppt
notes as .pptbutest
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401butest
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401butest
 
TPCMFinalACone
TPCMFinalAConeTPCMFinalACone
TPCMFinalAConeAdam Cone
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)butest
 
Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationAnkit Gupta
 
machine learning
machine learningmachine learning
machine learningMounisha A
 
A Few Useful Things to Know about Machine Learning
A Few Useful Things to Know about Machine LearningA Few Useful Things to Know about Machine Learning
A Few Useful Things to Know about Machine Learningnep_test_account
 
introduction to machin learning
introduction to machin learningintroduction to machin learning
introduction to machin learningnilimapatel6
 
White-Paper-the-AI-behind-vectra-AI.pdf
White-Paper-the-AI-behind-vectra-AI.pdfWhite-Paper-the-AI-behind-vectra-AI.pdf
White-Paper-the-AI-behind-vectra-AI.pdfBoris647814
 

Semelhante a Some Take-Home Message about Machine Learning (20)

On Machine Learning and Data Mining
On Machine Learning and Data MiningOn Machine Learning and Data Mining
On Machine Learning and Data Mining
 
Introduction AI ML& Mathematicals of ML.pdf
Introduction AI ML& Mathematicals of ML.pdfIntroduction AI ML& Mathematicals of ML.pdf
Introduction AI ML& Mathematicals of ML.pdf
 
Machine learning interview questions and answers
Machine learning interview questions and answersMachine learning interview questions and answers
Machine learning interview questions and answers
 
PREDICT 422 - Module 1.pptx
PREDICT 422 - Module 1.pptxPREDICT 422 - Module 1.pptx
PREDICT 422 - Module 1.pptx
 
AI Presentation 1
AI Presentation 1AI Presentation 1
AI Presentation 1
 
Machine Learning basics
Machine Learning basicsMachine Learning basics
Machine Learning basics
 
A Machine Learning Primer,
A Machine Learning Primer,A Machine Learning Primer,
A Machine Learning Primer,
 
DEFENSE
DEFENSEDEFENSE
DEFENSE
 
notes as .ppt
notes as .pptnotes as .ppt
notes as .ppt
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401
 
Time ser
Time serTime ser
Time ser
 
TPCMFinalACone
TPCMFinalAConeTPCMFinalACone
TPCMFinalACone
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)
 
Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning Presentation
 
machine learning
machine learningmachine learning
machine learning
 
A Few Useful Things to Know about Machine Learning
A Few Useful Things to Know about Machine LearningA Few Useful Things to Know about Machine Learning
A Few Useful Things to Know about Machine Learning
 
introduction to machin learning
introduction to machin learningintroduction to machin learning
introduction to machin learning
 
i2ml3e-chap1.pptx
i2ml3e-chap1.pptxi2ml3e-chap1.pptx
i2ml3e-chap1.pptx
 
White-Paper-the-AI-behind-vectra-AI.pdf
White-Paper-the-AI-behind-vectra-AI.pdfWhite-Paper-the-AI-behind-vectra-AI.pdf
White-Paper-the-AI-behind-vectra-AI.pdf
 

Mais de Gianluca Bontempi

A statistical criterion for reducing indeterminacy in linear causal modeling
A statistical criterion for reducing indeterminacy in linear causal modelingA statistical criterion for reducing indeterminacy in linear causal modeling
A statistical criterion for reducing indeterminacy in linear causal modelingGianluca Bontempi
 
Adaptive model selection in Wireless Sensor Networks
Adaptive model selection in Wireless Sensor NetworksAdaptive model selection in Wireless Sensor Networks
Adaptive model selection in Wireless Sensor NetworksGianluca Bontempi
 
Combining Lazy Learning, Racing and Subsampling for Effective Feature Selection
Combining Lazy Learning, Racing and Subsampling for Effective Feature SelectionCombining Lazy Learning, Racing and Subsampling for Effective Feature Selection
Combining Lazy Learning, Racing and Subsampling for Effective Feature SelectionGianluca Bontempi
 
A model-based relevance estimation approach for feature selection in microarr...
A model-based relevance estimation approach for feature selection in microarr...A model-based relevance estimation approach for feature selection in microarr...
A model-based relevance estimation approach for feature selection in microarr...Gianluca Bontempi
 
Machine Learning Strategies for Time Series Prediction
Machine Learning Strategies for Time Series PredictionMachine Learning Strategies for Time Series Prediction
Machine Learning Strategies for Time Series PredictionGianluca Bontempi
 
Feature selection and microarray data
Feature selection and microarray dataFeature selection and microarray data
Feature selection and microarray dataGianluca Bontempi
 
A Monte Carlo strategy for structure multiple-step-head time series prediction
A Monte Carlo strategy for structure multiple-step-head time series predictionA Monte Carlo strategy for structure multiple-step-head time series prediction
A Monte Carlo strategy for structure multiple-step-head time series predictionGianluca Bontempi
 
FP7 evaluation & selection: the point of view of an evaluator
FP7 evaluation & selection: the point of view of an evaluatorFP7 evaluation & selection: the point of view of an evaluator
FP7 evaluation & selection: the point of view of an evaluatorGianluca Bontempi
 
Local modeling in regression and time series prediction
Local modeling in regression and time series predictionLocal modeling in regression and time series prediction
Local modeling in regression and time series predictionGianluca Bontempi
 
Perspective of feature selection in bioinformatics
Perspective of feature selection in bioinformaticsPerspective of feature selection in bioinformatics
Perspective of feature selection in bioinformaticsGianluca Bontempi
 
Computational Intelligence for Time Series Prediction
Computational Intelligence for Time Series PredictionComputational Intelligence for Time Series Prediction
Computational Intelligence for Time Series PredictionGianluca Bontempi
 

Mais de Gianluca Bontempi (11)

A statistical criterion for reducing indeterminacy in linear causal modeling
A statistical criterion for reducing indeterminacy in linear causal modelingA statistical criterion for reducing indeterminacy in linear causal modeling
A statistical criterion for reducing indeterminacy in linear causal modeling
 
Adaptive model selection in Wireless Sensor Networks
Adaptive model selection in Wireless Sensor NetworksAdaptive model selection in Wireless Sensor Networks
Adaptive model selection in Wireless Sensor Networks
 
Combining Lazy Learning, Racing and Subsampling for Effective Feature Selection
Combining Lazy Learning, Racing and Subsampling for Effective Feature SelectionCombining Lazy Learning, Racing and Subsampling for Effective Feature Selection
Combining Lazy Learning, Racing and Subsampling for Effective Feature Selection
 
A model-based relevance estimation approach for feature selection in microarr...
A model-based relevance estimation approach for feature selection in microarr...A model-based relevance estimation approach for feature selection in microarr...
A model-based relevance estimation approach for feature selection in microarr...
 
Machine Learning Strategies for Time Series Prediction
Machine Learning Strategies for Time Series PredictionMachine Learning Strategies for Time Series Prediction
Machine Learning Strategies for Time Series Prediction
 
Feature selection and microarray data
Feature selection and microarray dataFeature selection and microarray data
Feature selection and microarray data
 
A Monte Carlo strategy for structure multiple-step-head time series prediction
A Monte Carlo strategy for structure multiple-step-head time series predictionA Monte Carlo strategy for structure multiple-step-head time series prediction
A Monte Carlo strategy for structure multiple-step-head time series prediction
 
FP7 evaluation & selection: the point of view of an evaluator
FP7 evaluation & selection: the point of view of an evaluatorFP7 evaluation & selection: the point of view of an evaluator
FP7 evaluation & selection: the point of view of an evaluator
 
Local modeling in regression and time series prediction
Local modeling in regression and time series predictionLocal modeling in regression and time series prediction
Local modeling in regression and time series prediction
 
Perspective of feature selection in bioinformatics
Perspective of feature selection in bioinformaticsPerspective of feature selection in bioinformatics
Perspective of feature selection in bioinformatics
 
Computational Intelligence for Time Series Prediction
Computational Intelligence for Time Series PredictionComputational Intelligence for Time Series Prediction
Computational Intelligence for Time Series Prediction
 

Último

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...gajnagarg
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...gajnagarg
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachBoston Institute of Analytics
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal
 
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...gajnagarg
 

Último (20)

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...
 

Some Take-Home Message about Machine Learning

  • 1. Some Take-Home Messages (THM) about ML.... Data Science Meetup Gianluca Bontempi Interuniversity Institute of Bioinformatics in Brussels, (IB)2 Machine Learning Group, Computer Science Department, ULB mlg.ulb.ac.be, ibsquare.be May 20, 2016
  • 2. Introducing myself 1992: Computer science engineer (Politecnico di Milano, Italy), 1994: Researcher in robotics in IRST, Trento, Italy, 1995: Researcher in IRIDIA, ULB, Brussels, 1996-97: Researcher in IDSIA, Lugano, Switzerland, 1998-2000: Marie Curie fellowship in IRIDIA, ULB, 2000-2001: Scientist in Philips Research, Eindhoven, The Netherlands, 2001-2002: Scientist in IMEC, Microelectronics Institute, Leuven, Belgium, since 2002: professor in Machine Learning, Modeling and Simulation, Bioinformatics in ULB Computer Science Dept., since 2004: head of the ULB Machine Learning Group (MLG). since 2013: director of the Interuniversity Institute of Bioinformatics in Brussels (IB)2, ibsquare.be.
  • 3. What is machine learning? Machine learning is that domain of computational intelligence which is concerned with the question of how to construct computer programs that automatically improve with experience. (Mitchell, 97) Reductionist attitude: ML is just a buzzword which equates to statistics plus marketing Positive attitude: ML paved the way to the treatment of real problems related to data analysis, sometimes overlooked by statisticians (nonlinearity, classification, pattern recognition, missing variables, adaptivity, optimization, massive datasets, data management, causality, representation of knowledge, parallelisation) Interdisciplinary attitude: ML should have its roots on statistics and complements it by focusing on: algorithmic issues, computational efficiency, data engineering.
  • 5. Prediction is pervasive ... Predict whether you will like a book/movie (collaborative filtering) credit applicants as low, medium, or high risk. which home telephone lines are used for Internet access. which customers are likely to stop being customers (churn). the value of a piece of real estate which telephone subscribers will order a 4G service which CARREFOUR clients will be more interested to a discount in Italian products. the probability that a company is employing black workers (anti-fraud detection) the survival risk of a patient on the basis of a genetic signature the probability of a crime in an urban area. the key of a cryptographic algorithm on the basis of power consumption
  • 6. Supervised learning First assumption: learning is essentially about prediction ! Second assumption: reality is stochastic, dependency and uncertainty are well described by conditional probability. PREDICTION TARGET TRAINING DATASET INPUT OUTPUT ERROR PREDICTION MODEL measurable features (inputs) measurable target variables (outputs) and accuracy criteria data (in God we trust, all the others must bring data) THM1: formalizing a problem as a prediction problem is often the most important contribution of a data scientist!
  • 7. It is all about ... 1 Probabilistic modeling it formalizes uncertainty and dependency (regression function) notions of entropy and information relevant and irrelevant features (e.g. Markov blanket notion) Bayesian networks, causal reasoning 2 Estimation bias/variance notions generalization issues: underfitting vs overfitting Bayesian, frequentist, decision theory validation combination/averaging of estimators (bagging, boosting) 3 Optimization Maximum likelihood, least squares, backpropagation Dual problems (SVM) L1, L2 norm (lasso) 4 Computer science implementation, algorithms parallelism, scalability data management
  • 8. So ... how to teach machine learning? Focus on ... Formalism ? Algorithms ? Coding ? Applications ? Of course all is important but what is the essence, what is common to the exploding number of algorithms, techniques, fancy applications?
  • 9. Estimation STOCHASTIC PHENOMENON DATA LEARNER DATA DATA MODEL, PREDICTION LEARNER MODEL, PREDICTION LEARNER MODEL, PREDICTION THM2: a predictor is an estimator, i.e. an algorithm (black-box) which takes data and returns a prediction. THM3: reality is stochastic, so data is stochastic and prediction is stochastic.
  • 10. Assessing in an un uncertain world (Baggio, 1998) non aver paura di sbagliare un calcio di rigore, non è mica da questi particolari che si giudica un giocatore (De Gregori, 1982)).
  • 11. Assessing a learner The goal of learning is to find a model which is able to generalize, i.e. able to return good predictions in contexts with the same distribution but independent of the training set How to estimate the quality of a model? It is always possible to find models with such a complicate structure that they have null training errors. Are these models good? Typically NOT. Since doing very well on the training set could mean doing badly on new data. This is the phenomenon of overfitting. THM4: learning is challenging since data have to be used 1) for creating prediction models and 2) for assessing them.
  • 12. Bias and variance of a model Estimation theory: mean-squared-error (a measure of the generalization quality) can be written as MSE = σ2 w + squared bias + variance where noise concerns the reality alone, bias reflects the relation between reality and the learning algorithm variance concerns the learning algorithm alone. This is purely theoretical since these quantities cannot be measured .... .. but useful to understand why and in which circumstances learners work.
  • 13. The bias/variance dilemma Noise is all that cannot be learned from data Bias measures the lack of representational power of the class of hypotheses. Too simple model ⇒ large bias ⇒ underfitting Variance warns us against an excessive complexity of the approximator. Too complex model ⇒ large variance ⇒ overfitting A neural network is less biased than a linear model but inevitably more variant. Averaging (e.g. bagging, boosting, random forests) is a good cure for variance.
  • 14. Bias/variance trade-off complexity generalization error Bias Variance Underfitting Overfitting THM5: think in terms of bias/variance tradeoff. Think to your preferred learning algorithm and discover how bias/variance is managed.
  • 15. The Ockam’s Razor (1825) THM6: "Pluralitas non est ponenda sine neccesitate" i.e. one should not increase, beyond what is necessary, the number of entities required to explain anything. This is the medieval rule of parsimony, or principle of economy, known as Ockham’s razor. In other terms the principle states that one should not make more assumptions than the minimum needed. It underlies all scientific modeling and theory building. It admonishes us to choose from a set of otherwise equivalent models the simplest one. Be simple: "shave off" those concepts, variables or constructs that are not really needed to explain the phenomenon.
  • 16. Does the best exist? Given a finite number of samples, are there any reasons to prefer one learning algorithm over another? If we make no assumption about the nature of the learning task, can we expect any learning method to be superior or inferior overall? Can we even find an algorithm that is overall superior to (or inferior to) random guessing? The No Free Lunch Theorem answers NO to these questions.
  • 17. No Free Lunch theorem If the goal is to obtain good generalization performance, there are no context-independent or usage-independent reasons to favor one learning method over another. If one algorithm seems to outperform another in a particular situation, it is a consequence of its fit to the particular pattern recognition problem, not the general superiority of the algorithm. The theorem also justifies the skeptiscism about studies that demonstrate the overall superiority of a particular learning or recognition algorithm. If a learning method performs well over some set of problems, then it must perform worse than average elsewhere. No method can perform well throughout the full set of functions. THM7: Every learning algorithm makes assumptions (most of the times in implicit manner) and these make the difference.
  • 18. Conclusion Popper claimed that, if a theory is falsifiable (i.e. it can be contradicted by an observation or the outcome of a physical experiment), then it is scientific. Since prediction is the most falsifiable aspect of science it is also the most scientific one. Effective machine learning is an extension of statistics, in no way an alternative. Simplest (i.e. linear) model first. Modelling is more an art than an automatic process... then experience data analysts are more valuable than expensive tools. Expert knowledge matters..., data too Understanding what is predictable is as important as trying to predict it. All models are wrong, some of them are useful.
  • 19. All that we did not discuss... Dimensionality reduction and feature selection Causal inference Unsupervised learning Active learning Spatio-temporal prediction Nonstationary problems Scalable machine learning Control and robotics Libraries and platforms (R, python, Weka)
  • 20. Resources A biased list ...:-) Scoop-it www.scoop.it/t/machine-learning-by-gianluca-bontempi on machine learning Scoop-it www.scoop.it/t/probabilistic-reasoning-and-statistics on Probabilistic reasoning, causal inference and statistics MLG mlg.ulb.ac.be MA course INFO-F-422 Statistical foundations of machine learning Handbook available on https://www.otexts.org