2. Disclaimer
Images used in this presentation are collated from
different internet sources. The content is used
only for the sake of understanding the concepts of
Machine Learning. Exercise your own judgement
regarding the suitability of the content.
2
17. What is Learning?
• Learning is any process by which a system improves performance
from experience- Herbert Simon
• Learning is not knowing what you do not know but
strengthening/enforcing what you already know.
• Learning is a repetitive process, process of refinement/
improvement.
• Learning is trying to understand (concept), recognize patterns,
trends, anomalies, summarizing, quantifying, qualifying a given
situation.
• Learning is through experience, through example, presented as
data.
17
21. What is Machine Learning?
21
Machine Learning is an application of AI that provides systems the ability
to automatically learn and improve from experience without being
explicitly programmed.
A computer program is said to learn from Experience E with
respect to some Task T and some Performance measure P, if
its performance on T, as measured by P, improves with
experience E.
Eg. Smart Homes
T: Estimate the desired Temperature
E: Learning from temperature dataset
P: Accuracy of the desired temperature
22. Defining the Learning Task
Improve on task T, with respect to Performance metric P, based on Experience E
22
Sr. No.
TASK (T) PERFORMANCE (P) EXPERIENCE (E)
1 Playing Checkers % of games won against an
arbitrary opponent.
Playing practice games against itself.
2 Recognizing hand written words % of words correctly
classified.
Databased of human-labelled images of
handwritten words.
3 Driving on four-lane highways
using vision sensors
Average distance travelled
before a human-judged
error.
A sequence of images and steering
commands recorded while observing a
human driver.
4 Categorize email messages as
spam or legitimate
% of email messages
correctly classified.
Database of emails, some with human-
given labels.
27. Dimensionality Reduction
27
The number of training examples required increases
exponentially with dimensionality.
• The classifier’s performance usually will degrade for a large number of
features!
29. Discrete Data
29
•You can count the data. It is usually units counted in whole
numbers.
•The values cannot be divided into smaller pieces and add
additional meaning.
•You cannot measure the data. By nature, discrete data cannot be
measured at all. For example, you can measure your weight with the
help of a scale. So, your weight is not a discrete data.
•It has a limited number of possible values e.g. days of the month.
•Discrete data is graphically displayed by a bar graph.
31. Continuous Data
31
• In general, continuous variables are not counted.
• The values can be subdivided into smaller and smaller
pieces and they have additional meaning.
• The continuous data is measurable.
• It has an infinite number of possible values within an
interval.
• Continuous data is graphically displayed by histograms.
36. 3 Key Components of Machine Learning
Representation: how to represent knowledge ? (Select a
model to represent data)
Examples include decision trees, sets of rules, instances,
graphical models, neural networks, support vector
machines, model ensembles..
Evaluation: optimize the objective function, given
different values for the model parameters
Confusion matrix, accuracy, prediction and recall,
squared error, likelihood, posterior probability, cost,
margin, entropy
Optimization: estimate model parameters using
optimization methods-Greedy Search
Variables, parameters, or generalized loss/error
36
43. Reinforcement Learning
43
Reinforcement Learning is learning what to do and how to map
situations to actions. The end result is to maximize the numerical
reward signal. The learner is not told which action to take, but
instead must discover which action will yield the maximum
reward.
46. 46
ERROR = noise2 + bias2 + variance
Unavoidable
error
Error due to
incorrect
assumptions
Error due to
variance of
training samples
High Bias leads to Underfitting
57. Classification Accuracy: Estimating Error Rates
57
Partition: Training-and-testing
use two independent data sets, e.g., training set (2/3), test set(1/3)
used for data set with large number of samples
Cross-validation
divide the data set into k subsamples
use k-1 subsamples as training data and one sub-sample as test
data --- k-fold cross-validation
for data set with moderate size
Bootstrapping (leave-one-out)
for small size data
60. Classification Accuracy: Estimating Error Rates
60
• There are several factors affecting the performance:
Types of training provided
The form and extent of any initial background knowledge
The type of feedback provided
The learning algorithms used
• Two important factors:
Modeling
Optimization