SlideShare uma empresa Scribd logo
1 de 19
Baixar para ler offline
Introduction to Machine
       Learning
                        Lecture 8
 Instance Based Learning and Case-Based Reasoning


                     Albert Orriols i Puig
                    aorriols@salle.url.edu
                        i l @ ll       ld

           Artificial Intelligence – Machine Learning
               Enginyeria i Arquitectura La Salle
                   gy           q
                      Universitat Ramon Llull
Recap of Lecture 7
        kNN
                             15-NN                      1-NN




        Key aspects
                Value of k
                Distance functions


                                                               Slide 2
Artificial Intelligence              Machine Learning
Recap of Lecture 7
        Where is learning in kNN?
                        g
                Retrieval system
                No l b l
                N global model
                           dl
                No generalization
                …
                No learning!
                 o ea     g


        But till i bl t
        B t still, it is able to create accurate classification
                                     t        tl      ifi ti
        models




                                                                  Slide 3
Artificial Intelligence             Machine Learning
Today’s Agenda


        Formalizing the framework: From kNN to CBR
        Incorporating learning in different phases:
                Learn prototypes
                Organize the memory in clusters
                Learn the best distance function
                Provide explanations




                                                      Slide 4
Artificial Intelligence            Machine Learning
From kNN to CBR

        kNN provides a retrieving system
        Much work on different phases of kNN
                Prototype selection
                Distance function selection
                …
        CBR provides a general framework based on kNN




                                                          Slide 5
Artificial Intelligence                Machine Learning
Schema of CBR

 CBR cycle
                                                      Select a
(Aamodt &
                                                      solution
Plaza,
Plaza 1994)
                                                      Reuse
                           Similarity
                            function
                                                                                 Revise the
                                                                                  solution



                                                                                  Solution
                                                                  Revise
           Problem             Retrieve                Case
                                                      Memory




                                                      Retain



  Coherence and
                              Structure and                        Retain the
 relevance of the
                          agrupation of the cases                new knowledge
     attributes




                                                                                  Slide 6
Artificial Intelligence                      Machine Learning
Phases of CBR
     Five key phases
             Preprocess the training instance
                      So that it meets the requirements of the system
             Retrieve
                      Use
                      U kNN with the selected distance function
                             ih h      l    d di       f    i
             Reuse
                      Vote-based scheme
             Revise
                      Adapt the solution if necessary
             Retain
                      Remove examples from or add examples to
                      the case memory


                                                                        Slide 7
Artificial Intelligence                   Machine Learning
Challenges in CBR

    Hot areas
            Reduce the cost of matching
                     Reduce the total number of examples in the case memory
                     Organize the case memory in clusters and only consult examples
                     O     i th               ilt           dl          lt      l
                     of some clusters
            Automatically create distance functions that are suited to your
            problem
            Extraction of explanations:
                     CBR does not extract legible models (actually, does not learn any
                     model)
                          )




                                                                               Slide 8
Artificial Intelligence                   Machine Learning
Prototype Selection
        Training data sets contain a large number of instances
               g                        g
                Increase the prediction time
                May
                M contain noisy i t
                     ti     i instances


        Prototype selection
                Select the representative examples to form the case base
                Remove all the other examples


        How?
                Learn which examples are the ones that maximize CBR
                accuracy


                                                                           Slide 9
Artificial Intelligence                Machine Learning
Prototype Selection
                                                               Possible sets of prototypes



                                                                                …
                              Training             Sel.              Sel.              Sel.
Training
                              Data set            Proto 1           Proto 2           Proto 3
Data set
                 Split the
               training set
                                                           How do we know which
                                                           is th b t S l ti
                                                           i the best Selection of
                                                                                 f
                                                                 Prototypes?
Validation
    set

                                           KNN
                           Test data set


         Does it sound familiar to you?
                 Problem: Search for the best SP
                 It s
                 It’s just an optimization problem
                 For robustness, use cross-validation or similar validation procedures
                                                                                                Slide 10
 Artificial Intelligence                    Machine Learning
Prototype Selection
        Optimization methods used so far
         p
                Genetic algorithms (Holland, 75)
                Genetic Programming (Koza et al., 1989)
                G   ti P        i (K       tl
                Grammar Evolution (Ryan & O’Neill, 1998)




                                                           Slide 11
Artificial Intelligence               Machine Learning
Case-Based Memory Clustering
        Training data sets contain a large number of instances
               g                        g
                Clustering: Place instances in different clusters
                Only t i
                O l retrieve from the same cluster or clusters that are
                             f    th        lt         lt      th t
                close to you




                                                                          Slide 12
Artificial Intelligence                 Machine Learning
Case-Based Memory Clustering
Retrieve phase
                                          Reuse          Reuse phase
1. Compare with all the prototypes
                                                         Propose a solution with the
2. Compare only with the examples
                                                         retrieved cases
   of the closest cluster




                                         Case
                   Retrieve
                   R ti                                                Revise
                                                                       Ri
                                        Memory




 Retain phase                                           Revise phase
 Update the organization.                               Revise if the solution is
 It may imply the update of the
      y py         p                                    p
                                                        potentially valid
                                                                  y
                                          Retain
 clusters
                                                                                    Slide 13
   Artificial Intelligence           Machine Learning
Generation of Distance Functions
        How does the distance function influences learning?
                                                         g




                It may be the key between success and failure!


                                                                 Slide 14
Artificial Intelligence               Machine Learning
Generation of Distance Functions
        Can I find a distance function that makes kNN perform
                                                      p
        the best in all cases?
                No way Actually, NFL announces it (Wolpert 1992)
                   way. Actually                  (Wolpert,
                Different distances suited for different domains


        May I try to create a new distance function for each
        specific problem?
                Of course. Again, an optimization problem




                                                                   Slide 15
Artificial Intelligence                Machine Learning
Generation of Distance Functions
        Split the training data set into
                Training t’
                T i i set’
                                                                           Optimization problem
                Validation set
                                                                             Assume a parametric form
                                                                             Optimize the parameters of the
                                   Validation                                underlying function
                                       set
                                                                           Being more ambitious?
                     Dist.                 Dist.         Dist.
                   function1             function2     functionn
                                                                             Do not assume any parametric
                                                …                            form
                                                                             Optimize both the function
                                                                             structure and the parameters
                                                kNN
                          Training
                                                                           Examples:
                          Data set‘
                                                                             (
                                                                             (Fornells et al., 2005)
                                                                                             ,     )
                                                                             (Camps et al., 2003)
                          error1     error2           errorn



                                                                                                       Slide 16
Artificial Intelligence                                 Machine Learning
Extraction of Explanations
        One of the main drawbacks of CBR is that it does not provide
                                                             p
        any explanation
                Prediction based on nearest neighbors


        New techniques to provide explanations
                Based on used instances
                Building of partial models


        Not studied in more detail here




                                                                 Slide 17
Artificial Intelligence                      Machine Learning
Next Class

        Probabilistic-based learning




                                                 Slide 18
Artificial Intelligence       Machine Learning
Introduction to Machine
       Learning
                       Lecture 8
Instance Based Learning and Case-Based Reasoning


                    Albert Orriols i Puig
                   aorriols@salle.url.edu
                       i l @ ll       ld

          Artificial Intelligence – Machine Learning
              Enginyeria i Arquitectura La Salle
                  gy           q
                     Universitat Ramon Llull

Mais conteúdo relacionado

Mais procurados

Lecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART IILecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART IIAlbert Orriols-Puig
 
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...Albert Orriols-Puig
 
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...Albert Orriols-Puig
 
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSHIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSAlbert Orriols-Puig
 
Lecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryLecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryAlbert Orriols-Puig
 
A New Model for Credit Approval Problems a Neuro Genetic System with Quantum ...
A New Model for Credit Approval Problems a Neuro Genetic System with Quantum ...A New Model for Credit Approval Problems a Neuro Genetic System with Quantum ...
A New Model for Credit Approval Problems a Neuro Genetic System with Quantum ...Anderson Pinho
 
SOIAM (SOINN-AM)
SOIAM (SOINN-AM)SOIAM (SOINN-AM)
SOIAM (SOINN-AM)SOINN Inc.
 
MATHEON D-Day: Numerical simulation of integrated circuits for future chip ge...
MATHEON D-Day: Numerical simulation of integrated circuits for future chip ge...MATHEON D-Day: Numerical simulation of integrated circuits for future chip ge...
MATHEON D-Day: Numerical simulation of integrated circuits for future chip ge...Dagmar Monett
 

Mais procurados (20)

Lecture18
Lecture18Lecture18
Lecture18
 
Lecture24
Lecture24Lecture24
Lecture24
 
Lecture11 - neural networks
Lecture11 - neural networksLecture11 - neural networks
Lecture11 - neural networks
 
Lecture5 - C4.5
Lecture5 - C4.5Lecture5 - C4.5
Lecture5 - C4.5
 
Lecture20
Lecture20Lecture20
Lecture20
 
Lecture17
Lecture17Lecture17
Lecture17
 
Lecture1 - Machine Learning
Lecture1 - Machine LearningLecture1 - Machine Learning
Lecture1 - Machine Learning
 
Lecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART IILecture15 - Advances topics on association rules PART II
Lecture15 - Advances topics on association rules PART II
 
Lecture2 - Machine Learning
Lecture2 - Machine LearningLecture2 - Machine Learning
Lecture2 - Machine Learning
 
Lecture12 - SVM
Lecture12 - SVMLecture12 - SVM
Lecture12 - SVM
 
Lecture19
Lecture19Lecture19
Lecture19
 
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...
 
Lecture23
Lecture23Lecture23
Lecture23
 
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
 
Lecture21
Lecture21Lecture21
Lecture21
 
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCSHIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
 
Lecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-TheoryLecture9 - Bayesian-Decision-Theory
Lecture9 - Bayesian-Decision-Theory
 
A New Model for Credit Approval Problems a Neuro Genetic System with Quantum ...
A New Model for Credit Approval Problems a Neuro Genetic System with Quantum ...A New Model for Credit Approval Problems a Neuro Genetic System with Quantum ...
A New Model for Credit Approval Problems a Neuro Genetic System with Quantum ...
 
SOIAM (SOINN-AM)
SOIAM (SOINN-AM)SOIAM (SOINN-AM)
SOIAM (SOINN-AM)
 
MATHEON D-Day: Numerical simulation of integrated circuits for future chip ge...
MATHEON D-Day: Numerical simulation of integrated circuits for future chip ge...MATHEON D-Day: Numerical simulation of integrated circuits for future chip ge...
MATHEON D-Day: Numerical simulation of integrated circuits for future chip ge...
 

Destaque

HAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsHAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsAlbert Orriols-Puig
 
Lecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rulesLecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rulesAlbert Orriols-Puig
 
Lecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART IIILecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART IIIAlbert Orriols-Puig
 
Lecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceLecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceAlbert Orriols-Puig
 
Incremental Learning using WEKA
Incremental Learning using WEKAIncremental Learning using WEKA
Incremental Learning using WEKAvrohit13
 
Radial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and DhanashriRadial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and Dhanashrisheetal katkar
 
Neural Networks: Radial Bases Functions (RBF)
Neural Networks: Radial Bases Functions (RBF)Neural Networks: Radial Bases Functions (RBF)
Neural Networks: Radial Bases Functions (RBF)Mostafa G. M. Mostafa
 
Machine Learning and Data Mining: 13 Nearest Neighbor and Bayesian Classifiers
Machine Learning and Data Mining: 13 Nearest Neighbor and Bayesian ClassifiersMachine Learning and Data Mining: 13 Nearest Neighbor and Bayesian Classifiers
Machine Learning and Data Mining: 13 Nearest Neighbor and Bayesian ClassifiersPier Luca Lanzi
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligenceu053675
 

Destaque (16)

HAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasetsHAIS09-BeyondHomemadeArtificialDatasets
HAIS09-BeyondHomemadeArtificialDatasets
 
Lecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rulesLecture14 - Advanced topics in association rules
Lecture14 - Advanced topics in association rules
 
Lecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART IIILecture16 - Advances topics on association rules PART III
Lecture16 - Advances topics on association rules PART III
 
Lecture10 - Naïve Bayes
Lecture10 - Naïve BayesLecture10 - Naïve Bayes
Lecture10 - Naïve Bayes
 
Lecture13 - Association Rules
Lecture13 - Association RulesLecture13 - Association Rules
Lecture13 - Association Rules
 
Lecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligenceLecture1 AI1 Introduction to artificial intelligence
Lecture1 AI1 Introduction to artificial intelligence
 
Lecture22
Lecture22Lecture22
Lecture22
 
Incremental Learning using WEKA
Incremental Learning using WEKAIncremental Learning using WEKA
Incremental Learning using WEKA
 
Radial Basis Function
Radial Basis FunctionRadial Basis Function
Radial Basis Function
 
Decision tree Using c4.5 Algorithm
Decision tree Using c4.5 AlgorithmDecision tree Using c4.5 Algorithm
Decision tree Using c4.5 Algorithm
 
Radial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and DhanashriRadial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and Dhanashri
 
Data mining with weka
Data mining with wekaData mining with weka
Data mining with weka
 
Neural Networks: Radial Bases Functions (RBF)
Neural Networks: Radial Bases Functions (RBF)Neural Networks: Radial Bases Functions (RBF)
Neural Networks: Radial Bases Functions (RBF)
 
Machine Learning and Data Mining: 13 Nearest Neighbor and Bayesian Classifiers
Machine Learning and Data Mining: 13 Nearest Neighbor and Bayesian ClassifiersMachine Learning and Data Mining: 13 Nearest Neighbor and Bayesian Classifiers
Machine Learning and Data Mining: 13 Nearest Neighbor and Bayesian Classifiers
 
KNN
KNN KNN
KNN
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 

Semelhante a Lecture8 - From CBR to IBk

GECCO'2006: Bounding XCS’s Parameters for Unbalanced Datasets
GECCO'2006: Bounding XCS’s Parameters for Unbalanced DatasetsGECCO'2006: Bounding XCS’s Parameters for Unbalanced Datasets
GECCO'2006: Bounding XCS’s Parameters for Unbalanced DatasetsAlbert Orriols-Puig
 
Ph d colloquium 2
Ph d colloquium 2Ph d colloquium 2
Ph d colloquium 2Rishi Roy
 
Machine Learning Live
Machine Learning LiveMachine Learning Live
Machine Learning LiveMike Anderson
 
Multi-core programming talk for weekly biostat seminar
Multi-core programming talk for weekly biostat seminarMulti-core programming talk for weekly biostat seminar
Multi-core programming talk for weekly biostat seminarUSC
 
Modeling XCS in class imbalances: Population sizing and parameter settings
Modeling XCS in class imbalances: Population sizing and parameter settingsModeling XCS in class imbalances: Population sizing and parameter settings
Modeling XCS in class imbalances: Population sizing and parameter settingskknsastry
 
DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101Felipe Prado
 
Developing and Visualizing Live Model Queries
Developing and Visualizing Live Model QueriesDeveloping and Visualizing Live Model Queries
Developing and Visualizing Live Model QueriesZoltán Ujhelyi
 
Presentation
PresentationPresentation
Presentationbutest
 
A Survey on in-a-box parallel computing and its implications on system softwa...
A Survey on in-a-box parallel computing and its implications on system softwa...A Survey on in-a-box parallel computing and its implications on system softwa...
A Survey on in-a-box parallel computing and its implications on system softwa...ChangWoo Min
 
Model remodeling with modern deep learning frameworks
Model remodeling with modern deep learning frameworksModel remodeling with modern deep learning frameworks
Model remodeling with modern deep learning frameworksrosentep
 
Deep Learning Overview Classification Types Examples And Limitations
Deep Learning Overview Classification Types Examples And LimitationsDeep Learning Overview Classification Types Examples And Limitations
Deep Learning Overview Classification Types Examples And LimitationsSlideTeam
 
Pr 005 qa_workshop
Pr 005 qa_workshopPr 005 qa_workshop
Pr 005 qa_workshopFrank Gielen
 
5 Essential Machine Learning Ideas
5 Essential Machine Learning Ideas5 Essential Machine Learning Ideas
5 Essential Machine Learning IdeasCarl Dawson
 
Obscenity Detection in Images
Obscenity Detection in ImagesObscenity Detection in Images
Obscenity Detection in ImagesAnil Kumar Gupta
 
Distributed Deep Learning on AWS with Apache MXNet
Distributed Deep Learning on AWS with Apache MXNetDistributed Deep Learning on AWS with Apache MXNet
Distributed Deep Learning on AWS with Apache MXNetAmazon Web Services
 
Memory Leak Profiling with NetBeans and HotSpot
Memory Leak Profiling with NetBeans and HotSpotMemory Leak Profiling with NetBeans and HotSpot
Memory Leak Profiling with NetBeans and HotSpotjavapsyche
 
Application scenarios in streaming oriented embedded-system design
Application scenarios in streaming oriented embedded-system designApplication scenarios in streaming oriented embedded-system design
Application scenarios in streaming oriented embedded-system designMr. Chanuwan
 

Semelhante a Lecture8 - From CBR to IBk (20)

GECCO'2006: Bounding XCS’s Parameters for Unbalanced Datasets
GECCO'2006: Bounding XCS’s Parameters for Unbalanced DatasetsGECCO'2006: Bounding XCS’s Parameters for Unbalanced Datasets
GECCO'2006: Bounding XCS’s Parameters for Unbalanced Datasets
 
Ph d colloquium 2
Ph d colloquium 2Ph d colloquium 2
Ph d colloquium 2
 
Machine Learning Live
Machine Learning LiveMachine Learning Live
Machine Learning Live
 
Multi-core programming talk for weekly biostat seminar
Multi-core programming talk for weekly biostat seminarMulti-core programming talk for weekly biostat seminar
Multi-core programming talk for weekly biostat seminar
 
Modeling XCS in class imbalances: Population sizing and parameter settings
Modeling XCS in class imbalances: Population sizing and parameter settingsModeling XCS in class imbalances: Population sizing and parameter settings
Modeling XCS in class imbalances: Population sizing and parameter settings
 
SED-ML Presentation
SED-ML PresentationSED-ML Presentation
SED-ML Presentation
 
DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101
 
Developing and Visualizing Live Model Queries
Developing and Visualizing Live Model QueriesDeveloping and Visualizing Live Model Queries
Developing and Visualizing Live Model Queries
 
Presentation
PresentationPresentation
Presentation
 
A Survey on in-a-box parallel computing and its implications on system softwa...
A Survey on in-a-box parallel computing and its implications on system softwa...A Survey on in-a-box parallel computing and its implications on system softwa...
A Survey on in-a-box parallel computing and its implications on system softwa...
 
Model remodeling with modern deep learning frameworks
Model remodeling with modern deep learning frameworksModel remodeling with modern deep learning frameworks
Model remodeling with modern deep learning frameworks
 
MIRU2014 SLAC
MIRU2014 SLACMIRU2014 SLAC
MIRU2014 SLAC
 
Deep Learning Overview Classification Types Examples And Limitations
Deep Learning Overview Classification Types Examples And LimitationsDeep Learning Overview Classification Types Examples And Limitations
Deep Learning Overview Classification Types Examples And Limitations
 
Pr 005 qa_workshop
Pr 005 qa_workshopPr 005 qa_workshop
Pr 005 qa_workshop
 
5 Essential Machine Learning Ideas
5 Essential Machine Learning Ideas5 Essential Machine Learning Ideas
5 Essential Machine Learning Ideas
 
Obscenity Detection in Images
Obscenity Detection in ImagesObscenity Detection in Images
Obscenity Detection in Images
 
ML_in_QM_JC_02-10-18
ML_in_QM_JC_02-10-18ML_in_QM_JC_02-10-18
ML_in_QM_JC_02-10-18
 
Distributed Deep Learning on AWS with Apache MXNet
Distributed Deep Learning on AWS with Apache MXNetDistributed Deep Learning on AWS with Apache MXNet
Distributed Deep Learning on AWS with Apache MXNet
 
Memory Leak Profiling with NetBeans and HotSpot
Memory Leak Profiling with NetBeans and HotSpotMemory Leak Profiling with NetBeans and HotSpot
Memory Leak Profiling with NetBeans and HotSpot
 
Application scenarios in streaming oriented embedded-system design
Application scenarios in streaming oriented embedded-system designApplication scenarios in streaming oriented embedded-system design
Application scenarios in streaming oriented embedded-system design
 

Último

Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 

Último (20)

Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 

Lecture8 - From CBR to IBk

  • 1. Introduction to Machine Learning Lecture 8 Instance Based Learning and Case-Based Reasoning Albert Orriols i Puig aorriols@salle.url.edu i l @ ll ld Artificial Intelligence – Machine Learning Enginyeria i Arquitectura La Salle gy q Universitat Ramon Llull
  • 2. Recap of Lecture 7 kNN 15-NN 1-NN Key aspects Value of k Distance functions Slide 2 Artificial Intelligence Machine Learning
  • 3. Recap of Lecture 7 Where is learning in kNN? g Retrieval system No l b l N global model dl No generalization … No learning! o ea g But till i bl t B t still, it is able to create accurate classification t tl ifi ti models Slide 3 Artificial Intelligence Machine Learning
  • 4. Today’s Agenda Formalizing the framework: From kNN to CBR Incorporating learning in different phases: Learn prototypes Organize the memory in clusters Learn the best distance function Provide explanations Slide 4 Artificial Intelligence Machine Learning
  • 5. From kNN to CBR kNN provides a retrieving system Much work on different phases of kNN Prototype selection Distance function selection … CBR provides a general framework based on kNN Slide 5 Artificial Intelligence Machine Learning
  • 6. Schema of CBR CBR cycle Select a (Aamodt & solution Plaza, Plaza 1994) Reuse Similarity function Revise the solution Solution Revise Problem Retrieve Case Memory Retain Coherence and Structure and Retain the relevance of the agrupation of the cases new knowledge attributes Slide 6 Artificial Intelligence Machine Learning
  • 7. Phases of CBR Five key phases Preprocess the training instance So that it meets the requirements of the system Retrieve Use U kNN with the selected distance function ih h l d di f i Reuse Vote-based scheme Revise Adapt the solution if necessary Retain Remove examples from or add examples to the case memory Slide 7 Artificial Intelligence Machine Learning
  • 8. Challenges in CBR Hot areas Reduce the cost of matching Reduce the total number of examples in the case memory Organize the case memory in clusters and only consult examples O i th ilt dl lt l of some clusters Automatically create distance functions that are suited to your problem Extraction of explanations: CBR does not extract legible models (actually, does not learn any model) ) Slide 8 Artificial Intelligence Machine Learning
  • 9. Prototype Selection Training data sets contain a large number of instances g g Increase the prediction time May M contain noisy i t ti i instances Prototype selection Select the representative examples to form the case base Remove all the other examples How? Learn which examples are the ones that maximize CBR accuracy Slide 9 Artificial Intelligence Machine Learning
  • 10. Prototype Selection Possible sets of prototypes … Training Sel. Sel. Sel. Training Data set Proto 1 Proto 2 Proto 3 Data set Split the training set How do we know which is th b t S l ti i the best Selection of f Prototypes? Validation set KNN Test data set Does it sound familiar to you? Problem: Search for the best SP It s It’s just an optimization problem For robustness, use cross-validation or similar validation procedures Slide 10 Artificial Intelligence Machine Learning
  • 11. Prototype Selection Optimization methods used so far p Genetic algorithms (Holland, 75) Genetic Programming (Koza et al., 1989) G ti P i (K tl Grammar Evolution (Ryan & O’Neill, 1998) Slide 11 Artificial Intelligence Machine Learning
  • 12. Case-Based Memory Clustering Training data sets contain a large number of instances g g Clustering: Place instances in different clusters Only t i O l retrieve from the same cluster or clusters that are f th lt lt th t close to you Slide 12 Artificial Intelligence Machine Learning
  • 13. Case-Based Memory Clustering Retrieve phase Reuse Reuse phase 1. Compare with all the prototypes Propose a solution with the 2. Compare only with the examples retrieved cases of the closest cluster Case Retrieve R ti Revise Ri Memory Retain phase Revise phase Update the organization. Revise if the solution is It may imply the update of the y py p p potentially valid y Retain clusters Slide 13 Artificial Intelligence Machine Learning
  • 14. Generation of Distance Functions How does the distance function influences learning? g It may be the key between success and failure! Slide 14 Artificial Intelligence Machine Learning
  • 15. Generation of Distance Functions Can I find a distance function that makes kNN perform p the best in all cases? No way Actually, NFL announces it (Wolpert 1992) way. Actually (Wolpert, Different distances suited for different domains May I try to create a new distance function for each specific problem? Of course. Again, an optimization problem Slide 15 Artificial Intelligence Machine Learning
  • 16. Generation of Distance Functions Split the training data set into Training t’ T i i set’ Optimization problem Validation set Assume a parametric form Optimize the parameters of the Validation underlying function set Being more ambitious? Dist. Dist. Dist. function1 function2 functionn Do not assume any parametric … form Optimize both the function structure and the parameters kNN Training Examples: Data set‘ ( (Fornells et al., 2005) , ) (Camps et al., 2003) error1 error2 errorn Slide 16 Artificial Intelligence Machine Learning
  • 17. Extraction of Explanations One of the main drawbacks of CBR is that it does not provide p any explanation Prediction based on nearest neighbors New techniques to provide explanations Based on used instances Building of partial models Not studied in more detail here Slide 17 Artificial Intelligence Machine Learning
  • 18. Next Class Probabilistic-based learning Slide 18 Artificial Intelligence Machine Learning
  • 19. Introduction to Machine Learning Lecture 8 Instance Based Learning and Case-Based Reasoning Albert Orriols i Puig aorriols@salle.url.edu i l @ ll ld Artificial Intelligence – Machine Learning Enginyeria i Arquitectura La Salle gy q Universitat Ramon Llull