SlideShare uma empresa Scribd logo
1 de 22
The Compact Classifier System:
Scalability Analysis and First Results

      Xavier Llorà, Kumara Sastry, & David E. Goldberg

                  Illinois Genetic Algorithms Lab
             University of Illinois at Urbana-Champaign


            {xllora,kumara,deg}@illigal.ge.uiuc.edu
Motivation
    • Pittsburgh classifier systems
    • Can we apply Wilson’s ideas for evolving rule sets
      formed only by maximally accurate and general rules?
    • Bottom up approach for evolving such rules
             – The compact classifier system
    • Previous Multiobjective (Llorà, Goldberg, Traus, Bernadó,
      2003) approaches were top down
             – Explicitly address accuracy and generality
             – Use it to push and product compact rule sets
    • Side product:
             – Scalability challenge of De Jong & Spears (1991) representation

IWLCS 2005                          Llorà, Sastry, & Goldberg, 2005              2
Binary Rule Encoding
    • De Jong & Spears (1991)
    • Widely used in Pittsburgh classifiers
    • GALE, MOLS, GAssist have used it
                     color              shape                                 size
        red green blue white     round square                      huge large medium small
             1   1     1     1      0              1                0     1          1   0


    • A rule is expressed as (1111|01|0110)
    • Equivalent to Holland's (1975) representation (#11,#12)
    • A rule set is a disjunction of such rules


IWLCS 2005                       Llorà, Sastry, & Goldberg, 2005                             3
Previous Efforts based using
              Multiobjective Optimization




                                                 (Llorà, Goldberg, Traus, Bernadó, 2003)
IWLCS 2005     Llorà, Sastry, & Goldberg, 2005                                             4
Maximally Accurate and General Rules
    • Accuracy and generality can be computed using data set

                        n t + (r) + n t− (r)                         n t + (r)
                 α(r) =                                       ε(r) =
                                 nt                                    nm
     • Fitness should combine accuracy and generality
                              f (r) = α(r) ⋅ ε(r)γ

     • Such measure can be either applied to rules or a rule sets
     • The compact classifier systems uses this fitness and a
       compact genetic algorithm (cGA) to evolve such rules
     • Each cGA run use a different initial perturbed probability
       vector
IWLCS 2005                      Llorà, Sastry, & Goldberg, 2005                  5
The Compact Genetic Algorithm Can Make It

    • Rules may be obtained optimizing

                                    f (r) = α(r) ⋅ ε(r)γ

             The basic cGA scheme
    •
                                 px i = 0.5
                                  0
             1. Initialization
             2. Model sampling (two individuals are generated)
             3. Evaluation (f(r))
             4. Selection (tournament selection)
             5. Probabilistic model updation
             6. Repeat steps 2-5 until termination criteria are met



IWLCS 2005                            Llorà, Sastry, & Goldberg, 2005   6
cGAModel Perturbation
    • Facilitate the evolution of different rules
    • Explore the frequency of appearance of each optimal
      rule
    • Initial model perturbation

                        px i = 0.5 + U(−0.4,0.4)
                         0



     • Experiments using the 3-input multiplexer
     • 1,000 independent runs
     • Visualize the pair-wise relations of the genes



IWLCS 2005                  Llorà, Sastry, & Goldberg, 2005   7
Initial Perturbed Vectors Leading to rule
                                         100111(01#)
       Problem
       structure




IWLCS 2005                Llorà, Sastry, & Goldberg, 2005   8
Initial Perturbed Vectors Leading to rule
                                         011101(1#1)
       Problem
       structure




IWLCS 2005                Llorà, Sastry, & Goldberg, 2005   9
Perturbation Summary
    • 97% of the runs lead to a maximally general and
      accurate rule
    • The provability of evolving each of the optimal rules was
      roughly 1/3
    • The initial perturbed probability vectors that lead to an
      optimal rule show pair-wise relations among genes
    • The pair-wise relations reflect the problem structure




IWLCS 2005                 Llorà, Sastry, & Goldberg, 2005        10
But One Rule Is Not Enough
    • Model perturbation in cGA evolve different rules
    • The goal: evolve population of rules that solve the
      problem together
    • The fitness measure (f(r)) can be also be applied to rule
      sets
             Two mechanisms:
    •
             – Spawn a population until the solution is meet
             – Fusing populations when they represent the same rule




IWLCS 2005                          Llorà, Sastry, & Goldberg, 2005   11
Spawning and Fusing Populations of Rules




IWLCS 2005      Llorà, Sastry, & Goldberg, 2005   12
Experiments & Scalability
    • Analysis using multiplexer problems (3-, 6-, and 11-input)
    • The number of rules in [O] grow exponentially
             – 2i, where i is the number of inputs
    • The CGA success as a function of the problem size
             – 3-input: 97%
             – 6-input: 73.93%
             – 11-input:43.03%
    • Scalability over 10,000 independent runs




IWLCS 2005                           Llorà, Sastry, & Goldberg, 2005   13
Scalability of CCS




IWLCS 2005   Llorà, Sastry, & Goldberg, 2005              14
Unmatchable Rules: A Byproduct
    • A rule is unmatchable if:
             – At least one attribute in the contain have all its possible values
               set to 0
                     color                  shape                                 size
        red green blue white         round square                      huge large medium small
             1   1     1     1          0              0                0     1          1   0


    • The rule (1111|00|0110) force the shape to be neither
      round or square
    • Hence no data instance will ever match it
    • Direct impact on the scalability of LCS/GBML system using
      it (as simple experiments with the multiplexer show)
IWLCS 2005                           Llorà, Sastry, & Goldberg, 2005                             15
3-Input Multiplexer




                                               37 unmatchable rules (57.8%)




IWLCS 2005   Llorà, Sastry, & Goldberg, 2005                                  16
6-Input Multiplexer




                                               3,367 unmatchable rules (82.2%)




IWLCS 2005   Llorà, Sastry, & Goldberg, 2005                                     17
11-Input Multiplexer



                                               4,017,157 unmatchable rules (95.7%)




IWLCS 2005   Llorà, Sastry, & Goldberg, 2005                                         18
Growth Ratio of Unmatchable Rules (I/III)
    • An unmatchable rule has of all attribute values set to 0
    • Analysis for problems with binary attributes (worst case)
    • The total number of rules
                                 Σ(l) = 2 l
     • Number of rules matchable rules (all attributes set to
       either 01, 11, & 11)         l
                                 Ψ(l) = 3 2
    • Size of the unmatchable rule set plateau
                                                              l
                     Φ(l) = Σ(l) − Ψ(l) = 2 l − 3             2



IWLCS 2005                  Llorà, Sastry, & Goldberg, 2005       19
Growth Ratio of Unmatchable Rules (II/III)
    • Growth ratio of unmatchable rules
                               Φ(l) 2 l
                        ρ(l) =      = l −1
                               Ψ(l)
                                     32
    • It can be approximated by
                               ρ(l) ≈ e cl
                               ⎛2⎞
                         c = ln⎜ ⎟ = 0.143
                               ⎝ 3⎠

     • The growth ratio (ρ) for this representation grows
       exponentially

IWLCS 2005                 Llorà, Sastry, & Goldberg, 2005   20
Growth Ratio of Unmatchable Rules (III/III)




IWLCS 2005         Llorà, Sastry, & Goldberg, 2005   21
Growth Ratio of Unmatchable Rules (III/III)




IWLCS 2005         Llorà, Sastry, & Goldberg, 2005   21

Mais conteúdo relacionado

Destaque

The role organizational rewards on employees motivation
The role organizational rewards on employees motivationThe role organizational rewards on employees motivation
The role organizational rewards on employees motivation
Awais Sargana
 
Questionnaire on training
Questionnaire on trainingQuestionnaire on training
Questionnaire on training
swati_tiwari
 
Time and space complexity
Time and space complexityTime and space complexity
Time and space complexity
Ankit Katiyar
 
The Secret To Employee Motivation
The Secret To Employee MotivationThe Secret To Employee Motivation
The Secret To Employee Motivation
D B
 
Questionnaire on training and development
Questionnaire on training and developmentQuestionnaire on training and development
Questionnaire on training and development
Susmitha Chowdary
 

Destaque (6)

Complexity of Algorithm
Complexity of AlgorithmComplexity of Algorithm
Complexity of Algorithm
 
The role organizational rewards on employees motivation
The role organizational rewards on employees motivationThe role organizational rewards on employees motivation
The role organizational rewards on employees motivation
 
Questionnaire on training
Questionnaire on trainingQuestionnaire on training
Questionnaire on training
 
Time and space complexity
Time and space complexityTime and space complexity
Time and space complexity
 
The Secret To Employee Motivation
The Secret To Employee MotivationThe Secret To Employee Motivation
The Secret To Employee Motivation
 
Questionnaire on training and development
Questionnaire on training and developmentQuestionnaire on training and development
Questionnaire on training and development
 

Semelhante a The compact classifier system: Motivation, analysis and first results

IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
Albert Orriols-Puig
 
Evolution of regression ols to gps to mars
Evolution of regression   ols to gps to marsEvolution of regression   ols to gps to mars
Evolution of regression ols to gps to mars
Salford Systems
 

Semelhante a The compact classifier system: Motivation, analysis and first results (9)

Linkage Learning for Pittsburgh LCS: Making Problems Tractable
Linkage Learning for Pittsburgh LCS: Making Problems TractableLinkage Learning for Pittsburgh LCS: Making Problems Tractable
Linkage Learning for Pittsburgh LCS: Making Problems Tractable
 
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...
Do not Match, Inherit: Fitness Surrogates for Genetics-Based Machine Learning...
 
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
IWLCS'2008: First Approach toward Online Evolution of Association Rules wit...
 
Evolution of regression ols to gps to mars
Evolution of regression   ols to gps to marsEvolution of regression   ols to gps to mars
Evolution of regression ols to gps to mars
 
Extending R-squared beyond ordinary least-squares linear regression
Extending R-squared beyond ordinary least-squares linear regressionExtending R-squared beyond ordinary least-squares linear regression
Extending R-squared beyond ordinary least-squares linear regression
 
Reporting.ppt
Reporting.pptReporting.ppt
Reporting.ppt
 
Revisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural NetworksRevisiting the Calibration of Modern Neural Networks
Revisiting the Calibration of Modern Neural Networks
 
Pittsburgh Learning Classifier Systems for Protein Structure Prediction: Sca...
Pittsburgh Learning Classifier Systems for Protein  Structure Prediction: Sca...Pittsburgh Learning Classifier Systems for Protein  Structure Prediction: Sca...
Pittsburgh Learning Classifier Systems for Protein Structure Prediction: Sca...
 
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
 

Mais de Xavier Llorà

Large Scale Data Mining using Genetics-Based Machine Learning
Large Scale Data Mining using   Genetics-Based Machine LearningLarge Scale Data Mining using   Genetics-Based Machine Learning
Large Scale Data Mining using Genetics-Based Machine Learning
Xavier Llorà
 

Mais de Xavier Llorà (20)

Meandre 2.0 Alpha Preview
Meandre 2.0 Alpha PreviewMeandre 2.0 Alpha Preview
Meandre 2.0 Alpha Preview
 
Soaring the Clouds with Meandre
Soaring the Clouds with MeandreSoaring the Clouds with Meandre
Soaring the Clouds with Meandre
 
From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0
From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0
From Galapagos to Twitter: Darwin, Natural Selection, and Web 2.0
 
Large Scale Data Mining using Genetics-Based Machine Learning
Large Scale Data Mining using   Genetics-Based Machine LearningLarge Scale Data Mining using   Genetics-Based Machine Learning
Large Scale Data Mining using Genetics-Based Machine Learning
 
Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study us...
Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study us...Data-Intensive Computing for  Competent Genetic Algorithms:  A Pilot Study us...
Data-Intensive Computing for Competent Genetic Algorithms: A Pilot Study us...
 
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new Trends
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new TrendsScalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new Trends
Scalabiltity in GBML, Accuracy-based Michigan Fuzzy LCS, and new Trends
 
Towards a Theoretical Towards a Theoretical Framework for LCS Framework fo...
Towards a Theoretical  Towards a Theoretical  Framework for LCS  Framework fo...Towards a Theoretical  Towards a Theoretical  Framework for LCS  Framework fo...
Towards a Theoretical Towards a Theoretical Framework for LCS Framework fo...
 
Learning Classifier Systems for Class Imbalance Problems
Learning Classifier Systems  for Class Imbalance  ProblemsLearning Classifier Systems  for Class Imbalance  Problems
Learning Classifier Systems for Class Imbalance Problems
 
A Retrospective Look at A Retrospective Look at Classifier System ResearchCl...
A Retrospective Look at  A Retrospective Look at  Classifier System ResearchCl...A Retrospective Look at  A Retrospective Look at  Classifier System ResearchCl...
A Retrospective Look at A Retrospective Look at Classifier System ResearchCl...
 
XCS: Current capabilities and future challenges
XCS: Current capabilities and future  challengesXCS: Current capabilities and future  challenges
XCS: Current capabilities and future challenges
 
Negative Selection for Algorithm for Anomaly Detection
Negative Selection for Algorithm for Anomaly DetectionNegative Selection for Algorithm for Anomaly Detection
Negative Selection for Algorithm for Anomaly Detection
 
Searle, Intentionality, and the Future of Classifier Systems
Searle, Intentionality, and the  Future of Classifier SystemsSearle, Intentionality, and the  Future of Classifier Systems
Searle, Intentionality, and the Future of Classifier Systems
 
Computed Prediction: So far, so good. What now?
Computed Prediction:  So far, so good. What now?Computed Prediction:  So far, so good. What now?
Computed Prediction: So far, so good. What now?
 
NIGEL 2006 welcome
NIGEL 2006 welcomeNIGEL 2006 welcome
NIGEL 2006 welcome
 
Meandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the CloudsMeandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the Clouds
 
ZigZag: The Meandring Language
ZigZag: The Meandring LanguageZigZag: The Meandring Language
ZigZag: The Meandring Language
 
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...
HUMIES 2007 Bronze Winner: Towards Better than Human Capability in Diagnosing...
 
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...
Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infr...
 
The DISCUS project
The DISCUS projectThe DISCUS project
The DISCUS project
 
Visualizing content in metadata stores
Visualizing content in metadata storesVisualizing content in metadata stores
Visualizing content in metadata stores
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Último (20)

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

The compact classifier system: Motivation, analysis and first results

  • 1. The Compact Classifier System: Scalability Analysis and First Results Xavier Llorà, Kumara Sastry, & David E. Goldberg Illinois Genetic Algorithms Lab University of Illinois at Urbana-Champaign {xllora,kumara,deg}@illigal.ge.uiuc.edu
  • 2. Motivation • Pittsburgh classifier systems • Can we apply Wilson’s ideas for evolving rule sets formed only by maximally accurate and general rules? • Bottom up approach for evolving such rules – The compact classifier system • Previous Multiobjective (Llorà, Goldberg, Traus, Bernadó, 2003) approaches were top down – Explicitly address accuracy and generality – Use it to push and product compact rule sets • Side product: – Scalability challenge of De Jong & Spears (1991) representation IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 2
  • 3. Binary Rule Encoding • De Jong & Spears (1991) • Widely used in Pittsburgh classifiers • GALE, MOLS, GAssist have used it color shape size red green blue white round square huge large medium small 1 1 1 1 0 1 0 1 1 0 • A rule is expressed as (1111|01|0110) • Equivalent to Holland's (1975) representation (#11,#12) • A rule set is a disjunction of such rules IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 3
  • 4. Previous Efforts based using Multiobjective Optimization (Llorà, Goldberg, Traus, Bernadó, 2003) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 4
  • 5. Maximally Accurate and General Rules • Accuracy and generality can be computed using data set n t + (r) + n t− (r) n t + (r) α(r) = ε(r) = nt nm • Fitness should combine accuracy and generality f (r) = α(r) ⋅ ε(r)γ • Such measure can be either applied to rules or a rule sets • The compact classifier systems uses this fitness and a compact genetic algorithm (cGA) to evolve such rules • Each cGA run use a different initial perturbed probability vector IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 5
  • 6. The Compact Genetic Algorithm Can Make It • Rules may be obtained optimizing f (r) = α(r) ⋅ ε(r)γ The basic cGA scheme • px i = 0.5 0 1. Initialization 2. Model sampling (two individuals are generated) 3. Evaluation (f(r)) 4. Selection (tournament selection) 5. Probabilistic model updation 6. Repeat steps 2-5 until termination criteria are met IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 6
  • 7. cGAModel Perturbation • Facilitate the evolution of different rules • Explore the frequency of appearance of each optimal rule • Initial model perturbation px i = 0.5 + U(−0.4,0.4) 0 • Experiments using the 3-input multiplexer • 1,000 independent runs • Visualize the pair-wise relations of the genes IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 7
  • 8. Initial Perturbed Vectors Leading to rule 100111(01#) Problem structure IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 8
  • 9. Initial Perturbed Vectors Leading to rule 011101(1#1) Problem structure IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 9
  • 10. Perturbation Summary • 97% of the runs lead to a maximally general and accurate rule • The provability of evolving each of the optimal rules was roughly 1/3 • The initial perturbed probability vectors that lead to an optimal rule show pair-wise relations among genes • The pair-wise relations reflect the problem structure IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 10
  • 11. But One Rule Is Not Enough • Model perturbation in cGA evolve different rules • The goal: evolve population of rules that solve the problem together • The fitness measure (f(r)) can be also be applied to rule sets Two mechanisms: • – Spawn a population until the solution is meet – Fusing populations when they represent the same rule IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 11
  • 12. Spawning and Fusing Populations of Rules IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 12
  • 13. Experiments & Scalability • Analysis using multiplexer problems (3-, 6-, and 11-input) • The number of rules in [O] grow exponentially – 2i, where i is the number of inputs • The CGA success as a function of the problem size – 3-input: 97% – 6-input: 73.93% – 11-input:43.03% • Scalability over 10,000 independent runs IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 13
  • 14. Scalability of CCS IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 14
  • 15. Unmatchable Rules: A Byproduct • A rule is unmatchable if: – At least one attribute in the contain have all its possible values set to 0 color shape size red green blue white round square huge large medium small 1 1 1 1 0 0 0 1 1 0 • The rule (1111|00|0110) force the shape to be neither round or square • Hence no data instance will ever match it • Direct impact on the scalability of LCS/GBML system using it (as simple experiments with the multiplexer show) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 15
  • 16. 3-Input Multiplexer 37 unmatchable rules (57.8%) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 16
  • 17. 6-Input Multiplexer 3,367 unmatchable rules (82.2%) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 17
  • 18. 11-Input Multiplexer 4,017,157 unmatchable rules (95.7%) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 18
  • 19. Growth Ratio of Unmatchable Rules (I/III) • An unmatchable rule has of all attribute values set to 0 • Analysis for problems with binary attributes (worst case) • The total number of rules Σ(l) = 2 l • Number of rules matchable rules (all attributes set to either 01, 11, & 11) l Ψ(l) = 3 2 • Size of the unmatchable rule set plateau l Φ(l) = Σ(l) − Ψ(l) = 2 l − 3 2 IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 19
  • 20. Growth Ratio of Unmatchable Rules (II/III) • Growth ratio of unmatchable rules Φ(l) 2 l ρ(l) = = l −1 Ψ(l) 32 • It can be approximated by ρ(l) ≈ e cl ⎛2⎞ c = ln⎜ ⎟ = 0.143 ⎝ 3⎠ • The growth ratio (ρ) for this representation grows exponentially IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 20
  • 21. Growth Ratio of Unmatchable Rules (III/III) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 21
  • 22. Growth Ratio of Unmatchable Rules (III/III) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 21
  • 23. Conclusions • Initial steps to evolve rule sets formed formed only by maximally accurate and general rules using Pittsburgh systems • Using a cGA and the appropriate fitness function (CCS) we can evolve such rules • Rule representation has a direct connection to the scalability of any GBML system – A wrong choice makes the problem extremely hard • Further analysis for different representations is needed (Stone, 2004) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 22
  • 24. Conclusions • Initial steps to evolve rule sets formed formed only by maximally accurate and general rules using Pittsburgh systems • Using a cGA and the appropriate fitness function (CCS) we can evolve such rules • Rule representation has a direct connection to the scalability of any GBML system – A wrong choice makes the problem extremely hard • Further analysis for different representations is needed (Stone, 2004) IWLCS 2005 Llorà, Sastry, & Goldberg, 2005 22