SlideShare uma empresa Scribd logo
1 de 26
Baixar para ler offline
Modeling decision making deficits in frontostriatal disorders
                            Michael Frank
             Laboratory for Neural Computation and Cognition
                             Brown University
Computational Psychiatry and...
                   Neurogenocomputomics
• Many disorders broadly characterized by changes in motivation
• Several fronto-striatal disorders have substantial genetic heritability
• Individual differences in reinforcement learning?
Computational Psychiatry and...
                   Neurogenocomputomics
• Many disorders broadly characterized by changes in motivation
• Several fronto-striatal disorders have substantial genetic heritability
• Individual differences in reinforcement learning?
• But... Candidate gene effects are generally small
• Which genes? Which task? Which measure?
Computational Psychiatry and...
                   Neurogenocomputomics
• Many disorders broadly characterized by changes in motivation
• Several fronto-striatal disorders have substantial genetic heritability
• Individual differences in reinforcement learning?
• But... Candidate gene effects are generally small
• Which genes? Which task? Which measure?
• Need theoretical model! (and converging pharmacology/imaging)
                                 Frank & Fossella, 2011; Maia & Frank, 2011; Huys et al, 2011
Reinforcement learning and dopamine: prediction errors
                           Positive PE:                    Negative PE:
dopamine:
  Montague, Dayan & Sejnowksi 96; Doya, 2002; O’Reilly, Frank, Hazy & Watz 06...
                                             ˆ           ˆ
                             δ(t) = r(t) + γ V (t + 1) − V (t)
Reinforcement learning and dopamine: prediction errors
                           Positive PE:                    Negative PE:
dopamine:
  Montague, Dayan & Sejnowksi 96; Doya, 2002; O’Reilly, Frank, Hazy & Watz 06...
                                             ˆ           ˆ
                             δ(t) = r(t) + γ V (t + 1) − V (t)
D1 effects on striatal learning: Positive PE
D1 effects on striatal learning: Positive PE
                    Three factor learning: presynaptic, postsynaptic and DA
D2 effects on striatal learning: Negative PE
                                               Frank 2005
Neural model of basal ganglia and dopamine
Integrates a wide range of data into a single coherent framework
Separate Go and NoGo populations integrate statistics of reinforcement
                             preSMA
Input
            Striatum                                                        γ [Vm− Θ]
                                             cVm = gege[E Vm]      y j ≈ γ [V − ] + 1
                                                                                     +
                                                                              m Θ+
                                                         e
                                                 + g g [E V ]
                                                    i i  i m
                                                 + g g [E Vm]                                β
                                                    l l l           net = ge ≈ <x i w ij > +
                                                                                             N
                                 STN             + ...
                                                            w ij
                       GPe
                                                           xi
Go         NoGo                   Thalamus
                                                      p p      t t
                                             ∆wij ≈ (xi yj )−(xi yj )
     SNc          GPi/SNr
                                                      Frank, 2005, 2006 J Cog Neurosci, Neural Networks
Maximizing Reward via RT Adaptation:
                   Temporal Utility Integration Task
                        Reward Frequency                                                                              Reward Magnitude
              1.0                                                                                        350
              0.9                                                    CEV                                                              CEV
                                                                     DEV                                 300                          DEV
              0.8                                                    IEV                                                              IEV
              0.7                                                    CEVR              # Points Gained   250                          CEVR
Probability




              0.6                                                                                        200
              0.5
              0.4                                                                                        150
              0.3                                                                                        100
              0.2
                                                                                                         50
              0.1
              0.0                                                                                         0
                 0   1000   2000        3000                          4000     5000                        0     1000   2000   3000   4000   5000
                             Time (ms)                                                                                    Time (ms)
                                                                               Expected Value
                                                               60
                                   Expected Value (freq*mag)
                                                               55
                                                               50
                                                               45
                                                               40
                                                               35
                                                               30
                                                               25
                                                               20                      CEV
                                                               15                      DEV
                                                               10                      IEV
                                                                5                      CEVR
                                                                0
                                                                 0      1000    2000         3000              4000   5000
                                                                                  Time (ms)
RL model: Fit to data across all subjects
RL model : adjust RTs as a function of reward prediction errors
                                    Frank, Doll, Oas-Terpstra & Moreno (2009, Nature Neuroscience)
Neurogenetic and pharmacological modulation of
      reinforcement learning parameters
                                        Frank & Fossella, 2011
Single subject Data...
                  Single Subject CEV                                  Single Subject DEV
          5000                                                5000
          4500                                                4500
          4000                                                4000
          3500                                                3500
RT (ms)




                                                    RT (ms)
          3000                                                3000
          2500                                                2500
          2000                                                2000
          1500                                                1500
          1000                                                1000
           500                                                 500
             0                                                   0
              0   10    20           30   40   50                 0   10    20           30   40   50
                             Trial                                               Trial
                   Single Subject IEV                                 Single Subject CEVR
          5000                                                5000
          4500                                                4500
          4000                                                4000
          3500                                                3500
RT (ms)




                                                    RT (ms)
          3000                                                3000
          2500                                                2500
          2000                                                2000
          1500                                                1500
          1000                                                1000
           500                                                 500
             0                                                   0
              0   10    20           30   40   50                 0   10    20           30   40   50
                             Trial                                               Trial
Exploration vs Exploitation
• By exploiting learned strategies, we know we can get a certain amount
  of reward
• But don’t know how good it can get. ⇒ Need to Explore
• Theory: Explore based on relative uncertainty about whether other
  actions might yield better outcomes than status quo (Dayan & Sejnowksi 96)
Exploration vs Exploitation
• By exploiting learned strategies, we know we can get a certain amount
  of reward
• But don’t know how good it can get. ⇒ Need to Explore
• Theory: Explore based on relative uncertainty about whether other
  actions might yield better outcomes than status quo (Dayan & Sejnowksi 96)
Uncertainty-Based Exploration
                                     Exploration
                4000
                                                              Model Exp term
                3000                                          RT diff
                2000
RT Diff (ms)




                1000
                   0
               −1000
               −2000
               −3000
                                           Single Subject, CEV
               −4000
                       5   10   15    20     25     30   35      40   45   50
                                            Trial
PFC Gene-Dose Effect on Uncertainty-Based Exploration
                        COMT gene-dose effects
                        Uncertainty-exploration parameter
                 0.50
                 0.45       val/val
                 0.40       val/met
       (x 1e4)




                            met/met
                 0.35
                 0.30
                 0.25
      ε


                 0.20
                 0.15
                 0.10
                 0.05
                 0.00
                          Frank, Doll, Oas-Terpstra & Moreno (2009, Nature Neuroscience)
Does the brain track relative uncertainty for exploration?
Does the brain track relative uncertainty for exploration?
             ǫ > 0 (’explorers’)   explorers > non-explorers
                                           Badre, Doll, Long & Frank, under review
EEG reveals temporal dynamics
EEG reveals temporal dynamics
Relative uncertainty represented prior to choice, and more so in exploratory trials
                                                 Cavanagh, Cohen, Figueroa & Frank, under review
Negative symptoms in schizophrenia:
                      Uncertainty-Based Exploration
                                                                        Anhedonia & Exploration
                    Uncertainty-driven exploration
                                                             0.8
             0.40
                                                             0.6
             0.35      SZ
                       CN                                    0.4
 ε (x 1e4)




             0.30                                            0.2
                                                               0
                                                     ε (x1e4)
             0.25
             0.20                                           -0.2
                              **                            -0.4
             0.15                                           -0.6
             0.10                                           -0.8   r = -.44, p = .002
             0.05                                           -1.0
             0.00                                           -1.2
                                                                   0         1          2     3        4
                              ε(uncert)                                      Global Anhedonia
• Anhedonia = behavioral component of reward seeking (e.g., initiating
  social/recreational activities) not capacity to experience pleasure
• Anhedonia related to exploration and not learning from reward prediction errors
                                                                        Strauss et al, 2011, Biological Psychiatry
Obsessive Compulsive Disorder: Aversion to Uncertainty
                                     Uncertainty-driven exploration
                              0.6
                                                                      CN
                              0.4                                     OCD
                  ε (x 1e4)




                              0.2
                              0.0
                              -0.2
                              -0.4
                                     gains                 losses
preliminary data, N=17 per group
                                           with Mascha van ’t Wout, Ben Greenberg, Steve Rasmussen
Summary
• Dopamine modulates reinforcement learning and choice based on
  positive and negative outcomes: patients, pharmacology, genetics,
  imaging
• Prefrontal cortex tracks outcome uncertainty so as to reduce it
• Disruption of these mechanisms is associated with fronto-striatal
  disorders, Parkinson’s, schizophrenia, OCD
• Models integrate between multiple levels of analysis:
  neural mechanism to abstract computation (see Thomas Wiecki
  demonstration tomorrow!).
Thanks To...
Bradley Doll
Christina Figueroa
Jim Cavanagh
David Badre
Jeff Cockburn
Anne Collins
Thomas Wiecki
Jim Gold
Kent Hutchison
Mascha van ’t Wout
Nicole Long
Mike Cohen
Ahmed Moustafa
Scott Sherman        Lab for Neural Computation and Cognition
The patients

Mais conteúdo relacionado

Mais procurados

Finite Element Analysis Made Easy Lr
Finite Element Analysis Made Easy LrFinite Element Analysis Made Easy Lr
Finite Element Analysis Made Easy Lr
guesta32562
 
Bfi_barcelona08
Bfi_barcelona08Bfi_barcelona08
Bfi_barcelona08
Carlo Chiorri
 
修士論文発表会
修士論文発表会修士論文発表会
修士論文発表会
Keikusl
 
Real time information reconstruction -The Prediction Market
Real time information reconstruction -The Prediction MarketReal time information reconstruction -The Prediction Market
Real time information reconstruction -The Prediction Market
raghavr186
 

Mais procurados (11)

Finite Element Analysis Made Easy Lr
Finite Element Analysis Made Easy LrFinite Element Analysis Made Easy Lr
Finite Element Analysis Made Easy Lr
 
Peyman
PeymanPeyman
Peyman
 
Bfi_barcelona08
Bfi_barcelona08Bfi_barcelona08
Bfi_barcelona08
 
Computing Loops
Computing LoopsComputing Loops
Computing Loops
 
Optimal Finite Difference Grids for Elliptic and Parabolic PDEs with Applicat...
Optimal Finite Difference Grids for Elliptic and Parabolic PDEs with Applicat...Optimal Finite Difference Grids for Elliptic and Parabolic PDEs with Applicat...
Optimal Finite Difference Grids for Elliptic and Parabolic PDEs with Applicat...
 
修士論文発表会
修士論文発表会修士論文発表会
修士論文発表会
 
Session 5
Session 5Session 5
Session 5
 
Further Advanced Methods from Mathematical Optimization
Further Advanced Methods from Mathematical OptimizationFurther Advanced Methods from Mathematical Optimization
Further Advanced Methods from Mathematical Optimization
 
Faults and Regression Testing - Fault interaction and its repercussions
Faults and Regression Testing - Fault interaction and its repercussionsFaults and Regression Testing - Fault interaction and its repercussions
Faults and Regression Testing - Fault interaction and its repercussions
 
Real time information reconstruction -The Prediction Market
Real time information reconstruction -The Prediction MarketReal time information reconstruction -The Prediction Market
Real time information reconstruction -The Prediction Market
 
Prediction of Financial Processes
Prediction of Financial ProcessesPrediction of Financial Processes
Prediction of Financial Processes
 

Semelhante a Frank_NeuroInformatics11.pdf

Semelhante a Frank_NeuroInformatics11.pdf (10)

Gregoire H&N
Gregoire H&NGregoire H&N
Gregoire H&N
 
Free SPICE Model of 1S1887 in SPICE PARK
Free SPICE Model of 1S1887 in SPICE PARKFree SPICE Model of 1S1887 in SPICE PARK
Free SPICE Model of 1S1887 in SPICE PARK
 
SPICE MODEL of 30WQ04FN (Professional Model) in SPICE PARK
SPICE MODEL of 30WQ04FN (Professional Model) in SPICE PARKSPICE MODEL of 30WQ04FN (Professional Model) in SPICE PARK
SPICE MODEL of 30WQ04FN (Professional Model) in SPICE PARK
 
Free SPICE Model of 1SS397 in SPICE PARK
Free SPICE Model of 1SS397 in SPICE PARKFree SPICE Model of 1SS397 in SPICE PARK
Free SPICE Model of 1SS397 in SPICE PARK
 
SPICE MODEL of RURG8060 , TC=110degree (Standard Model) in SPICE PARK
SPICE MODEL of RURG8060 , TC=110degree (Standard Model) in SPICE PARKSPICE MODEL of RURG8060 , TC=110degree (Standard Model) in SPICE PARK
SPICE MODEL of RURG8060 , TC=110degree (Standard Model) in SPICE PARK
 
SPICE MODEL of SF3L60U (Standard Model) in SPICE PARK
SPICE MODEL of SF3L60U (Standard Model) in SPICE PARKSPICE MODEL of SF3L60U (Standard Model) in SPICE PARK
SPICE MODEL of SF3L60U (Standard Model) in SPICE PARK
 
SPICE MODEL of M1FS4 (Standard Model) in SPICE PARK
SPICE MODEL of M1FS4 (Standard Model) in SPICE PARKSPICE MODEL of M1FS4 (Standard Model) in SPICE PARK
SPICE MODEL of M1FS4 (Standard Model) in SPICE PARK
 
Free SPICE Model of 1SS187 in SPICE PARK
Free SPICE Model of 1SS187 in SPICE PARKFree SPICE Model of 1SS187 in SPICE PARK
Free SPICE Model of 1SS187 in SPICE PARK
 
SPICE MODEL of 1SS187 (Professional Model) in SPICE PARK
SPICE MODEL of 1SS187 (Professional Model) in SPICE PARKSPICE MODEL of 1SS187 (Professional Model) in SPICE PARK
SPICE MODEL of 1SS187 (Professional Model) in SPICE PARK
 
SPICE MODEL of 1SS193 (Professional Model) in SPICE PARK
SPICE MODEL of 1SS193 (Professional Model) in SPICE PARKSPICE MODEL of 1SS193 (Professional Model) in SPICE PARK
SPICE MODEL of 1SS193 (Professional Model) in SPICE PARK
 

Último

Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Último (20)

All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
 
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
 
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Raipur Just Call 9630942363 Top Class Call Girl Service Available
 
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
Independent Call Girls In Jaipur { 8445551418 } ✔ ANIKA MEHTA ✔ Get High Prof...
 
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
 
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
 
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
 
Russian Call Girls Service Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
Russian Call Girls Service  Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...Russian Call Girls Service  Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
Russian Call Girls Service Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
 
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
 
Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
 
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
 
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
 
Call Girls Madurai Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Madurai Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Madurai Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Madurai Just Call 9630942363 Top Class Call Girl Service Available
 
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
 
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
 
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
 
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service AvailableTrichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
 
Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...
Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...
Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...
 
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on WhatsappMost Beautiful Call Girl in Bangalore Contact on Whatsapp
Most Beautiful Call Girl in Bangalore Contact on Whatsapp
 

Frank_NeuroInformatics11.pdf

  • 1. Modeling decision making deficits in frontostriatal disorders Michael Frank Laboratory for Neural Computation and Cognition Brown University
  • 2. Computational Psychiatry and... Neurogenocomputomics • Many disorders broadly characterized by changes in motivation • Several fronto-striatal disorders have substantial genetic heritability • Individual differences in reinforcement learning?
  • 3. Computational Psychiatry and... Neurogenocomputomics • Many disorders broadly characterized by changes in motivation • Several fronto-striatal disorders have substantial genetic heritability • Individual differences in reinforcement learning? • But... Candidate gene effects are generally small • Which genes? Which task? Which measure?
  • 4. Computational Psychiatry and... Neurogenocomputomics • Many disorders broadly characterized by changes in motivation • Several fronto-striatal disorders have substantial genetic heritability • Individual differences in reinforcement learning? • But... Candidate gene effects are generally small • Which genes? Which task? Which measure? • Need theoretical model! (and converging pharmacology/imaging) Frank & Fossella, 2011; Maia & Frank, 2011; Huys et al, 2011
  • 5. Reinforcement learning and dopamine: prediction errors Positive PE: Negative PE: dopamine: Montague, Dayan & Sejnowksi 96; Doya, 2002; O’Reilly, Frank, Hazy & Watz 06... ˆ ˆ δ(t) = r(t) + γ V (t + 1) − V (t)
  • 6. Reinforcement learning and dopamine: prediction errors Positive PE: Negative PE: dopamine: Montague, Dayan & Sejnowksi 96; Doya, 2002; O’Reilly, Frank, Hazy & Watz 06... ˆ ˆ δ(t) = r(t) + γ V (t + 1) − V (t)
  • 7. D1 effects on striatal learning: Positive PE
  • 8. D1 effects on striatal learning: Positive PE Three factor learning: presynaptic, postsynaptic and DA
  • 9. D2 effects on striatal learning: Negative PE Frank 2005
  • 10. Neural model of basal ganglia and dopamine Integrates a wide range of data into a single coherent framework Separate Go and NoGo populations integrate statistics of reinforcement preSMA Input Striatum γ [Vm− Θ] cVm = gege[E Vm] y j ≈ γ [V − ] + 1 + m Θ+ e + g g [E V ] i i i m + g g [E Vm] β l l l net = ge ≈ <x i w ij > + N STN + ... w ij GPe xi Go NoGo Thalamus p p t t ∆wij ≈ (xi yj )−(xi yj ) SNc GPi/SNr Frank, 2005, 2006 J Cog Neurosci, Neural Networks
  • 11. Maximizing Reward via RT Adaptation: Temporal Utility Integration Task Reward Frequency Reward Magnitude 1.0 350 0.9 CEV CEV DEV 300 DEV 0.8 IEV IEV 0.7 CEVR # Points Gained 250 CEVR Probability 0.6 200 0.5 0.4 150 0.3 100 0.2 50 0.1 0.0 0 0 1000 2000 3000 4000 5000 0 1000 2000 3000 4000 5000 Time (ms) Time (ms) Expected Value 60 Expected Value (freq*mag) 55 50 45 40 35 30 25 20 CEV 15 DEV 10 IEV 5 CEVR 0 0 1000 2000 3000 4000 5000 Time (ms)
  • 12. RL model: Fit to data across all subjects RL model : adjust RTs as a function of reward prediction errors Frank, Doll, Oas-Terpstra & Moreno (2009, Nature Neuroscience)
  • 13. Neurogenetic and pharmacological modulation of reinforcement learning parameters Frank & Fossella, 2011
  • 14. Single subject Data... Single Subject CEV Single Subject DEV 5000 5000 4500 4500 4000 4000 3500 3500 RT (ms) RT (ms) 3000 3000 2500 2500 2000 2000 1500 1500 1000 1000 500 500 0 0 0 10 20 30 40 50 0 10 20 30 40 50 Trial Trial Single Subject IEV Single Subject CEVR 5000 5000 4500 4500 4000 4000 3500 3500 RT (ms) RT (ms) 3000 3000 2500 2500 2000 2000 1500 1500 1000 1000 500 500 0 0 0 10 20 30 40 50 0 10 20 30 40 50 Trial Trial
  • 15. Exploration vs Exploitation • By exploiting learned strategies, we know we can get a certain amount of reward • But don’t know how good it can get. ⇒ Need to Explore • Theory: Explore based on relative uncertainty about whether other actions might yield better outcomes than status quo (Dayan & Sejnowksi 96)
  • 16. Exploration vs Exploitation • By exploiting learned strategies, we know we can get a certain amount of reward • But don’t know how good it can get. ⇒ Need to Explore • Theory: Explore based on relative uncertainty about whether other actions might yield better outcomes than status quo (Dayan & Sejnowksi 96)
  • 17. Uncertainty-Based Exploration Exploration 4000 Model Exp term 3000 RT diff 2000 RT Diff (ms) 1000 0 −1000 −2000 −3000 Single Subject, CEV −4000 5 10 15 20 25 30 35 40 45 50 Trial
  • 18. PFC Gene-Dose Effect on Uncertainty-Based Exploration COMT gene-dose effects Uncertainty-exploration parameter 0.50 0.45 val/val 0.40 val/met (x 1e4) met/met 0.35 0.30 0.25 ε 0.20 0.15 0.10 0.05 0.00 Frank, Doll, Oas-Terpstra & Moreno (2009, Nature Neuroscience)
  • 19. Does the brain track relative uncertainty for exploration?
  • 20. Does the brain track relative uncertainty for exploration? ǫ > 0 (’explorers’) explorers > non-explorers Badre, Doll, Long & Frank, under review
  • 22. EEG reveals temporal dynamics Relative uncertainty represented prior to choice, and more so in exploratory trials Cavanagh, Cohen, Figueroa & Frank, under review
  • 23. Negative symptoms in schizophrenia: Uncertainty-Based Exploration Anhedonia & Exploration Uncertainty-driven exploration 0.8 0.40 0.6 0.35 SZ CN 0.4 ε (x 1e4) 0.30 0.2 0 ε (x1e4) 0.25 0.20 -0.2 ** -0.4 0.15 -0.6 0.10 -0.8 r = -.44, p = .002 0.05 -1.0 0.00 -1.2 0 1 2 3 4 ε(uncert) Global Anhedonia • Anhedonia = behavioral component of reward seeking (e.g., initiating social/recreational activities) not capacity to experience pleasure • Anhedonia related to exploration and not learning from reward prediction errors Strauss et al, 2011, Biological Psychiatry
  • 24. Obsessive Compulsive Disorder: Aversion to Uncertainty Uncertainty-driven exploration 0.6 CN 0.4 OCD ε (x 1e4) 0.2 0.0 -0.2 -0.4 gains losses preliminary data, N=17 per group with Mascha van ’t Wout, Ben Greenberg, Steve Rasmussen
  • 25. Summary • Dopamine modulates reinforcement learning and choice based on positive and negative outcomes: patients, pharmacology, genetics, imaging • Prefrontal cortex tracks outcome uncertainty so as to reduce it • Disruption of these mechanisms is associated with fronto-striatal disorders, Parkinson’s, schizophrenia, OCD • Models integrate between multiple levels of analysis: neural mechanism to abstract computation (see Thomas Wiecki demonstration tomorrow!).
  • 26. Thanks To... Bradley Doll Christina Figueroa Jim Cavanagh David Badre Jeff Cockburn Anne Collins Thomas Wiecki Jim Gold Kent Hutchison Mascha van ’t Wout Nicole Long Mike Cohen Ahmed Moustafa Scott Sherman Lab for Neural Computation and Cognition The patients