How cortico-basal ganglia-thalamic subnetworks can shift decision policies to increase reward rate
Fig 5
Suboptimal and optimal choices modulate control ensembles in opposite directions.
(A) The modulation of control ensembles associated with various reward sequences encountered in two initial trials with cortico-striatal plasticity. U represents “Unrewarded" and R represents “Rewarded" trials. (B) The reward rate changes obtained by simulation of networks with synaptic weights frozen after various reward sequences occurred on two initial trials.