Reward-driven changes in striatal pathway competition shape evidence evaluation in decision-making
Fig 2
Corticostriatal synaptic weights with probabilistic reward feedback.
First column: pL = 0.65; second column: pL = 0.75; third column: pL = 0.85 A, B, and C: Averaged weights over each of four specific populations of neurons, which are dMSN neurons selecting action L (solid black); dMSN neurons selecting action R (solid red); iMSN neurons countering action L (dashed black); iMSN neurons countering action R (dashed red). D, E, and F: Evolution of the estimates of the values for actions L (QL) and R (QR) estimated by Q-learning versus the ratio of the corticostriatal weights to those dMSN neurons that facilitate the action relative to the weights to those iMSN that interfere with the action. Both the weights and the ratios have been averaged over 8 different realizations. A small jump occurs in the QR trace for pL = 0.65 and is joined by a dashed line; this comes from the time discretization and averaging.