Predictive reward-prediction errors of climbing fiber inputs integrate modular reinforcement learning with supervised learning

doi:10.1371/journal.pcbi.1012899

Predictive reward-prediction errors of climbing fiber inputs integrate modular reinforcement learning with supervised learning

Fig 2

CS activities to cues and their correlations with reward and sensorimotor variables.

A: Panels showed PSTHs of CSs in 8 aldolase-C zones (7+ to 4b-, columns) in the four cue-response conditions (rows). Blue, green and red traces are for 1st, 2nd and 3rd learning stages, respectively. The horizontal lines indicate cue onset. Dashed horizontal lines represent the boundary between lateral vs. medial parts of the left Crus II. B: Bars showed the variable-importance-in-prediction (VIP) scores of 10 reinforcement-learning and sensorimotor-control variables (from left to right, R, Q, δQ, Go ✕ ELick, No-go ✕ ELick, Go ✕ RLick, No-go ✕ RLick, Go ✕ LLick, No-go ✕ LLick and latency fluctuation) for spiking activity of neurons in 8 aldolase-C zones. Dashed lines indicated VIP score = 1, which is considered a threshold of importance. See the inset for color codes of the 10 explanatory variables.

doi: https://doi.org/10.1371/journal.pcbi.1012899.g002