Predictive reward-prediction errors of climbing fiber inputs integrate modular reinforcement learning with supervised learning

doi:10.1371/journal.pcbi.1012899

Predictive reward-prediction errors of climbing fiber inputs integrate modular reinforcement learning with supervised learning

Fig 6

Spiking neural network model of the cerebellum with 5,000 neurons in Go/No-go tasks.

A: The model consists of two groups of neurons in the PC–CN–IO circuitry, each corresponding to TC1 & TC3 (TC_Go: PC_Go–CN_Go–IO_Go) and TC2 & TC4 (TC_Nogo: PC_Nogo–CN_Nogo–IO_Nogo). Sensory input to the PC_Go and PC_Nogo were transmitted via mossy fibers (MFs) to granule cells for Go (GrC_Go) and No-go (GrC_Nogo), respectively. Note that the two neuronal groups received shared mossy fiber input, which is represented by equal connection of GrC_Go and GrC_Nogo to both PC_Go and PC_Nogo. In this model, LTP and LTD are assumed to occur at PF-PC synapses of TC_Go and TC_Nogo, when IO firing is lower and higher than the threshold, respectively. For each group, PCs, CN, and IO designated by green, yellow and blue discs contained 100 simulated neurons each, and we prepared 2000 GrCs for both Go (GrC_Go) and No-go (GrC_Nogo) cues. B: The lick rate is modeled as a sigmoid function of the combined firing rates of CN_Go and CN_Nogo neurons, with the maximum lick rate (rate_max) set at 6 Hz. C: The error rates of Go and No-go trials, defined by the difference between the target lick rate (rate_max for Go and 0 for No-Go trials) and the actual lick rate, are transformed into the rate of Poisson spike generator inputs Err_Go and Err_Nogo to IO_Go and IO_Nogo neurons, respectively. This reproduces the established negative correlations between δQ and CSs in Go trials for TC_Go (blue region) and No-go trials for TC_Nogo (red region). D: A lattice structure with 10x10 IO neurons for each of TC_Go and TC_Nogo is modeled, where the effective coupling strength between neurons is proportional to their relative distance. In each trial, the effective coupling strength was determined by the firing rate of CN neurons (see Methods for details).

doi: https://doi.org/10.1371/journal.pcbi.1012899.g006