Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts

doi:10.1371/journal.pcbi.1011950

Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts

Fig 13

Hysteresis parameters with exponential or nonparametric models.

The fitted parameters of the GRL model with either exponential or 4-back hysteresis are plotted as repetition weights (or alternation if negative)—simply β_n for n-back models or the corresponding weights β₁λ_H^n-1 in the exponential function. Action-specific effects are better illuminated here by explicitly factoring out effects of RL and GRL within the comprehensive model. There is close correspondence between these parametric (2E1 and 2CE1) and nonparametric (2N4 and 2CN4) implementations of hysteresis for at least the first two trials back. The need for a scope extending beyond 1-back demands more than one free parameter, and a proper hysteresis trace with exponential decay yields an even better fit than a scope of 2-back due to subtle effects from 3-back and beyond. As further evidence of interactions among parameters, omission of constant bias (2E1 or 2N4) consistently inflated the modeled repetition weights as they were forced to attempt to mimic the necessary third parameter for constant bias. Altogether, the CE1 adjunct is essential. Error bars indicate standard errors of the means.

doi: https://doi.org/10.1371/journal.pcbi.1011950.g013