Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts

doi:10.1371/journal.pcbi.1011950

Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts

Table 3

Parameters of the 2CE1 model.

Fitted parameters for the preferred 2CE1 model are listed for each participant group based on learning performance. To characterize the dimensions of distinct behavioral profiles for each participant, the signs of individual fits are categorized as “discriminative” (-1 ≤ g_A < 0) or “none” (g_A = 0) for action generalization; “discriminative” (-1 ≤ g_S < 0), “none” (g_S = 0), or “associative” (0 < g_S ≤ 1) for state generalization; “leftward” or (β_R < 0) “rightward” (β_R > 0) for constant bias; and “alternation” (β₁ < 0) or “repetition” (β₁ > 0) for hysteretic bias. Also listed are metrics for absolute constant bias |β_R|, absolute hysteretic bias |β₁|, and overall bias |β_R|+|β₁|, which is inversely related to the probability of a correct response (p < 0.05). The residual deviance D_df (with degrees of freedom in the subscript) corresponds to the 2CE1 model’s improvement in fit relative to either the XC model with only constant bias or the complete nonlearning model XCE1 adding exponential hysteresis. Standard deviations are listed in parentheses below corresponding means.

doi: https://doi.org/10.1371/journal.pcbi.1011950.t003