Towards a more general understanding of the algorithmic utility of recurrent connections
Table 3
Hybrid models considered in Fig 4C. Models were only included in the plot if adding neurons decreased the loss. The models were assumed to be given the allowed computational time at initialization, enabling them to switch optimally.