## Correction

15 Jul 2015: Tsetsos K, Pfeffer T, Jentgens P, Donner TH (2015) Correction: Action Planning and the Timescale of Evidence Accumulation. PLOS ONE 10(7): e0133332. https://doi.org/10.1371/journal.pone.0133332 View correction

## Figures

## Abstract

Perceptual decisions are based on the temporal integration of sensory evidence for different states of the outside world. The timescale of this integration process varies widely across behavioral contexts and individuals, and it is diagnostic for the underlying neural mechanisms. In many situations, the decision-maker knows the required mapping between perceptual evidence and motor response (henceforth termed “sensory-motor contingency”) before decision formation. Here, the integrated evidence can be directly translated into a motor plan and, indeed, neural signatures of the integration process are evident as build-up activity in premotor brain regions. In other situations, however, the sensory-motor contingencies are unknown at the time of decision formation. We used behavioral psychophysics and computational modeling to test if knowledge about sensory-motor contingencies affects the timescale of perceptual evidence integration. We asked human observers to perform the same motion discrimination task, with or without trial-to-trial variations of the mapping between perceptual choice and motor response. When the mapping varied, it was either instructed before or after the stimulus presentation. We quantified the timescale of evidence integration under these different sensory-motor mapping conditions by means of two approaches. First, we analyzed subjects’ discrimination threshold as a function of stimulus duration. Second, we fitted a dynamical decision-making model to subjects’ choice behavior. The results from both approaches indicated that observers (i) integrated motion information for several hundred ms, (ii) used a shorter than optimal integration timescale, and (iii) used the same integration timescale under all sensory-motor mappings. We conclude that the mechanisms limiting the timescale of perceptual decisions are largely independent from long-term learning (under fixed mapping) or rapid acquisition (under variable mapping) of sensory-motor contingencies. This conclusion has implications for neurophysiological and neuroimaging studies of perceptual decision-making.

**Citation: **Tsetsos K, Pfeffer T, Jentgens P, Donner TH (2015) Action Planning and the Timescale of Evidence Accumulation. PLoS ONE 10(6):
e0129473.
https://doi.org/10.1371/journal.pone.0129473

**Academic Editor: **James A.R. Marshall, University of Sheffield, UNITED KINGDOM

**Received: **May 10, 2014; **Accepted: **May 12, 2015; **Published: ** June 12, 2015

**Copyright: ** © 2015 Tsetsos et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited

**Data Availability: **All raw data files in Ascii format, and a brief description of how the data is structured, are available from the Open Science Framework database: (https://osf.io/ju95a/).

**Funding: **This work was funded by the Amsterdam Brain and Cognition Priority Program (Grant number: ABC2014-01, http://abc.uva.nl/research/nav) and the European Union FP7, Human Brain Project (Grant number: Call-ID FP7-ICT-2013-FET-F, Project ID: 604102(v1), http://www.humanbrainproject.eu). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

**Competing interests: ** The authors have declared that no competing interests exist.

## Introduction

A hallmark of perceptual decision-making is the integration of evidence for different states of the world [1]. Imagine driving your car on a rainy day and reading a street sign to decide whether to turn left or right. Since the “sensory evidence” you are trying to interpret is noisy (i.e., it fluctuates randomly), you can improve your judgment by integrating evidence over time [1,2,3].

The timescale of this integration process is a key psychophysical parameter quantifying perceptual decision-making as it reflects the network mechanisms underlying the accumulation of sensory information in the brain [4–7]. While many studies of non-sensory (top-down) effects in perceptual decision-making have focused on strategic adjustments of the decision threshold that terminates the decision process [3,6,8,9,10], only few previous studies have investigated direct top-down effects on the evidence integration process *per se*, as indicated by the integration timescale [11–14]. Only two of these studies were conducted in human observers [11,13] whose integration mechanisms may differ from those of other species [15]. Here, we examined the effect of one important top-down factor that has not been previously examined: knowledge of sensory-motor contingencies.

In the above example, the integrated evidence is continuously mapped onto a plan to select and execute a motor movement. The same holds for most previous neurophysiological laboratory studies of perceptual decision-making [1,16,17,18]. Under such conditions, build-up signatures of evidence integration are found in brain regions involved in action planning. In particular, when perceptual choices are reported as saccades [12,19,20] or hand movements [21,22] choice-specific activity ramps up in the corresponding (pre-)motor brain regions. These premotor build-up signatures are not evident if the sensory-motor contingencies are broken up by instructing the mapping between perceptual choice and motor response only after stimulus presentation [12], or by eliminating the motor response altogether (in a covert counting task) [23]. This raises the question whether knowledge about sensory-motor contingencies might also improve (i.e., prolong) integration timescales observed behaviorally. Further, it has been shown that learning of fixed sensory-motor contingencies (over hundreds of trials or more) improves the selectivity of the read-out of sensory information by the association cortex [24,25]. But it remains unknown whether such learning also improves the integration timescale.

We addressed these questions in six human observers performing the same motion discrimination task under three different sensory-motor mapping conditions. In one experiment, the mapping between decision outcome and motor response varied on a trial-by-trial basis. In different conditions, this mapping was instructed before or after stimulus presentation. While the first condition allowed the integration of evidence directly towards action plans, the second did not. We found that integration timescales were generally shorter than the maximum stimulus duration, and thus shorter than the timescale required to maximize the fraction of correct choices in the task. But integration timescales were indistinguishable between conditions. We then reasoned that sensory-motor mapping might only improve integration timescales if fixed over many trials, due to a slow learning process. Thus, we asked the same observers to perform the task under fixed mapping in another experiment. Again, despite extensive practice, integration timescales were indistinguishable from the other two conditions. We conclude that the integration of perceptual evidence does not depend on sensory-motor contingencies.

## Materials and Methods

### Ethics Statement

The ethical committee of the University of Amsterdam approved the study (reference number 2011-OP-1588). Written informed consent was obtained from all participants.

### Observers

Six healthy human observers were recruited for this study (2 males, mean age: 25; range: 22–29 years). All observers had normal or corrected-to-normal vision. The pool included four observers who were naïve with respect to the purpose of the experiment, and two authors (P.J. and T.P). Observers received either course credits or were paid a small amount of money (€10/hour) for their participation.

### Stimuli

We used an established psychophysical approach for quantifying the perceptual evidence integration timescale [12,13,26–29], which entailed the following two aspects of the sensory input: First, we systematically manipulated the duration of the stimulus (i.e., the maximal evidence integration time available to the observer) and prompted the response after that time (“interrogation protocol”). Second, we systematically manipulated the strength of the perceptual evidence to estimate the observer’s perceptual discrimination threshold for each stimulus duration. This enabled us to quantify discrimination thresholds as function of stimulus duration (see below).

Using this general approach, we performed two experiments. Below, we first describe all general aspects, followed by the specifics of each experiment. Motion stimuli consisted of “random dot kinematograms” (RDK), consisting of 785 white dots (on average) within a circular aperture 9.1° in diameter (dot density: 12.07 dots per deg^{2}), centered on a red fixation cross (0.4° x 0.4°), and displayed against a black background. Individual dots subtended 0.04° x 0.04°. On each frame, the dots were randomly assigned to either a population of “signal dots” or of “noise dots”. The signal dots were randomly selected on each frame and were displaced from frame to frame with a fixed spatiotemporal offset, creating a coherent motion signal with upward or downward direction (separated by 180°) and a speed of 2.6°/s. We used “random position” noise. That is, the noise dots were re-drawn on a randomly selected position, creating spatiotemporal white noise, which comprised a mixture of directions and speeds [30]. On each trial, three different “sets” of RDKs of the selected direction and coherence were plotted in an interleaved fashion, where the dot pattern from each set was shown for one frame and followed by the next pattern from the same set only after three successive video frames, and so forth. This version of the RDK stimulus corresponds to the one used in many of the seminal monkey physiology studies on temporal integration of visual motion information (e.g. [31]). This was to encourage integration of motion information across space and time. The percentage of coherently moving dots (“motion coherence”; 0.05, 1.26, 3.15, 7.92, 19.91, and 50%), viewing duration (150, 300, 600, 1200, 2400, and 4800 ms), direction (“up/ down”), and “decision-to-response-mapping” (“DR-mapping”; e.g. upward motion left button and downward motion right button) were randomly chosen on each trial, under the constraint that each combination of these parameters occurred equally often within a block of 144 trials. The six coherence levels listed above were determined in extensive pilot sessions, tailored to sample the full psychometric function for all stimulus durations. New stimuli were generated for each experimental block, including six different variations of the interleaved RDK sequences.

Stimuli were displayed on a 22-inch CRT monitor (resolution: 800 x 600 pixels) at a rate of 100 Hz. The viewing distance was 68 cm. Experiments were conducted in a dimly illuminated room. Subjects were seated in an adjustable chair with their chin resting comfortably in a chin cup and additional support was provided by a head restraint mounted on the table. The height of the monitor placed the center of the display at approximately eye level.

### Task and procedure

Throughout all experimental conditions, subjects were required to fixate the red cross in the center of the screen and classify the net motion in a stimulus as upward (50% of trials) or downward motion by pressing one of two buttons (left or the right index finger) when a response prompt was provided. The RDK presentation was followed by a variable delay interval, after which the response was prompted. Observers were under no time pressure to respond. Auditory feedback of 100 ms duration (a 1000 Hz tone) was provided for incorrect responses.

#### Variable DR-mapping experiment.

The decision-response mapping (e.g., left-hand button press for indicating “upward” choice) varied randomly from trial to trial. We will henceforth abbreviate this as “DR-mapping”. The experiment consisted of two conditions, which differed only in the timing of the cue instructing observers about the DR-mapping, relative to presentation of the RDK stimuli (Fig 1A and 1B). The cue indicated the motion direction corresponding to each response button in terms of two white arrows (one upward, one downward pointing) presented on the left and right half of the screen (11° from fixation).

On each trial, the observer was required to discriminate the direction (upward or downward) of the random dot kinematogram (RDK), while fixating the central crosshair. The RDK was presented for one of a number of different durations and levels of motion strength **(A)** Variable “Pre” DR-mapping condition. The DR-mapping cue (two arrows mapping up/down motion directions onto left/right hand button presses) was presented before the RDK (separated by a variable delay), and it varied randomly from trial to trial. After another variable delay, a color switch of the fixation cross prompted the observer to indicate the choice with a button press. **(B)** Variable”Post” DR-mapping condition. Identical to “Pre”, except that the DR-mapping cue was presented after the RDK (“Post”). **(C)** “Fixed” DR-mapping condition. Identical to “Pre”, except that the DR-mapping cue was kept constant throughout the experiment, enabling long-term learning of sensory-motor associations.

In the pre-cueing (“Pre”) condition (Fig 1A), the arrows were presented after a variable period of fixation for 1000 ms, and were followed by a second random delay (200–400 ms) and the onset of the motion stimulus (150–4800 ms). After stimulus offset and another random delay (200–400 ms) the fixation cross turned green, which prompted the subject to report their choice.

The post-cueing (“Post”) condition was identical, except that DR-mapping cue was presented after the motion stimulus and second delay (Fig 1B). In both conditions, the inter-trial intervals were 900 ms. “Pre”- and “Post”-conditions were conducted in alternating blocks of 144 trials (see *General design* below).

#### Fixed DR-mapping experiment.

This experiment consisted of a single condition, which was identical to the “Pre”-condition, with the exception that the DR-mapping was kept constant across all trials (Fig 1C). Although the DR-mapping was instructed at the start of the experiment and remained constant thereafter, the DR-cue was shown at the beginning of each trial to keep the visual input and trial duration identical to the “Pre”- from the variable DR-mapping experiment. Each subject was first trained on the task for a minimum of 432 trials and then completed between 2448 and 7632 trials (distributed over 3–9 experimental sessions), which were used for the analyses reported in this paper.

#### General design.

All statistical analyses reported in this paper were performed within individual observers. Given the large number of trials required from each observer per condition (minimum: 2016) and the clear effect evident in Subjects 1–3 who participated in all conditions (see Results), the remaining three observers were asked to only participate in a subset of conditions. Subjects 4 and 5 were used to replicate the comparison between “Pre” and “Post”. Subject 6 was used to replicate the comparison between the variable and “Fixed” mappings.

The experimental conditions were arranged as follows. Subjects 1–5 first performed the Variable DR-mapping experiment, in which they alternated between “Pre” and “Post” conditions in a pseudorandom order. Each experimental session consisted of 6 blocks: 3 blocks of each condition (“Pre” and “Post”). Subjects 1–3 then performed the Fixed DR-mapping experiment. General order effects would have (if anything) predicted an improvement (prolongation) of integration timescales in the “Fixed” condition, due to the extensive practice of the task (under variable and fixed mapping). By contrast, we found no change of integration timescale in these subjects. Nonetheless, to rule out any order effects, we flipped the order in one additional observer (subject 6), who started with “Fixed”, followed by “Post”. Each subject was first trained on the task for a minimum of 432 trials per condition. Subjects then completed between 2110 and 5760 trials of each condition (distributed over 3–7 experimental sessions in total).

### Model-independent analysis of integration timescale

#### Quantifying psychophysical thresholds.

Given the task design (interrogation protocol with different levels of stimulus strength), our analyses focused on proportion correct data, rather than response times [3]. We fitted the observers’ proportion of correct choices as a function of motion coherence, denoted P(C) below, by means of a cumulative Weibull function [32], separately for each experimental condition and stimulus duration. See Fig 2 for example fits from one subject in both conditions of Experiment 2. The cumulative Weibull distribution function was defined as
(1)
where *λ*, *α* and *β* are free parameters and the value 0.5 represents chance performance. The lapse rate (λ) represents stimulus-independent errors, corresponding to the fraction of incorrect choices at the highest motion strength and longest viewing duration. *λ* was determined by fitting Eq (1) to data from the longest viewing duration in each experiment, and then inserted in Eq 2 to find best fits of *α*, and *β* to the data from all conditions. The value of *α* is the psychophysical threshold corresponding to the coherence level that elicits 82% correct responses when asymptotic performance is perfect (i.e., *λ = 0*). Parameter *β* determines the steepness of the psychometric function for a particular threshold. Best-fitting values for the free parameters were obtained by means of a maximum likelihood procedure [33]. To obtain the best-fitting values for the free parameters, we minimized the negative log-likelihood, which yields the exact same parameters as maximizing the likelihood.

Solid curves are maximum likelihood fits of cumulative Weibull functions to the proportion correct data. Vertical dashed lines represent estimated threshold parameters (solid horizontal lines at bottom, 95% confidence intervals). The horizontal dashed line represents the lapse rate. **(A)** “Pre”- condition. The performance data and psychometric functions are shown separately for all stimulus durations. **(B)** As in A, but for “Post”. **(C)** As in A, but for “Fixed”.

Confidence intervals for the proportion of correct choices were obtained by employing a non-parametric bootstrap [34], in which the original set of trials was resampled with replacement a large number of times (N = 10,000) and the proportion of correct responses was computed for each iteration. The confidence intervals of the parameters of the cumulative Weibull functions and of the regression-based threshold vs. duration functions (see below, eqs 2–5) were obtained by means of a parametric bootstrap procedure [35]. We used a binary process to generate a new set of data based on the binomially distributed noise and estimated Weibull parameters from the observed data set. We repeated the maximum likelihood procedure for each bootstrap iteration to find the best parameter fits for the “mock” data set and calculated the corresponding parameters anew. The resulting distributions indicated the likely spread of all parameters for the original data set.

#### Fitting threshold versus duration functions.

Perfect integration predicts a linear decrease in threshold with duration with a slope of -0.5 in log-log coordinates. A lack of integration predicts a flat line (slope of 0). Hence, to analyze the dependence of thresholds on duration, we fitted a bilinear function to the log of the best fits of *α* and viewing duration [26–28]. The slope of the first line was constrained to -0.5 and the slope of the second line was constrained to 0. We determined the best fitting value for the intercept *β*_{0} of the linear function. The general fit was evaluated by calculating the sum of squared errors (SSE) and the best fit of the bilinear function was determined by means of an iterative least squares method [36].

With a number of durations *n*, the relationship between thresholds and viewing duration for each *i* = 1,…*n* was expressed as
and
(2)
where *β*_{0} correspond to the *y*-intercept of the first line, corresponds to the *y*-intercept of the second line, *A* represents the abscissa of the joint point. In order to ensure that the two lines join at the value *x* = *A*, we applied the following restriction:
(3)

The error terms (i.e. *e*_{i}’s for *i* = 1,…*n*) were independent and identically distributed normal random variables with mean equal to zero and constant variance. Due to the restriction from Eq (3), the abscissa of the joint point could be estimated as
(4)

Eq (4) held only for cases in which *A* lied between two consecutive values of log(*t*), log(*t*_{j}) and log(*t*_{j+1}). In this case, the first line included *n*_{j} number of log(*t*_{i}) and log(*a*_{i}) pairs up to and including log(*t*_{j}) and log(*a*_{j}), whereas the second line included all values from log(*t*_{j+1}) and log(*a*_{j+1}) on to . If the estimated joint point was indeed log(*t*_{j}), and log(*t*_{j+1}), we computed the ordinary least squares solutions and determined the joint point by substituting the observed *β*’s with the estimated *β*’s. The SSE was computed as the sum of the individual SSEs from the two lines
(5)

If the estimate of the joint point was not between log(*t*_{j}), and log(*t*_{j+1}), we modified the computation such that *A* occurred exactly at log(*t*_{j}). Since the minimization of SSE in this case is constrained to only one possible joint point log(*t*_{j}), this constrained least squares solution was computed as a modification of the two separate ordinary least squares solutions [36].

In a control analysis, we relaxed the constraints on the slopes of the bi-linear fits, by allowing the first slope to be in the range from -0.5 to 0, and the second slope was constrained to any value larger than the first slope. These fits required two additional constraints. 1. If the estimated joint point exceeded the longest signal duration in the experiment, it was set to the longest signal duration (4.8 s). 2. If the first slope was smaller than 0.001, we assumed that participants did not integrate and set the joint point to the shortest signal duration of 0.15 s.

Given the six stimulus durations, the joint point could only be estimated for the four intermediate durations, which corresponded to the limited interval 0.3–2.4 s. We verified that in all cases, the bi-linear fits provided a significantly better match to the data than a single linear fit, with or without slopes constrained to -0.5 (data not shown). Further, in all cases were the joint point estimates significantly shorter than the longest possible timescale estimates of 2.4 s (see Results, Fig 3). Thus, it is unlikely that the short timescale estimates obtained are due to the limitations of the procedure.

Threshold vs. duration functions from all conditions. Circles represent psychophysical thresholds for each stimulus duration. Solid lines: best fitting bilinear function, with the slopes constrained to -0.5 (first branch) and 0 (second branch; see text for details). Error bars, 60% confidence intervals (bootstrap). Inset: bar graphs of the joint point estimates of the best fitting bilinear function. Error bars, 95% confidence intervals (bootstrap).

#### Statistical comparisons of joint points.

To compare integration time scales and lapse rates between conditions within individual observers (i.e., “Pre”- versus “Post”-conditions; “Post” vs. “Fixed”; “Post” vs. “Fixed”), we compared the joint point estimates *A* by means of non-parametric permutation tests [34]. For each permutation, all trials were combined into one set of N_{pre}/N_{fixed} + N_{post} trials and shuffled 10,000 times. Then the shuffled set was split into 2 sets of N_{pre}/N_{fixed} and N_{post} trials, the proportion correct was recalculated and the Weibull functions were fitted to determine the new thresholds and lapse rates. Based on these, *A* was iteratively computed to obtain the permutated difference between the two sets. Finally, we compared the observed difference lapse rates and in *A* with the permutated differences. P-values were obtained by calculating the fractions of repetitions (10,000), in which the absolute value of the permutated difference was larger than the absolute value of the measured difference.

### Computational modeling of psychophysical performance

#### General model description.

We fitted the leaky-competing accumulator model (LCA) [37,38] to the behavioral data of each individual observer. The LCA is a neurophysiologically inspired sequential sampling model of decision-making and has successfully accounted for behavioral data from a wide variety of perceptual tasks [2]. Similar to other models, LCA assumes that noisy evidence for different hypotheses is accumulated towards a decision criterion. A separate accumulator, corresponding to the pooled neuronal activity of a dedicated population, represents each competing hypothesis. Thus, in choices between two options, two pools of neurons integrate the momentary evidence for each alternative and compete with each other via lateral inhibition, while their activity is subject to slow decay (“leak”). The activation states of the accumulators are described by the following finite difference equations: (6)

In the above equation *x*_{i} corresponds to the activation states of the accumulator associated to alternative *i*. The *max* function prevents activation states (which correspond to population firing rates) from going below a predefined value, implementing in this case a lower reflecting boundary at 0. Constant input to both accumulators (controlling for the degree of non-linearity in the activation states) is denoted by *I*_{0} while Δ is the momentary change of each accumulator on each time step. This change or increment is driven by three factors: i) the external input in favor of the corresponding accumulator, ii) the activation state of the accumulator on the preceding moment and iii) the activation state of the competing accumulator. The external momentary input, denoted by *I*_{i}, is subject to Gaussian noise fluctuations with zero mean and standard deviation *σ*. The accumulators compete with each other via lateral inhibition of strength *β*, and their activation is subject to leak of κ. For the interrogation protocol used in this study, the activation states of the two accumulators are read out at the end of the trial (corresponding to the response cue presentation), and the alternative with the highest activation state, up to that point, is chosen.

In the simulations presented here, we fixed the standard deviation of noise to σ = 1. We further assumed that *I*_{1} = *C* × *s*, with *C* corresponding to the motion coherence for the corresponding alternative and *s* being the sensitivity parameter that modulates the signal to noise ratio. We let *I*_{0} be a free parameter, and we assumed that motion coherence is subject to power-law saturation with exponent *m*: *I*_{1} = *C*^{m} × *s*. The experimental time units were converted into simulation time steps with 1 second corresponding to 250 time-steps. The coherence level (*C*) was determined by the experimental condition and ranged from 0 (no coherent motion) to 1 (all dots move coherently towards a given direction). The input to the first unit (*I*_{1}) was always proportional to the coherence level *c*, while the input to the second unit (*I*_{2}) was always set to 0. In sum, the decision-making model had five free parameters: inhibition (*β*), leak (κ), sensitivity (*s*), constant activity (*I*_{0}) and coherence saturation (*m*).

#### Model fits to behavioral data.

We fitted the model described above to the data of each individual observer. Specifically, the model was fitted to the individual proportion correct data for *N* coherencies x *M* duration levels (*N* = 5 and *M* = 6). We excluded the lowest coherence level (0.05) because performance indistinguishable from chance for all durations at that coherence and we found that including this coherence yielded worse fits. Assuming that the correct responses follow a binomial distribution, we computed the likelihood for a given parameterization of the model, for the *K* = *N x M* data points as:
(7)
where *n*_{i} was the number of trials for the *i-th* data point, *y*_{i} was the corresponding number of correct responses and *p*_{i} the probability of correct response predicted by the model (obtained by running 5000 iterations of the model for the given condition/ parameterization). The cost function was the negative logarithm of L:
(8)
and was minimized using SUBPLEX minimization routine [39]. For each subject and each model we ran the optimization 400 times with starting points randomly sampled from uniform distributions within a parameter-specific range.

In order to assess the goodness of fit for the best parameters of a given model, we calculated the chi-square statistic as follows:
(9)
where *K* was the number of bins corresponding to the experimental conditions, *O*_{i} was the observed frequency of correct responses at condition *i*, and *E*_{i} the corresponding frequency predicted by the model. Because the number of experimental and simulated trials was different, *E*_{i} was calculated by multiplying *p*_{i} (the probability of correct response predicted by the model) by the number of experimental trials. The chi-square statistic had *K*-1 degrees of freedom. P-values indicated the probability that the chi-square statistic is at least as extreme as the obtained one, under the null hypothesis that the data and the predictions of the model follow the same distribution. We rejected the null hypothesis at a significance level of *α* = 0.05.

#### Model-based estimation of integration time constants.

In order to obtain a model-based estimate of the time constant of evidence integration, we fitted shifted exponential functions [40] to the d’ transformed psychometric functions of the LCA model fits of each observer. The shifted exponential function has been shown to accurately track the stimulus sensitivity increase as a function of time of human observers [40]. Fitting shifted exponential functions to the simulated (rather than the measured) d’ vs. duration functions provided more robust time constant estimates (Fig 4) by discounting noise in the behavioral data. We applied this procedure after ensuring that the LCA provided a reasonable fit to the behavioral (proportion correct) data of each subject (MLE, χ^{2} and *p* values for assessment of goodness of fit). We fitted the following shifted exponential function to the predicted average sensitivity across all coherences d' (*d*′ = Φ^{-1}(0.99·*p*_{i})) vs. duration (Fig 4):
(10)
with *D′* denoting the average asymptotic sensitivity level for all coherences, *t*_{0} the period during which sensitivity was zero and *τ* the time constant (Fig 5).

Simulated d’ vs. duration functions, and exponential fits. Filled dots are the best-fitting LCA model estimates for each subject; solid curves: shifted exponential functions on the predicted average sensitivity across all coherences; blue stars: measured average sensitivity across all coherences. **(A)** “Pre”. **(B)** “Post”. **(C)** “Fixed”.

Summary of integration times under the three DR-mappings tested. The gray horizontal line marks the median, the upper and lower edges of the box mark the 25^{th} and 75^{th} percentiles, and the whiskers extend to the most extreme data points, excluding outliers. **(A)** Joint points of the bilinear fit to threshold vs. duration functions. **(B)** Time constants derived from LCA model fits.

## Results

We examined the impact of sensory-motor contingencies [41,42] on the timescale of perceptual evidence integration in a total of six human observers. To this end, we used a standard psychophysical task, coarse discrimination of visual motion direction, and quantified the integration timescale under variable (Fig 1A and 1B) or fixed (Fig 1C) DR-mapping. For the variable mapping, the mapping was instructed either before (“Pre”; Fig 1A) or after decision formation (“Post”; Fig 1B). Subjects 1–3 were asked to perform all three mapping conditions, to establish the main result in terms of three independent within-subject comparisons. Subjects 4 and 5 were measured to replicate the results of the “Pre” vs. “Post” comparison. Subject 6 was measured to replicate the result of the variable (only “Post”) vs. fixed comparison.

### Overall performance

In all observers and experimental conditions, performance depended lawfully on both, the strength and the duration of the visual motion signal (see Fig 2 for an example observer). The proportion of correct choices was generally about chance-level (~0.5) for the lowest level of coherently moving dots (0.05) and about perfect (~1) for the highest coherence level (0.5). This was true for all stimulus durations, except for the shortest (150 ms), at which even the highest coherence did not yield perfect performance in some subjects. We quantified the dependency on motion coherence by fitting cumulative Weibull functions to the proportion correct data. The psychometric functions generally shifted leftwards with increasing stimulus duration, as reflected by the decrease of the threshold parameter (inverse of sensitivity; Fig 2), meaning that observers could integrate the motion information over time and even discriminate patterns with a low percentage of coherently moving dots. The thresholds reached less then 0.1 motion coherence in all observers for the longer durations (Fig 3). This decrease of psychophysical thresholds with stimulus duration is an index of temporal integration of stimulus information (see below).

To assess whether observers made more “non-perceptual” errors (i.e., choosing the incorrect buttons irrespective of signal strength) in any of the three experimental conditions, we compared the “lapse rate” estimates (i.e., the upper asymptote of best-fitting cumulative Weibull functions) between conditions. Lapse rates quantify processes independent of the perceptual decision per se, such as lapses of attention or motor errors. Lapse rates were generally negligible in all three conditions (Table 1), and there were no systematic differences between the three conditions in most observers. One observer (subject 5) exhibited a significant difference in lapse rates between “Pre” and “Post” (p = 0.02), but also evident in the joint point (see below). There were also no differences in the overall threshold levels between the conditions when tested separately for all but the first (0.15 s) stimulus durations (Table 2).

In sum, all six observers exhibited a high sensitivity and were able to adjust the sensory-motor associations and even rapidly (on a trial-by-trial basis) under the variable DR-mappings. There were no systematic differences in the occurrence of motor errors, as well as in overall performance, between the three conditions. Finally, the lawful shift in psychometric functions with stimulus duration indicates that observers indeed integrated stimulus information over time and thus became better in discriminating motion directions with low coherences the longer the observation period [12,13,26–28]. We next quantified the timescale of this integration process, and its dependence on DR-mapping, based on two complementary approaches: a model-independent and a model-based approach.

### Model-independent characterization of integration timescales

The model-independent approach was based on the following rationale. The integral of the “signal” embedded in the noisy stimulus increases linearly with the stimulus duration, while the “noise” (i.e., standard deviation) increases with the square root of the stimulus duration. Consequently, the signal-to-noise ratio increases with the square root of duration and perfect integration of the sensory evidence implies that the observers’ psychophysical threshold (the inverse of their signal-to-noise ratio) decreases as a function of the square root of stimulus duration, yielding a straight line with a slope of -0.5 on log-log axes. Conversely, no integration of sensory evidence implies that the observers’ threshold does not change as a function of stimulus duration (i.e., a straight line with a slope of 0). Consequently, integration of evidence across a limited time window predicts an initial linear decrease of the threshold vs. duration function with a slope of -0.5, followed by gradual deceleration towards a second (asymptotic) linear portion with a slope of 0. Note that such a timescale limitation may be due to leaky integration [37,38], or perfect (i.e., without leak) integration towards absorbing bounds [28] (see Discussion). Based on this rationale, the duration situated in between the -0.5- and 0-slope portions of the threshold vs. duration function can be used as an estimate of the integration timescale. We fitted bilinear functions to the individual threshold vs. duration functions, whereby the slope of the first line was constrained to -0.5 and the slope of the second to 0 and used the joint point between both lines as time scale estimate.

In all three conditions, the threshold vs. duration functions decreased with stimulus duration, but only for a limited range (Fig 3). The individual joint points ranged from 300 to 870 ms for “Pre” (presumably involving the highest short-term memory demands, see Discussion), from 300 to 610 ms for “Post”, and from 310 to 580 ms for “Fixed”. Any two frames of one of the three interleaved sequences of coherent motion were separated by 30 ms (at the monitor refresh rate of 100 Hz; see Materials and Methods), across which the observers’ visual motion system could pair dots to extract motion. Thus, the physical evidence fluctuated over 30 ms. Under the assumption that visual cortical regions like MT that encode visual motion track this stimulus information with high temporal precision [43], the shortest integration timescale observed here (300 ms) implies integration of ten samples of sensory evidence provided by visual cortex into the decision. The longest timescale of 870 ms corresponds to integration of close to 30 samples of sensory evidence.

Given the experimental design and fitting procedure, possible estimates of the joint point were confined to the interval 0.3–2.4 s (see Materials and Methods). All joint point estimates in the insets Fig 3 were shorter than the upper bound of 2.4 s, and the 95% confidence intervals of these joint point also estimates excluded 2.4 s. Thus, in all cases were the timescale significantly shorter than the one that would have maximized performance in this task entailing the maximum stimulus duration of 4.8 s.

In sum, all subjects showed temporal integration of perceptual evidence, but their integration timescale were consistently smaller than the optimal timescale for this task (defined as the timescale that would have maximized the overall fraction of correct choices). Possible mechanistic accounts of the timescale limitation are described in Discussion. We next explored if and how the timescale was affected by our manipulations of the sensory-motor contingencies.

#### Integration timescale for “Pre” vs. “Post” under variable DR-mapping.

If the integration timescales depended on subjects’ ability to directly translate the integrated evidence into an action plan, then precluding the sensory-motor contingency in the “Post”-condition might be expected to shorten the integration timescale, relative to the “Pre”-condition. We found no consistent evidence for this scenario. Only one observer (subject 5 who also had a difference in lapse rates; Table 1) exhibited a significant difference in joint points between “Pre”- and “Post”-conditions (p < 0.01; two-tailed permutation test). In the remaining four observers (subjects 1–4) the joint points did not differ significantly between “Pre”- and “Post”-(range of p-values: 0.07–0.37; two-tailed permutation test).

Separate bi-linear fits with relaxed constraints on the two slopes (see Materials and Methods) yielded qualitatively identical results. The joint point estimates obtained from this procedure were generally less precise (larger confidence intervals, data not shown). Importantly, however, there was again no robust difference between joint points from the different conditions in all of the five observers (range of p-values across subjects 1–5: 0.09–0.77; two-tailed permutation test). Taken together, the results suggest that the evidence integration timescale is largely independent of the “Pre” vs. “Post” condition.

#### Integration timescale for fixed vs. variable DR-mapping.

In previous studies reporting neural signatures of evidence integration in brain regions involved in motor planning [12,19–22,44,45], subjects were typically practiced with one specific sensory-motor contingency for at least hundreds of trials. It is possible that sensory-motor contingencies only improve the integration timescale after extensive practice, due to slow learning mechanisms. The trial-to-trial variation of DR-mapping in the “Pre-”condition of the previous experiment may have not have enabled such learning and, therefore, no improvement in integration timescale. To test this idea, we next explored the effect of long-term practice of one specific fixed DR-mapping on the integration timescale (Fig 1C), and compared this with the timescale under the variable DR-mapping.

The joint points obtained from the constrained fits (first slope: -0.5, second slope: 0) were statistically indistinguishable between the “Fixed” and “Pre” (range of p-values across three observers: 0.07–0.19; two-tailed permutation test) and the “Fixed” and “Post” conditions (range of p-values across four observers: 0.31–0.70; two-tailed permutation test) and. The bi-linear fits with relaxed constraints on the slopes yielded similar results: There was no significant difference in the joint points for two out of three observers in the “Fixed” vs. “Pre” comparison (subject 1: p = 0.81; subject 2: p = 0.02; subject 3: p = 0.21) and three out of four observers in the “Fixed” vs. “Post” comparison (subject 1: p = 0.41; subject 2: p = 0.68, subject 3: p = 0.66; subject 6 p < 0.01. Thus, even extensive practice of a specific sensory-motor association across several thousands of trials did not seem to robustly improve the temporal integration process.

#### Model-based characterization of integration timescales.

One concern may be that our model-independent assessment of integration timescale may not have been sufficiently sensitive to reveal subtle effects of the sensory-motor contingencies. To address this, and to obtain a more theoretically motivated estimate of the integration timescale, we used a computational model of the decision process to fit the behavioral performance data (see Materials and Methods). The LCA is a neurophysiologically inspired model of the collective dynamics of two populations of “decision neurons”, which has been successfully applied to behavioral data from a wide variety of decision tasks [2,37,38,40]. For our interrogation protocol, the model assumed that the observers kept integrating for the whole stimulus interval and made a decision in favor of the alternative with the largest integrated evidence after that interval. Limitations of integration time scale resulted from the balance between “leaking away” of past evidence (biasing choices towards the most recent evidence) and mutual inhibition (biasing choices towards the early evidence).

The LCA model provided a reasonable fit to the behavioral performance of all observers in all three experimental conditions (Table 3). The goodness of fit was assessed by means of comparing the empirical and model-predicted psychometric functions using the chi-square statistic (see Materials and Methods). For an alpha value of 0.05, the null hypothesis that the two compared distributions (e.g. psychometric functions) are the same was not rejected for any of the participants (*p* >0.95 for all participants, see Table 3).

We used the model with the best-fitting parameters for each observer to generate accuracy (d') vs. duration functions and fitted the average (across coherence levels) sensitivity with a shifted exponential function (Fig 4). The LCA fit of subject 4 in the “Pre”-condition was invariant to increases of duration and thus meaningful exponential fits could not be obtained. For the remaining subjects and conditions the obtained time constants were typically around 200 ms (Table 4, Fig 5B), consistently shorter than the around 500 ms timescale estimates obtained from the bi-linear fits (compare Fig 5A and 5B). This difference is expected since the time constant of the exponential fit represents the time it takes for the sensitivity to reach ~62% of its asymptotic value while the joint point indicates the exact moment of asymptotic saturation; by simulating a leaky integrator model we confirmed that the bilinear joint point is expected to be 2–2.5 times higher than the exponential time constant. The exact, analytical correspondence between the two measures of integration timescale should be addressed in future theoretical work. Despite the quantitative difference between the two timescale measures, there was a consistent qualitative correspondence across subjects (compare insets in Fig 3 with Table 4).

Most importantly, in line with the analysis of joint points, comparison of the model-based integration time constants between conditions did not yield any significant difference between DR-mapping contexts, as assessed by comparing the 95% confidence intervals between conditions (Table 4).

To assess the sensitivity of the model-based time constant estimation, we simulated a leaky integration model to generate many psychometric functions and repeated the same time constant estimation procedure (i.e., based on exponential fits as described in Materials and Methods) on these psychometric functions. In a simple leaky integration model, the integration time constant is known precisely (inverse of the leak parameter), whereas the time constant is a combination of leak and mutual inhibition in the more complex non-linear LCA described above (see Materials and Methods and [37]). We used three different levels of leak corresponding to time constants of 200, 400, and 600 ms, whereby 200 ms corresponds roughly to the median time constant estimate obtained from our behavioral data in all conditions (Fig 5B). Using three motion coherence levels (7%, 11%, 15%) and six stimulus durations (as in our behavioral experiments) we generated 300 simulated model responses per condition and repeated the time constant estimation process 10,000 times, for each of the three different time constants. The 95% confidence intervals of the resulting distributions of estimated time constants did not overlap (Fig 6). We conclude that our estimation procedure was sufficiently sensitive for distinguishing differences in time constants of 200 ms.

Sensitivity of the model-based timescale estimation. Three distributions of estimated time constants obtained by simulating leaky integrator models with three different time constants (0.2, 0.4, 0.6 s). Vertical lines indicate the 2.5% and 97.5% quantiles. The overlap between the distributions is small (<5%).

## Discussion

Here, we examined how knowledge of sensory-motor contingencies [41,42] affects the timescale of perceptual evidence integration into visual motion discrimination decisions, on a short-term (trial-by-trial) or long-term (across thousands of trials) basis. We found that all subjects integrated visual motion signals across time, and that their behavioral performance was well accounted by a leaky competing accumulator [37] model. However, integration timescales were generally shorter than optimal for the task (median joint points of ~450 ms), in which subject had to integrate evidence for up to 4.8 s. Finally, we found that neither subjects’ knowledge of a (variable) sensory-motor contingency, nor their long-term learning of one (fixed) contingency, had a significant and consistent effect on their integration timescale. Although we used two independent analytic approaches (model-independent and model-based) for estimating integration timescales which yielded consistent results, and established the sensitivity of our analyses, there was no evidence for a difference in integration timescales between the experimental conditions. We, therefore, conclude that long-term learning (under fixed mapping) or rapid acquisition (under variable mapping) of sensory-motor contingencies have only a small, if any, effect on the mechanisms limiting the timescale of perceptual decisions.

Many studies have used free response protocols, in which the observer controls the decision time, to study the dynamics of perceptual decision-making [8,9,31,37,46]. Evidence integration is then inferred from fitting a decision model to the reaction time distributions [2,3,8,37]. Reaction times do not only reflect the integration process, but also the observer’s speed-accuracy tradeoff [3,6,8,9,47]. By contrast, in the interrogation protocol used in the present study, the optimal strategy is to integrate all available evidence (perhaps subject to inevitable leak), and then choose the option that is best supported by the integrated evidence when the response is prompted [3,37]. Thus, the improvement of threshold as a function of stimulus duration should directly reflect the evidence integration process.

Previous studies of perceptual decision-making in humans and animals using this approach yielded a wide range of integration timescales. Some studies of human motion discrimination found timescales on the order of several seconds [26,27]. One study in rats and humans showed close-to-perfect integration, within a range of ~1 s using a task where participants had to discriminate the relative frequency of discrete events [29]. Another human study showed threshold decreases throughout a range of 900 ms—crucially, this decrease was steeper when subjects expected longer signals compared to when they expected shorter signals [13]. The relatively short timescales observed in the current study are in line with previous results from monkeys [28] in motion discrimination: In particular, using stimuli and tasks analogous to our fixed DR-mapping condition, Kiani et al. [28] found that monkeys exhibited a joint point of ~420 ms, just like our current human results. The human studies with larger samples sizes performing the same task also observed a substantial inter-individual variability in integration timescales [13]. Taken together, these results may suggest that integration timescale may, just like short-term memory capacity, be an individual trait with an upper limit, which can only be adapted to task demands within that limited range [13]. Additionally, the length of the temporal integration window may differ across different tasks.

Limited integration timescales are consistent with two mechanistically distinct scenarios. First, the decision process may terminate prematurely, once the integrated evidence has reached an implicit absorbing bound (termed “bounded diffusion”) [28]. This strategy can be compared to “closing the eyes” after an initial decision is made, thus eliminating the impact of the subsequently presented evidence. Second, the limit may be explained using a leaky competing accumulation process, as postulated by the LCA model that we used here [38,48]. If the leak parameter exceeds inhibition then the model implements a stable Ornstein-Uhlenbeck (leaky integration) process, with integration taking place only until the stable state is reached [3]. When the inhibition parameter is larger than leak, the model has unstable dynamics and is maximally sensitive to early evidence, in close resemblance to the bounded diffusion model. Both the implicit boundary and LCA dynamics might also work in concert, and to different extent in different individuals. The current findings establish that, whatever the mechanism limiting the integration time scale, this mechanism seems unaffected by sensory-motor mapping.

The variable DR-mapping forced observers to establish and remember a new association between decision and response on each trial, whereas the fixed mapping allowed them to establish an automatic sensory-motor transformation, presumably after a few hundred trials of practice [49]. Specifically, maintaining a new DR-mapping rule online during the decision formation (“Pre”-condition) increased short-term memory load [49,50], which may have interfered with the integration process. This difference in task demand seemed not to affect the behavioral performance measures–neither the mean integration timescales, nor the frequency of simple motor errors (lapse rates; Table 1).

Our results have a number of implications for neurophysiological studies of perceptual decision-making. First, several studies into the neural basis of decision-making have used a manipulation analogous to our “Post-condition” to decouple decision-making from action planning [12,23,42,51–54]. Our present results indicate that these studies, in fact, probe decision dynamics analogous to those occurring in the classical tasks with fixed mapping. Second, our observation of limited integration time scales question an assumption that has been implicit in several fMRI and neurophysiological studies using interrogation protocols [16,20,21,30,50,53–55]: that observers integrate all sensory information provided, even for extended (> 1 s) stimulus durations. Our results indicate that this assumption should be verified for each experimental condition and subject. Finally, our findings shed new light on the build-up activity commonly observed during decision formation in motor structures of the human brain [21,22,23,45,56]: Albeit providing a useful neural marker of the evolving integration process, this activity seems to be a downstream consequence of the integration process rather than a direct correlate of that process. Our results are consistent with a growing body of physiological evidence [23,53,57–59] indicating that evidence integration during decision-making is distinct from action planning.

## Author Contributions

Conceived and designed the experiments: TP PJ THD. Performed the experiments: TP PJ. Analyzed the data: KT TP PJ. Contributed reagents/materials/analysis tools: PJ KT TP THD. Wrote the paper: KT TP PJ THD.

## References

- 1. Gold JI, Shadlen MN (2007) The neural basis of decision making. Annu Rev Neurosci 30:535–574. pmid:17600525
- 2. Smith P, Ratcliff R (2004) Psychology and neurobiology of simple decisions. Trends in Neurosci 27:161–168. pmid:15036882
- 3. Bogacz R, Brown E, Moehlis J, Holmes P, Cohen JD (2006) The physics of optimal decision making: A formal analysis of models of performance in two-alternative forced-choice tasks. Psychol Rev 113:700–765. pmid:17014301
- 4. Honey CJ, Thesen T, Donner TH, Silbert LJ, Carlson CE, Devinsky O, et al. (2012) Slow cortical dynamics and the accumulation of information over long time scales. Neuron 76:423–434. pmid:23083743
- 5. Brody CD, Romo R, Kepecs A (2003) Basic mechanisms for graded persistent activity: discrete attractors, continuous attractors, and dynamic representations. Curr Opin Neurobiol 13:204–11. pmid:12744975
- 6. Uchida N, Kepecs A, Mainen ZF (2006) Seeing at a glance, smelling in a whiff: rapid forms of perceptual decision making. Nat Rev Neurosci 7:485–491. pmid:16715056
- 7. Wang XJ (2002) Probabilistic decision making by slow reverberation in cortical circuits. Neuron 36:955–968. pmid:12467598
- 8. Ratcliff R (1978) A theory of memory retrieval. Psychol Rev 85:59–108.
- 9. Palmer J, Huk AC, Shadlen MN (2005) The effect of stimulus strength on the speed and accuracy of perceptual decisions. J Vis 5:376–404. pmid:16097871
- 10. Bogacz R, Wagenmakers EJ, Forstmann BU, Nieuwenhuis S (2010) The neural basis of speed-accuracy tradeoff. TICS 33: 10–16.
- 11. Carrasco M, McElree B (2001) Covert attention accelerates the rate of visual information processing. Proc Natl Acad Sci U S A. 98:5363–7 pmid:11309485
- 12. Gold JI, Shadlen MN (2003) The influence of behavioral context on the representation of a perceptual decision in developing oculomotor commands. J Neurosci 23:632–651. pmid:12533623
- 13. Ossmy O, Moran O, Pfeffer T, Tsetsos K, Usher M, Donner TH (2013) The timescale of perceptual evidence integration can be adapted to the environment. Curr Biol. 23: 981–6. pmid:23684972
- 14. Zariwala HA, Kepecs A, Uchida N, Hirokawa J, Mainen ZF (2013) The Limits of deliberation in a perceptual decision task. Neuron 78:339–351. pmid:23541901
- 15. Rorie AE, Newsome WT (2005) A general mechanism for decision-making in the human brain? Trends Cogn Sci. 9:41–43. pmid:15668095
- 16. Heekeren HR, Marrett S, Bandettini PA, Ungeleider LG (2004) A general mechanism for perceptual decision-making in the human brain. Nature 431:859–862. pmid:15483614
- 17. Siegel M, Engel AK, Donner TH (2011) Cortical network dynamics of perceptual decision-making in the human brain. Front Hum Neurosci 5:21. pmid:21427777
- 18. Siegel M, Donner TH, Engel AK (2012) Spectral fingerprints of large-scale neuronal interactions. Nat Rev Neurosci 13:121–134. pmid:22233726
- 19. Schall JD, Thompson KG (1999) Neural selection and control of visually guided eye movements. Annu Rev Neurosci 22:241–259. pmid:10202539
- 20. Horwitz GD, Newsome WT (2001) Target selection for saccadic eye movements: prelude activity in the superior colliculus during a direction discrimination task. J Neurophysiol 86:2543–2558. pmid:11698541
- 21. Donner TH, Siegel M, Fries P, Engel AK (2009) Build-up of choice-predictive activity in human motor cortex during perceptual decision-making, Curr Biol 19:1581–1585. pmid:19747828
- 22.
de Lange F, Dobromir Rahnev D, Donner TH, Lau H (2013) Pre-stimulus oscillatory activity over motor cortex reflects perceptual expectations.
*J Neurosci*. 33:1400–1410. - 23. O'Connell RG, Dockree PM, Kelly SP (2012) A supramodal accumulation-to-bound signal that determines perceptual decisions in humans. Nat Neurosci. 15: 1729–35. pmid:23103963
- 24. Law CT, Gold JI (2008) Neural correlates of perceptual learning in a sensory-motor, but not a sensory, cortical area. Nat Neurosci 11:505–513. pmid:18327253
- 25. Connolly PM, Bennur S, Gold JI (2009) Correlates of perceptual learning in an oculomotor decision variable. J Neurosci 29:2136–2150. pmid:19228966
- 26. Watamanuik SNJ, Sekuler R (1992) Temporal and spatial integration in dynamic random-dot stimuli. Vision Res 32:2341–2347. pmid:1288010
- 27. Burr DC, Santoro L (2001) Temporal integration of optic flow, measured by contrast and coherence thresholds. Vision Res 41:1891–1899. pmid:11412882
- 28. Kiani R, Hanks TD, Shadlen MN (2008) Bounded integration in parietal cortex underlies decisions even when viewing duration is dictated by the environment. J Neurosci 28:3017–3029. pmid:18354005
- 29. Brunton BW, Botvinick MM, Brody CD (2013) Rats and humans can optimally accumulate evidence for decision-making. Science 6128: 95–98. pmid:23559254
- 30. Britten KH, Shadlen MN, Newsome WT, Movshon JA (1992) The analysis of visual motion: a comparison of neuronal and psychophysical performance. J Neurosci 12:4745–4765. pmid:1464765
- 31. Roitman JD, Shadlen MN (2002) Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task, J Neurosci 22:9475–9489.
- 32. Quick RF Jr (1974) A vector-magnitude model of contrast detection. Kybernetik 16:65–67. pmid:4453110
- 33. Wichmann FA, Hill JN (2001) The psychometric function: I Fitting, sampling, and goodness of fit. Perception & Psychophysics 63:1314–1329.
- 34.
Efron B, Tibshirani R (1993) An Introduction to the Bootstrap. New York: Chapman and Hall.
- 35. Wichmann FA, Hill JN (2001) The psychometric function: II Bootstrap-based confidence intervals and sampling. Perception & Psychophysics 63:1314–1329.
- 36. Nickerson DM, Douglas EF, Grossman GD (1989) Estimating physiological thresholds with continuous two-phase-regression. Physiological Zoology 62:866–887.
- 37. Usher M, McClelland JL (2001) The time course of perceptual choice: the leaky, competing accumulator model. Psychol Rev 108:550–592. pmid:11488378
- 38. Tsetsos K, Gao J, McClelland JL, Usher M (2012) Using time-varying evidence to test models of decision dynamics: bounded diffusion vs. the leaky competing accumulator model. Front Decis Neurosci 6:79.
- 39. Bogacz R, Cohen JD (2004) Parameterization of connectionist models. Behav Res Methods Instrum Comput 36:732–741. pmid:15641419
- 40. Gao J, Tortell R, McClelland JL (2011) Dynamic Integration of Reward and Stimulus Information in Perceptual Decision-Making. PLoS One 6: e16749. pmid:21390225
- 41. Bennur S, Gold JI (2011) Distinct representations of perceptual decisions and the asssociated oculomotor plan in the monkey lateral ontraparietal area. J Neurosci 31: 913–921. pmid:21248116
- 42. Freedman DJ, Assad JA (2011) A proposed common neural mechanism for categorization and perceptual decisions. Nat Neurosci 14: 143–146. pmid:21270782
- 43. Bair W, Koch C (1996) Temporal precision of spike trains in extrastriate cortex of the behaving macaque monkey. Neural Comput 15:1185–1202. pmid:8845004
- 44. Horwitz GD, Batista AP, Newsome WT (2004) Representation of an abstract perceptual decision in macaque superior colliculus. J Neurosci 91:2281–2296.
- 45. Wyart V, de Gardelle V, Scholl J, Summerfield C (2012) Rhythmic fluctuations in evidence accumulation during decision making in the human brain. Neuron 76(4): 847–858 pmid:23177968
- 46.
Luce RD (1986) Response times: Their role in inferring elementary mental organization. Belfast, UK: Oxford UP.
- 47. Marshall JAR, Bogacz R, Gilchrist ID (2012) Consistent implementation of decisions in the brain. PLoS ONE 7(9): e43443. pmid:22984425
- 48. Tsetsos K, Usher M, McClelland JL (2011) Testing multi-alternative decision models with non-stationary evidence. Front Decis Neurosci 5:63.
- 49. Miller EK, Cohen JD (2001) An integrative theory of prefrontal cortex function. Ann Rev Neurosci 14:167–202.
- 50. Schneider W, Shiffrin RM (1977) Controlled and automatic human information processing: I. Detection, search, and attention. Psychol Rev 84:1–66.
- 51. Rahnev D, Lau H, de Lange FP (2011) Prior Expectation modulates the interaction between sensory and prefrontal regions in the human brain. J Neurosci 31:10741–10748. pmid:21775617
- 52.
Merten K, Nieder A (2012) Active encoding of decisions about stimulus absence in primate prefrontal cortex neurons. Proc Natl Acad Sci
*U S A*. 109: 6289–94. pmid:22460793 - 53. Hebart MN, Donner TH, Haynes JD (2012) Human visual and parietal cortex encode visual choices independent of motor plans. NeuroImage. 63: 1392–1403.
- 54. Hebart MN, Schriever Y, Donner TH, Haynes JD (2014) The relationship between perceptual decision variables and confidence in the human brain. Cereb Cortex. pii: bhu181. [Epub ahead of print].
- 55. Donner TH, Siegel M, Oostenveld R, Fries P, Bauer M, Engel AK (2007) Population activity in the human dorsal pathway predicts the accuracy of visual motion Detection. J Neurophysiol 98, 345–359 pmid:17493916
- 56. Gould IC, Nobre AC, Wyart V, Rushworth MF (2012) Effects of decision variables and intraparietal stimulation on sensorimotor oscillatory activity in the human brain. J Neurosci. 32:13805–18. pmid:23035092
- 57. Liu T, Pleskac TJ (2011) Neural correlates of evidence accumulation in a perceptual decision task. J Neurophysiol. 106:2383–98. pmid:21849612
- 58. Filimon F, Philiastides MG, Nelson JD, Kloosterman NA, Heekeren HR (2013) How embodied is perceptual decision making? Evidence for separate processing of perceptual and motor decisions. J Neurosci 33:2121–36. pmid:23365248
- 59. Hanks TD, Kopec CD, Brunton BW, Duan CA, Erlich JC, Brody CD (2015) Distinct relationships of parietal and prefrontal cortices to evidence accumulation. Nature 520:220–3. pmid:25600270