Less Is More: Latent Learning Is Maximized by Shorter Training Sessions in Auditory Perceptual Learning

Katharine Molloy; David R. Moore; Ediz Sohoglu; Sygal Amitay

doi:10.1371/journal.pone.0036929

Abstract

Background

The time course and outcome of perceptual learning can be affected by the length and distribution of practice, but the training regimen parameters that govern these effects have received little systematic study in the auditory domain. We asked whether there was a minimum requirement on the number of trials within a training session for learning to occur, whether there was a maximum limit beyond which additional trials became ineffective, and whether multiple training sessions provided benefit over a single session.

Methodology/Principal Findings

We investigated the efficacy of different regimens that varied in the distribution of practice across training sessions and in the overall amount of practice received on a frequency discrimination task. While learning was relatively robust to variations in regimen, the group with the shortest training sessions (∼8 min) had significantly faster learning in early stages of training than groups with longer sessions. In later stages, the group with the longest training sessions (>1 hr) showed slower learning than the other groups, suggesting overtraining. Between-session improvements were inversely correlated with performance; they were largest at the start of training and reduced as training progressed. In a second experiment we found no additional longer-term improvement in performance, retention, or transfer of learning for a group that trained over 4 sessions (∼4 hr in total) relative to a group that trained for a single session (∼1 hr). However, the mechanisms of learning differed; the single-session group continued to improve in the days following cessation of training, whereas the multi-session group showed no further improvement once training had ceased.

Conclusions/Significance

Shorter training sessions were advantageous because they allowed for more latent, between-session and post-training learning to emerge. These findings suggest that efficient regimens should use short training sessions, and optimized spacing between sessions.

Citation: Molloy K, Moore DR, Sohoglu E, Amitay S (2012) Less Is More: Latent Learning Is Maximized by Shorter Training Sessions in Auditory Perceptual Learning. PLoS ONE 7(5): e36929. https://doi.org/10.1371/journal.pone.0036929

Editor: Michael H. Herzog, Ecole Polytechnique Federale de Lausanne, Switzerland

Received: February 20, 2012; Accepted: April 17, 2012; Published: May 14, 2012

Copyright: © 2012 Molloy et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding: This study was funded by the Medical Research Council (MRC), United Kingdom, through intramural funding to the MRC Institute of Hearing Research. All authors were MRC employees at the time the research was conducted. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Perceptual learning is the process whereby practice on a perceptual task, such as discriminating between sounds, improves performance on that task. Though learning can be contingent simply on the overall amount of practice [1], it can also be affected by other aspects of the training regimen, including the amount of practice within each session [2], [3], or the length of breaks between sessions ([4] for a review). Systematically investigating the effects of varying the training regimen may provide insight into both learning mechanisms and the optimal design of applied training programs that aim to improve perceptual skills.

When designing training programs, whether for clinical or research use, it is important to use regimens which are feasible for the patient or participant, while ensuring that learning is also maximized. Training sessions that are shorter or fewer in number may increase compliance, especially in children [5]. However, learning may not occur if sessions are too short [1], [2] and may require extensive training, sometimes occurring over thousands of practice trials [6], [7]. Consequently, it is important to find a balance between brevity and efficacy of training. In the experiments described here we addressed two crucial questions: how much training is required overall to produce significant learning, and how is it best distributed across training sessions? In investigating these aspects of learning, the time course of improvements within and across training sessions, and the amount of practice required to trigger and sustain these improvements, are of fundamental importance. In addition, it is necessary to establish the amount of training beyond which no further benefit is gained.

Remarkably different time courses have been observed for perceptual learning. Improvements are often apparent while training (within-session learning [1], [8], [9]; Fig. 1, green line), but can sometimes occur during a latent period after training has finished (between-session learning [2], [7], [10], [11]; Fig. 1, red line). Within- and between-session learning probably represent two different processes, as they can be disrupted independently [12] and show differences in retention [13]. They also appear to have different electrophysiological correlates [13]–[15]. Both types of learning can also occur on the same task [16]–[18] (Fig. 1, blue line). However, neither of two previous studies that varied the number of trials within sessions, while controlling the total amount of practice, reported both forms of learning: Aberg et al. [1] showed only within-session learning in a visual experiment, while Wright & Sabin [2] showed only between-session learning in the auditory domain. Thus, the effect of varying the training regimen on a task that displays both learning types is currently undocumented. Moreover, neither study assessed how well learning was retained once practice had ceased, so the effect of training distribution on long term benefits is unclear.

Download:

Figure 1. Schema of different time courses for learning.

Lines represent different hypothetical learning curves in situations where between- and within-session changes are combined in different ways.

https://doi.org/10.1371/journal.pone.0036929.g001

Perceptual learning studies have shown that specific requirements should be met for learning to occur. For example, a sufficient number of trials (critical minimum) may be required to initiate within-session [1] as well as between-session [2] learning. Insufficient practice results in a lack of performance improvement during training, or a failure of overnight consolidation of improvements attained within a session (Fig. 1, yellow line). On the other hand, learning has been shown with as little as one trial of training on some tasks [3], [19] so minima may not always exist. Within- and between-session learning may have different critical minima, and this can only be established on a task which shows both learning types.

Overtraining on a task is also possible, with extra practice providing no added benefit. For example, no additional between-session learning was observed on a temporal interval discrimination task for a regimen with a large number of trials each day compared with fewer trials [2]. Within-session learning can plateau towards the end of a session and restart once a new session has begun [16], [17], suggesting overtraining within a session. As with the minimum requirements for learning, the maximum effective amount of training may differ for between- and within-session learning, but as of yet no study has determined whether this is the case.

Learning is usually non-linear; very early learning is typically rapid whereas later learning is slower [20], [21]. It is conceivable that other aspects of learning, such as the critical minimum, the maximum effective training per day, or the relative contributions of within- and between-session learning might also change as training progresses. However, studies that have varied the amount of training in each session have used extensive pre-testing to establish baseline performance on the training and untrained tasks, and so the characteristics of the early stages of learning were not documented [1], [2].

Since learning generally follows the characteristic time course described above, large performance improvements occur early on. Extended training may provide only marginal additional benefit compared to less exhaustive training. However, a longer training regimen may provide other benefits: tasks which are learnt over a more extended period of time are often remembered better [4], [22]. On the other hand, longer regimens may produce less generalization than shorter regimens, since as learning progresses it can become more specific to the trained stimulus [6], [23], [24].

Here we used a frequency discrimination (FD) task that was previously shown to result in both within- and between-session learning [8], [25] to investigate the characteristics of early and late within- and between-session learning. We compared their relative contributions to overall improvements, and the parameters within which they produce effective learning. We avoided the extensive pre-tests used in previous studies in order to capture the early stage of learning, and recalled participants up to several weeks after cessation of training to assess retention.

Experiment 1: Distribution of Training across Sessions

In this experiment we asked how much training per session is most effective. We varied the number of trials each day whilst keeping the overall amount of training constant (with the exception of the regimen with the shortest sessions). Overall learning and long-term retention were compared between four, multi-day training regimens (Fig. 2). We further assessed whether minimum or maximum effective amounts of daily training were achieved by comparing the speed of learning between regimens. Based on Wright and Sabin [2], who found a critical minimum of between 360 and 900 trials for between-session improvements, we tested a similar range. We expected the group(s) with fewer trials not to achieve a critical minimum and show reduced or no learning. Conversely, we expected overtraining to result in a reduced learning rate in the longer regimens relative to regimens with fewer trials.

Download:

Figure 2. Training regimens for Experiment 1.

Groups T800, T400 and T200 trained on 1600 trials of FD overall, with 800, 400 and 200 trials per day, respectively. Group T100 trained on 800 trials of FD overall, with 100 trials per day. A five-trial demo preceded the training, and 100 trials were run the day after training was completed (post-test), and 4–6 weeks after training was completed (retention test).

https://doi.org/10.1371/journal.pone.0036929.g002

We also differentiated learning seen within and between training sessions. We expected to see within-session learning in the early stages of this task, based on previous data from single-session studies in our lab (e.g. [8]). Between-session improvements were predicted to be more dominant in later stages, as typically observed by Wright and colleagues in extended FD training (e.g. [2]).

Methods

Ethics statement.

The research protocols for Experiments 1 and 2 were approved by the Nottingham University Hospitals Research Ethics Committee. Informed written consent was obtained from all participants.

Participants.

Forty eight adults aged 18–27 were recruited via posters from the University of Nottingham student population and the general public, and were paid an inconvenience allowance for their participation. All participants had normal hearing (pure-tone thresholds < = 20 dB HL across 0.5–4 kHz, measured according to BSA guidelines [26]), except one participant who had a threshold of 25 dB HL at 4 kHz in the right ear. Participants had no prior experience of psychoacoustic testing, and had initial FD thresholds between 0.4 and 15% at 1 kHz (i.e. between 4 and 150 Hz), as determined by the first block of trials (see below).

General procedure.

Participants were allocated to one of the four groups (Fig. 2) according to their FD performance on the first block, in order to match the groups for initial performance. Groups trained using 50-trial blocks of adaptive FD, and differed according to the number of blocks per session. Group T800 trained on 800 trials per day over two days, group T400 trained on 400 trials a day over 4 days, and group T200 trained on 200 trials a day over 8 days (with a weekend occurring after the 5th day) for a total of 1600 trials each. Group T100 trained on 800 trials in total, with 100 trials per day over 8 days (using the same schedule as T200). In all four groups a five trial FD demo was run at the beginning of the experiment to introduce the task concept. FD performance was assessed the day after training was completed (post-test), and four to six weeks later (retention test) using two blocks (100 trials) (Fig. 2).

Stimuli.

Stimuli consisted of 100 ms tones (including 10 ms raised cosine ramps) presented with an inter-stimulus interval of 500 ms. Stimuli were presented diotically at 60 dB SPL using Sennheiser HD-25-1 headphones. The frequencies of the tones ranged between 1 and 1.5 kHz according to the adaptive procedure described below.

Task and adaptive procedure.

All testing was conducted within a sound-attenuated booth. The FD task was administered via computer games with a visual interface that cued sound presentation and provided trial-by-trial feedback. Responses were recorded via touchscreen and there was no time limit in which to respond.

During each trial participants heard three intervals, two of which contained a standard tone of frequency f and a third, randomly determined interval, contained a higher-frequency target tone (f + Δf, where Δf is in per cent of the standard frequency f). Participants were instructed to choose the interval that was different from the other two (3-interval, 3-alternative forced choice; 3I-3AFC). The value of Δf was adaptively varied using a three-down one-up staircase procedure, targeting 79.4% correct on the psychometric function [27]. Starting with Δf = 50%, it was divided by 2 following every correct response until the first incorrect response, and then multiplied by two following each incorrect response until the first correct response. Thereafter, Δf was divided by √2 after three correct responses, and multiplied by √2 after one incorrect response. The adaptive track was terminated after 50 trials had elapsed.

A demo of five trials was administered before the first block to familiarize participants with task requirements (see Fig. 2). Three of these trials were ‘easy’ (Δf = 50%), and two were impossible (Δf = 0%). All participants correctly identified the target sounds for the Δf = 50% practice trials.

Training, post-test and retention test.

Training was administered in blocks of 50 trials of FD, each of which was a threshold assessment where the difference in frequency, Δf, was adapted as described above. Sessions containing more than 200 trials were split up with 5 minute breaks every 200 trials.

All participants completed a further 100 trials of FD (two blocks, identical to those used in training) during the post-test. Some participants (n = 9, 7, 9, 6 for groups T800, T400, T200 and T100 respectively) returned for the retention test, which consisted of a further two blocks of FD.

Non-verbal IQ.

The matrix reasoning and block design subtests of the Wechsler Abbreviated Scale of Intelligence (WASI [28]) were administered at the end of the post-test to assess non-verbal IQ (NVIQ). A one-way ANOVA confirmed that NVIQ did not differ significantly between the groups (F(3,44) = 0.11, p = 0.95). NVIQ scores were entered as covariates into all learning ANOVAs.

Data analysis.

The log-transformed Δf values for each adaptive track were fitted with a logistic psychometric function [29], and the difference limens for frequency (DLFs) were estimated as the 79.4% correct point on this function. Tracks where the psychometric function had a slope of less than 0.10 were discarded because shallow slopes render the threshold estimates unreliable – this occurred for just 0.5% of DLFs measured. One participant was excluded because of highly inconsistent DLFs. Excluding this participant did not affect the mean results, but reduced the variability in the sample considerably.

Overall learning was analyzed by comparing individual DLFs at the beginning of training (average of the first two blocks) and immediately after training (average of the two blocks from the post-test) using a mixed ANCOVA model with group as a between-subjects factor, threshold as the repeated measure, and NVIQ as a covariate. The data were then split to consider the early (first 800 trials) and later (second 800 trials) stages of training. ANCOVAs (as above) were used to compare DLFs at the beginning and immediately after each 800 trials. To compare retention of learning between groups, DLFs at the retention test were modeled using an ANCOVA as described above, but with number of days between post and retention tests as an additional covariate.

To assess whether minimum or maximum amounts of effective training per day had been reached, slopes of the learning curve for each group were compared. Multiple regression models were fitted to the mean DLFs for each block in the first 800 and second 800 trials separately. The models entered log(block) and group*log(block) as covariates, and group was entered as an additional factor for the second 800 trials model to allow for the possibility of different performance levels at the midpoint of training (groups were matched on performance initially). All p values of the slope parameters in the regression were Bonferroni corrected for multiple comparisons.

Similarly, to compare the rate of learning over days, mean daily DLFs were modeled using a multiple regression with log(day) and log(day)*group as covariates. Thresholds from block 1 (where all groups were matched on performance) were included in the regression model as the first point. Mean group daily DLFs were calculated by averaging individual thresholds within training days (note that the block 1 DLF was not reused in calculating the mean performance on day 1).

To assess within- and between-session learning, DLFs for the beginning and end of each day were calculated by averaging the DLFs from the first or last two training blocks. As Group T100 only had two blocks per day, it was not included in these statistical analyses. Within- and between-day improvements were calculated for each individual and each day/night, by finding the difference between the relevant DLFs. A multiple regression model with group, log(day) and log(day)*group was fitted to the mean learning for each group (for both within- and between-session datasets), to determine whether the amount of within- and between-session learning changed as training progressed.

Results

Overall learning.

DLFs improved significantly (Fig. 3) from the initial blocks to the post-test (F(1,42) = 130.2, p<.001), with no difference between groups (F(3,42) = 0.1, ns). Note that this is true even though the T100 group had half the training. Learning was significant over the first 800 trials in all groups (F(1,42) = 92.7, p<.001) and also over the second 800 trials in the T200, T400 and T800 groups (F(1,31) = 10.3, p = .003), with no significant differences between groups at either stage (F <0.8 for both analyses).

Download:

Figure 3. Changes in FD performance with training.

Data points show mean group DLFs for each training block of 50 trials, and the post-test. Logarithmic and power curve fits to the mean learning data were compared [43]. Learning was best fitted by a logarithmic function in all groups (power function least squares fits, mean r² = 0.78, logarithmic least squares fits, mean r² = 0.90). Logarithmic fits are indicated by solid curves in the figure. Bars along the top of the figure illustrate sessions in each group’s training regimen. Error bars were omitted for clarity as they overlap for all groups at each block.

https://doi.org/10.1371/journal.pone.0036929.g003

Retention of learning.

Performance did not deteriorate following cessation of training. There was no significant change in DLFs from the post-test to the retention test several weeks later (F(1,25) = 0.6, ns), and no difference between groups (F(3,25) = 0.6, ns; see Fig. 4), indicating all groups retained their learning equally successfully.

Download:

Figure 4. Retention of FD learning following cessation of training.

Group mean DLFs for initial performance (first two blocks), post-test (immediately after end of training) and retention test (4–6 weeks later). Groups were no longer matched because only a subset of the participants returned for the retention test, so DLFs were adjusted for individual differences in initial DLFs [44]. Error bars show ±SEM.

https://doi.org/10.1371/journal.pone.0036929.g004

Minimum and maximum effective daily training.

Learning rates during early (first 800 trials) and later learning (800–1600 trials) were investigated separately (Fig. 5A and B, respectively) by comparing the slopes of the learning curves. During the first 800 training trials the T200, T400 and T800 groups showed equivalent learning speed, but the T100 group showed significantly faster learning than the other groups (t(63) >4.9, p<.001 for all comparisons). Rather than a critical minimum requirement of daily training, these results suggest that shorter sessions result in more overall learning than longer ones, at least in the early stage of training.

Download:

Figure 5. Learning curves for early and late stages of training.

(A) Group mean DLFs for the first 800 trials for all groups. (B) Group mean DLFs for the second 800 trials for groups T800, T400 and T200. Solid lines are least squares logarithmic fits plotted on a log-log scale to appear linear. Error bars were omitted, since analyses compared slopes not individual points. Bars along the top of the figure illustrate sessions in each group’s training regimen. Note the different DLF axis scales in A and B.

https://doi.org/10.1371/journal.pone.0036929.g005

During the second 800 trials (Fig. 5B) the T800 group had a shallower slope than the other two groups, but these differences were not significant after Bonferroni correction (T400: t(47) = −2.1, p = .042; T200: t(47) = −1.9, p = .066; α = 0.025). The trend for the T800 group to show slower learning could indicate that 800 trials exceeded a maximum effective amount of daily training in the later stages of learning, with additional trials resulting in less benefit; however further data were required to confirm whether this was the case (see Experiment 2 below).

Considering improvements gained each day rather than per trial further clarifies the relative learning rate (Fig. 6). The slopes describing amount of learning per day grow progressively shallower from T800 to T200, but further reducing the number of trials per day does not decrease the learning rate. The T800 and T400 groups show significant differences in slope compared to each other, the T200 and the T100 groups (t(25) >3.7, p≤.001). However, the T200 group did not show more daily improvement than the T100 group (t(25) = 0.6, ns). This further highlights that the T100 group is improving relatively faster than the other groups, showing as much improvement each day as the T200 group, who had double the training. These results suggest that, at least for this task, 100 trials are above the critical minimum number of trials required to initiate learning, and that there is benefit to having shorter training sessions.

Download:

Figure 6. Progress of learning over training days.

Group mean DLFs for each training day. DLFs from block 1 are plotted at the far left, followed by daily DLFs for each training day (note that the block 1 DLFs were not reused in calculating the mean for Day 1). Solid lines are least squares logarithmic fits plotted on a log-log scale to appear linear. Error bars were omitted, since analyses compared slopes not individual points.

https://doi.org/10.1371/journal.pone.0036929.g006

Within- and between-session changes.

All groups showed within-session improvements on each day (Fig. 7A). Group T800 showed greater learning on Day 1 than Day 2 (t(13) = −6.8, p<.001), but groups T400 and T200 showed no change in the amount learnt in each day (t(13) <0.9 for both, ns). The T100 group did not have enough data within each day to be included in this analysis. The results for the T400 and T200 groups suggest that within-session learning is constant, with a fixed benefit per practice block regardless of the stage of training (at least up to 1600 trials). The difference seen in the T800 group’s within-day learning could thus be another indication that while 800 trials per day is an effective regimen for early training, it may lose some efficacy as training progresses.

Download:

Figure 7. Within- and between-session changes in performance.

(A) Group mean within-session changes for groups T800, T400 and T200. (B) Group mean between-session changes in all training groups. The gap between bars 5 and 8 in A and B indicate a weekend break. Error bars show ±SEM.

https://doi.org/10.1371/journal.pone.0036929.g007

Between-session changes (estimated as the difference in DLFs for the last two blocks of each training day and the first two blocks of the next) were positive at the beginning of training, decreased as training progressed, and became negative in some cases towards the end of training (Fig. 7B). This progressive loss of between session benefit was significant in T800, T400 and T200 (t(13) < −5.6, p≤.001). T100 data could not be analyzed because there were not enough blocks within each session, but they are pictured in Figure 7B for comparison. Performance at the end of each session (i.e. the average DLF from the last two blocks) was correlated with the between-session change in threshold that followed it – the better the performance, the smaller the between-session gains (r = .49, p<.001; Fig. 8).

Download:

Figure 8. Correlation between performance and between-session learning.

Amount of between-session improvement plotted as a function of mean DLFs on the last two blocks of the session. Dashed line indicates the regression fit.

https://doi.org/10.1371/journal.pone.0036929.g008

All four training regimens produced equivalent overall learning and retention. Regimens ranging from as little as 100 trials per day (about 8 minutes’ practice) to 800 trials per day (over one hour of practice) were equally effective on this task, indicating that FD learning is relatively robust to regimen changes. Further, the group with the shortest sessions reached equivalent final performance to that of the groups who trained twice as much overall. This suggests that more training is not necessarily better, and that excess training can, in fact, be inefficient. These findings bode well for applications of FD training, since they support flexibility in the training regimen to suit individual schedules.

Censor and colleagues [30], [31] have also shown improved learning with shorter sessions on a visual texture discrimination task. They attributed their results to within-session adaptation: increased stimulus exposure in longer sessions produced performance deterioration, while overnight sleep resulted in improvement. Our data do not preclude the possibility of adaptation-related deterioration. However, if adaptation did occur, it did not result in reduced within-session learning, as shown by Censor and colleagues. In fact, the greatest within-session learning was observed in the group with the longest training sessions (T800).

The amount of overnight benefit was greatest and lasted over more sessions in the T100 group, consistent with the findings of Goedert and Miller [3] on a visual motor task, where groups trained on fewer trials within a session showed greater overnight improvement than groups who trained on more trials. Our finding that the largest overnight benefit is associated with the poorest performance is also consistent with the observation that difficult tasks (poorer performance) show greater between-session improvement than easier tasks [32]. This is not surprising as performance thresholds decrease with practice. Taken together, these results suggest that using more difficult tasks coupled with short training sessions may be advantageous in maximizing the benefit of between-session learning.

The reduced and negative contribution of between-session learning in later stages of training was unexpected given a previous finding that between-session learning occurs throughout multi-day training, and in spite of extensive pre-tests [2], [33]. It is possible that even extensive pre-testing does not produce much training, and that subsequent between-session learning is still early stage. However, the between-session learning seen by Wright and colleagues persisted over several thousand trials. Here, we saw very little between-session learning after the first 1000 trials. The tones used by Wright and colleagues were very short compared to those used here, making the task more difficult (i.e. increasing the discrimination threshold [34]). Our data show that higher thresholds lead to more between-session learning. Thus, it is possible that harder tasks start with poorer performance and improve more slowly than easier tasks, yielding a later transition from a stage where between-session learning is effective to a stage where it is not. Alternatively, the two tasks may produce different learning profiles because practicing FD on very short tones may train different aspects of auditory perception compared to practice on longer tones.

Differences in the task may also affect the critical minimum number of trials required for learning. Wright and Sabin [2] observed between-session learning for a group who trained on 900 but not 360 trials each day, indicating a critical minimum within this range. The groups in our study (all of whom trained within or below this range) showed no evidence of a critical minimum. It is possible that difficult tasks require more practice within a session in order to trigger learning than easier tasks. Alternatively, as noted above, it may be that fundamentally different aspects of perception are being trained by practice with short and long tones, and that these aspects have different requirements.

While we saw no evidence for a critical minimum, the T800 group showed a marginally reduced slope compared to other regimens in later learning, which could indicate that a maximum amount of effective training was exceeded. One explanation is that within-session learning had saturated during the session, so that some of the practice was wasted. If this were the case, the finding that 800 trials per session was effective for early training would indicate that the effective maximum decreases as learning progresses. This explanation is consistent with the finding that, while within-session learning was constant throughout the study for the T200 and T400 regimens, the T800 group showed less learning within session 2 than session 1. An alternative explanation is that the T800 group (unlike the other groups) did not have any session breaks within the second 800 trials, and so could not benefit from any between-session learning. Our data suggest that this is unlikely; rather than contributing to learning, session breaks produced decrements in performance in these later stages in the other regimens. In addition to investigating the total amount of effective training for lasting performance improvement, Experiment 2 was designed to provide additional data on longer-term training with 800 trials per day.

Experiment 2: Single- and Multiple-Session Training

The second experiment addressed two questions. The first, raised in the Introduction, is whether extended, multi-day training confers any benefit over single-session training. The second, raised in Experiment 1, regards the possibility that 800 trials per day exceed a maximum effective daily training in the later stages of learning. We found in Experiment 1 that performance of the T800 group improved significantly between 800 and 1600 trials. This would suggest that multi session training should enhance learning compared to single session training. On the other hand, we found that between-session improvements become more negative as training progresses, suggesting prolonged training may be less effective. In Experiment 2 one group trained on 800 trials of FD for a single day (T800 s) and a second group on 800 trials per day over 4 days (T800 m; Fig. 9). All participants were tested at the trained and an untrained frequency before training, and several times during and after training, to determine how well the regimens compared in terms of overall learning, retention of learning, and transfer to another condition.

Download:

Figure 9. Training regimens for Experiment 2.

Two groups trained on 800 trials of FD per day. The T800 m group completed four days of training and the T800 s group completed one. Tests consisted of assessment at the trained and an untrained frequency, and were conducted at the beginning of Days 1, 2 and 5, and then one week (Day 12) and four weeks afterwards (Day 33). A five trial demo preceded the experiment.

https://doi.org/10.1371/journal.pone.0036929.g009

We expected significant additional learning in the multi-session group compared to the single-session group. Based on visual studies showing increased specificity with training [6], [10], we also expected that multi session training would produce less transfer to a different frequency than single session training. Multi-session learning studies suggest that learning is retained over long time periods [10], [25], [35]. While long-term retention can be observed after extremely short exposure to visual stimuli (for example, the “McCollough Effect” [36], [37]), there is no previous evidence that short auditory training can induce or maintain long-term retention.

If a slower learning rate for T800 regimens in later stages of training is confirmed, data from days 2–4 of the T800 m group will allow us to determine its cause. The slope beyond the first 800 trials should be shallow if a maximum of effective daily training has been exceeded. However, if the slow learning is due to the lack of overnight benefit, the slope over the three additional training days taken together should be equivalent to those of the T400 and T200 regimens in Experiment 1 (Fig. 5B).