Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Method matters: Experimental evidence for shorter avian sperm in faecal compared to abdominal massage samples

  • Antje Girndt ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing;

    Affiliations Evolutionary Biology, Max Planck Institute for Ornithology, Seewiesen, Germany, Department of Life Sciences, Imperial College London, Silwood Park Campus, Ascot, United Kingdom, International Max-Planck Research School (IMPRS) for Organismal Biology, University of Konstanz, Konstanz, Germany

  • Glenn Cockburn,

    Roles Data curation, Methodology, Project administration, Writing – review & editing

    Affiliations Evolutionary Biology, Max Planck Institute for Ornithology, Seewiesen, Germany, International Max-Planck Research School (IMPRS) for Organismal Biology, University of Konstanz, Konstanz, Germany

  • Alfredo Sánchez-Tójar,

    Roles Funding acquisition, Visualization, Writing – review & editing

    Affiliations Evolutionary Biology, Max Planck Institute for Ornithology, Seewiesen, Germany, Department of Life Sciences, Imperial College London, Silwood Park Campus, Ascot, United Kingdom, International Max-Planck Research School (IMPRS) for Organismal Biology, University of Konstanz, Konstanz, Germany

  • Hanne Løvlie,

    Roles Supervision, Validation, Writing – review & editing

    Affiliation IFM Biology, Linköping University, Linköping, Sweden

  • Julia Schroeder

    Roles Conceptualization, Funding acquisition, Resources, Supervision, Validation, Writing – review & editing;

    Affiliations Evolutionary Biology, Max Planck Institute for Ornithology, Seewiesen, Germany, Department of Life Sciences, Imperial College London, Silwood Park Campus, Ascot, United Kingdom


Birds are model organisms in sperm biology. Previous work in zebra finches, suggested that sperm sampled from males' faeces and ejaculates do not differ in size. Here, we tested this assumption in a captive population of house sparrows, Passer domesticus. We compared sperm length in samples from three collection techniques: female dummy, faecal and abdominal massage samples. We found that sperm were significantly shorter in faecal than abdominal massage samples, which was explained by shorter heads and midpieces, but not flagella. This result might indicate that faecal sampled sperm could be less mature than sperm collected by abdominal massage. The female dummy method resulted in an insufficient number of experimental ejaculates because most males ignored it. In light of these results, we recommend using abdominal massage as a preferred method for avian sperm sampling. Where avian sperm cannot be collected by abdominal massage alone, we advise controlling for sperm sampling protocol statistically.


Male competition over access to females, and sperm competition over fertilisation of eggs, are two sides of the same coin − both determine male reproductive success and ultimately fitness [1,2]. In sexually reproducing species, males compete with each other for access to mates, and when a male fails to secure exclusive copulation rights, his sperm need to outcompete rivals' sperm in fertilising eggs [3]. Sperm competition is ubiquitous across taxa and an important part of sexual selection [2,4]. Thus, one eminent interest of evolutionary biologists is to understand which traits predict the competitiveness of sperm and thus the likeliness to win the sperm race.

In sperm evolutionary ecology research, sperm size and shape matters. Sperm length commonly correlates positively with sperm swimming speed [57], but see [8], comparative sperm morphometry (i.e. measured dimensions of different sperm components) is used to reveal phylogenetic relationships and predict sperm energetics [9,10], and variation in sperm morphometry can be indicative of the intensity of sperm competition within species [1113]. In birds, sperm competition is widespread because of frequent extra-pair copulations in socially monogamous species, polyandrous mating systems or rapid mate switching [14].

Avian sperm biologists have successfully adopted semen collection techniques from the poultry industry [15] and thus can sample sperm from birds with relative ease [16]. Techniques that minimise handling stress and are applicable in the field are desirable because non-domestic birds are often of conservation concern and cannot be kept in captivity. Avian faecal sperm sampling is advocated as a simple and non-invasive alternative to other methods of sperm collection [17]. This technique uses the pathway of passively lost sperm during defaecation to obtain sperm in reproductively active males [18]. An intial study on ten zebra finches, Taeniopygia guttata, comparing sperm from faeces and sperm ejaculated into a stuffed female dummy, showed no morphological difference between sperm from both collection techniques [17]. Consequently, sperm collection techniques, such as faecal sperm sampling, dissection of seminal glomera or testes of sacrificed or road-killed birds, female dummy techniques or abdominal massage sperm sampling [19,20] (hereafter called massage), are used interchangeably (e.g. [2126]). Furthermore, only few studies included the different methods used for sperm sampling in their statistical analysis [2729] and we are aware of only one [27] that gave detailed information on its effects. For instance, one study has used three methods of sperm collection in the past, accounted for variation in the sampling method statistically by adding collection technique as a random effect, but did not report these estimates [28]. Yet, there is reason to consider that sampling method may sample sperm at different maturational stages [30,31] because intra-testicular sperm, for instance, are less developed than extra-testicular sperm [3234]. It is currently unclear whether this affects sperm morphometry of sperm sampled with different methods, and if so how one should account for it.

Here, we tested the hypothesis that sperm morphometry does not differ between three sampling methods: samples from males ejaculating into a stuffed female dummy, faecal collection, and massage technique. We used a captive population of house sparrows to repeatedly sample individual males with a randomised design. We measured the length of a sperm's main components: head, i.e. nucleus and acrosome, midpiece and flagellum, and predicted that sampling method does not influence sperm length. In contrast to our prediction, we demonstrate differences in sperm length between the faecal and the abdominal massage method, so mixing sperm sampling methods should be avoided or controlled for statistically to reduce uncertainty in statistical analyses [35,36]. The female dummy did not result in sufficient experimental ejaculates thus statistical analyses of sperm length differences were only conducted between faecal and abdominal massage samples.

Material and methods

Study population

Male house sparrows (n = 52) were kept at the Max Planck Institute for Ornithology in Seewiesen, Germany, in June 2015. The males were housed in four single-sex semi-outdoor aviaries, single aviary dimensions: 1.2 m x 4.0 m x 2.2 m high, and each aviary contained 13 males. Adjacent walls were covered with hessian fabric to prevent visual contact. The population consists of wild-caught birds in 2005 and 2006 [37] and their offspring. Males were in acoustic, but not visual or physical contact with females for a period of two months before sampling sperm and thus could not copulate with females. Mating can affect sperm depletion and post-meiotic sperm senescence [38,39], which, in our case, can be considered standardised. All individuals were fitted with a numbered metal-ring and a unique combination of three coloured plastic rings for individual identification. Because aviaries had meshed outside walls, light, temperature, humidity and ventilation were close to natural conditions. Additionally, an artificial light-dark cycle was set from 05:30 to 18:00 with light intensity gradually increasing in the morning and dimming in the evening over a period of one hour. Birds were provided with ad libitum water and food (wild seed mixture, fresh salad, sunflower seeds, crushed corn and wheat, oats) and mineral mix at all times, as well as sand and water baths. The Government of Upper Bavaria approved the care, handling and husbandry of all birds in this study (Nr 311.5–5682.1/1-2014-024).

Sperm collection

Sperm were obtained with three different methods: (a) collected with a stuffed female dummy, (b) through faecal collection, and (c) from massage.

a) Stuffed female dummy.

The body of one adult female house sparrow that in 2014 had died of a natural cause in our population was skinned, moulded, set-up in copulatory position and fitted with a false cloaca (Fig 1). Cloacae were handmade using medical silicone tubing (inner diameter 1.98 mm) and metal wire (strength 0.7 mm) and filled with 4μl phosphate-buffered saline (PBS) following the design and procedure described in [40] (Fig 1). The false cloaca was then inserted into the female dummy, which was attached to a perch inside the males' aviary and left for a trial period of ten minutes. When a copulation occurred (see S1 Movie in the supporting information), the copulating male was identified by his colour rings, and the experimental ejaculate pipetted from the false cloaca into 200μl of 5% formalin (following [17,40]) for subsequent sperm measurements. Afterwards, the female dummy was equipped with a fresh false cloaca and introduced into the aviary allowing for new copulations.

Fig 1. False cloaca and female dummy used for experimental ejaculate collection in male house sparrows.

(a) Example of a false cloaca prototype, which was filled at the larger opening with 4μl PBS before being inserted up until the wire into the female dummy. Rear (b), and side (c, d) view of the single female dummy used in sperm collection trials. Pictures courtesy of: Elena Beirer.

b) Faeces collection.

Sperm are continuously released during defaecation in reproductively active males [18] and can be obtained by pipetting any fluid part from fresh faeces as described in [17]. To sample sperm from males' faeces, individual males were removed from their aviary and placed individually inside a cage measuring 60 cm x 40 cm x 45 cm with non-absorbent flooring for a period of ten minutes. Once defaecation occurred, the fluid parts of a male's faeces were pipetted into 200μl of 5% formalin [17].

c) Massage.

In reproductively active male passerines, growth of the seminal glomera results in a swelling called cloacal protuberance, whose main function seems to be sperm storage and maturation [41,42]. Sperm were collected by gently squeezing the cloacal protuberance (Fig 2), which resulted in immediate ejaculation. Individual samples were collected with a 5μl ring-marked capillary (Fig 2) and stored in 200μl of 5% formalin [16,17].

Fig 2. Massage technique used for male house sparrows.

(a) A male was positioned on his back and his cloacal protuberance exposed. (b) Pressure was gently applied at the base of the cloacal protuberance using three fingers. (c) Experimental ejaculates were collected using a ring-marked capillary. Pictures a) and b) courtesy of Elena Beirer and c) Julia Schroeder.

Experimental protocol for sperm collection

Sperm sampling took place over four days in mid June, which represented the middle of the house sparrow breeding season [43]. The female dummy collection technique was always performed before faecal and massage sampling because we anticipated that males would be less interested in the female dummy after handling stress from catching. For this purpose, the female dummy was placed inside an aviary for males to copulate with for a period of ten minutes. We proceeded with sperm sampling using the other two methods only after the female dummy test was finished. Therefore, we tossed a coin to randomise whether a male was first sampled by massage, followed by a defaecation trial, or first received a defaecation trial followed by a massage. The whole procedure − first offering the female dummy and then randomising faecal and massage sampling − was repeated for each male after a day's rest. In other words, males were caught again two days after their first trial and depending on their treatment during the first trial, received the reversed sampling order during their second trial. Hence, we aimed to obtain a total of six samples per male: one female dummy, one faecal and one massage sample per day, per male. This protocol randomised massage and faecal sampling in time and sequence to experimentally control for potential order effects on sperm morphometry between methods within sampling days. A schematic overview of the experimental procedure is presented in Fig 3.

Fig 3. Schematic overview of experimental protocol.

Whereas the time and sequence of massage and faecal sampling was randomised, female dummy trials always took place before the faecal and massage sampling.

Sperm morphometrics

We prepared and measured sperm, as it is common practice in studies of avian sperm morphometry by using unstained sperm, bright field microscopy and formalin as a fixative (e.g. [17,21,23,25,42]). Specifically, we prepared 10μl aliquots onto microscope slides from the formalin-fixed samples. We only used slides that held a minimum of 100 sperm to account for potential sperm abnormalities [28] and photographed the first ten sperm that were intact and normal, i.e. that did not deviate in form from typical passerine sperm [44] such as e.g. deformations or loss of sperm components (see S1 Appendix in the supporting information). We only used sperm that were not covered by other sperm or detritus and always started in the upper left corner of a slide to avoid observer bias and to ensure that no sperm was mistakenly measured twice. Digital images of single sperm were taken with a Leica DFC450-C camera, mounted on a Zeiss Axioplan-2 microscope at x400 magnification using bright field settings. From these digital pictures, we then took three consecutive measurements to the nearest 0.01μm of the following three sperm traits: head (i.e. nucleus including acrosome), midpiece, and flagellum. We used the mean of the three consecutive measurements for statistical analyses. All measurements were taken by one observer only (GC) with the Leica Application Suite (LAS) software v4.2 using the LAS segment tool and centring the line within sperm and segmenting where necessary to follow the helical twists and natural curvature (Fig 4).

Fig 4. Length measurements of house sparrow sperm.

Example of measurements of single sperm components: head: nucleus (red) and acrosome (pink), midpiece (green), and flagellum: tail (cyan) and midpiece (green). Total length was calculated by using the sum of flagellum and head length. Note that whereas the transition from midpiece to head and the end of the acrosome are visible in light microscopy; the acrosome and nucleus cannot be precisely differentiated and are portrayed here only to demonstrate the composite nature of the head measurement.

Total sperm length was calculated as the sum of the length of the flagellum and head because the calculated measurement correlated strongly with measured total sperm length (Pearson r = 0.92, df = 98, p<0.0001, n = 100 randomly chosen sperm using the function ‘runif()’ in R version 3.3.1 [45]).

Whereas massage and female dummy samples were indistinguishable under the microscope, on occasion, we could discriminate a faecal from a massage sample, if detritus was present in faecal samples. Throughout the measuring process, however, the observer was blind in respect to the question at test. To test how many sperm were needed to get a sufficiently precise estimate of individual sperm length, we initially measured 20 sperm per male, considering individual mean trait estimates using R2 of linear regression. Using the built-in function ‘sample()’ in R version 3.3.1 [45], we selected one of the three sperm components (i.e. midpiece) measured from 45 males and regressed means from measuring 5, 10, 15 midpieces against the "full" mean trait estimate when measuring 20 midpieces. The adjusted R2 can then be used to interpret how much of the variance, when measuring 20 sperm, is explained by each single predictor [46]. The adjusted R2 was 0.71, when using the mean length of 5 sperm per male, compared to 0.91 and 0.97 for 10 and 15 sperm, respectively. We thus concluded that measuring 10 sperm per male was sufficient to estimate individual sperm length. To establish observer repeatability, we randomly selected one microscope slide using the built-in function ‘sample()’ in R version 3.3.1 [45] and measured 20 sperm twice, leaving 48 hours between measurements to ensure independence of measurements.

Statistical analyses

We fitted linear mixed models with Gaussian errors using the function ‘lmer’ from the package ‘lme4’ [47] in R version 3.3.1 [45] with the total length of single sperm components as respective response variables. We used the raw data from all sperm measured per male for linear mixed models (range 10 − 20 sperm per male) instead of using the mean or median of all sperm measured per male. Collection technique was fitted as a predictor variable (two levels: faecal and massage sampling). Because we did not obtain all samples from all males as anticipated, we excluded female dummy samples from analyses and added the relative order of the faecal to massage collection technique (i.e. first, second) as a fixed effect to the model. This choice of order is sensible, because sperm supplies are replenished in house sparrows over night [39]. Inbreeding depression can affect sperm morphology [48], and individual standardised multilocus heterozygosity (i.e. sMLH, calculated with the R package ‘inbreedR’ [49]) was therefore used as a proxy for inbreeding depression from marker data [49]. However, sMLH was not associated with the length of sperm traits (results not shown), which means that our findings were not affected by inbreeding depression, if at all present in our population, and we therefore did not keep this variable in the model. We added male ID, sample ID and cohort (i.e. year of birth) as random effects on the intercept to account for repeated measurements of individuals, non-independence of sperm measurements within experimental ejaculates, and potential cohort effects. Model fit and assumptions were validated by visual inspection of residuals [46]. Observer repeatability and individual male repeatability for length measurements were calculated with the R package ‘rptR’ [50] suitable for Gaussian data using 1000 bootstrap and 1000 permutations. We used the function ‘sim’ from the package ‘arm’ to calculate posterior distributions (n = 1000 draws) with flat priors from our linear mixed models [51] and report posterior means and 95% Bayesian Credible Intervals (CrI). CrI not overlapping zero are interpreted as a Frequentist p-value of < 0.05, and thus as a statistically significant result [46]. Hence, CrIs can be used to test null-hypotheses but doing so should not exclude acknowledging that CrIs provide more valuable information than p-values (e.g. uncertainty in the parameter estimate, how close the model estimate is to zero) [46,52]. The data and the R script are available at the Open Science Framework (DOI 10.17605/OSF.IO/RYCMN).


Efficacy of sperm collection techniques

We collected sperm using three common methods, which allowed us to compare their efficacy in obtaining samples and sperm. Of 52 males tested on two days, only three males copulated with the female dummy, and for only two of these males could we collect experimental ejaculates. In total, we had five experimental ejaculates from two males available from female dummy sampling (one male copulated twice with the female dummy during trial 1 and both males copulated with the female dummy during trial 1 and trial 2). Because of the limited sample size, the sperm length measures from the female dummy samples were thus only used for descriptive summary statistics (Table 1) but omitted from further statistical analyses. Faecal sampling proved to be more successful and samples were obtained in 99 of 104 trials. Five defaecation trials were unsuccessful, because the males did not defaecate within ten minutes. Similarly successful as with faeces collection, massage failed only in five out of 104 trials and in all these cases, failure coincided with males exhibiting small cloacal protuberances [41], indicating low breeding condition [53] compared to the other experimental males. All five female dummy samples, 67 out of 99 faecal samples, and 71 out of 99 massage samples (6% more compared to faeces) could be used for sperm length measurements.

Table 1. Descriptive summary statistics of length measurements of house sparrow sperm.

Sperm morphometrics

All sperm traits showed high observer measurement repeatability (>80%, Table 2) and repeatability within-males across days and methods (Table 2). Descriptive summary statistics (Table 2) demonstrate that our length measurements are representative of the species because they are similar to published records (e.g. [24,44,54]).

Table 2. (a) Observer repeatability, and (b) individual male repeatability for house sparrow sperm length measurements.

Our analysis of collection method affecting sperm length controlling for relative order of sampling showed that, on average, the total length of faecal sampled sperm across males was 0.41 μm less than of sperm collected by massage (Table 3). This difference, albeit small, was statistically significant, and explained by shorter sperm heads and midpieces in faecal compared to massage samples (Table 3, Fig 5). Sperm flagella did not differ in length between methods (Table 3). Furthermore, because we ultimately compared only two methods for sperm length differences, due to the female dummy experiment resulting in small sample sizes, a paired t-test could be seen as a post-hoc statistical alternative to linear mixed models. In contrast to the linear mixed models, only males with massage and faecal samples obtained during both trials were used for the paired t-test (n = 16 instead of 47 males), so relative order of methods, inbreeding etc. did not need to be controlled for. Using this simpler test confirmed our result of shorter sperm heads (paired t-test t 31 = 5.26, P < 0.001) and midpieces (paired t-test t 31 = 2.55, P = 0.02), but not flagella (paired t-test t 31 = 0.27, P = 0.79) in faeces compared to massage samples (Fig 6). Hence, despite small effect sizes, the results remained robust when using a smaller dataset.

Fig 5. Sperm length (μm) differences in relation to sperm sampling method in house sparrows controlling for relative order of massage to faecal sampling.

Sperm heads (a), and midpieces (b) were significantly shorter sampled from faeces [17] (823 sperm from 41 males) compared to abdominal massage samples [19,20] (822 sperm from 45 males). Filled dots represent means and vertical lines represent 95% Bayesian Credible Intervals (CrI).

Fig 6. Individual males' sperm length (μm) differences in relation to sperm sampling method in house sparrows.

We present individual raw data of males (n = 16 males) for which we had a total of two faecal and two massage samples. Thin grey lines connect the measurements for massage and faecal samples of individual males per trial. Pink lines connect the average measurements of massage and faecal sampled sperm across trials.

Table 3. Length (μm) of house sparrow sperm in relation to sperm sampling method controlling for relative order of faecal [17] to massage samples [19,20].

A follow-up analysis of a subset of 13 males also showed no quantitative difference of deformed sperm in faecal compared to massage sampled sperm (mean number deformed sperm among 100 sperm ± SD: faecal: 8.9 ± 8.04; massaged: 13.46 ± 10.34, paired Wilcoxon signed rank test, V = 20, P = 0.15, see also S1 Appendix in the supporting information for more details).


In contrast to the assumption that sperm morphometry does not differ between collection methods (e.g. [55]), we found that sperm were shorter when collected from faecal instead of massage samples. Specifically, we found heads and midpieces in faecal samples to be shorter compared to massage samples, while no difference was found in flagella length.

Sperm length differences can have methodological or biological origins. For instance, sperm are sensitive to handling and storage [56] and sperm deformation can occur because of sample preparation [57,58]. However, sperm deformation cannot explain our results, because we only measured intact, non-deformed sperm. Also, handling and storage cannot explain our finding either, as we collected samples within a short period of time, sample type was randomised within sampling event, and samples were prepared and measured as one batch. Additionally, we adopted published procedures for all three methods, used fixatives of identical concentrations and neither PBS nor formalin are known to affect avian sperm morphology [59]. Instead, biological factors such as the intensity of sperm competition [60], genome size [28] or sperm maturation [61] can affect sperm size. Here, we had repeated measures of individual males in identical environments, which remove the possibility for intrinsic effects to account for our results. Specifically, our additional analysis of using a paired t-test confirmed our results from linear mixed models and was restricted to males for which we collected faecal and massage samples during both events, so here males served as their own control. Therefore, the observed sperm size differences in heads and midpieces between methods might be explained by sperm in faeces resembling a different subpopulation of sperm within males and could be indicative of differences in the degree of post-meiotic sperm maturation [62,63].

Highlighting sperm maturation can help interpreting our result of differences in sperm length between methods. Sperm production takes place in the seminiferous tubules of the testis where primordial germ cells transform into spermatids [64]. Spermatids then undergo further development to mature into fully functioning spermatozoa. One feature of this maturation process called spermiogenesis is the elongation of sperm heads and flagella [61,64]. In other words, fully matured sperm are longer compared to less mature sperm. Furthermore, passerine spermiogenesis can be categorised according to work done at the ultrastructural level on house sparrow sperm [61]. Importantly, sperm at the last two stages of the maturation process, which are called stage 5 and 6 [61], resemble morphologically typical passerine sperm [44], so sperm from stage 5 onwards would have qualified to be measured in our study. However, in regard to sperm length, stage 5 sperm differ from stage 6 sperm in that the heads and midpieces are not fully elongated [44,61]. It is thus possible that the observed difference in sperm length is explained by immature sperm, (i.e. stage 5 ≤ sperm < stage 6 [61]), being defaecated rather than stored for copulation. An alternative explanation is that both faecal and massage sampled sperm have finished spermiogenesis, and thus the elongation of sperm components, but it is more senescent sperm sensu [31] that are defaecated instead of being stored. Indeed, the continuous release of sperm in reproductively active males has been speculated to remove excessive or senescent sperm [65], which would ensure that high-quality sperm are available for insemination. The length difference that we found could thus be explained, in principle, by senescent sperm in faeces versus fresh sperm in the seminal glomera. However, the literature does not suggest changes in sperm length as an accompanying feature of post-meiotic sperm senescence [31], which makes this explanation less likely. Moreover, it is unclear which mechanisms might account for either the idea of selective sperm loss of senescent [18] or less mature sperm with defaecation. Under both scenarios, however, our results could be regarded as indirect evidence to support an adaptive explanation of sperm defaecation [65], assuming that sperm of lower fertilisation efficiency are defaecated. Another factor that might play a role is that sperm sampled from faeces might experience higher osmotic stress compared to sperm sampled from abdominal massage. Uric acid attributes little to osmotic pressure [15], but the osmotic concentration in faeces might still be higher than the osmotic concentration of epididymal secretions, which might lead to shrinkage of faecal sampled sperm. Whereas we can only speculate about the biological causes of our results, the important implication from our findings is that when collection techniques are mixed in avian sperm studies, the method of sperm sampling should be controlled for statistically to reduce uncertainty in statistical models [35,36].

To explain aspects of avian evolutionary biology through sperm biology it is desirable to collect natural ejaculates. Unarguably, the method that comes closest to sampling natural ejaculates is the stuffed female dummy technique [40], especially with its adaptation in fowl where a harness is fitted to live females [66]. The stuffed female dummy technique might work well in some species [67], but despite a peculiar anecdote that house sparrow males might require little stimuli to initiate copulation [68], it did not work well in our populations (we also tried the female dummy collection technique in the field on a wild population with no success (Girndt personal observation). Commonly, only a subset of males copulate with a female dummy [67]. In our experiment this subset was very small (6% of 52 males) and the behavioural difference between the three males that did copulate with the female dummy and the majority of males that did not, differed markedly: rapid and repeated copulation versus complete ignorance. Also, a pilot study on 45 reproductively active males in our population in 2014 (Beirer et al unpublished) tested the female dummy three times using a single-male set-up. There, a total of two males copulated with the female dummy, and the number of males that approached it within a vicinity of 20 centimetres decreased from 11 males during trial 1 to two males during the final, third trial. Thus, repeated exposure to the female dummy seemed to counteract the little initial interest she had sparked. Therefore, we doubt that house sparrows could be trained to copulate with the female dummy, but we cannot exclude that a more sophisticated female dummy, e.g. a copulation robot mimicking female solicitation [43] might be more successful as a sperm collection device in house sparrows. The massage method yielded more samples than the female dummy method, and both methods have additional advantages and disadvantages that are worth highlighting. Whereas little to no training will be required to collect the wet part of a male's faeces, massaging passerines demands practice and training. Also, mostly clean samples are obtained from massage, whereas faecal samples hold detritus that can obstruct viewing sperm under the microscope. Lastly, faecal sperm sampling has been advertised as less invasive [17]. Under the original medical interpretation of the word: no introduction of instruments into the body [69], neither the faecal, nor the female dummy or massage technique are invasive. Using a more applied interpretation of "non-invasive" as minimised handling stress, we argue that faecal sperm sampling is only non-invasive in the unlikely scenario that a researcher finds a fresh [17], pipettable bird's faeces in the field, and can assign it to a species/individual without restraining it. This scenario immediately limits the questions that can be answered by it (e.g. describing gross sperm morphology in a new species). Instead, a literature search of studies citing [17] suggested that faecal sperm sampling was commonly applied by following the sampling method described in [17], which involved handling and constraining males. Handling males is also needed for sperm collection via massage but from our experience massaging males requires less time than faecal sperm sampling.

To the best of our knowledge, our study is the first that experimentally tested whether the faecal method can be used interchangeably with massage or the female dummy technique, using a randomised experimental design with repeated measures from individual male house sparrows. We found a statistically significant, albeit small, difference between sperm length of massage and faecal sampled sperm that could resemble length differences between sperm at potentially different maturational stages. Importantly, our effect size of mean head differences for instance, is similar to effect sizes described in bird, mammal or fish sperm literature (e.g. [65,7073]). Consequently, if sperm length varies within males according to method (see also [29,74] highlighting qualitative sperm differences), earlier results that did not take methodological differences into account might have made an interpretation of results more difficult. We encourage other scientists working in avian sperm biology to replicate our approach to test its generality in other species. In addition, where collection of natural ejaculates is improbable, we recommend the abdominal massage over the faecal sampling technique due to our findings, its advantage of giving cleaner samples and no difference in invasiveness.

Supporting information

S1 Movie. House sparrow male copulating with a female dummy.

This video gives an example of a male house sparrow, Passer domesticus, copulating with a stuffed female dummy house sparrow. The female dummy had lived in the population for seven years before she died of a natural cause in 2014. The video was taken to illustrate an experimental copulation with the female dummy. It was not part of the experiment described in the manuscript.


S1 Appendix. Sperm abnormality procedures.

Description of methods.



Various people supported data collection and we are grateful to E. Beirer, K.R. Heckel-Merz, B. Kempenaers, E. Koch and K. Teltscher. We thank M. Bulla and F. Helfenstein for comments on the manuscript and four anonymous reviewers.


  1. 1. Dunbar RIM. Life history tactics and alternative strategies of reproduction. In: Bateson P, editor. Mate choice. Cambridge: Cambridge University Press; 1983. pp. 423–447.
  2. 2. Birkhead T. Promiscuity. An evolutionary history of sperm competition. Cambridge, Massachusetts: Harvard University Press; 2000. pp. 1–272.
  3. 3. Parker GA. Sperm competition and its evolutionary consequences in the insects. Biol Rev. 1970;45: 525–567.
  4. 4. Møller AP, Ninni P. Sperm competition and sexual selection: a meta analysis of paternity studies of birds. Behav Ecol Sociobiol. 1998;43: 345–358.
  5. 5. Simpson JL, Humphries S, Evans JP, Simmons LW, Fitzpatrick JL. Relationships between sperm length and speed differ among three internally and three externally fertilizing species. Evolution. 2014;68: 92–104. pmid:24224469
  6. 6. Fitzpatrick JL, Garcia-Gonzalez F, Evans JP. Linking sperm length and velocity: the importance of intramale variation. Biol Lett. 2010;6: 797–799. pmid:20484233
  7. 7. Firman RC, Simmons LW. Sperm midpiece length predicts sperm swimming velocity in house mice. Biol Lett. 2010;6: 513–516. pmid:20147311
  8. 8. Fitzpatrick JL, Lüpold S. Sexual selection and the evolution of sperm quality. Molecular Human Reproduction. 2014. pp. 1180–1189. pmid:25323970
  9. 9. Cummins J. Sperm motility and energetics. In: Birkhead TR, Hosken DJ, Pitnick S, editors. Sperm Biology An evolutionary perspective. 1st ed. Academic Press; 2009. pp. 186–206.
  10. 10. Howard DJ, Palumbi SR, Birge LM, Manier MK. Sperm and speciation. In: Birkhead TR, Hosken DJ, Pitnick S, editors. Sperm Biology An evolutionary perspective. 1st ed. Academic Press; 2009. pp. 367–403.
  11. 11. Kleven O, Laskemoen T, Fossøy F, Robertson RJ, Lifjeld JT. Intraspecific variation in sperm length is negatively related to sperm competition in passerine birds. Evolution. 2008;62: 494–499. pmid:18070085
  12. 12. Johnson DP, Briskie J V. Sperm competition and sperm length in shorebirds. Condor. 1999;101: 848–854.
  13. 13. Lifjeld JT, Laskemoen T, Kleven O, Albrecht T, Robertson RJ. Sperm length variation as a predictor of extrapair paternity in passerine birds. PLoS One. 2010;5. pmid:20976147
  14. 14. Birkhead TR, Moller AP. Sperm competition in birds. Evolutionary causes and consequences. London: Academic Press; 1992.
  15. 15. Gee GF, Bertschinger H, Donoghue AM, Blanco J, Soley J. Reproduction in nondomestic birds: physiology, semen collection, artificial insemination and cryopreservation. Avian Poult Biol Rev. 2004;15: 47–101.
  16. 16. Briskie J V, Montgomerie R. Testis size, sperm size and sperm competition. In: Jamieson B, editor. Reproductive Biology and Phylogeny of Birds. Science Publishers; pp. 513–553.
  17. 17. Immler S, Birkhead TR. A non-invasive method for obtaining spermatozoa from birds. Ibis. 2005;147: 827–830.
  18. 18. Quay WB. Spontaneous continuous release of spermatozoa and its predawn surge in male passerine birds. Gamete Res. 1987;16: 83–92. pmid:3506902
  19. 19. Wolfson A. The cloacal protuberance: a means for determining breeding condition in live male passerines. Bird-Banding. 1952;23: 159–165.
  20. 20. Burrows WH, Quinn JP. The collection of spermatozoa from the domestic fowl and turkey. Poult Sci. 1937;16: 19–24.
  21. 21. Calhim S, Immler S, Birkhead TR. Postcopulatory sexual selection is associated with reduced variation in sperm morphology. PLoS One. 2007;2. pmid:17476335
  22. 22. Immler S, Pitnick S, Parker GA, Durrant KL, Lüpold S, Calhim S, et al. Resolving variation in the reproductive tradeoff between sperm size and number. Proc Natl Acad Sci U S A. 2011;108: 5325–30. pmid:21402912
  23. 23. Lüpold S, Linz GM, Rivers JW, Westneat DF, Birkhead TR. Sperm competition selects beyond relative testes size in birds. Evolution. 2009;63: 391–402. pmid:19215291
  24. 24. Birkhead TR, Immler S, Pellatt EJ, Freckleton RP. Unusual sperm morphology in the Eurasian Bullfinch (Pyrrhula pyrrhula). Auk. 2006;123: 383–392.
  25. 25. Laskemoen T, Albrecht T, Bonisoli-Alquati A, Cepak J, de Lope F, Hermosell IG, et al. Variation in sperm morphometry and sperm competition among barn swallow (Hirundo rustica) populations. Behav Ecol Sociobiol. 2013;67: 301–309.
  26. 26. Bennison C, Hemmings N, Slate J, Birkhead T, Bennison C. Long sperm fertilize more eggs in a bird. Proc R Soc B. 2015;282: 20141897. 0.1098/rspb.2014.1897
  27. 27. Immler S, Calhim S, Birkhead TR. Increased postcopulatory sexual selection reduces the intramale variation in sperm design. Evolution. 2008;62: 1538–1543. pmid:18384656
  28. 28. Girndt A, Knief U, Forstmeier W, Kempenaers B. Triploid ZZZ Zebra Finches Taeniopygia guttata exhibit abnormal sperm heads and poor reproductive performance. Ibis. 2014;156: 472–477.
  29. 29. Lüpold S, Calhim S, Immler S, Birkhead TR. Sperm morphology and sperm velocity in passerine birds. Proc Biol Sci. 2009;276: 1175–1181. pmid:19129098
  30. 30. Humphreys P. Brief obervations on the semen and spermatozoa of certain passerine and non-passerine birds. J Reprod Fertil. 1972;29: 327–336. pmid:4113685
  31. 31. Pizzari T, Dean R, Pacey A, Moore H, Bonsall MB. The evolutionary ecology of pre- and post-meiotic sperm senescence. Trends in Ecology and Evolution. 2008. pp. 131–140. pmid:18280006
  32. 32. Deviche P, Hurley LL, Fokidis HB. Avian testicular structure, function, and regulation. Hormones and Reproduction of Vertebrates—Volume 4. 2011. pp. 27–70.
  33. 33. Esponda P. Spermatozoon maturation in vertebrates with internal fertilization. Microsc Electron Biol Cel. 1991;15: 1–23.
  34. 34. Ashizawa K, Sano R. Effects of temperature on the immobilization and the initiation of motility of spermatozoa in the male reproductive tract of the domestic fowl, Gallus domesticus. Comp Biochem Physiol—Part A Physiol. 1990;96: 297–301.
  35. 35. Forstmeier W, Wagenmakers E, Parker TH. Detecting and avoiding likely false-positive findings–a practical guide. Biol Rev. 2016; pmid:27879038
  36. 36. Houslay TM, Wilson AJ. Avoiding the misuse of BLUP in behavioural ecology. Behav Ecol. 2017;
  37. 37. Laucht S, Kempenaers B, Dale J. Bill color, not badge size, indicates testosterone-related information in house sparrows. Behav Ecol Sociobiol. 2010;64: 1461–1471. pmid:20730125
  38. 38. Pizzari T, Parker G. Sperm competition and sperm phenotype. In: Birkhead TR, Hosken DJ, Pitnick S, editors. Sperm Biology An evolutionary perspective. 1st ed. Academic Press; 2009. pp. 207–245.
  39. 39. Birkhead TR, Veiga JP, Moller AP. Male sperm reserves and copulation behaviour in the House sparrow, Passer domesticus. Proc R Soc B Biol Sci. 1994;256: 247–251.
  40. 40. Pellatt EJ, Birkhead TR. Ejaculate size in Zebra finches Taeniopygia guttata and a method for obtaining ejaculates from passerine birds. Ibis. 1994;136: 97–101.
  41. 41. Quay WB. Cloacal protuberance and cloacal sperm in passerine birds: comparative study of quantitative relations. Condor. 1986;88: 160–168.
  42. 42. Sax A, Hoi H. Individual and temporal variation in cloacal protuberance size of male bearded tits (Panurus biarmicus). Auk. 1998;115: 964–969.
  43. 43. Anderson TR. Biology of the ubiquitous house sparrow. From genes to populations. New York: Oxford University Press; 2006. chapter 4.
  44. 44. Jamieson B. Avian spermatozoa: structure and phylogeny. In: Jamieson B, editor. Reproductive Biology and Phylogeny of Birds. Jersey: Science Publishers; 2007. pp. 349–398.
  45. 45. R Development Core Team. R: A language and environment for statistical computing. 2013.
  46. 46. Korner-Nievergelt, Franzi von Felten S, Roth T, Almasi B, Guélat J, Korner-Nievergelt P. Bayesian data analysis in ecology using linear models with R, BUGS, and Stan. 1st ed. Academic Press; 2015.
  47. 47. Bates D, Mächler M, Bolker BM, Walker SC. Fitting linear mixed-effects models using lme4. arXiv:14065823v1[statCO]23. 2014; 1–51. 10.1177/009286150103500418
  48. 48. Opatová P, Ihle M, Albrechtová J, Tomášek O, Kempenaers B, Forstmeier W, et al. Inbreeding depression of sperm traits in the zebra finch Taeniopygia guttata. Ecol Evol. 2016;6: 295–304. pmid:26811793
  49. 49. Stoffel MA, Esser M, Kardos M, Humble E, Nichols H, David P, et al. inbreedR: an R package for the analysis of inbreeding based on genetic markers. Methods in Ecology and Evolution. 2016. pp. 1331–1339.
  50. 50. Nakagawa S, Schielzeth H. Repeatability for Gaussian and non-Gaussian data: a practical guide for biologists. Biol Rev Camb Philos Soc. 2010;85: 935–56. pmid:20569253
  51. 51. Gelman A, Hill J. Data analysis using regression and multilevel/hierarchical models. New York: Cambridge University Press; 2007.
  52. 52. Nakagawa S, Cuthill IC. Effect size, confidence interval and statistical significance: a practical guide for biologists. Biol Rev. 2007;82: 591–605. pmid:17944619
  53. 53. Schut E, Magrath MJL, Oers K van, Komdeur J. Volume of the cloacal protuberance as an indication of reproductive state in male blue tits Cyanistes caeruleus. Ardea. 2012;100: 202–205.
  54. 54. Helfenstein F, Podevin M, Richner H. Sperm morphology, swimming velocity, and longevity in the house sparrow Passer domesticus. Behav Ecol Sociobiol. 2010;64: 557–565.
  55. 55. Immler S, Birkhead TR. Sperm competition and sperm midpiece size: no consistent pattern in passerine birds. Proc R Soc B-Biological Sci. 2007;274: 561–568.
  56. 56. Leahy T, Gadella BM. Sperm surface changes and physiological consequences induced by sperm handling and storage. Reproduction. 2011. pp. 759–778. pmid:21964828
  57. 57. du Plessis L, Soley JT. Head-base bending and disjointed spermatozoa in the emu (Dromaius novaehollandiae): A morphological comparison of two closely related defects. Theriogenology. 2011;76: 1275–1283. pmid:21752445
  58. 58. Kamar GAR, Badreldin AL. Sperm morphology and viability. Cells Tissues Organs. 1959;39: 81–83.
  59. 59. Schmoll T, Sanciprian R, Kleven O. No evidence for effects of formalin storage duration or solvent medium exposure on avian sperm morphology. J Ornithol. 2016;157: 647–652.
  60. 60. Immler S, Birkhead TR. Sperm competition and sperm midpiece size: no consistent pattern in passerine birds. Proc Biol Sci. 2007;274: 561–8. pmid:17476777
  61. 61. Góes RM, Dolder H. Cytological steps during spermiogenesis in the house sparrow (Passer domesticus, Linnaeus). Tissue Cell. 2002;34: 273–282. pmid:12176310
  62. 62. García-Herreros M. Sperm subpopulations in avian species: a comparative study between the rooster (Gallus domesticus) and Guinea fowl (Numida meleagris). Asian J Androl. 2016;18: 889–894. pmid:27751988
  63. 63. Santiago-Moreno J, Esteso MC, Villaverde-Morcillo S, Toledano-Díaz A, Castaño C, Velázquez R, et al. Recent advances in bird sperm morphometric analysis and its role in male gamete characterization and reproduction technologies. Asian J Androl. 2016;18: 882–888. pmid:27678467
  64. 64. Aire TA. Spermatogenesis and testicular cycles. In: Jamieson B, editor. Reproductive Biology and Phylogeny of Birds. 6th ed. Science Publishers; 2007.
  65. 65. Immler S, Pryke SR, Birkhead TR, Griffith SC. Pronounced within-individual plasticity in sperm morphometry across social environments. Evolution. 2010;64: 1634–1643. pmid:20015235
  66. 66. Pizzari T, Cornwallis CK, Levlie H, Jakobsson S, Birkhead TR. Sophisticated sperm allocation in male fowl. Nature. 2003;426: 70–74. pmid:14603319
  67. 67. Pizzari T. Post-insemination sexual selection in birds. In: Roldan E, Gomendio M, editors. Spermatology. Nottingham; 2007. pp. 137–155.
  68. 68. Simmons KEL. Bizarre behaviour and death of male house sparrow. Br birds an Illus Mag devoted to birds Br List. 1985;78.
  69. 69. Oxford English Dictionary. Illustrated Oxford Dictionary. Revised ed. Paperback Oxford English Dictionary. Oxford: Oxford University Press, Dorling Kindersley.
  70. 70. Burness G, Casselman SJ, Schulte-Hostedde AI, Moyes CD, Montgomerie R. Sperm swimming speed and energetics vary with sperm competition risk in bluegill (Lepomis macrochirus). Behav Ecol Sociobiol. 2004;56: 65–70.
  71. 71. Bennison C, Hemmings N, Brookes L, Slate J, Birkhead T, Birkhead T, et al. Sperm morphology, adenosine triphosphate (ATP) concentration and swimming velocity: unexpected relationships in a passerine bird. Proc Biol Sci. 2016;283: 69–149. pmid:27559067
  72. 72. Esteso MC, Fernández-Santos MR, Soler AJ, Montoro V, Martínez-Pastor F, Garde JJ. Identification of sperm-head morphometric subpopulations in Iberian red deer epididymal sperm samples. Reprod Domest Anim. 2009;44: 206–211. pmid:18992078
  73. 73. Elgee KE, Evans JP, Ramnarine IW, Rush SA, Pitcher TE. Geographic variation in sperm traits reflects predation risk and natural rates of multiple paternity in the guppy. J Evol Biol. 2010;23: 1331–1338. pmid:20456562
  74. 74. Łukaszewicz ET, Kowalczyk AM, Rzońca Z. Comparative examination of capercaillie (Tetrao urogallus L.) behaviour responses and semen quality to two methods of semen collection. PLoS One. 2015;10. pmid:26397704