Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Species-specific dynamics may cause deviations from general biogeographical predictions – evidence from a population genomics study of a New Guinean endemic passerine bird family (Melampittidae)

  • Ingo A. Müller ,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    ingo.mueller94@gmail.com

    Affiliations Department of Zoology, Division of Systematics and Evolution, Stockholm University, Stockholm, Sweden, Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden, Leibniz Institut für Evolutions- und Biodiversitätsforschung, Museum für Naturkunde, Berlin, Germany

  • Filip Thörn,

    Roles Formal analysis, Investigation, Methodology, Software, Visualization, Writing – review & editing

    Affiliations Department of Zoology, Division of Systematics and Evolution, Stockholm University, Stockholm, Sweden, Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden, Leibniz Institut für Evolutions- und Biodiversitätsforschung, Museum für Naturkunde, Berlin, Germany

  • Samyuktha Rajan,

    Roles Data curation, Formal analysis, Investigation, Methodology, Software, Visualization, Writing – review & editing

    Affiliation Department of Zoology, Division of Ethology, Stockholm University, Stockholm, Sweden

  • Per G. P. Ericson,

    Roles Formal analysis, Investigation, Methodology, Resources, Software, Validation, Visualization, Writing – review & editing

    Affiliation Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden

  • John P. Dumbacher,

    Roles Investigation, Resources, Writing – review & editing

    Affiliation Department of Ornithology and Mammalogy, California Academy of Sciences, San Francisco, CA, United States of America

  • Gibson Maiah,

    Roles Resources, Writing – review & editing

    Affiliation New Guinea Binatang Research Center, Madang, Papua New Guinea

  • Mozes P. K. Blom,

    Roles Conceptualization, Data curation, Methodology, Project administration, Resources, Software, Supervision, Validation, Writing – review & editing

    Affiliation Leibniz Institut für Evolutions- und Biodiversitätsforschung, Museum für Naturkunde, Berlin, Germany

  • Knud A. Jønsson,

    Roles Conceptualization, Data curation, Funding acquisition, Project administration, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing

    Affiliation Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden

  • Martin Irestedt

    Roles Conceptualization, Data curation, Funding acquisition, Methodology, Project administration, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing

    Affiliation Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden

Abstract

The family Melampittidae is endemic to New Guinea and consists of two monotypic genera: Melampitta lugubris (Lesser Melampitta) and Megalampitta gigantea (Greater Melampitta). Both Melampitta species have scattered and disconnected distributions across New Guinea in the central mountain range and in some of the outlying ranges. While M. lugubris is common and found in most montane regions of the island, M. gigantaea is elusive and known from only six localities in isolated pockets on New Guinea with very specific habitats of limestone and sinkholes. In this project, we apply museomics to determine the population structure and demographic history of these two species. We re-sequenced the genomes of all seven known M. gigantaea samples housed in museum collections as well as 24 M. lugubris samples from across its distribution. By comparing population structure between the two species, we investigate to what extent habitat dependence, such as in M. gigantaea, may affect population connectivity. Phylogenetic and population genomic analyses, as well as acoustic variation revealed that M. gigantaea consists of a single population in contrast to M. lugubris that shows much stronger population structure across the island. We suggest a recent collapse of M. gigantaea into its fragmented habitats as an explanation to its unexpected low diversity and lack of population structure. The deep genetic divergences between the M. lugubris populations on the Vogelkop region, in the western central range and the eastern central range, respectively, suggests that these three populations should be elevated to full species level. This work sheds new light on the mechanisms that have shaped the intriguing distribution of the two species within this family and is a prime example of the importance of museum collections for genomic studies of poorly known and rare species.

Introduction

What determines the build-up of biodiversity in space and through time is a long-standing question within biology. The accumulation of phenotypic and genetic differences between populations can only be generated through reproductive isolation that impedes genetic exchange between populations [1,2]. More explicitly, gene flow between diverging populations must be sufficiently limited so that genetic exchange does not exceed the accumulation of differentiation. Barriers underlying reproductive isolation, may differ markedly. They may be postzygotic and arise from genetic incompatibilities, which produce hybrid offspring that have either reduced fitness or are infertile [35]. Alternatively, barriers may be prezygotic and decrease the probability of mating events between populations, due to mating preferences, or through geographical (allopatric) or habitat barriers that separate different populations [46].

Mountains represent a classic example of geographical barriers both as physical barriers for populations but also because they harbour highly differentiated environments at different elevations. For sedentary lowland populations, mountains may represent unsurpassable barriers, which may over time lead to isolation and differentiation of separate lowland populations. Evidence for such montane barriers restricting gene flow between lowland populations are known from various organismal groups such as amphibians, spiders and coniferous trees [79]. Alternatively, extensive lowland valleys can also act as barriers to geneflow between populations adapted to high elevations. Lowland environments may be unsuitable for such mountain-adapted individuals, which over time become isolated on a series of mountaintops or “sky islands” [10,11] as known from some groups of birds, lizards and plants [1216].

Related to this is the observation that within island systems such as on New Guinea older taxa are often found at higher elevations, while young lineages that are generally widespread, good dispersers and show little differentiation inhabit the lowlands (e.g. [1619]. Such observations (mostly from island systems) have led to the formulation of the concept of taxon cycles, in which taxa pass through phases of expansions and contractions. The concept predicts that over time, taxa move into high elevation habitats either because they are outcompeted by new young generalist taxa in their original (lowland) habitats or because they specialise and adapt to new environments at higher elevations [17,18,2022].

Recent work on the New Guinean avifauna has provided empirical evidence in favour of species originating in the lowlands from which they move into the highlands over time and become relictual specialists [16,22,23], although some colonisation from mountaintop to mountaintop has also been shown to occur [15]. In addition, recent Pleistocene speciation events on New Guinea are mainly the result of changes in habitat distributions due to climate fluctuations, as this has caused species with continuous distributions to become geographically fragmented [2426]. Pliocene speciation events, on the other hand, are driven mainly by geological processes such as montane uplift, which is known to have caused barriers to gene flow [2730].

The Melampittidae represents an example of an old passerine family with only two deeply diverged species each placed in monotypic genera. Their taxonomic affinities have been difficult to establish, but recent genetic results have placed the family as sister to the clade containing crows (Corvidae), white-crowned shrikes (Eurocephalidae), crested jayshrikes (Platylophidae) and shrikes (Laniidae) with an estimated divergence time from these at ca. 16.1 Mya [31,32]. One of the species, Melampitta lugubris (Lesser Melampitta) is relatively common at high elevations (1150–3500 m asl.), in accordance with the notion that older species tend to occupy higher elevations [21,22,33]. The other species, Megalampitta gigantea (Greater Melampitta) is only known from six localities at mid-elevations (650–1400 m asl.) scattered across New Guinea. Based on few field observations, it is considered to be sedentary and to have limited flight capabilities [34,35]. Within its range, M. gigantea is associated with very specific karstic habitats where it has been observed to spend considerable time nesting in narrow limestone sinkholes in which the birds have to climb in and out [34]. In contrast to M. lugubris the distribution of M. gigantea does not fit the general pattern that old taxa tend to occupy high elevation.

In this genomic study we determine the population structure and demographic patterns within the species M. lugubris and M. gigantea that each exhibit different levels of habitat connectivity. To answer this, we have re-sequenced 31 individuals at a median depth-of-coverage of 10.8 x and performed various population genetic (PCA, Admixture, diversity estimates) and phylogenetic analyses (mitochondrial and nuclear phylogenies). Based on the contemporary distributions of the two species we hypothesize that:

  1. The individuals of M. gigantea represent several distinct evolutionary entities/populations, as the species is a poor disperser and has a fragmented distribution across New Guinea where it is associated with specific karst limestone habitat with sinkholes.
  2. Individuals of M. lugubris represent a relatively cohesive group, yet with some population structure as deep lowland valleys may prevent gene flow between the various montane populations in the Central Range, the Huon mountains in the northeast and the Arfak mountains in the northwest.

Material & methods

DNA sampling, sequencing and read processing

In this study, we follow the taxonomy of the IOC World Bird List [36]. We sampled 24 individuals of Melampitta lugubris of which 22 were footpads from museum specimens and two were fresh tissue samples. Additionally, we sampled 7 individuals of Megalampitta gigantaea, which represent all known samples present in museum collections. One of these samples was the only available fresh blood sample for this species. The rest were footpads from historical samples (for a detailed list of samples and the museum collections in which they are stored see S1 Table). The work is mainly based on old museum specimens and therefore no ethical approval for animal research was required for this type of study. The few fresh tissue samples included in the study are from already preserved samples at natural history museums. Required permits were obtained through the Conservation and Environment Protection Authority (CEPA) of Papua New Guinea for research permits (99902749307 to K.A.J.) and export permits (017179 and 19069), which complied with all relevant regulations. The low number of modern tissue samples was due to the difficulty of acquiring fresh samples as both species are rather elusive.

DNA from fresh blood/tissue samples was extracted using Qiagen’s DNeasy Blood and Tissue kits. For DNA extraction and sequencing library preparation of historical samples, we followed a modified version of Meyer and Kircher [37] that proved suitable for avian museum samples [38]. In short, we extract DNA from toepad tissue mainly following the instructions from Qiagen for animal tissue with the addition of Dithiothreitol (DTT) to improve the ligation yield. During library preparation, we treat our samples with USER enzyme to reduce deamination patterns that are typical for fragmented DNA from historical or ancient samples [39]. For a detailed protocol see [38]. Whole genome re-sequencing was performed on Illumina NovaSeq 6000 machines on S4 flow cells going through 200 cycles with a read length of 2 x 100 bp at the National Genomics Infrastructure (NGI) in Stockholm. We consider this read length adequate, as our previous work has shown an average length of historical DNA fragments around 90–130 bp [38]. Up to 24 samples with four libraries each were multiplexed on a single flow cell lane yielding on average 108 reads per sample at a targeted depth-of-coverage of 10 x per individual (expected genome size: ∼ 1 Gb).

Sequenced reads were then polished using the reproducible Nextflow workflow nf-polish (https://github.com/MozesBlom/nf-polish) [40,41] (see S2 Table for specific github commits used). Besides providing a quality report through FastQC (v0.11.8, [42]), this pipeline performs multiple polishing steps, including deduplication (SuperDeduper, as part of HTStream v1.3.3, [43]), adapter- and quality-based trimming (Trimmomatic v0.39, [44]), read merging (PEAR v v0.9.11, [45]) and the removal of low-complexity reads, and calculates processing statistics after each step using seqkit (v0.16, [46]). See S2 File for the applied flags of each tool. Polished reads were then mapped onto a reference genome using nf-μmap (https://github.com/IngoMue/nf-umap [47]) applying bwa-mem2 (v2.2.1, [48]) as mapping algorithm (default settings, only adding read group information), merging and converting into bam format through samtools (v1.13, [49]) with defaults and evaluating the quality control after mapping with qualimap (v2.2.2d, [50]) and investigating damage patterns that are typical for historical DNA through DamageProfiler (v0.4.9, [51]). We used the hooded crow (Corvus cornix, Refseq GCF_000738735.5, [52]) as our reference genome as it represents the most closely related species with a high-quality chromosome level assembly [31].

Our evaluation of mapped reads against the Corvus cornix genome showed a median depth-of-coverage (DoC) for the nuclear genome of ∼ 10.822 x (min: 0.031 x, max: 31.324 x, SD: 7.513) and a median percentage of mapped reads at 89.5% (min 0.2%:, max: 95.8%, SD: 23.693). Detailed values for each individual and the mitochondria are listed in S1 Table.

Phylogenetic analyses on mitochondrial and nuclear DNA

To assemble the full mitogenome from our polished reads we used nf_mito-mania with default settings (https://github.com/FilipThorn/nf_mito-mania) [53]. In a first step, this workflow uses a random subset of 5 * 106 polished reads to assemble a de novo mitochondrial backbone using MITObim (v1.9.1, [54]). As a starting seed for MITObim we have used the Corvus cornix mitochondrial scaffold (Accession MT371428.1). Variant calling implemented in this pipeline filters sites with a depth-of-coverage below 20 or above three times the average depth-of-coverage across the whole mitogenome of each individual. The resulting consensus sequences of every individual were aligned using MAFFT (v7.407, [55], see S2 File for specific flags). An occasional artefact of MITObim where mitochondrial assemblies become longer than they are supposed to be resulted in overhanging sequences in some individuals. These overhangs were then cut out of the alignment after visual inspection using Geneious Prime 2023.0.4 so that the final alignment consisted only of overlapping reads (total length 17 112 bp including gaps), which were then used as input for RAxML-NG (v1.1.0, [56] using the GTR+G substitution model, 100 bootstrap replicates and ten parsimony-based randomised stepwise addition starting trees to generate a mitochondrial maximum likelihood phylogenetic tree. Mitochondrial assemblies were also forced into diploid variant calls to check for contamination in our samples. As mitochondria are haploid, heterozygote sites are not expected and could therefore be indicative of cross-contamination.

For the nuclear phylogenetic tree, we used the previously mapped .bam files excluding individuals with very low mean depth-of-coverage (n = 3, DoC < 4 x) to call variants for each individual using freebayes (v1.3.1-dirty, [57]). We used freebayes as variant caller in this step as it is suitable to use on non-model organism genomes and is more efficient when using samples with a low to intermediate depth-of-coverage [58]. Polymorphic sites were filtered based on their quality score (> 20), allelic balance (≥ 0.2), and minimum and maximum depth-of-coverage (3 x / 100 x). We also decomposed multiple nucleotide polymorphisms (MNPs) into single nucleotide polymorphisms (SNPs) and removed heterozygous positions and indels. These filtered .vcf files were then used as input files for the reproducible Nextflow workflow nf-phylo with default settings (https://github.com/MozesBlom/nf-phylo) [59]. This pipeline first creates consensus sequences for each individual and chromosome using samtools and bcftools (both v1.12, [49]) which are then combined into chromosomal alignments. Alignments were then divided into windows of different sizes (2000, 5000, 10000, 20000 base pairs, with 100 kbp between windows). Each window would undergo filtering and was included in a filtered concatenated alignment if at least half of the individuals were represented in more than 50% of an alignment column and if 80% or more of the individuals had missing data below 40%. The resulting filtered alignments were then used to generate phylogenetic trees for each window size using IQ-TREE (v2.0.3, [60]) while applying a GTR+I+G model. Using the same windows and model, a concatenated alignment of all autosomes and alignments for each chromosome were also generated and used to infer phylogenies with IQ-TREE. Additionally, summary coalescent phylogenies were generated based on all autosomal windows using ASTRAL3 (v5.7.8, [61,62]). Lastly, site and window concordance factors (sCF and gCF) were being calculated for all inferred phylogenies using IQ-TREE to complement bootstrapping. All implemented flags and filters are listed in S2 File.

Population structure and genetic diversity

To quantify population structure and to estimate genetic diversity between samples, we used a genotype likelihood approach as implemented in ANGSD (v0.938) as this is better suited for low coverage data and would allow us to include all of our samples [63]. Specific commands and the filters used are explained in the S2 File. The filters we used for admixture and principal component analyses (PCAs) were slightly different from those used to calculate nucleotide diversity, heterozygosity and Tajima’s D. As we did not have an ancestral genome available, we used the Corvus cornix reference genome as ancestral sequence and folded the site frequency spectra (SFS). PCAs were performed through PCAngsd [64] and plotted with custom R scripts through RStudio (v 2023.03.0 build 386, R version 4.1.1, [65,66]). Admixture analyses to determine population structure were run through NgsAdmix [67] running up to K = 10 with ten replicates for each K and visualised with custom R scripts. Individual heterozygosity was estimated by generating a site frequency spectrum for each individual and dividing the number of sites with one derived allele by the total number of sites as performed by e.g. Hansen et al. [68]. Using SFS for each species, nucleotide diversity and Tajima’s D were both estimated for each chromosome as well as in 20 kb windows sliding in steps of 10 kb using the thetaStat command. We divided the pairwise theta estimator (tP) by the total number of Sites (nSites) of each chromosome/window to calculate nucleotide diversity. Statistical significance of differences in heterozygosity and nucleotide diversity between the two species was checked using Welch’s t-test after verifying normal distributions and inequal variances within the data. Lastly, we also calculated absolute divergences (Dxy) between different population pairs by using allele frequency estimates (maf files) from ANGSD as input. First, we calculated allele frequencies within a dataset containing all individuals to obtain a list of polymorphic sites to be analysed even if a site may be fixed in one of the populations. Next, we calculated allele frequencies for each population (see S6 Table for each included individual). Different population pairs were then compared and Dxy was calculated using the R script calcDxy (available from https://github.com/mfumagalli/ngsPopGen/blob/master/scripts/calcDxy.R). Compared to other methods, this tool may provide underestimated values of Dxy which should therefore only be compared between the groups within this study.

Estimation of effective population sizes through time and divergence times

To estimate effective population sizes through time, we used Pairwise Sequentially Markovian coalescent (PSMC) [69] (for details on the method see S1 File). As an estimate of the neutral genomic mutation rate per generation we used 4.6*10−9 as obtained in a study of the collared flycatcher Ficedula albicollis [70]. We set the estimated generation time for M. lugubris to 3.90 years and for M. gigantea to 4.58 years [71]. The parameters for the PSMC analysis were set to “-N30 -t5 -r5 -p 4 + 30*2 + 4 + 6 + 10” following Nadachowska-Brzyska et al. [72]. The authors observed no significant change in curve shape when modifying the atomic vectors parameter (-p) and applied the same settings to several different avian species. We only ran PSMC for the two samples of M. gigantea with the highest depth-of-coverage and for each of the five identified clusters within M. lugubris (West, Central, East, Huon, Southeast) with 100 bootstrap replicates per individual. False negative rates (FNRs) were adjusted based on depth-of-coverage. If depth-of-coverage was higher than 15 x, FNR was kept at 0. However, if individual A had higher depth-of-coverage than individual B, then individual B would have an FNR of 0.1 * x, where x is the depth-of-coverage of individual A divided by depth-of-coverage of individual B.

To estimate divergence times between the two species, but also between the different subpopulations of M. lugubris, we first ran F1-hybrid PSMC (hPSMC, [73] using the same parameters as for the previous PSMC analyses and implementing 100 bootstraps replicates. Additionally, we estimated mitochondrial divergences within and between the two species and subpopulations of M. lugubris using the previously generated mitochondrial alignment and its nucleotide diversity matrix as obtained through Geneious Prime 2023.0.4. We applied a simple average divergence rate between two avian lineages for the whole mitochondria at 1.8% per million years as estimated by Lerner et al. [74] which refines the “2% rule” by estimating divergence rates for several mitochondrial regions and genes rather than just cytochrome b [75]. We compared these divergence time estimates with those obtained from the hPSMC analyses.

Acoustic recordings and analysis

Acoustic recordings of 10 M. gigantea individuals and 28 M. lugubris individuals were obtained from an online repository of avian vocalizations (https://xeno-canto.org/), which covered different locations across New Guinea. We included all types of vocalisations ‐ songs, calls and vocalisations of an unknown type in the analysis, unless the function of the vocalisation was specified by the recordist (eg: alarm). This is due to the high uncertainty in estimating the type of vocalisation in M. lugubris, and visual comparison between vocalisations classified as ‘songs’ versus ‘calls’ between individuals recorded in the same location, often showed that they were the same. The vocalisations of each individual (median = 9 vocalisations/individual) were measured by a single author (SR) using the Luscinia sound analysis program (version 2.17.11.22.01, [76]. Each vocalisation was visualised using a Gaussian windowing function with the following spectrogram settings: 13 kHz maximum frequency, 5 ms frame length, 221 spectrograph points, 80% spectrograph overlap, 80 dB dynamic range, 30% dereverberation, and 50 ms of dereverberation range. Elements, which are the smallest unit of a vocalisation, were measured as continuous sound traces and then grouped into syllables within each vocalisation (each vocalisation contained only one syllable) (see S14 Fig)

The structures of the acoustic vocalisations were then compared to each other using the dynamic time warping algorithm (DTW) in Luscinia. The DTW algorithm calculates the optimal alignment between all the vocalisations i.e., syllables in the dataset, based on multiple acoustic features, and then provides a final output of a syllable dissimilarity matrix [76]. We followed the same settings used in Wheatcroft et al. [77] that has provided reliable grouping outputs for other songbird species: compression factor = 0.0001, time SD weighting = 1, maximum warp = 25%, minimum element length = 25 samples; with the following weightings for time (5), mean frequency (1), mean frequency change (1), normalised mean frequency (1). All other acoustic features were left at the standard values. In order to examine acoustic variation in syllables, we converted the syllable dissimilarity matrix obtained from the DTW process, into 10 principal components using non-metric multidimensional scaling. These 10 components preserved the overall dissimilarity between the syllables well (kruskal stress value = 0.002, values < 0.1 are usually considered good [76]) and were used to infer acoustic variation among the two species.

Results

During contamination control using mitochondrial assemblies, we observed an increased amount of heterozygous sites across the libraries in 6 individuals (S3 Table). Upon manual inspection using Geneious Prime we found that these heterozygote positions mostly appear in blocks and often within the same regions. This suggests that they were in fact nuclear mitochondrial sequences (NUMTs) that were wrongly mapped onto the mitochondrial genome instead of being a result of contamination. We also observed that non-reference alleles often appeared at a lower frequency (98.093% of heterozygote sites had a reference allele frequency > 0.5, median reference allele frequency across all heterozygote sites at 0.874) and therefore disappeared during consensus calling, as the more frequent allele gets chosen during this step. Nonetheless, we manually excluded two regions from all samples with blocks (in total 5 700 bp out of the entire alignment’s 17 112 bp) of heterozygote sites shared across the majority of individuals. The remaining 11 412 bp were used to generate the mitochondrial phylogenies.

Phylogenetic analyses on mitochondrial and nuclear DNA

We found high congruence between phylogenies built from mitochondrial and nuclear genomes (Figs 1A and S1). Different window sizes and summary coalescent vs concatenated nuclear phylogenies also had little effect on the topology. We recovered three main clusters within M. lugubris that correspond to the geographic location of the samples on an east to west axis (Fig 1). These clusters also align with previously described subspecies of M. lugubris [35]. The first cluster within M. lugubris consists of individuals inhabiting the Birds-Head of north-western New Guinea as well as an individual in the westernmost part of the Central Range. The next cluster inhabits the western and central parts of the Central Range of New Guinea, and the third cluster inhabits the eastern and south-eastern section of the Central Range as well as the isolated outlying Huon mountains. M. gigantaea, on the other hand, shows very short branch lengths between individuals compared to M. lugubris indicating less diversity within this species. Relationships within M. gigantaea are also in accordance with the geographical locality of the samples.

thumbnail
Fig 1.

A Distribution map and sampling sites of M. gigantaea (orange distribution) and M. lugubris (blue distribution), subclusters of M. lugubris are also coloured differently. Shapefiles for administrative boundaries were obtained from geoBoundaries [78] under a CC BY license, with permission from Dan Runfola, original copyright CC BY 4.0 (2020), the map was created using QGIS [79]. B Nuclear phylogeny of Melampittidae highlighting the subdivisions within M. lugubris (West, Central, East, Huon, Southeast), the tree was constructed using IQ-TREE on concatenated 5 kbp window alignments with a GTR+I+G substitution model, support values next to the main branches show bootstraps/site concordance factors (sCF)/window concordance factors (wCF).

https://doi.org/10.1371/journal.pone.0293715.g001

Population structure and genetic diversity

As observed in the phylogenies, we recover a similar pattern in the PCAs (Fig 2A), heterozygosity (Figs 2B and S13), nucleotide diversity (π, S4 Table and S2 Fig), absolute divergence (Dxy, S6 Table) and admixture (Fig 2C) where M. gigantaea exhibits lower genetic diversity in compared to M. lugubris. For M. lugubris Tajima’s D was consistently negative with a mean value of -0.902 (SD: 0.126, median: -0.897, S5 Table and S3 Fig) which is indicative of either population expansion or a selective sweep. In M. gigantaea values for Tajima’s D were slightly above zero in the range of 0–0.2 (mean 0.103, SD: 0.040, median: 0.112, S5 Table and S3 Fig). Positive values of Tajima’s D could indicate a reduction in population size or balancing selection acting, however as the values are so close to zero the population may just evolve neutrally. In the PCA (Fig 2A), PC1 separates the two species, afterwards M. gigantaea remains closely clustered up to PC4, while subgroups corresponding to geographic localities make up clusters within M. lugubris (S4 Fig). The two distinct clusters of M. lugubris on PC2 (Fig 2A) separate eastern New Guinean populations and northwestern New Guinean populations as also observed in the phylogenetic tree (Fig 1B). Both heterozygosity and nucleotide diversity were significantly lower in M. gigantaea than in M. lugubris. Although we observed a clear trend of increasing heterozygosity with higher depth-of-coverage, the slopes for each population were similar and consistently higher in all but one population of M. lugubris (S5 Fig). Admixture analysis revealed no substructure within M. gigantaea from K = 2 to K = 7. For M. lugubris the observed clusters between K = 2–6 align with the clusters observed in the phylogenetic trees and in the PCAs. Further subdivisions within the main clusters of M. lugubris at higher values of K are also corresponding to the populations’ geographical location. Estimates of mean absolute divergence (Dxy) per-site across the entire genome ranged from a minimum of 0.0005 between the geographically most distant M. gigantaea individuals to a maximum of 0.012 between the two species. Dxy between different pairs of (sub)populations within M. lugubris (min: 0.002, max: 0.005) was higher than within M. gigantaea and about half as much as the divergence between M. gigantaea and M. lugubris in the most divergent pair of M. lugubris (West vs. East). An overview of all pairwise comparisons is given in S6 Table.

thumbnail
Fig 2. Genetic diversity and population structure in Melampittidae.

A) PCA showing the first two principal components for both species (M. gigantaea in orange, subclusters of M. lugubris in shades of blue/purple). B) Admixture analysis from K = 2 to K = 6 C) Heterozygosity for all individuals between both species.

https://doi.org/10.1371/journal.pone.0293715.g002

Estimation of effective population size in time and divergence times

PSMC curves (Fig 3) for samples from the same populations had similar shapes, but not entirely overlapping as depth-of-coverage varied between samples. Within M. lugubris the shape of the curves varied, but most of this variation could be ascribed to population specific events. In M. gigantaea, we observe an effective population size peak at around 200 Kya followed by a steady decline in effective population size up until around 40 Kya.

thumbnail
Fig 3.

PSMC plots for two representative individuals of A) M. gigantaea (solid line: AMNH 590764, dashed line: PNGNM 26694) B) M. Lugubris western population (solid line: AMNH 293751, dashed line: AMNH 293748) C) M. lugubris central population (solid line: AMNH 340368, dashed line: Bishop 98503) D) M. lugubris eastern population (solid line: Bishop 100613, dashed line: Bishop 55612), E) M. lugubris Huon population (solid line: NHMD 615247, dashed line: NHMD 616019) F) M. lugubris southeastern population (solid line: Bishop 54821, dashed line: AMNH 590750) Thinner lines depict the curves of bootstrap runs.

https://doi.org/10.1371/journal.pone.0293715.g003

The divergence time obtained from hPSMCs curves for the split between M. gigantaea and M. lugubris was estimated to about 10 mya (S6 Fig). Splits between subgroups within M. lugubris were estimated more recently with the split between Western+Vogelkop and Eastern populations at around 4–5 mya (S7 Fig) and between Western and Vogelkop populations at about 3–4 mya (S8 Fig). The next divisions within Eastern M. lugubris populations (East, Huon and Southeast) happened at similar times around 1 mya (S8 and S9 Figs).

Mitochondrial divergences were, as expected, highest for comparisons between M. gigantaea and M. lugubris and its subpopulations at a range of 9.8–13.3%. Divergence within M. gigantaea was also lower (mean 0.912%) than within M. lugubris (mean 4.979%) or even in some of its subpopulations. (For an extensive table with all comparisons of mitochondrial divergence see S11 and S12 Figs). Divergence times obtained by assuming an average rate of mitochondrial divergence of 1.8% were 5.5–7.4 mya for the split between M. gigantaea and M. lugubris, 4.1–6.2 mya for splits between Western+Vogelkop populations from Eastern populations of M. lugubris and Western populations from Vogelkop populations at 4–4.2 mya. Subdivisions within the Eastern populations were estimated at 0.2–2.9 mya.

Acoustic recordings and analysis

The first ten principal components collectively explained 97% of the variation in vocalisations across the two Melampitta species. PC1, which explained 44.5% of the variation in all vocalisations, was more varied for M. lugubris (standard deviation (SD) = 0.086) compared to M. gigantaea (SD = 0.036). The same was true for PC2, where the standard deviation was once again higher for M. lugubris (SD = 0.11) compared to M. gigantaea (SD = 0.017). These results show that M. lugubris has greater acoustic diversity than M. gigantaea (Fig 4), highlighting the need for a more in-depth examination of life history trait variation between the two species. In addition to obtaining higher quality recordings, future work should examine whether this preliminary observation of acoustic variation corresponds to different populations.

thumbnail
Fig 4. Acoustic variation across species.

Principal component space (PC1-2) of vocalisations from M. gigantaea and M. lugubris. PC1 and PC2 scores are averaged within individuals and triangles represent species centroids. Ellipses contain 95% of vocalisations of each species. The significant outlier within M. lugubris may represent an odd vocalisation that is not directly comparable with the other vocalisations included here. Note that vocalisations for M. gigantaea were only available from three localities (the Fakfak mountains in the Bird’s Neck, a locality in the southern Bird’s Neck and Tabubil in the central highlands) and vocalisations for M. lugubris were only available from two of the three distinct clades (samples were available from the western and central but no vocalisation data was available from the eastern and Huon populations).

https://doi.org/10.1371/journal.pone.0293715.g004

Discussion

The formation of the avifauna on New Guinea largely follows the predictions of taxon cycles [15,16] whereby new species form in or colonise through the lowlands and over time move upwards and become relictual at high elevations. The family Melampittidae is a species-poor old endemic lineage of New Guinea [31]. The family includes two extant species of which one (Melampitta lugubris) follows the general taxon-cycle expectation in that it is an old lineage that inhabits montane forests of New Guinea. The other species, Megalampitta gigantea, however, has a distribution associated with specific karst habitats at lower elevations and in foothills [35].

Our divergence time estimates suggest that M. gigantea and M. lugubris diverged from each other in the Miocene (at approximately 10 Mya based on hPSMC results), which is slightly younger than the divergence time estimated by Jønsson et al. [80] and slightly older than the divergence time estimated by McCullough et al. [81]. The three main populations of M. lugubris (Fig 1B) diverged from each other in the early Pliocene (at approximately 4–5 Mya based on hPSMC curves). A Pliocene divergence of M. lugubris populations coincides with major uplift of various mountain regions on New Guinea [8284], which may have shaped the present population structure of M. lugubris. The distributional pattern of populations of M. lugubris, with one distinct Vogelkop population and a division of an eastern and a western population along the central mountain range, is also a pattern similar to that found in other New Guinean mountain birds with Pliocene divergences [30,85,86].

The PSMC curves of the three main populations of M. lugubris differ (Fig 3), yet with a general trend of increasing population sizes towards the present. The exception to this is the population of the Huon mountains, which shows a continuous decrease in population size since approximately 100 Kya. Our interpretation is that eastern and south-eastern populations of the Central Range have maintained continuous gene flow, while the connectivity with the Huon population was broken or at least severely reduced as this population became isolated in the outlying Huon Mountain range.

Given a presumed poor dispersal capacity [34] and a patchy distribution at mid-elevations, we initially hypothesised that M. gigantea would exhibit a clear population structure. However, contrary to expectations, all our samples of M. gigantea, from localities scattered across New Guinea, cluster tightly together genetically (Figs 1 and 2). Analysis of vocalisations also shows a similar pattern, as M. gigantea exhibits less acoustic diversity compared to M. lugubris (Fig 4). This is fascinating and difficult to explain. Below, we discuss three scenarios that may provide possible explanations for these patterns. First, it is possible that continuous migration (or high rates of juvenile dispersal) of M. gigantea individuals maintains contact and gene flow between populations. However, an exclusively ground-dwelling lifestyle and the lack of long-distance flight capabilities, suggested by its morphology and field observations contradict this scenario [34,35]. Second, it is possible that their presently known fragmented distribution does not properly reflect their actual distribution, which may be more extensive [35,87]. Karst regions are generally species-poor in comparison to the species-rich tropical forests of New Guinea and such localised karstic areas dispersed throughout New Guinea may therefore have commanded less attention by ornithological surveys. Finally, it is possible that M. gigantea once had a wider more continuous distribution and that a recent decline has left scattered populations in small pockets of Karst habitat. The PSMC analyses support this scenario by showing that the population size of M. gigantea has dropped dramatically within the last 200 Ky (Fig 3). The fact that M. gigantea is highly adapted to a very specific habitat type (nesting in deep holes in karst limestone that they have to climb out of [33] is, however, difficult to reconcile with this scenario. However, one may speculate that M. gigantea in the past had broader habitat preferences not only restricted to the present karst limestone habitats. Perhaps during the last 200 Ky, increased competition from other species forced M. gigantea to retract to a particular low-diversity habitat type, leaving behind the scattered distribution that we see today. Overall, we find it most plausible, that M. gigantea had a larger and more continuous distribution in the past, yet we acknowledge that the present distribution may be underestimated. Additional ornithological surveys to suitable habitats may, thus, reveal further M. gigantea populations.

Conclusions

In this study, the rather surprising population structure of the two species of an old New Guinean avian family have been elucidated by genomic data largely obtained from historical museum collections. While the population structure of Melampitta lugubris is similar to those found in other mountain birds of New Guinea with similar age, the population structure of Megalampitta gigantea is intriguing. The study is an example of how intrinsic properties, such as demographic history as exhibited by M. gigantea, may cause their population dynamics to deviate from general biogeographical predictions. The study is also an example of how important museum collections are for increasing the knowledge of rare taxa that occur in remote regions. The levels of divergence between the three major populations of M. lugubris are well above those at which ornithologists would normally assign species rank. Consequently, we tentatively propose that these three populations should be elevated to species rank, Melampitta lugubris (Schlegel, 1871) in the Vogelkop region, Melampitta rostrata (Ogilvie-Grant, 1913) in the western central range and Melampitta longicauda (Mayr & Gilliard, 1952) in the eastern central range.

Supporting information

S1 Fig. Mitochondrial phylogeny for all individuals based on an alignment of mitochondrial consensus sequences.

The tree was constructed using RaxML-NG applying a GTR+G substitution model.

https://doi.org/10.1371/journal.pone.0293715.s001

(TIF)

S2 Fig. Individual nucleotide diversity (π) is significantly lower in M. gigantaea (orange) than in M. lugubris (blue).

The applied statistical test was a Welch’s two sample t-test for unequal variances.

https://doi.org/10.1371/journal.pone.0293715.s002

(TIF)

S3 Fig. Tajima’s D across all chromosomes.

Chromosomes are divided into macrochromosomes (> = 40 Mbp), intermediate chromosomes (> = 20 Mbp, < 40 Mbp) and microchromosomes (< 20 Mbp). Values are consistently negative across most of each chromosome in M. lugubris (blue) and slightly positive in M. gigantaea (orange).

https://doi.org/10.1371/journal.pone.0293715.s003

(TIF)

S4 Fig. Principal components 1 to 4 describing divisions within subpopulations of M. lugubris (shades of blue/purple) while M. gigantaea (orange) remains a tight cluster.

Weyland represents one individual (AMNH 301956) that was collected between our Western and Central populations and is assigned to the Western population in most other analyses.

https://doi.org/10.1371/journal.pone.0293715.s004

(TIF)

S5 Fig. Heterozygosity shows a correlation with increasing depth of coverage (DoC).

Slopes are similar between populations/species and M. gigantaea still shows lower heterozygosity than most M. lugubris populations when comparing individuals with similar DoC. To fit regression lines, we applied Kendall’s rank correlation coefficient as it is recommended for smaller sample sizes containing outliers [Kendall MG. A New Measure of Rank Correlation. Biometrika. 1938;30(1/2):81–93.].

https://doi.org/10.1371/journal.pone.0293715.s005

(TIF)

S6 Fig. PSMC for M. lugubris (blue) and M. gigantaea (red) and their hybrid PSMC curve (purple) to show the divergence time between the two species.

https://doi.org/10.1371/journal.pone.0293715.s006

(TIF)

S7 Fig. PSMC for M. lugubris NHMD616019 from Huon (red) and M. lugubris B98503 from Central New Guinea (blue) and their hybrid PSMC curve (purple) to show the divergence time between Eastern populations and Western + Central populations of M. lugubris.

https://doi.org/10.1371/journal.pone.0293715.s007

(TIF)

S8 Fig. PSMC for M. lugubris B98503 from Central New Guinea (red) and M. lugubris AMNH293751 from Western New Guinea (blue) and their hybrid PSMC curve (purple) to show the divergence time between Central populations and Western populations of M. lugubris.

https://doi.org/10.1371/journal.pone.0293715.s008

(TIF)

S9 Fig. PSMC for M. lugubris NHMD616019 from Huon (red) and M. lugubris AMNH590750 from the Southeast (blue) and their hybrid PSMC curve (purple) to show the divergence time between Huon populations and Southeastern populations of M. lugubris.

https://doi.org/10.1371/journal.pone.0293715.s009

(TIF)

S10 Fig. PSMC for M. lugubris NHMD616019 from Huon (red) and M. lugubris B100613 from the East (blue) and their hybrid PSMC curve (purple) to show the divergence time between Huon populations and East populations of M. lugubris.

https://doi.org/10.1371/journal.pone.0293715.s010

(TIF)

S11 Fig. Mitochondrial divergence matrix showing the minimum (first value) and maximum (second value) for each comparison of populations and species, M. gigantaea (Meg), M. lugubris (Mel), Eastern populations (EPops) include the subpopulations East, Southeast and Huon.

https://doi.org/10.1371/journal.pone.0293715.s011

(TIF)

S12 Fig. Mitochondrial divergence matrix showing the mean divergence for each comparison of populations and species, M. gigantaea (Meg), M. lugubris (Mel), Eastern populations (EPops) include the subpopulations East, Southeast and Huon.

https://doi.org/10.1371/journal.pone.0293715.s012

(TIF)

S13 Fig. Heterozygosity between M. gigantaea and populations of M. lugubris. M. gigantaea is shown in orange, populations of M. lugubris in shades of blue/purple. Statistical significance is only shown for significant differences between pairwise comparisons of M. gigantaea and each population of M. lugubris applying Welch’s t-test.

https://doi.org/10.1371/journal.pone.0293715.s013

(TIF)

S14 Fig. Representative acoustic vocalisations of M. gigantaea and M. lugubris.

For each species, vocalisations from four different individuals are depicted. The top panel illustrates how vocalisations were measured in the acoustic software Luscinia: The user manually traces out the elements, the smallest unit within each vocalisation (in green), after which they are grouped into syllables (in red). Each vocalisation typically contains only 1 syllable for both species. The dynamic time warping algorithm in Luscinia creates a matrix of syllable dissimilarities using multiple frequency and time measurements that are extracted from these measured syllables.

https://doi.org/10.1371/journal.pone.0293715.s014

(TIF)

S1 Table. List of samples.

Additional information such as sample locality, museum voucher, tissue type, etc. are included. The table also shows mapping statistics (e.g. mapping percentage and depth-of-coverage) for each individual.

https://doi.org/10.1371/journal.pone.0293715.s015

(XLSX)

S2 Table. Used github commits when running Nextflow workflows.

https://doi.org/10.1371/journal.pone.0293715.s016

(XLSX)

S3 Table. Filtered individuals with heterozygous blocks in mtDNA.

https://doi.org/10.1371/journal.pone.0293715.s017

(XLSX)

S4 Table. Individual nucleotide diversity (π).

Sheet 1 (Individual) contains statistics calculated for each species using i) all chromosomes and ii) only autosomes (aut.). Sheet 2 (Species-wide) contains estimates averaged across i) all chromosomes and ii) only autosomes (aut.).

https://doi.org/10.1371/journal.pone.0293715.s018

(XLSX)

S5 Table. Species-wide Tajima’s D.

Values averaged across i) all chromosomes and ii) only autosomes (aut.).

https://doi.org/10.1371/journal.pone.0293715.s019

(XLSX)

S6 Table. Mean per-site Dxy etimates.

Sheet 1 (Dxy) shows pairwise comparisons between all major population splits as well as an estimate within M. gigantaea. Sheet 2 (Populations) lists the individuals that were included in each population.

https://doi.org/10.1371/journal.pone.0293715.s020

(XLSX)

S3 File. List of adapters that were removed during trimming.

https://doi.org/10.1371/journal.pone.0293715.s023

(DOCX)

Acknowledgments

We thank all the staff and field assistants that facilitated fieldwork in Papua New Guinea. Notably, the Binatang Research Center and local communities in the YUS conservation area from the villages of Towet and Yawan. We are also grateful for the assistance provided by the PNG National Museum and Art Gallery and the Conservation and Environment Protection Authority (CEPA) of Papua New Guinea for research permits (99902749307 to K.A.J.) and export permits (017179 and 19069). We would like to acknowledge the following museum collections and their managers that have generously provided us with tissue samples for this study: American Museum of Natural History, New York, USA (Paul Sweet, Pete Capainolo, Tom Trombone and Brian T. Smith); Bernice Pauahi Bishop Museum, Honolulu, USA (Molly Hagemann); Natural History Museum, London, UK (Robert Prys-Jones, Hein van Grouw and Mark Adams); Papua New Guinea National Museum and Art Gallery, Port Moresby, Papua New Guinea; Swedish Museum of Natural History, Stockholm, Sweden (Ulf Johansson) and the Natural history museum of Denmark, Copenhagen, Denmark (Peter Hosner). Lastly, we would like to thank Nikolay Oskolkov for assistance with bioinformatic analyses as part of the drop-in services and the Swedish Bioinformatics Advisory Program of the National Bioinformatics Infrastructure Sweden (NBIS).

References

  1. 1. Dobzhansky TG. Genetics of the evolutionary process. Vol. 139. Columbia University Press; 1970.
  2. 2. Westram AM, Stankowski S, Surendranadh P, Barton N. What is reproductive isolation? J Evol Biol. 2022;35(9):1143–64. pmid:36063156
  3. 3. Dobzhansky TG. Studies on Hybrid Sterility. II. Localization of Sterility Factors in Drosophila Pseudoobscura Hybrids. Genetics. 1936 Mar;21(2):113–35. pmid:17246786
  4. 4. Coyne JA. Genetics and speciation. Nature. 1992 Feb;355(6360):511–5. pmid:1741030
  5. 5. Turelli M, Barton NH, Coyne JA. Theory and speciation. Trends Ecol Evol. 2001 Jul;16(7):330–43. pmid:11403865
  6. 6. North HL, Caminade P, Severac D, Belkhir K, Smadja CM. The role of copy-number variation in the reinforcement of sexual isolation between the two European subspecies of the house mouse. Philos Trans R Soc B Biol Sci. 2020 Aug 31;375(1806):20190540. pmid:32654648
  7. 7. Sánchez-Montes G, Wang J, Ariño AH, Martínez-Solano Í. Mountains as barriers to gene flow in amphibians: Quantifying the differential effect of a major mountain ridge on the genetic structure of four sympatric species with different life history traits. J Biogeogr. 2018;45(2):318–31.
  8. 8. Salgado-Roa FC, Pardo-Diaz C, Lasso E, Arias CF, Solferini VN, Salazar C. Gene flow and Andean uplift shape the diversification of Gasteracantha cancriformis (Araneae: Araneidae) in Northern South America. Ecol Evol. 2018;8(14):7131–42. pmid:30073072
  9. 9. Li YS, Shih KM, Chang CT, Chung JD, Hwang SY. Testing the Effect of Mountain Ranges as a Physical Barrier to Current Gene Flow and Environmentally Dependent Adaptive Divergence in Cunninghamia konishii (Cupressaceae). Front Genet [Internet]. 2019 [cited 2023 Jan 19];10. Available from: https://www.frontiersin.org/articles/10.3389/fgene.2019.00742. pmid:31447888
  10. 10. Heald WF. Sky Island [Internet]. Van Nostrand; 1967. Available from: https://books.google.se/books?id=jPWeuQAACAAJ.
  11. 11. McCormack JE, Huang H, Knowles LL, Gillespie R, Clague D. Sky islands. Encycl Isl. 2009;4:841–3.
  12. 12. Robin VV, Sinha A, Ramakrishnan U. Ancient Geographical Gaps and Paleo-Climate Shape the Phylogeography of an Endemic Bird in the Sky Islands of Southern India. PLOS ONE. 2010 Oct 13;5(10):e13321. pmid:20967202
  13. 13. Wagner P, Greenbaum E, Malonza P, Branch W. Resolving sky island speciation in populations of East African Adolfus alleni (Sauria, Lacertidae). Salamandra. 2014;50(1):1–17.
  14. 14. Zhang D, Hao G, Guo X, Hu Q, Liu J. Genomic insight into “sky island” species diversification in a mountainous biodiversity hotspot. J Syst Evol. 2019 Nov;57(6):633–45.
  15. 15. Pujolar JM, Blom MPK, Reeve AH, Kennedy JD, Marki PZ, Korneliussen TS, et al. The formation of avian montane diversity across barriers and along elevational gradients. Nat Commun. 2022 Dec;13(1):268. pmid:35022441
  16. 16. Kennedy JD, Marki PZ, Reeve AH, Blom MPK, Prawiradilaga DM, Haryoko T, et al. Diversification and community assembly of the world’s largest tropical island. Glob Ecol Biogeogr. 2022;31(6):1078–89.
  17. 17. Wilson EO. Adaptive Shift and Dispersal in a Tropical Ant Fauna. Evolution. 1959;13(1):122–44.
  18. 18. Wilson EO. The Nature of the Taxon Cycle in the Melanesian Ant Fauna. Am Nat. 1961;95(882):169–93.
  19. 19. Ricklefs RE, Bermingham E. The concept of the taxon cycle in biogeography. Glob Ecol Biogeogr. 2002;11(5):353–61.
  20. 20. MacArthur RH, Wilson EO. The Theory of Island Biogeography. REV-Revised. Princeton University Press; 1967.
  21. 21. Ricklefs RE, Cox GW. Taxon Cycles in the West Indian Avifauna. Am Nat. 1972 Mar;106(948):195–219.
  22. 22. Pepke ML, Irestedt M, Fjeldså J, Rahbek C, Jønsson KA. Reconciling supertramps, great speciators and relict species with the taxon cycle stages of a large island radiation (Aves: Campephagidae). J Biogeogr. 2019 Jun;46(6):1214–25.
  23. 23. Reeve AH, Kennedy JD, Pujolar JM, Petersen B, Blom MPK, Alström P, et al. The formation of the Indo-Pacific montane avifauna. Nat Commun. 2023 Dec 11;14(1):8215. pmid:38081809
  24. 24. Williams DJ, O’Shea M, Daguerre RL, Pook CE, Wüster W, Hayden CJ, et al. Origin of the eastern brownsnake, Pseudonaja textilis (Duméril, Bibron and Duméril) (Serpentes: Elapidae: Hydrophiinae) in New Guinea: evidence of multiple dispersals from Australia, and comments on the status of Pseudonaja textilis pughi Hoser 2003. Zootaxa. 2008 Feb 13;(1703):47–61.
  25. 25. Kearns AM, Joseph L, Omland KE, Cook LG. Testing the effect of transient Plio-Pleistocene barriers in monsoonal Australo-Papua: did mangrove habitats maintain genetic connectivity in the Black Butcherbird? Mol Ecol. 2011;20(23):5042–59. pmid:22060632
  26. 26. Ericson PGP, Irestedt M, She H, Qu Y. Genomic signatures of rapid adaptive divergence in a tropical montane species. Biol Lett. 2021 Jul;17(7):20210089. pmid:34314643
  27. 27. Rawlings LH, Donnellan SC. Phylogeographic analysis of the green python, Morelia viridis, reveals cryptic diversity. Mol Phylogenet Evol. 2003 Apr 1;27(1):36–44. pmid:12679069
  28. 28. Dumbacher JP, Fleischer RC. Phylogenetic evidence for colour pattern convergence in toxic pitohuis: Müllerian mimicry in birds? Proc R Soc Lond B Biol Sci. 2001 Oct 7;268(1480):1971–6.
  29. 29. Deiner K, Lemmon AR, Mack AL, Fleischer RC, Dumbacher JP. A Passerine Bird’s Evolution Corroborates the Geologic History of the Island of New Guinea. PLOS ONE. 2011 May 6;6(5):e19479. pmid:21573115
  30. 30. Irestedt M, Batalha-Filho H, Roselaar CS, Christidis L, Ericson PGP. Contrasting phylogeographic signatures in two Australo-Papuan bowerbird species complexes (Aves: Ailuroedus). Zool Scr. 2016;45(4):365–79.
  31. 31. Oliveros CH, Field DJ, Ksepka DT, Barker FK, Aleixo A, Andersen MJ, et al. Earth history and the passerine superradiation. Proc Natl Acad Sci. 2019 Apr 16;116(16):7916–25. pmid:30936315
  32. 32. McCullough JM, Hruska JP, Oliveros CH, Moyle RG, Andersen MJ. Ultraconserved elements support the elevation of a new avian family, Eurocephalidae, the white-crowned shrikes. Ornithology. 2023 Jul 11;140(3):ukad025.
  33. 33. Price TD, Hooper DM, Buchanan CD, Johansson US, Tietze DT, Alström P, et al. Niche filling slows the diversification of Himalayan songbirds. Nature. 2014 May 8;509(7499):222–5. pmid:24776798
  34. 34. Diamond JM. Melampitta gigantea: Possible Relation between Feather Structure and Underground Roosting Habits. The Condor. 1983 Feb;85(1):89–91.
  35. 35. Del Hoyo J, Elliot A, Christie D. Handbook of the Birds of the World, Picathartes to Tits and Chickadees, vol. 12. Lynx Edicions Barc. 2007.
  36. 36. Gill F, Donsker DB, Rasmussen PC. IOC World Bird List 13.1 [Internet]. World Bird Names; 2023 [cited 2023 Mar 30]. Available from: https://www.worldbirdnames.org/ioc-lists/crossref.
  37. 37. Meyer M, Kircher M. Illumina Sequencing Library Preparation for Highly Multiplexed Target Capture and Sequencing. Cold Spring Harb Protoc. 2010 Jun 1;2010(6):pdb.prot5448-pdb.prot5448. pmid:20516186
  38. 38. Irestedt M, Thörn F, Müller IA, Jønsson KA, Ericson PGP, Blom MPK. A guide to avian museomics: Insights gained from resequencing hundreds of avian study skins. Mol Ecol Resour. 2022;22(7):2672–84. pmid:35661418
  39. 39. Briggs AW, Stenzel U, Meyer M, Krause J, Kircher M, Pääbo S. Removal of deaminated cytosines and detection of in vivo methylation in ancient DNA. Nucleic Acids Res. 2010 Apr;38(6):e87–e87. pmid:20028723
  40. 40. Di Tommaso P, Chatzou M, Floden EW, Barja PP, Palumbo E, Notredame C. Nextflow enables reproducible computational workflows. Nat Biotechnol. 2017 Apr;35(4):316–9. pmid:28398311
  41. 41. Blom MPK. nf-polish [Internet]. 2021. Available from: https://github.com/MozesBlom/nf-polish.
  42. 42. Andrews S. Babraham Bioinformatics ‐ FastQC A Quality Control tool for High Throughput Sequence Data [Internet]. 2010. Available from: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/.
  43. 43. s4hts/HTStream [Internet]. Software (for) High Throughput Sequencing; 2023 [cited 2024 Jan 15]. Available from: https://github.com/s4hts/HTStream.
  44. 44. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014 Aug 1;30(15):2114–20. pmid:24695404
  45. 45. Zhang J, Kobert K, Flouri T, Stamatakis A. PEAR: a fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics. 2014 Mar 1;30(5):614–20. pmid:24142950
  46. 46. Shen W, Le S, Li Y, Hu F. SeqKit: A Cross-Platform and Ultrafast Toolkit for FASTA/Q File Manipulation. PLOS ONE. 2016 Oct 5;11(10):e0163962. pmid:27706213
  47. 47. Müller IA. nf-μmap [Internet]. 2022. Available from: https://github.com/IngoMue/nf-umap.
  48. 48. Vasimuddin Md, Misra S, Li H, Aluru S. Efficient Architecture-Aware Acceleration of BWA-MEM for Multicore Systems. In: 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS). 2019. p. 314–24.
  49. 49. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. GigaScience [Internet]. 2021 Feb;10(2). Available from: pmid:33590861
  50. 50. Okonechnikov K, Conesa A, García-Alcalde F. Qualimap 2: advanced multi-sample quality control for high-throughput sequencing data. Bioinformatics. 2016 Jan 15;32(2):292–4. pmid:26428292
  51. 51. Neukamm J, Peltzer A, Nieselt K. DamageProfiler: fast damage pattern calculation for ancient DNA. Bioinformatics. 2021 Oct 25;37(20):3652–3. pmid:33890614
  52. 52. Poelstra JW, Vijay N, Bossu CM, Lantz H, Ryll B, Müller I, et al. The genomic landscape underlying phenotypic integrity in the face of gene flow in crows. Science. 2014 Jun 20;344(6190):1410–4. pmid:24948738
  53. 53. Thörn F. nf_mito-mania [Internet]. 2022 [cited 2023 Mar 30]. Available from: https://github.com/FilipThorn/nf_mito-mania.
  54. 54. Hahn C, Bachmann L, Chevreux B. Reconstructing mitochondrial genomes directly from genomic next-generation sequencing reads—a baiting and iterative mapping approach. Nucleic Acids Res. 2013 Jul 1;41(13):e129. pmid:23661685
  55. 55. Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–80. pmid:23329690
  56. 56. Kozlov AM, Darriba D, Flouri T, Morel B, Stamatakis A. RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference. Bioinformatics. 2019 Nov 1;35(21):4453–5. pmid:31070718
  57. 57. Garrison E, Marth G. Haplotype-based variant detection from short-read sequencing [Internet]. arXiv; 2012 [cited 2023 Sep 8]. Available from: http://arxiv.org/abs/1207.3907.
  58. 58. Stegemiller MR, Redden RR, Notter DR, Taylor T, Taylor JB, Cockett NE, et al. Using whole genome sequence to compare variant callers and breed differences of US sheep. Front Genet [Internet]. 2023 Jan 4 [cited 2024 Mar 22];13. Available from: https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2022.1060882/full. pmid:36685812
  59. 59. Blom MPK. nf-phylo [Internet]. 2023 [cited 2023 Mar 30]. Available from: https://github.com/MozesBlom/nf-phylo.
  60. 60. Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies. Mol Biol Evol. 2015 Jan 1;32(1):268–74. pmid:25371430
  61. 61. Zhang C, Rabiee M, Sayyari E, Mirarab S. ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees. BMC Bioinformatics. 2018 May 8;19(6):153. pmid:29745866
  62. 62. Rabiee M, Sayyari E, Mirarab S. Multi-allele species reconstruction using ASTRAL. Mol Phylogenet Evol. 2019 Jan 1;130:286–96. pmid:30393186
  63. 63. Korneliussen TS, Albrechtsen A, Nielsen R. ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinformatics. 2014 Nov 25;15(1):356. pmid:25420514
  64. 64. Meisner J, Albrechtsen A. Inferring Population Structure and Admixture Proportions in Low-Depth NGS Data. Genetics. 2018 Oct 1;210(2):719–31. pmid:30131346
  65. 65. R Core Team. R: A Language and Environment for Statistical Computing [Internet]. Vienna, Austria: R Foundation for Statistical Computing; 2021. Available from: https://www.R-project.org/.
  66. 66. Posit team. RStudio: Integrated Development Environment for R [Internet]. Boston, MA: Posit Software, PBC; 2023. Available from: http://www.posit.co/.
  67. 67. Skotte L, Korneliussen TS, Albrechtsen A. Estimating Individual Admixture Proportions from Next Generation Sequencing Data. Genetics. 2013 Sep. pmid:24026093
  68. 68. Hansen CCR, Hvilsom C, Schmidt NM, Aastrup P, Van Coeverden De Groot PJ, Siegismund HR, et al. The Muskox Lost a Substantial Part of Its Genetic Diversity on Its Long Road to Greenland. Curr Biol. 2018 Dec;28(24):4022-4028.e5. pmid:30528581
  69. 69. Li H, Durbin R. Inference of human population history from individual whole-genome sequences. Nature. 2011 Jul;475(7357):493–6. pmid:21753753
  70. 70. Smeds L, Qvarnström A, Ellegren H. Direct estimate of the rate of germline mutation in a bird. Genome Res. 2016 Jan 9;26(9):1211–8. pmid:27412854
  71. 71. Bird JP, Martin R, Akçakaya HR, Gilroy J, Burfield IJ, Garnett ST, et al. Generation lengths of the world’s birds and their implications for extinction risk. Conserv Biol. 2020;34(5):1252–61. pmid:32058610
  72. 72. Nadachowska-Brzyska K, Li C, Smeds L, Zhang G, Ellegren H. Temporal Dynamics of Avian Populations during Pleistocene Revealed by Whole-Genome Sequences. Curr Biol. 2015 May 18;25(10):1375–80. pmid:25891404
  73. 73. Cahill JA, Soares AER, Green RE, Shapiro B. Inferring species divergence times using pairwise sequential Markovian coalescent modelling and low-coverage genomic data. Philos Trans R Soc B Biol Sci. 2016 Jul 19;371(1699):20150138. pmid:27325835
  74. 74. Lerner HRL, Meyer M, James HF, Hofreiter M, Fleischer RC. Multilocus Resolution of Phylogeny and Timescale in the Extant Adaptive Radiation of Hawaiian Honeycreepers. Curr Biol. 2011 Nov;21(21):1838–44. pmid:22018543
  75. 75. Weir JT, Schluter D. Calibrating the avian molecular clock. Mol Ecol. 2008;17(10):2321–8. pmid:18422932
  76. 76. Lachlan R. rflachlan/Luscinia [Internet]. 2014. Available from: https://github.com/rflachlan/Luscinia.
  77. 77. Wheatcroft D, Bliard L, El Harouchi M, López-Idiáquez D, Kärkkäinen T, Kraft FLH, et al. Species-specific song responses emerge as a by-product of tuning to the local dialect. Curr Biol. 2022 Dec 5;32(23):5153–5158.e5. pmid:36288731
  78. 78. Runfola D, Anderson A, Baier H, Crittenden M, Dowker E, Fuhrig S, et al. geoBoundaries: A global database of political administrative boundaries. PLOS ONE. 2020 Apr 24;15(4):e0231866. pmid:32330167
  79. 79. QGIS Development Team. QGIS Geographic Information System [Internet]. QGIS Association; 2023. Available from: https://www.qgis.org
  80. 80. Jønsson KA, Fabre PH, Kennedy JD, Holt BG, Borregaard MK, Rahbek C, et al. A supermatrix phylogeny of corvoid passerine birds (Aves: Corvides). Mol Phylogenet Evol. 2016 Jan 1;94:87–94. pmid:26327328
  81. 81. McCullough JM, Oliveros CH, Benz BW, Zenil-Ferguson R, Cracraft J, Moyle RG, et al. Wallacean and Melanesian Islands Promote Higher Rates of Diversification within the Global Passerine Radiation Corvides. Ho S, editor. Syst Biol. 2022 Oct 12;71(6):1423–39.
  82. 82. Pigram C, Davies H. Terranes and the accretion history of the Papua New Guinea orogeny. BMR J Aust Geol Geophys. 1987;10:193–212.
  83. 83. Hall R. Cenozoic geological and plate tectonic evolution of SE Asia and the SW Pacific: computer-based reconstructions, model and animations. J Asian Earth Sci. 2002 Apr 1;20(4):353–431.
  84. 84. Davies HL. The geology of New Guinea-the cordilleran margin of the Australian continent. Episodes J Int Geosci. 2012;35(1):87–102.
  85. 85. Irestedt M, Batalha-Filho H, Ericson PGP, Christidis L, Schodde R. Phylogeny, biogeography and taxonomic consequences in a bird-of-paradise species complex, Lophorina–Ptiloris (Aves: Paradisaeidae). Zool J Linn Soc. 2017 Oct 1;181(2):439–70.
  86. 86. Jønsson KA, Reeve AH, Blom MP, Irestedt M, Marki PZ. Unrecognised (species) diversity in New Guinean passerine birds. Emu-Austral Ornithol. 2019;119(3):233–41.
  87. 87. Diamond J, Bishop KD. What’s so special about New Guinea birds? Bull Br Ornithol Club. 2023 Jun;143(2):212–36.