Figures
Abstract
In the Falconidae, the genus Falco comprises species of large birds of prey with wide distribution worldwide. However, the European lanner falcon Falco biarmicus feldeggii is rapidly heading for global extinction following a dramatic decline caused by anthropogenic interference. Conservation projects are currently underway with the main purpose of increasing its population size in the Mediterranean basin through captive breeding and release of birds into the wild. To support the projects, and strengthen the legitimacy of conservation efforts consistently with the Evolutionary Significant Unit concept, we explored the possibility of characterising the gene pool of the European lanner and reliably distinguishing it from other falcon taxa inhabiting the Mediterranean area, which show morphological and genetic similarities. To address the issue, we examined genetic variability at the nuclear level through the analysis of 12 neutral Short Tandem Repeat loci, and, for the first time in these taxa, two single-copy functional genes, coding for the brain-derived neurotrophic factor precursor and the oocyte maturation factor, respectively. The second exon of the major histocompatibility complex class II B gene was also investigated. Additionally, to frame our data with previously published data, we assess variation at the mitochondrial level by sequencing portions of the cytochrome b, 12S rRNA gene, and the control region. Our results showed that the European lanner is highly distinct from other falcon taxa, as revealed by nuclear, but not by mitochondrial DNA. We discuss our findings focusing on their implications for the preservation of this highly endangered European bird, and highlighted the critical role of genetic information in planning and monitoring concrete interventions.
Citation: Attili L, Garofalo L, Puddu G, Tirone G, Pizzarelli A, Barbara N, et al. (2023) Genetic distinctiveness of an endangered falcon: Implications for conservation in Europe. PLoS ONE 18(12): e0295424. https://doi.org/10.1371/journal.pone.0295424
Editor: Sebastian D. Fugmann, Chang Gung University, TAIWAN
Received: September 14, 2023; Accepted: November 20, 2023; Published: December 20, 2023
Copyright: © 2023 Attili et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and in its Supporting Information files. New sequences were deposited in the GenBank public database (https://www.ncbi.nlm.nih.gov/genbank/) with Accession numbers reported in S1 Table.
Funding: This work was supported by the European Community through a LIFE Project 18NAT/IT/000720 “LANNER”. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
In the hierofalcon complex (Aves: Falconidae), the genus Falco comprises five species of large birds of prey (saker falcon Falco cherrug, lanner falcon Falco biarmicus, gyrfalcon Falco rusticolus and laggar falcon Falco jugger) with wide distribution in the Palearctic, Nearctic and Afrotropical regions. However, many falcon populations are experiencing intense decline in different areas of the range due to direct and accidental human interference ([1] and references therein). While inhabiting very diverse habitats, these raptors show high morphological similarity and surprisingly poor genetic differentiation at the mitochondrial level, possibly as a consequence of their recent radiation and interspecific gene flow through hybridisation [1–3].
The polytypic lanner falcon F. biarmicus is widespread in the Mediterranean area with three putative subspecies: F. biarmicus feldeggii dwelling in Italy, the Balkans, Greece, Turkey and the southern Caucasus areas, F. biarmicus erlangeri in north western Africa, and F. biarmicus tanypterus occurring from northeastern Africa eastward into Israel (Fig 1). Whether tanypterus and erlangeri are different taxonomic entities, or rather are the same subspecies, is a matter of debate [4, 5]. The saker falcon F. cherrug, which reaches as far as the eastern Alps as the westernmost limit of its range, is the hierofalcon species that is geographically closest to F. biarmicus in the Mediterranean area, where it probably meets the European lanner falcon F. b. feldeggii. The latter is highly threatened with extinction over the entire range, with no more than 200 breeding pairs worldwide [6]. In peninsular Italy and Sicily, in particular, only 140–172 pairs still remained a few years ago [7], while in central Italy the population decreased from 10–15 pairs in the 1980s, to no breeding pairs at all in 2016 [6]. The European lanner is currently listed in the Birds Directive 2009/147/CE, Appendix II of the Bonn Convention, Annex II of the Bern Convention, and Appendix II of the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES). Furthermore, it is categorised as “near threatened” by the International Union for Conservation of Nature’s (IUCN) Red List of Threatened Species (https://www.iucn.org), and classified as a European Species of Conservation Concern (SPEC level 3).
Only areas where the species occur as resident and breeding native are indicated (redrawn from BirdLife International 2022, public domain, web site http://datazone.birdlife.org/species/requestdis). The individual ranges for subspecies of F. biarmicus are not indicated because it is not clear exactly which areas they cover and where they meet. Map layout was freely available online from Natural Earth at https://www.naturalearthdata.com/.
Falco b. feldeggii has been poorly investigated genetically, both in terms of number of individuals, populations and genome coverage [1, 3, 8, 9]. Most of previous molecular studies in the hierofalcon complex were based on variation at the mitochondrial (mt) DNA. Partial mt sequences of the cytochrome b gene (cytB) and the non-coding control region (CR) showed that the extant mitochondrial lineages did not identify any Falco species or subspecies (F. b. feldeggii included) as a monophyletic group. Therefore, these molecular markers proved to be of little diagnostic utility for (sub)species identification, nor were they in accordance with current taxonomy [3]. The occurrence of interbreeding between hierofalcon species, both natural and human-mediated through the practice of falconry, further complicates their differentiation [1, 3].
At the genomic level, some species of Falconidae have been investigated through the analysis of the major histocompatibility complex, MHC [8, 10–12]. MHC genes have recently gained much attention in evolutionary studies and conservation biology in birds of prey due to their exceptionally high heterozygosity compared to neutral markers [13]. However, only the saker falcon and the kestrels (Falco naumanni and Falco tinnunculus) have been extensively examined across their Eurasian ranges, actually revealing high levels of polymorphism [11, 12]. On the contrary, few individuals have been analysed for other falcons, including F. biarmicus. These preliminary results, however, revealed surprisingly low ancestral intra- and interspecific MHC variability [8].
In order to strengthen the legitimacy of conservation efforts (sensu Moritz [14]), and to proactively support ongoing projects for the reintroduction/restocking of the European lanner (LIFE ConRaSI: https://www.lifeconrasi.eu/; LIFE LANNER: https://www.lifelanner.eu/it/), we explored the possibility of characterising the gene pool of F. b. feldeggii and reliably distinguishing it from the other subspecies inhabiting the Mediterranean area (erlangeri and tanypterus) and their possible crossbreds. It is worth mentioning, however, that this goal is far from trivial, since earlier genetic studies indicated very little differentiation for the Palearctic taxa of F. biarmicus, so as to rekindle the debate on their subspecific designation ([9] and references therein).
Additionally, we aimed at differentiating F. b. feldeggii from the saker falcon F. cherrug, which is geographically the closest species, and therefore potential candidate for hybridisation. To address the issue, we first examined genetic variability at the nuclear level in F. biarmicus ssp. and F. cherrug through the analysis of 12 Short Tandem Repeat (STR) loci, selected as neutral genetic markers. Secondly, we analysed variation at portions of two single-copy functional genes, previously used for phylogeny studies in birds [15], coding for the brain-derived neurotrophic factor precursor (BDNF) and the oocyte maturation factor (Cmos). They are studied here for the first time in these taxa. Furthermore, we investigated variability at the MHC class II B gene, including more falcon taxa and a higher number of individuals than in previous studies [8, 10, 12]. The study of neutral loci and functional genes under selective pressure provides a more comprehensive picture of genetic variability [16].
Genetic variation was also assessed at the mitochondrial level by sequencing portions of the cytB gene, the 12S rRNA gene (12S), and the CR, primarily to frame our data in a context of greater variability, through comparisons with the available published results from other hierofalcon taxa and multiple geographic areas.
Finally, we discuss our genetic findings focusing on their implications for the conservation of the highly endangered European lanner, highlighting the critical role of genetic information in planning and monitoring concrete interventions.
Materials and methods
Sample collection and DNA extraction
A total of 115 samples of lanner F. biarmicus ssp. and saker falcon F. cherrug were gathered from different Mediterranean countries, museums and collections (Table 1, S1 Table). Samples consisted of plucked feathers and blood from live lanners held in captivity because sick, injured in the wild, seized by the authorities, or because involved in LIFE LANNER Project. Skin and feathers of taxidermied falcons were obtained from European collections. Most stuffed specimens were recent (N = 38), dating from no earlier than 1950, while some of them (N = 7) dated from 1847 to 1953. In addition, fresh muscle tissues were opportunistically collected from individuals that died naturally, accidentally or killed illegally. No birds were wild-caught, nor were any live birds handled for the purpose of this study, rather only as part of conservation actions and health interventions under regular permissions from the countries of origin. All necessary permits were obtained for the described study, which complied with all relevant CITES regulations.
Total DNA was isolated from approximately 15 mg of muscle or 200 μl of whole blood in EDTA preserved at -20°C, using the QIAamp DNA Mini Kit (QIAGEN, Hilden, Germany). The DNA from feather calamus and skin of museum specimens was extracted with the Maxwell16 Instrument (Promega, Madison, USA) for automated genomic DNA isolation following the user handbook, with an overnight incubation in two-fold lysis buffer volume (containing 1M dithiothreitol) as minor adjustment. Extracted DNAs were diluted in 100 μl RNAse-free molecular grade water and quantified with the Quantus Fluorometer (Promega) according to the manufacturer’s instructions. One mock tube with reagents and no sample DNA was included in each extraction round. Museum samples were processed separately from fresh tissues using dedicated equipment in different extraction/amplification sessions.
STR genotyping
We selected and amplified 18 STR loci that were isolated originally in Falco peregrinus (FP01, FP89, FP92-1, FP46-1, FP86-2, FP5, FP13, FP54, FP31, FP347, FP82-2, Fp79-1, FP79-4, FP107; [1, 17]), Falco vespertinus (FALVES-03, FALVES-31, FALVES-13; [18]), and Accipiter gentilis (AGE-5; [19]). Two loci (FP01, FP79-4) failed to amplify, and four (FP92-1, FP 46–1, FP86-2, FALVES-13) proved monomorphic in a pilot subset of twenty lanners. Consequently, six loci were excluded from further analyses. Amplifications of the remaining 12 loci were optimised in two multiplexes (Table 2). Each multiplex contained 2 μl of genomic DNA, 0.8 μM of each primer, 3.6 μl of master mix (QIAGEN Multiplex PCR Kit) and PCR grade H2O in 10.4 μl total reaction volume. Multiplex amplifications were performed in a Veriti thermal cycler (Applied Biosystems), and consisted of an initial 15 min activation step at 95°C, followed by 32 cycles of denaturation at 94°C for 30 sec, annealing at 55°C for 90 sec, extension at 72°C for 60 sec, and a final extension step at 60°C for 30 min. Extraction negative controls and PCR mock reactions were included in each amplification session. A volume of 2 μl of five-fold diluted amplified reaction products was denatured with 14 μl HiDiTM formamide (Applied Biosystems), combined with 0.2 μl GeneScanTM 500 LIZTM size standard (Applied Biosystems) and loaded on the ABI PrismTM 3130 Genetic Analyzer. GeneMapper Software 5.0 was used for fragment sizing and allele calling.
Amplification and sequencing of nuclear coding genes and mitochondrial markers
Portions of the mitochondrial cytB, 12S, CR, as well as fragments of the nuclear protein-coding genes BDNF and Cmos, and the second exon of the MHC class II B gene were amplified and sequenced using published primers and primers developed de novo for this study (Table 3). The algorithm in Primer3 (https://bioinfo.ut.ee/primer3-0.4.0/) was employed to design the new primers based on the reference mitogenome of Falco cherrug (deposited in GenBank https://www.ncbi.nlm.nih.gov/genbank/ with Accession number NC026715).
The same PCR reactions were used for all fragments and contained 2.5 μl of 10X Gold buffer (Applied Biosystems, USA), 200 μM of each dNTP, 2.5 mM MgCl2, 1.5 μM of each primer (Table 3), 1U of AmpliTaqGold polymerase (Applied Biosystems) and 3μl template DNA (approximately 20 ng) in a final volume of 25 μl. Amplifications consisted of an initial denaturation step at 94°C for 3 min, 38 cycles of 30 s at 94°C, 30 s at the annealing temperature (Table 3), 30 s extension step at 72°C for 30 s, followed by 5 min at 72°C. Extraction and PCR negative controls were included in each amplification round to monitor contaminations. Amplified products were visualised under UV light after electrophoresis on 1.5% agarose gel and staining with Gel RedTM (Biotium, Inc., USA). A 50–2000 bp DNA ladder (Sigma-Aldrich Chemicals, Italy) was used to verify the molecular size of amplicons. PCR products were cleaned up with the QIAquick PCR purification kit (QIAGEN, Hilden, Germany) and sequenced bidirectionally using the amplification primers and the BigDye Terminator kit v3.1 (Applied Biosystems) according to the manufacturer’s protocol. Unincorporated dyes and other contaminants were removed with the Agencourt® CleanSEQ solution (Beckman Coulter, USA), then sequences were loaded on the ABI PrismTM 3130 Genetic Analyzer and analysed using the Sequencing Analysis Software v5.3.1.
Data analysis
Nuclear coding genes and STRs.
After amplification and sequencing of MHC, BDNF and Cmos genes, the chromatograms were inspected and sequences without double peaks at any site (indicating homozygotes) were recorded as real alleles (or haplotypes) when their occurrence in the homozygous state was verified in at least three different individuals. When sequences exhibited double peaks, each polymorphic position was carefully scanned by eye to check for accuracy and consistency of double peaks across all heterozygous individuals. As an additional evidence for true polymorphic positions, we verified that double peaks in the chromatograms were lower than adjacent single peaks, as compared to homozygous individuals at that site. Haplotypes of heterozygotes at multiple sites were reconstructed via computation based on a Bayesian algorithm [24]. The computational approach was used as an efficient and cost-effective alternative method to standard cloning for the separation of the constituent alleles [25]. For this purpose, we employed the package PHASE v2.1.1, (https://stephenslab.uchicago.edu/phase/download.html) to phase the observed genotype data under the recombination model using the default settings [24]. Allelic states from unphased haplotypes were statistically inferred and the phase probability value at each variable site was derived. If the probability for the reconstructed alleles was less than 0.90, due to the possibility of multiple allele combinations or the presence of polymorphisms occurring only once in the data set, then the unresolved “orphan” alleles [25] were recorded as missing values. To check for the misamplification of duplicated or pseudogenes, the sequences of alleles from homozygous individuals and reconstructed alleles from heterozygotes were verified against F. cherrug reference genome (Accession number GCF_000337975.1). Additionally, the sequences were matched and aligned with those available in GenBank for the Falco (sub)species using the BLAST algorithm [26] (https://blast.ncbi.nlm.nih.gov).
Single genotypes were obtained at 12 STR loci and overall individual multilocus profiles based on a 15-loci complete panel, consisting of both neutral STRs and coding genes, were obtained for all samples. Departures from linkage equilibrium (LE) between loci were assessed in GENEPOP v4.7.5 [27] (https://genepop.curtin.edu.au) using 104 steps of dememorisation, 103 batches and 104 iterations per batch. The software MICRO-CHECKER 2.2.3 [28] was used to evaluate potential genotyping errors and frequencies of null alleles at single loci.
A Discriminant Analysis of Principal Components, DAPC [29] based on multilocus profiles was performed using the Adegenet package 2.1.6 [30] for the R software v.4.2.1 on the web interface. This multivariate approach identifies discrete clusters of individuals, maximising variation among groups while minimising variation within groups, without relying on any underlying genetic model. The analysis allowed us to visualise and qualitatively describe genetic variation at the population level based on our entire falcon dataset. Clusters of samples were initially pre-defined according to their collection location/taxonomic status. The optimal number of principal components that were retained was determined by cross validation.
To complement the multivariate analysis of nuclear data, a Bayesian assignment procedure as implemented in the software STRUCTURE v.2.3.4 [31] was used to identify clusters (K) of genetically close individuals and admixed genotypes based on a Markov Chain of Monte Carlo with 5 x 106 iterations and 5 x 105 steps of burn-in to achieve convergence. Ten repetitions for each K ranging from 2 to 8 were performed under an admixture model using independent allele frequencies, without assuming any prior nongenetic information. The Δk [32] and Puechmaille’s [33] methods, as performed in StructureSelector [34] (http://Imme.ac.cn7StructureSelector/), were used to assess the most likely number of genetic clusters. Individual (q) and average (Q) proportions of membership in each of the inferred clusters were eventually derived from the multilocus genotype data of single falcon samples.
Mitochondrial DNA
In order to investigate the genetic relationships of the sampled hierofalcons through variation in their mtDNAs, single sequences of partial CR (294 bp), 12S (263 bp) and cytB (428 bp) were amplified in all samples. Subsequently, sequences were concatenated in haplotypes of 985 bp in total length, edited and aligned using Geneious Prime (https://www.geneious.com). To compare broader geographic ranges for our falcon taxa, as well as to assess their relationships with unsampled Falco (sub)species, 47 already published CR sequences were downloaded from GenBank (S2 Table) and added to our dataset of single CR sequences in a second, shorter mtDNA alignment of 295 bp in length. To examine the relationships and distribution of our concatenated haplotypes, a median-joining network [35] was constructed using the software PopART [36]. Separately, a network was also derived using a shorter alignment consisting of our and online CR-based haplotypes. Nucleotide diversity (π) for concatenated haplotypes and single CR sequences was computed using DnaSP 6.0 [37].
Results
Coding genes and STRs
Of the 115 samples collected from F. biarmicus and F. cherrug, 3 museum samples actually showed mitochondrial haplotypes from peregrine falcon (F. peregrinus, see below) and were not analysed. Furthermore, 11 additional taxidermied specimens yielded unusable partial profiles and were discarded. Finally, genotypes at nuclear genes were obtained for a total of 101 hierofalcon samples (F. b. feldeggii, n = 55; F. b. tanypterus, n = 14, F. b. erlangeri, n = 10; F. cherrug, n = 22).
The reconstruction of haplotypes performed by PHASE was used to derive allelic states in the coding genes. Results from computations revealed five alleles for BDNF (4 segregating sites), 10 alleles for Cmos (6 segregating sites) and, surprisingly, only 4 alleles for MHC (5 segregating sites) across all genotyped samples. The polymorphic sites were all biallelic, and no positions with three or four segregating nucleotides were detected. A maximum of four polymorphic positions were found per individual in each gene. Only one unphased haplotype in the gene Cmos, found as unique polymorphism in F. b. erlangeri, was ambiguously reconstructed (phase probabilities < 0.90), and remained unresolved as missing allele. When sequences of real and reconstructed alleles were checked against the reference genome of F. cherrug, no evidence for pseudogenes was detected, nor was there any inconsistency with online homologous MHC sequences from other falcons [8, 12]. Allele 4, however, was found as a new highly frequent variant in F. b. erlangeri and F. b. tanypterus (S3 Table). Overall, allele variation was largely shared in all falcons, and only a few private alleles were found at low frequencies (ranging from 0.0094 at Cmos in F. b. feldeggii to 0.1875 at MHC in F. b. erlangeri) in each taxon (S3 Table).
All 12 STR loci were polymorphic in our hierofalcon taxa, showing from three (in FP89 and FP5) to eleven (in FP82-2) alleles per locus (S3 Table). Significant genotyping errors were not found by MICRO-CHECKER, nor was evidence for null alleles provided at any locus. Private alleles with frequency higher than 10% were rare in all taxa, two being found in F. cherrug, one each in F. b. erlangeri and F. b. feldeggii, and none in F. b. tanypterus. Significant linkage disequilibrium (p < 0.001) involved five locus-by-locus comparisons (FP13 x FP89, AGE-5 x FP107, FALVES-03 x FP107, AGE-5 x FP82-2, FP82-2 x FP347) and was found exclusively in F. b. feldeggii. Five is equal to the number of significant comparisons expected by chance based on 105 possible pairwise comparisons. Moreover, if LE is needed for powerful downstream analyses, minor violations do not prevent from identifying discrete clusters of close genotypes [38]. Consequently, for our purposes, no locus was excluded from subsequent computations. Finally, downstream data analyses were performed on a total of 101 falcon samples based on ultimate 15-locus genotypes that encompassed 12 STR loci and 3 coding genes, with only a minor fraction of missing values (4% of unsampled or unresolved alleles).
The overall genetic variability in the dataset was qualitatively described at the population level using DAPC (Fig 2). This multivariate approach uses principal components to summarise the original genetic data into uncorrelated discrete groups, and subsequently applies discriminant analysis to maximise the among-group variation. We entered four pre-defined populations into the computations, which corresponded to their specific (F. cherrug) and subspecific status (F. b. feldeggii, F. b. erlangeri, F. b. tanypterus) in accordance with taxonomy. Cross-validation procedure suggested that ten principal components (PC) were retained in the analysis as a proxy for optimal description of the original data and highest predictive capacity. The first two PCs best summarised most of the allelic diversity, and defined four clusters of genetically close individuals. The first PC identified the three F. biarmicus subspecies as separate clusters, with feldeggii being highly distinct. F. b. erlangeri and F. b. tanypterus, however, were not completely distinct clusters, showing some degree of overlapping and pointing to possible admixture. In particular, according to the individual membership probabilities provided by DAPC for our a-priori groups, one sample of F. b. tanypterus was actually assigned to the F. b. erlangeri cluster. The second PC clearly separated samples of saker falcon F. cherrug as a further discrete and distant cluster.
The graph features single individuals as dots and groups as inertia ellipses.
Single multilocus genotypes were used to assess the probability of their ancestry in one (or more) genetic clusters with the STRUCTURE Bayesian analysis for K ranging from 2 to 8. Both the Evanno Δk and Puechmaille’s method identified K = 5 as the most probable value for the number of inferred clusters. All lanners F. b. feldeggii were probabilistically assigned to two clusters (III and V) with a total mean Q = 0.988 (Table 4). Falcons from F. b. tanypterus and F. cherrug fell into two alternative clusters (II and I) showing Q values of 0.920 and 0.933, respectively. On the contrary, significant admixture was found in F. b. erlangeri (Fig 3, Table 4), where all samples, while falling into their own cluster (IV) with Q = 0.507, shared a non-negligible amount of ancestry with F. b. tanypterus (individual q values ranging from 0.143 to 0.726). Additionally, four of them showed also some admixture with F. b. feldeggii (q from 0.092 to 0.154), and two additional individuals with F. cherrug (q = 0.104 and 0.285). One sample of F. b. tanypterus fell into its own cluster (II) with only a small q-value (0.195), while most of its ancestry was shared with F. b. erlangeri (q = 0.788), as already suggested by DAPC. Conversely, one F. b. erlangeri shared ancestry in the tanypterus cluster (II) with high membership value (q = 0.722, Fig 3). The majority of saker falcons F. cherrug fell into cluster I; few individuals, however, showed from low to moderate membership to the F. b. feldeggii clusters (q values from 0.086 to 0.279). On the contrary, no shared ancestry with other falcon taxa was found in the feldeggii individuals, which showed values of membership to their clusters (III and V) all higher than or equal to 0.975.
Each vertical bar represents one individual and the length of the different coloured sections is proportional to membership (q-values) in the inferred genetic clusters. Membership of F. b. feldeggii is distributed into two private clusters (see also Table 4). The star indicates one specimen of tanypterus that fell into the erlangeri group, as also revealed by DAPC. The bar plot was drawn using STRUCTURE PLOT [39] (http://omicsspeaks.com/strplot2/).
Mitochondrial DNA
Amplifications of the mitochondrial marker sequences were performed in all collected samples. Seven taxidermied specimens, however, yielded no amplicons (one F. b. feldeggii from Dalmatia, dated 1847, two putative F. b. erlangeri from the museum in Malta, dating back to the 1940s, two feldeggii from the museum in Chieti, collected around the mid-1970s, and two F. cherrug stuffed at the Vienna Museum in 1975 and 2004). Furthermore, three specimens from the museums in Vienna and Chieti were misclassified as one cherrug and two F. b. erlangeri, respectively, while being actually peregrine falcons (F. peregrinus) based on their mtDNA. Finally, single sequences of 12S, cytB and the CR were obtained for a total of 105 falcon samples (F. b. feldeggii, N = 58; F. b. tanypterus, N = 14, F. b. erlangeri, N = 10; F. cherrug, N = 23).
Two haplotypes were found at 12S (263 bp) and four at cytB (428 bp), while seven haplotypes were identified at the CR (294 bp). Sequences have been deposited in GenBank, with accession numbers listed in S1 Table. The three fragments were concatenated to obtain sequences of 985 bp in total length. Eight concatenated haplotypes were identified in the falcon taxa (Table 1, S1 Table), based on 12 polymorphic sites, 5 of which were found in the CR, 5 in cytB and 2 in 12S. Except for one transversion in the 12S sequence, all point mutations were transitions. Haplotypes showed moderate differentiation, with nucleotide diversity (π) ranging from 0.00102 to 0.01218. Due to small sample sizes for most of the taxa studied, no inference was derived based on the frequency distribution of haplotypes. In 58 feldeggii individuals, however, only the concatenated haplotypes H3 and H6 were present, with frequencies of 0.273 and 0.727, respectively.
The genealogical relationships of concatenated haplotypes were inferred by a median-joining network, which clearly depicted two mixed mitochondrial haplogroups (I and II, Fig 4A), separated by a π-value of 0.00968. In comparison, nucleotide diversity within either haplogroup was smaller than nearly one order of magnitude (0.00203, sd 0.00047 and 0.00203, sd 0.00071 in haplogroups I and II, respectively). It is noteworthy that sample-by-sample inspection in F. b. feldeggii did not reveal any relationship between the distribution of specimens in the mitochondrial haplogroups and the membership clusters inferred by STRUCTURE (Fig 3, Table 4).
Networks obtained using: a) a 985-bp alignment of concatenated sequences from our samples of F. biarmicus ssp. and F. cherrug; b) a trimmed 295-bp CR alignment including our and published sequences from several falcon (sub)species. Circles represent single haplotypes; the size of the circles is proportional to the haplotype frequency; colours represent the proportion of individuals from each (sub)species; ticks on connecting lines indicate mutational steps between haplotypes; small black nodes mark inferred (unsampled) sequences. Hatched lines separate two major haplogroups (I and II).
Sequences of the CR are the most represented online for the mitochondrial DNA in the genus Falco, therefore we aligned our seven single CR haplotypes with published sequences from eight hierofalcon (sub)species (S3 Table) in order to reconstruct their genetic relationships by including more genetic variability and a higher number of taxa. The network derived using a trimmed 295-bp alignment (Fig 4B) confirmed both our findings from concatenated haplotypes and the results from the literature, i.e. the existence of two distinct non-monophyletic mtDNA lineages, that were both present in different taxa and the same taxon from different geographic locations. Compared to our previous network based on concatenated haplotypes, the inclusion of greater variability led to an increase in nucleotide diversity within haplogroups (0.00939, sd 0.00191 and 0.00841, sd 0.00122 in I and II, respectively) and, proportionally, shorter distance between haplogroups (0.01882).
Discussion
Efforts to protect wild taxa are highly endorsed, and conservation projects gain priority and funding attention, when their (phylo)genetic distinctiveness is assessed and taxonomic recognition is supported by the occurrence of unique hereditary characters. In the case of hierofalcons, this holds particularly true, and genetic studies face a twofold challenge: some of the extant (sub)species are rare, up to being to the brink of extinction, and many of them are genetically closely related, due to their evolutionary recent common ancestor [3]. This can make their diagnosability rather tricky. That said, however, the joint use of multiple neutral STR markers and coding genes under selection, as adopted in this study, proved a successful approach to differentiate our falcon taxa through non-trivial genetic differences [40] combined with statistical assigning procedures. In this framework, based on the allele number and frequency, coding genes proved less variable than neutral loci, as expected from lack of functional constrains for the latter and different mutational and evolutionary mechanisms. The second exon of the MHC class II B gene, in particular, showed unusually low variability, a scenario of genetic impoverishment already described in large falcons and confirmed in this study by an increased number of individuals and taxa than previously investigated [8, 12]. Natural selection mediated by particularly abundant and/or harmful pathogens could have driven the depletion of MHC variation via directional selection in the common ancestor of extant falcons. Compensatory mechanisms by other immune genes cannot be ruled out either [8, 12].
Partly disagreeing with earlier studies that indicated “strong genetic identity” for the three Palearctic subspecies of F. biarmicus [9], our results revealed that F. b. feldeggii, in particular, exhibited a nuclear gene pool that was highly distinct from the other subspecies of F. biarmicus, as well as from the saker falcon F. cherrug, revealing strong geographic signature and supporting the current taxonomy. Noteworthy, this result was obtained when allele frequency data were analysed either using an underlying genetic model, as in STRUCTURE, or without any model, as in DAPC. At the individual and population level, respectively, both approaches showed that no single F. b. feldeggii was admixed, i.e. their ancestry was not shared with any other falcon studied. Lanners from Sicily and the Balkans presumably share the same gene pool as the peninsular population, although we could not verify this directly, as individuals from those areas are underrepresented in our sampling. However, Sarà et al. [9] reported no significant genetic differences between continental Italy, Sicily and the Balkan area, and suggested the existence of a metapopulation with natural gene flow that prevents their isolation.
Differently from the European lanner F. b. feldeggii, the subspecies F. b. tanypterus from Israel and F. b. erlangeri from North Africa shared part of their nuclear gene pools, suggesting that they may be geographic populations of the same subspecies, as previously proposed [4, 5]. At the individual level, one tanypterus shared most of its ancestry in the erlangeri gene pool, and, vice versa, one erlangeri showed highest membership to the gene pool of tanypterus, indicating either morphological misidentification or interbreeding for both birds. Overall, falcons from North Africa appear as the most genetically admixed: this area could be as a meeting point and occasional interbreeding between resident and dispersing lanner falcons in the Mediterranean basin.
Nuclear DNA allowed a clear distinction also of the saker falcon F. cherrug. However, few individuals (5 out of 23) showed from low to moderate membership to the gene pool of F. b. feldeggii. This slight admixture can be explained by the fact that most of our saker samples were collected in Austria and Slovakia, i.e. the westernmost part of the distribution and the area closest to the lanner’s range. In these areas, the saker falcon is reported as a non-resident and non-breeding, but rather as a vagrant visitor species (http://datazone.birdlife.org/). In Bulgaria, the Balkans and Israel the saker can also overwinter [1]. Rare and rather recent hybridisation [41] may explain the signature of limited gene flow that we have observed. It is noteworthy, however, that introgression is asymmetrical, in that it occurs from F. feldeggii into F. cherrug, and not vice versa. This finding, as well as the high homogeneity of the feldeggii gene pool, can be explained by the loss of genetic diversity as a consequence of the severe bottleneck that it experienced in the late 19th and early 20th century across the entire range [6, 7]. Hybridisation events with the saker falcon may have predated this dramatic population decline. Bottleneck effects, however, may have removed traces of its ancestry from the feldeggii’s gene pool. In conclusion, while accounting for some uncertainty regarding F. b. tanypterus and F. b. erlangeri, the grouping of samples based on taxonomy was substantially confirmed by our nuclear genetic data, which indicate, in particular, F. b. feldeggii as the most divergent subspecies of F. biarmicus in the Mediterranean area.
While nuclear genetic data basically supports current taxonomy, however this does not hold true with mtDNA. The network, based on our and online sequences, identified two different lineages, but clearly showed that no falcon taxon formed a monophyletic group, nor were any relationships found with their respective distribution areas: the haplotypes found in F. cherrug and subspecies of F. biarmicus from different Mediterranean sites were dispersed in both lineages, suggesting the lack of diagnostic power for mitochondrial markers in these birds. Overall, the haplotypes were poorly differentiated from each other, with a few haplotypes being largely widespread across taxa, while others occurring only rarely. These results fully agree with earlier findings in highlighting paraphyly for the Falco species and the mismatch between haplogroups and taxonomic entities [1, 3]. Furthermore, they point to the real difficulty of finding solid explanations for the current phylogeographic pattern obtained in these birds [1, 3], a picture that remains largely speculative.
The lanner is the most critically endangered hierofalcon in Europe. The species is worthy of high conservation priority especially in Italy [42], that hosts the most important population and a considerable portion of the world genetic stock of the feldeggii subspecies [43, 44]. Thus, a crucial role and great responsibility is assumed by this country for the conservation of the taxon at a global level [7]. To address the issue, with the purpose of increasing population size, or at least halting its decline, five-year conservation LIFE projects with EU funds were launched in 2010 in Sicily (LIFE ConRaSI) and in 2020 in central Italy (LIFE LANNER), with the latter including captive breeding and release of birds into the wild. One of the goals of the projects was to highlight the genetic distinctiveness of the lanner, in order to endorse and encourage, if ever needed, the efforts for its conservation. Our results showed that selected nuclear markers were capable of genetically distinguishing the European lanner both at the population and individual level. Although not supported by mitochondrial data (not surprisingly, however, [1, 3]), this genetic uniqueness, combined with the existence of distinctive non-genetic traits in morphology, distribution and ecology, strongly claims for the conservation argument and emphasises the need for urgent interventions. Without timely and effective actions, this bird risks disappearing forever in the wild, adding to the already long list of species that witness the loss of biodiversity.
The rarity of lanners and the conservation policy of “no impact” on the wild population, has led to rely on captive-bred instead of wild-sourced falcons for the recruitment of individuals in projects involving the release of birds into the wild (cfr. LIFE LANNER). In such a scenario, monitoring captive falcons is crucial. In this regard, molecular identification of single birds, individual membership to a specific gene pool (i.e. subspecies), assessment of parental pairs, and reconstruction of genetic relationships among generations are essential to select the individuals to be released in nature (Lorenzini et al. in prep.). In addition to assess genealogy and ancestry, our panel of nuclear markers proved a valuable tool to identify genetically admixed individuals. This application is much desired when occasional intra- or interspecific hybrids, either natural or anthropogenic, are to be identified for management, scientific and forensic purposes.
Supporting information
S1 Table. List of samples, origin, year of collection, single and concatenated mitochondrial haplotypes.
The order of samples corresponds to the order of samples in the barplot (Fig 3 in the main text). GenBank accession numbers are indicated in parentheses.
https://doi.org/10.1371/journal.pone.0295424.s001
(PDF)
S2 Table. List of 47 mitochondrial CR sequences downloaded from GenBank and used to construct a median-joining network.
H1-H8 indicate the haplotypes obtained in this study, corresponding to sequences previously published elsewhere with the accession numbers below.
https://doi.org/10.1371/journal.pone.0295424.s002
(PDF)
S3 Table. Allele frequencies at 12 STR loci and 3 nuclear coding genes in four hierofalcon taxa.
Sample size in parenthesis.
https://doi.org/10.1371/journal.pone.0295424.s003
(PDF)
Acknowledgments
We thank Ohad Hatzofe for providing specimens from Israel and Erika Ciarrocca for collecting historical samples from the Museum of the University "G. D’Annunzio" (Chieti, Italy). We are also grateful to Carmagnola Natural History Museum (Torino), LIPU Wildlife Recovery Center (CRUMA) of Livorno, Group for the Rehabilitation of Native Fauna and its Habitat (GREFA) of Madrid for providing some live and museum specimens. We are in debt to Stefano Picchi for his continuous support and suggestions, and to Lara Maurel and Evgeniya Titarenko for their contribution to the administrative management of the study. We thank Rita Fanelli, Rosario Fico and Alessia Mariacher for support in the early stages of the Project setup, Riccardo Castiglia for helpful comments and suggestions, and the Italian network of Centri di Recupero Animali Selvatici (CRAS) for collaboration.
References
- 1. Nittinger F, Gamauf A, Pinsker W, Wink M, Haring E. Phylogeography and population structure of the saker falcon (Falco cherrug) and the influence of hybridization: mitochondrial and microsatellite data. Mol Ecol. 2007;16: 1497–1517. pmid:17391272
- 2. Wink M, Sauer-Gürth H, Ellis D, Kenward R. Phylogenetic relationships in the Hierofalco complex (saker-, gyr-, lanner-, laggar falcon). Raptors Worldw. 2004;499504.
- 3. Nittinger F, Haring E, Pinsker W, Wink M, Gamauf A. Out of Africa? Phylogenetic relationships between Falco biarmicus and the other hierofalcons (Aves: Falconidae). J Zool Syst Evol Res. 2005;43: 321–331.
- 4.
Clark WS. A field guide to the raptors of Europe, the Middle East, and North Africa. Oxford University Press, USA; 1999.
- 5.
Forsman D. Flight identification of raptors of Europe, North Africa and the Middle East. Bloomsbury Publishing; 2016.
- 6. Corso A. Updated status of European Lanner Falcon, Falco biarmicus feldeggii (Schlegel, 1843)(Aves Falconiformes): a taxon on the verge of extinction, with brief comments on the North African Lanner, F. biarmicus erlangeri (Kleinschimdt, 1901). Biodivers J. 2018;9: 35–44.
- 7. Andreotti A, Leonardi G, Sara M, Brunelli M, De Lisio L, De Sanctis A, et al. Landscape-scale spatial distribution of the Lanner Falcon (Falco biarmicus feldeggii) breeding population in Italy. AMBIO J Hum Environ. 2008;37: 440–444.
- 8. Gangoso L, Alcaide M, Grande JM, Muñoz J, Talbot SL, Sonsthagen SA, et al. Colonizing the world in spite of reduced MHC variation: Ancestrally low MHC diversity in falcons. J Evol Biol. 2012;25: 1438–1447. pmid:22686489
- 9. Sarà M, Mengoni C, Mucci N, Guzzo E, Ruzic M, Amato M, et al. Genetic variability and population structuring in the European Lanner Falcon Falco biarmicus feldeggii. Avocetta. 2019; 43(1): 55–65
- 10. Alcaide M, Edwards SV, Negro JJ. Characterization, Polymorphism, and Evolution of MHC Class II B Genes in Birds of Prey. J Mol Evol. 2007;65: 541–554. pmid:17925996
- 11. Alcaide M, Edwards SV, Negro JJ, Serrano D, Tella JL. Extensive polymorphism and geographical variation at a positively selected MHC class II B gene of the lesser kestrel (Falco naumanni). Mol Ecol. 2008;17: 2652–2665. pmid:18489548
- 12. Zhan X, Dixon A, Batbayar N, Bragin E, Ayas Z, Deutschova L, et al. Exonic versus intronic SNPs: contrasting roles in revealing the population genetic differentiation of a widespread bird species. Heredity. 2015;114: 1–9. pmid:25074575
- 13. Minias P, Pikus E, Whittingham LA, Dunn PO. Evolution of copy number at the MHC varies across the avian tree of life. Genome Biol Evol. 2019;11: 17–28. pmid:30476037
- 14. Moritz C. Defining ‘evolutionarily significant units’ for conservation. Trends Ecol Evol. 1994;9: 373–375. pmid:21236896
- 15. Aggerbeck M, Fjeldså J, Christidis L, Fabre P-H, Jønsson KA. Resolving deep lineage divergences in core corvoid passerine birds supports a proto-Papuan island origin. Mol Phylogenet Evol. 2014;70: 272–285. pmid:24125832
- 16. Zinevich L, Prommer M, Laczkó L, Rozhkova D, Sorokin A, Karyakin I, et al. Phylogenomic insights into the polyphyletic nature of Altai falcons within eastern sakers (Falco cherrug) and the origins of gyrfalcons (Falco rusticolus). Sci Rep. 2023;13: 17800.
- 17. Nesje M, Roed KH, Lifjeld JT, Lindberg P, Steen OF. Genetic relationships in the peregrine falcon (Falco peregrinus) analysed by microsatellite DNA markers. Mol Ecol. 2000;9: 53–60. pmid:10652075
- 18. Magonyi NM, Mátics R, Szabó K, Fehérvári P, Solt S, Palatitz P, et al. Characterization of novel microsatellite markers for the red-footed falcon (Falco vespertinus) and cross-species amplification in other Falco species. Eur J Wildl Res. 2019;65: 62.
- 19. Topinka JR, May B. Development of polymorphic microsatellite loci in the northern goshawk (Accipiter gentilis) and cross-amplification in other raptor species. Conserv Genet. 2004;5: 861–864.
- 20. Garofalo L, Fanelli R, Brucoli A, Cannarella S, Lorenzini R. European Roller thief nailed by DNA found in a bird carrier. Avocetta. 2022;46: 41–47.
- 21. Kocher TD, Thomas WK, Meyer A, Edwards SV, Pääbo S, Villablanca FX, et al. Dynamics of mitochondrial DNA evolution in animals: amplification and sequencing with conserved primers. Proc Natl Acad Sci. 1989;86: 6196–6200. pmid:2762322
- 22. Noonan BP, Chippindale PT. Dispersal and vicariance: The complex evolutionary history of boid snakes. Mol Phylogenet Evol. 2006;40: 347–358. pmid:16624591
- 23. Le M, Raxworthy CJ, McCord WP, Mertz L. A molecular phylogeny of tortoises (Testudines: Testudinidae) based on mitochondrial and nuclear genes. Mol Phylogenet Evol. 2006;40: 517–531. pmid:16678445
- 24. Stephens M, Donnelly P. A comparison of bayesian methods for haplotype reconstruction from population genotype data. Am J Hum Genet. 2003;73: 1162–1169. pmid:14574645
- 25. Harrigan RJ, Mazza ME, Sorenson MD. Computation vs. cloning: evaluation of two methods for haplotype determination. Mol Ecol Resour. 2008;8: 1239–1248. pmid:21586011
- 26. Johnson M, Zaretskaya I, Raytselis Y, Merezhuk Y, McGinnis S, Madden TL. NCBI BLAST: a better web interface. Nucleic Acids Res. 2008;36: W5–W9. pmid:18440982
- 27. Rousset F. genepop’007: a complete re-implementation of the genepop software for Windows and Linux. Mol Ecol Resour. 2008;8: 103–106. pmid:21585727
- 28. Van Oosterhout C, Hutchinson WF, Wills DP, Shipley P. MICRO-CHECKER: software for identifying and correcting genotyping errors in microsatellite data. Mol Ecol Notes. 2004;4: 535–538.
- 29. Jombart T, Devillard S, Balloux F. Discriminant analysis of principal components: a new method for the analysis of genetically structured populations. BMC Genet. 2010;11: 1–15.
- 30. Jombart T. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics. 2008;24: 1403–1405. pmid:18397895
- 31. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155: 945–959. pmid:10835412
- 32. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software structure: a simulation study. Mol Ecol. 2005;14: 2611–2620. pmid:15969739
- 33. Puechmaille SJ. The program structure does not reliably recover the correct population structure when sampling is uneven: subsampling and new estimators alleviate the problem. Mol Ecol Resour. 2016;16: 608–627. pmid:26856252
- 34. Li Y-L, Liu J-X. Structure Selector: A web-based software to select and visualize the optimal number of clusters using multiple methods. Mol Ecol Resour. 2018;18: 176–177. pmid:28921901
- 35. Bandelt H-J, Forster P, Röhl A. Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol. 1999;16: 37–48. pmid:10331250
- 36. Leigh JW, Bryant D. POPART: full-feature software for haplotype network construction. Methods Ecol Evol. 2015;6: 1110–1116.
- 37. Rozas J, Ferrer-Mata A, Sánchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, et al. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol Biol Evol. 2017;34: 3299–3302. pmid:29029172
- 38. Wang J. Individual identification from genetic marker data: developments and accuracy comparisons of methods. Mol Ecol Resour. 2016;16: 163–175. pmid:26230747
- 39. Ramasamy RK, Ramasamy S, Bindroo BB, Naik VG. STRUCTURE PLOT: a program for drawing elegant STRUCTURE bar plots in user friendly interface. SpringerPlus. 2014;3: 1–3.
- 40. Zachos FE. (New) Species concepts, species delimitation and the inherent limitations of taxonomy. J Genet. 2018;97: 811–815. pmid:30262692
- 41.
Brown L, Urban EK, Newman K, Woodcock M, Hayman P. The birds of Africa. Academic Press London; 1982.
- 42. Rondinini C, Battistoni A, Teofili C. Lista Rossa IUCN dei Vertebrati Italiani 2022. Com Ital IUCN E Minist Dell’Ambiente E Della Tutela Territ E Mare Roma Italy. 2022.
- 43. Brunelli M. Lanario Falco biarmicus. Libro Rosso degli Animali d’Italia Vertebrati. 1998.
- 44. Leonardi G. Falco biarmicus lanner falcon. BWP Update. 2001;3: 157–174.