Recently, an extensive study of 2,748 polar bears (Ursus maritimus) from across their circumpolar range was published in PLOS ONE, which used microsatellites and mitochondrial haplotypes to apparently show altered population structure and a dramatic change in directional gene flow towards the Canadian Archipelago—an area believed to be a future refugium for polar bears as their southernmost habitats decline under climate change. Although this study represents a major international collaborative effort and promised to be a baseline for future genetics work, methodological shortcomings and errors of interpretation undermine some of the study’s main conclusions. Here, we present a reanalysis of this data in which we address some of these issues, including: (1) highly unbalanced sample sizes and large amounts of systematically missing data; (2) incorrect calculation of FST and of significance levels; (3) misleading estimates of recent gene flow resulting from non-convergence of the program BayesAss. In contrast to the original findings, in our reanalysis we find six genetic clusters of polar bears worldwide: the Hudson Bay Complex, the Western and Eastern Canadian Arctic Archipelago, the Western and Eastern Polar Basin, and—importantly—we reconfirm the presence of a unique and possibly endangered cluster of bears in Norwegian Bay near Canada’s expected last sea-ice refugium. Although polar bears’ abundance, distribution, and population structure will certainly be negatively affected by ongoing—and increasingly rapid—loss of Arctic sea ice, these genetic data provide no evidence of strong directional gene flow in response to recent climate change.
Citation: Malenfant RM, Davis CS, Cullingham CI, Coltman DW (2016) Circumpolar Genetic Structure and Recent Gene Flow of Polar Bears: A Reanalysis. PLoS ONE 11(3): e0148967. https://doi.org/10.1371/journal.pone.0148967
Editor: Axel Janke, BiK-F Biodiversity and Climate Research Center, GERMANY
Received: July 14, 2015; Accepted: January 21, 2016; Published: March 14, 2016
Copyright: © 2016 Malenfant et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Input, output, and (where relevant) scripts to perform analyses are available from the Open Science Framework database (https://osf.io/kqcr4/).
Funding: RMM has been funded by scholarships from Alberta Innovates Technology Futures, the University of Alberta, and the Province of Alberta. Part of this work was funded by a Natural Sciences and Engineering Research Council of Canada, http://www.nserc-crsng.gc.ca/ (Grant ID 312207-2011). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: All four coauthors of this reanalysis have past or ongoing collaborations with at least one coauthor of the original study. Three of the coauthors (RMM, CSD, DWC) are also undertaking other polar bear population genetics projects using medium-density SNP arrays independently from USGS, and the authors recognize that this may be perceived as a competing interest. None of the possible perceived competing interests above has influenced the decision to submit this reanalysis.
Polar bears (Ursus maritimus) are Holarctic marine mammals that are dependent on sea ice as a platform for mating, reproduction, and locomotion. The southern boundary of their distribution is limited by the extent of the sea ice, which forms the habitat for their primary prey, pagophilic seals such as ringed seals (Pusa hispida) and bearded seals (Erignathus barbatus). Though long-distance swimming  and overland migration  are possible, open water, land, and multiyear ice—which is too thick for seals to create breathing holes—generally form barriers to movement and gene flow [3, 4]. Although polar bears have large home ranges  and are capable of travelling vast distances , gene flow among subpopulations appears to be limited . Currently, 19 management units (MUs) of polar bears are recognized globally, including the Arctic Basin, which is believed to be poor-quality habitat that hinders movement of bears across the area of the North Pole . MUs have been delineated based on radio-telemetry data (primarily of females), hunter tag returns (primarily of males), and genetic data [3, 4, 8–11].
The genetic structure of polar bears has been well characterized in a number of previous studies. The most important of these studies used 16 microsatellites and assignment tests to detect four moderately differentiated genetic clusters across the Arctic, corresponding to the Hudson Bay Complex, the Canadian Arctic Archipelago, the Polar Basin, and Norwegian Bay . Each of these clusters is represented in Canada, and all were recently re-detected in a population genetics study of Canadian polar bears using newly collected samples and thousands of single-nucleotide polymorphisms (SNPs) . Of particular interest is the small, isolated Norwegian Bay MU of the Canadian High Arctic, which is separated from surrounding MUs by thick ice, land, and polynyas [3, 13], and which has been reported as genetically divergent [4, 12] and perhaps phenotypically distinct . Other key population genetics findings include differentiation of Akimiski Island from the rest of Hudson Bay [14–16], east–west differentiation in the Polar Basin  (however, cf. ), and differentiation in the Canadian Archipelago in the area of the Gulf of Boothia and M’Clintock Channel MUs .
In a recent study published in PLOS ONE, Peacock et al., 2015  present an analysis based on an impressive dataset of up to 21 nuclear microsatellites and the mitochondrial control region (plus tRNAPro, tRNAThr, and partial cytb) obtained from 2748 and 411 polar bears respectively. Individuals were included from 18 of 19 global MUs (omitting the largely uninhabitable Arctic Basin). Key findings from this study include: (1) a revision of global population genetic structure for polar bears, with three–four major genetic clusters differing somewhat from those that have previously been reported, i.e. [4, 12]: the Canadian Arctic Archipelago, Southern Canada, and the Polar Basin (further subdivided into eastern and western sub-clusters); (2) highly directional recent gene flow into the Canadian Arctic Archipelago from Southern Canada and the Eastern Polar Basin, perhaps due to altered sea-ice conditions caused by climate change, (3) male-biased gene flow, (4) a possible role for the Canadian Arctic Archipelago (and other scattered areas such as the Barents Sea) as interglacial refugia. Most striking among their results, however, is the disappearance of the Norwegian Bay genetic cluster—an important change that is never discussed in Peacock et al., 2015 .
Upon examination of the article’s methods and supplementary material, we discovered a number of serious errors that call into question the population grouping used in the paper and other conclusions. These include the following (all references to tables and figures are those from Peacock et al., 2015 ):
- Large amounts of systematically missing data (i.e., genotypes for 5/21 microsatellite loci are missing in at least 6/18 MUs) and differences in sample sizes among MUs that are of two orders of magnitude (S1 Table in ).
- Miscalculation of FST and other measures of genetic differentiation because of the retention of loci with missing data, such that average pairwise FST between all MUs globally is actually negative for microsatellites (–0.03), with values ranging as low as –0.26 (S5 Table; S4 Fig in ).
- Bonferroni correction of significance thresholds that incorrectly account for the number of loci rather than the number of tests (S4, S5, S6, S7 Tables in ).
- Probable non-convergence of the program BayesAss (S8 Table in ; see below for explanation).
- Retention of the M’Clintock Channel, Norwegian Bay, and Viscount Melville MUs in population-level analyses of mitochondrial DNA (mtDNA) despite small sample sizes (i.e., N ≤ 3) that are inadequate for estimating haplotype frequencies, and treatment of the Laptev Sea MU as a single subpopulation despite huge geographical discontinuity in sampling and significant deviation from Hardy–Weinberg equilibrium (S1 Table in ).
Non-convergence of BayesAss
The most important conclusion in Peacock et al., 2015  is that there has been a recent influx of polar bears into the Canadian Arctic Archipelago from Southern Canada (and the Eastern Polar Basin) in response to recent climate change. They also report a surprising 29-fold difference in directional gene flow from the Eastern Polar Basin to the Western Polar Basin. However, the results given in the Supplementary Material (S8 Table of Peacock et al., 2015 ) strongly suggest that their BayesAss analysis of recent gene flow failed to converge. Non-convergence is a common problem for BayesAss [21, 22], and non-converged runs often show a bimodal distribution of proportions of non-migrants (Propnon-mig) with some populations having Propnon-mig < 0.73 and the remainder having Propnon-mig > 0.9 . Non-convergence is particularly likely when FST values are low, and immigration rates may be particularly untrustworthy if they have narrow confidence intervals near one of the prior bounds (i.e., 0 or 1/3) . This is because BayesAss bounds Propnon-mig between 0.667 and 1 [22, 23]. In S8 Table of Peacock et al., 2015 , Propnon-mig (and 95% CIs) are reported for the four genetic clusters as: Eastern Polar Basin = 0.941 (0.888–0.993), Western Polar Basin = 0.678 (0.657–0.699), Canadian Archipelago = 0.699 (0.621–0.777), and Southern Canada = 0.952 (0.912–0.991). These results follow the bimodal distribution described above, and all 95% CIs either overlap the lower bound or are <0.01 from the upper bound. Although it is stated in Peacock et al., 2015  that 3–4 runs resulted in similar estimates, this does not indicate that runs converged or that results are accurate, because multiple runs often get trapped near the program’s bounds .
To address some of these issues, we reanalysed the original dataset. Because the analyses in Peacock et al., 2015  were numerous and wide-ranging, we focused primarily on estimates of contemporary population structure, noting that the generation of contemporary genetic clusters was an important first step for some additional downstream analyses in the original paper, since they formed the groupings between which to test migration, etc. Therefore, this reanalysis may also have implications for some other findings in Peacock et al., 2015 . In our opinion, it represents our best estimate to date of the contemporary worldwide population structure of polar bears.
Materials and Methods
Nuclear microsatellite data
We downloaded the microsatellite genotypes used in Peacock et al., 2015  from datadryad.org (doi:10.5061/dryad.v2j1r). Individual-specific information, such as lat–long coordinates, year of sampling, population of sampling, sex, age, etc. are available in Table S11 of Peacock et al., 2015 . Methods of DNA extraction, microsatellite genotyping, and genotype quality control are provided in S1 File of Peacock et al., 2015 . Microsatellite genotypes were heavily biased towards the Davis Strait (N = 1050) and the Barents Sea (N = 454), Chukchi Sea (N = 266), and Southern Beaufort Sea (N = 233) MUs. Genetic data were compiled from disparate sources (each having been genotyped at different sets of loci), and therefore there are systematic patterns of missing data (Fig 1). Notably, missing data exceeds 80% for marker MU26 and 30% in the Barents Sea MU. Because various programs treat missing data differently (e.g., Structure ignores missing genotypes, GenoDive assigns missing genotypes random values based on allele frequencies, and BayesAss imputes missing genotypes), we reduced the dataset to include only the 14 loci reliably genotyped in all 18 MUs (Fig 1). After an initial analysis, we noticed that Gulf of Boothia was unexpectedly quite divergent from other MUs when using the stepwise-mutation model. We then discovered that the Peacock et al., 2015  genotypes for the locus CXX20 were duplicated from the locus CXX110 for many Gulf of Boothia individuals. We replaced these errant CXX20 genotypes with the original genotypes from Paetkau et al., 1999 .
The size of the square at each management unit–locus intersection is proportional to the amount of missing data at that locus in that management unit. Management unit abbreviations are as in Table 1. Asterisks denote loci that were retained for the reanalysis presented in this paper.
In Peacock et al., 2015 , first-degree relatives were excluded based on field data; however, their microsatellite dataset for Davis Strait includes 1050 individuals sampled mostly between 2005 and 2007 (out of an estimated population size of 2158 individuals ), and therefore likely still includes many unknown first- and second-degree relatives, the presence of which can cause inaccurate Structure results [25, 26]. Structure also struggles with unbalanced sample sizes  and typical pairwise FST calculations can be biased by unequal sample sizes as well . Therefore, for MUs having more than 30 microsatellite-genotyped individuals, we used the sample() function in R 3.2.0  to randomly select 30 fully genotyped individuals for inclusion in the reduced dataset used in this paper. We used 30 individuals as a cutoff because this was the number used in the last global analysis of population structure , and because this number has been shown to be adequate for estimating allele frequencies and FST from microsatellite data  (however, cf. ). One individual from the Laptev Sea had missing data at all loci and was discarded. Our final dataset contained 495 individuals (Table A of S1 File). After filtering all loci and individuals, we checked for Hardy–Weinberg equilibrium in each subpopulation in GenoDive 2.0b27  using Nei’s GIS statistic  (1000 permutations) and for linkage disequilibrium (LD) between loci using Fisher’s method across MUs in GenePop 4.3  (default settings). Unless otherwise indicated, a significance level of α = 0.05 was used for all tests, with a Holm correction  in the p.adjust() function of R to account for multiple tests where appropriate.
Mitochondrial sequence data
We obtained haplotypes from GenBank according to accession numbers and haplotype counts specified in S2 Table of Peacock et al., 2015 . The haplotypes UMACR17 and UMACR87 were identical, so we combined these haplotype counts in our dataset. We aligned sequences using MAFFT 7.221 (1PAM/κ = 2 scoring matrix and default settings for auto-strategy; ) and after trimming extraneous bases from the ends of the alignment, we found that UMACR88 and UMACR3 were also identical, so we merged these counts as well. We estimated the optimal substitution model under the corrected Akaike information criterion (AICc; ) using jModelTest 2.1.7 (default settings; [38, 39]) and calculated summary statistics for mtDNA using Arlequin 18.104.22.168 .
Genetic differentiation, principal components analysis, and AMOVAs
To determine if microsatellites were likely to underestimate population differentiation because of high mutation rates or marker diversity, we tested for a correlation between GST and HS using CoDiDi 1.0 (100,000 permutations; ). We calculated pairwise FST values [42, 43] between MUs, as well as AMOVAs  using GenoDive. We also calculated pairwise RST [43, 44] using SPAGeDi 1.4b ; however, these results are not presented because an allele-size permutation test (10,000 permutations; ) suggested that microsatellite allele sizes were uninformative. To explore the data, we performed a principal components analysis (PCA) of individual genetic variation using adegenet 1.4–2 (centred and scaled, missing data set to mean; [47, 48]). To examine the robustness of our primary conclusion (i.e., the divergence of Norwegian Bay) to the 30-individual sampling process we used to generate our reduced dataset, we also plotted PCAs for 100 additional randomly generated subsamples of the full dataset. We generated a population tree using the recommendations for the infinite-allele model in Takezaki and Nei, 1996 : we estimated the topology of the tree with unweighted pair-group method with arithmetic mean (UPGMA) in POPTREEW  using Nei’s DA , then we unrooted the tree and estimated branch lengths using Nei’s standard distance (DS)  using non-negative least squares in phangorn 1.99–13 .
For mtDNA, we calculated pairwise FST and ΦST values (and their corresponding AMOVAs ) using Arlequin. For ΦST calculations, distances between haplotypes were calculated using the Tamura & Nei substitution model  with gamma-distributed rate heterogeneity (α = 0.021), which was determined as the optimal model of evolution under the AICc. Significance of all pairwise measures was assessed using 10,000 permutations. We also conducted exact tests of population differentiation in GenePop for microsatellites and in Arlequin for mtDNA (default settings). Significance of AMOVAs was not tested because of circularity in the logic of doing so for pre-clustered groups . Pairwise FST values for microsatellites and pairwise ΦST values for mtDNA were compared with the expectation of , as was used in Peacock et al., 2015 , to determine whether polar bears exhibit male-biased gene flow.
Clustering methods and isolation by distance
The settings used for Structure analysis (e.g., number of repetitions, length of burn-in, priors) were not given in Peacock et al., 2015 . We followed the recommendations of Gilbert et al., 2012 : 20 independent runs of 200,000 iterations (incl. 100,000 burn-in iterations) using the correlated allele frequencies model with no location prior using Structure 2.3.4 [58, 59]. Runs were clustered and averaged using CLUMPAK 1.1 (default settings; ), and support for K-values was generated in CLUMPAK’s “Best K” feature using the Evanno method  and the Pritchard method . As has been recommended in the case of low genetic differentiation , we compared the output from Structure with output from BAPS 6.0 using its non-spatial admixture mode (Kmax = 20; Nmin = 5; Nit = 100; Nref_ind = 200; Nref_it = 10; [64, 65]). To infer genetic clusters for individuals used in the original study but not included in our reduced dataset, we used trained clustering in BAPS [65–67], using non-admixed individuals from each genetic cluster as the training set. We also grouped MUs using AMOVA-based K-means clustering in GenoDive for K = 6, which was found to be the optimal K-value in Structure analyses. Finally, to confirm the hierarchical structure (i.e., east–west differentiation) that we detected within the Canadian Arctic Archipelago and the Polar Basin, we ran Structure on the full set of samples collected within each of these MUs using LOCPRIOR = 1 to improve the power to detect weak differentiation .
Isolation by distance (IBD) can confound clustering analyses . Because the optimal Structure results for K = 6 showed an east–west cline in Q-values across the Polar Basin, and because there was a large sampling discontinuity in the middle of this cline (i.e., in the Laptev Sea MU), we suspected that one of these two clusters may have been spuriously generated by IBD. To test for IBD across this region, we performed a Mantel test between genetic distances  and geographical distances (calculated in SPAGeDi) for all individuals that were highly assigned (i.e., CLUMPAK-averaged Q ≥ 0.7) to either the Eastern or Western Basin clusters (N = 62). To determine if IBD alone might be responsible for observed east–west genetic clustering in the Basin, we performed a partial Mantel test of association between a matrix of genetic distances and a model matrix denoting whether each pair of individuals belonged to the same genetic cluster (= 0) or not (= 1), while conditioning on geographical distances (cf. ). Both tests were conducted in vegan 2.2–1 , using 10,000 permutations to test for significance.
Using BayesAss 3.0.3 , we attempted to re-estimate rates of gene flow between five of our six regions (Eastern & Western Polar Basin, Eastern & Western Archipelago, Hudson Complex) and—for comparison—between three of the four major regions identified as optimal by Paetkau et al., 1999  and in our K = 4 Structure results (i.e., Polar Basin, Archipelago, Hudson Complex). We omitted Norwegian Bay from both of these runs because its small sample size might result in non-converged estimates , and before running BayesAss, we used assignment tests in GenoDive to remove any significant (default settings, 1000 permutations) immigrants from Norwegian Bay found in other MUs. Because our dataset of ~30 samples per MU does not accurately reflect differences in MU population size that would affect gene-flow estimates when MUs were merged into regions, we generated balanced subsets using the sampling regimes shown in Table D of S1 File. Individuals were selected for inclusion in these subsets blindly (i.e., without viewing their genetic cluster membership) while attempting to obtain geographical balance and high sample sizes of 100–150 individuals per region, which have been shown to be correlated with the probability of convergence . For direct comparison with Peacock et al., 2015 , we also generated a balanced subset corresponding to their four regions (Eastern & Western Polar Basin, Archipelago, Hudson Complex). For all BayesAss runs, we followed the recommendations of Faubet et al., 2007  (i.e., ten runs with different random seeds, Nit = 21,000,000, Nburn-in = 2,000,000, sampling interval = 2,000), and we used the Bayesian deviance (as calculated in the calculateDeviance.R script from Meirmans, 2014 ) to select the best run. Convergence of parameter estimates in these best runs was also assessed by manual examination of trace files, and by using the Heidelberger-and-Welch diagnostics  in boa 1.1.8–1 . Significant differences in proportions of immigrant ancestry were assessed using non-overlapping 95% CIs. To ensure that we were not unintentionally broadening CIs by using only 14 loci, we also performed runs for all datasets including all 21 loci. Finally, to test whether the placement of the Laptev Sea MU (which straddles the apparent boundary between the Western and Eastern Polar Basin clusters) affected our results, we considered a run that excluded this MU entirely.
Nuclear microsatellite and mitochondrial DNA statistics
We found that one MU, the Laptev Sea, exhibits significant heterozygote deficiency (GIS = 0.15, P < 0.001; Table 1), likely because of a Wahlund effect  caused by discontinuous sampling in this MU: there is a >1,400 km gap between western and eastern Laptev Sea samples. Because subpopulation deviation from Hardy–Weinberg equilibrium affects F-statistics, we followed Paetkau et al., 1999  in excluding the Laptev Sea from all MU-based analyses such as LD and pairwise FST. For AMOVAs and BayesAss analyses of gene flow among major genetic clusters, we apportioned the Laptev Sea MU’s eastern and western samples into the eastern and western Polar Basin clusters, respectively.
For microsatellite data, a maximum of 30 individuals have been retained from each management unit from the original dataset of 2,748 individuals, and only the 14 loci indicated in Fig 1 have been used. Molecular diversity indices for mitochondrial DNA were calculated in Arlequin using pairwise differences with no gamma correction.
No locus deviated significantly from Hardy–Weinberg equilibrium. Two pairs of loci were in significant LD (G10B–G10J, G10B–G10X); however, both had P = 0 in one MU, which causes problems for Fisher’s method , and neither pair is located on the same genomic scaffold . Even if the scaffolds were contiguous within a chromosome, these markers would be separated by >5 Mb, and at these distances, LD is negligible in polar bears . Therefore, we assumed these were false positives, and we elected to keep all 14 microsatellite markers for subsequent analysis. HS and GST were not significantly negatively correlated (r = –0.008, P = 0.467), indicating that microsatellites were unlikely to underestimate population differentiation because of high mutation rates or marker diversity. Three MUs had inadequate sampling (i.e., N ≤ 3, k = 1) to accurately determine mitochondrial haplotype frequencies: namely, M’Clintock Channel, Norwegian Bay, and Viscount Melville Sound. Therefore, we excluded these MUs in pairwise population comparisons and AMOVAs of mtDNA.
Clustering of individuals and management units
CLUMPAK-averaged admixture plots for K = 2–7 are shown in Fig 2. Progressively, they show the addition of clusters that largely correspond to the following, with some apparent admixture and migration: K = 2: the Polar Basin, K = 3: the Canadian Arctic Archipelago, K = 4: Norwegian Bay, K = 5: west–east differentiation in the Polar Basin, K = 6: west–east differentiation in the Canadian Arctic Archipelago, K = 7: apparent noise. Although the Evanno ΔK method selected K = 2 (Fig 3b), likelihood was maximized at K = 6 (Fig 3a)—this was the number of clusters preferred using the Pritchard method (Fig 3c), and there was also a small peak in ΔK at this value.
Numbers under each K-value indicate the proportion of runs that converged to the solution presented. Minority modes supported by at least two runs are provided in Fig A in S1 File. Management unit abbreviations are as in Table 1.
(a) Structure output of mean likelihood ± SD calculated from 20 independent runs for each value of K from 1 to 20. (b) ΔK calculated using the Evanno method in CLUMPAK. (c) Probability of K calculated using the Pritchard method in CLUMPAK.
The six genetic clusters we detected correspond roughly to: the Hudson Complex (incl. Labrador), the Eastern Archipelago, the Western Archipelago, Norwegian Bay, the Eastern Polar Basin, and the Western Polar Basin. Because these results were geographically defensible and corresponded roughly with previously discovered genetic structure in the Archipelago  and across the Polar Basin , we accepted K = 6 for our Structure analysis. Regional Structure analyses using the full dataset and LOCPRIOR = 1 also detected east–west differentiation within the Archipelago and within the Basin, with possible additional clusters present in the Gulf of Boothia and in the Chukchi Sea (Fig B in S1 File). GenoDive clustering of MUs for K = 6 reached similar conclusions as Structure (cf. shaded areas in Table 2), splitting the Archipelago into Eastern (KB, BB, northern DS) and Western (VM, GB, MC, LS) clusters and splitting the Polar Basin into Eastern (EG, KS, BS, eastern LP) and Western (SB, NB, CS, western LP) clusters.
MU abbreviations are as in Table 1. Solid lines demarcate the four major clusters discovered by Paetkau et al., 1999 , which correspond to our Structure results for K = 4. From left to right, these are: the Hudson Complex, the Canadian Arctic Archipelago, Norwegian Bay, and the Polar Basin. Dotted lines denote the west–east clusters within the Basin and the Archipelago detected by K-means clustering in GenoDive. These six clusters include additional east–west substructure within the Archipelago and within the Polar Basin. DS is an admixture zone showing affinity for both Hudson Complex and the Archipelago, with southern samples tending to belong to the Hudson Complex cluster and northern samples tending to belong to the Eastern Archipelago cluster. LP has been excluded from all comparisons because it deviates significantly from Hardy–Weinberg equilibrium. For mitochondrial DNA, MC, VM, and NW were omitted because sample sizes were too small (i.e., N ≤ 3, k = 1) to accurately estimate haplotype frequencies.
There is significant IBD across the Polar Basin (Mantel test: r = 0.2354, P < 0.0001), though genetic clustering remained marginally significant after accounting for IBD (partial Mantel test: r = 0.06391, P = 0.039). Therefore, we decided to retain both the Eastern and the Western Basin clusters, though we note that traversable distances between individuals in this region will be underestimated by SPAGeDi if it calculates distances over the poor-quality Arctic Basin MU. Results are mapped in Fig 4.
Individuals are colour-coded by genetic cluster similarly to the colour scheme for K = 6 in Fig 2. Black samples are unassigned (i.e., Qmax < 0.5). Uncoloured individuals are those that were used in the original study but were not included in our random subset of 30 individuals per management unit; their predicted cluster memberships based on BAPS trained clustering are shown in Fig C of S1 File. Management unit abbreviations are as in Table 1. Approximate sea ice extent during the breeding season is shown using measurements for April 15, 2008 , though there is great spatial heterogeneity in sea ice thickness and concentration, as well as great intra-seasonal and inter-annual variability. Note that this map (and the data) does not reflect a 2014 boundary change between NB and SB made by the territorial governments and the co-management boards with management authority for these two subpopulations, because it has not yet been evaluated by the IUCN Polar Bear Specialist Group.
Mixture analysis in BAPS found K = 6 as being optimal; however, one of these clusters contained only a single individual with missing data at four loci, perhaps indicative of the unexpected effects that missing data can have upon such methods. This single-individual cluster was discarded prior to admixture analysis. The remaining clusters were: the Hudson Complex, the Eastern and Western Canadian Arctic Archipelago, the Polar Basin, and Norwegian Bay. Results of the BAPS admixture analysis based on these five clusters is found in Fig C of S1 File; they differ from the optimal Structure results for K = 6 in that there is less admixture and no distinction of the Eastern/Western Polar Basin. Trained clustering in BAPS using K = 5 gave sensible estimates of genetic-cluster membership for all individuals not included in our main study (Fig D of S1 File).
Our PCA and population tree reveal four broad groupings of MUs that correspond to the clusters identified by our Structure results for K = 4 and by Paetkau et al., 1999  (Fig 5): the Polar Basin (CS, SB, BS, NB, EG, KS), the Archipelago (MC, VM, LS, BB, KB, GB), Norwegian Bay (NW), and the Hudson Complex (DS, FB, WH, SH). These four groupings were also seen in most of our other 100 randomly resampled subsets of individuals (Figs E and F of S1 File). The six genetic clusters selected by GenoDive explain ~3.9% of the nuclear genetic variation and ~9.4% of genetic variation in mtDNA. MU designations within clusters explain 0.8% and 6.0% for microsatellites and mtDNA respectively (Tables 3, 4). Overall, MUs were slightly to moderately differentiated (average pairwise microsatellite FST = 0.04). Tests of pairwise population differentiation revealed many significant differences between major genetic clusters, but few significant differences within clusters. In total, 121/136 (≈89%) of population pairs were significantly differentiated after a Holm correction based on nuclear FST, genic, or genotypic differentiation (compared to only 20% in Peacock et al., 2015 , who also included tests for RST). Importantly, all tests of genetic differentiation show that Norwegian Bay is significantly differentiated from all other MUs (Table 2, Table B of S1 File). Though Gulf of Boothia differed significantly from most nearby MUs in tests of genotypic and genic differentiation of nuclear markers (Table B of S1 File), it was not as well differentiated from other members of the Western Archipelago using pairwise FST or ΦST (Table 2) or tests of haplotypic differentiation for mtDNA (Table C of S1 File).
(a) Principal component analysis of genetic variation. Each point represents an individual; each individual is connected to a label indicating the centroid of the management unit in which it was sampled. The inertia ellipse for each management unit contains approximately two-thirds of all individuals sampled there. (b) Population tree. The scale bar indicates Nei’s standard distance; branch lengths were estimated using non-negative least squares, and the tree has an R2  of 0.903. Samples from the Laptev Sea (LP) have been excluded in (b) because a large spatial discontinuity in sampling in this management unit resulted in it being significantly out of Hardy–Weinberg equilibrium. Management unit abbreviations are as in Table 1 and are coloured as in Fig 4.
For this analysis, we followed Peacock et al., 2015  by including northern Davis Strait in the Eastern Archipelago cluster and southern Davis Strait in the Hudson cluster. Western Laptev was included in the Western Basin cluster and Eastern Laptev was included in the Eastern Basin cluster. However, results did not differ significantly when the Laptev Sea and Davis Strait MUs were excluded entirely.
Note that many management units (incl. the entire Norwegian Bay cluster) were excluded entirely from this AMOVA because of inadequate sampling. Because we lacked sample location information for downloaded haplotypes, we were unable to split Davis Strait or the Laptev Sea into northern/southern or eastern/western samples; therefore, these MUs were removed for this calculation in addition to the MUs that were removed for low sample sizes in Table 2.
Recent gene flow and sex-biased dispersal
All “best” BayesAss runs for each population grouping (selected based on the deviance) were at stationarity after burn-in, according to Heidelberger-and-Welch diagnostics. Estimates were surprisingly robust to large amounts of systematically missing data, as results for 14 loci and 21 loci were nearly identical in terms of means and confidence intervals (Fig 6); however, because runs for 14 loci had larger effective sample sizes, we discuss these results below. All runs gave highly similar estimates of gene flow among the Polar Basin, the Hudson Complex (incl. Labrador), and the Canadian Arctic Archipelago. In no case was there a significant difference in the proportion of migrants into any of these populations. Within these major clusters, BayesAss suggested highly directional gene flow from the Western Polar Basin into the Eastern Polar Basin, and from the Western Archipelago into the Eastern Archipelago; however, these highly directional estimates are likely untrustworthy, as discussed below. Immigration rates and proportions of non-migrant ancestry are given in Tables E–G of S1 File. Exclusion of the Laptev Sea MU did not change the estimates of migration (Table H of S1 File).
Segment colours are as in Fig 4 and are sized proportionally to the population size estimates in Table D of S1 File, though polar bear population sizes are estimated with very broad confidence intervals, particularly in the Polar Basin, where reliable estimates are not available for most MUs. The width of each ribbon where it meets a segment on the circumference indicates the proportion of migrants into (but not out of) each region. Black ribbons are significantly directional based on non-overlapping 95% CIs of immigration rates; grey ribbons are not significantly directional.
To test for sex-biased dispersal across MU boundaries, we plotted pairwise FST for microsatellites against pairwise ΦST for mtDNA, as in Peacock et al., 2015  (Fig 7b). In contrast to estimates from Peacock et al., 2015 , more points lie on or above the line of expectation (i.e., the line at which microsatellites differentiate populations as well as mitochondrial haplotypes), and the extreme values most supportive of strong male-biased dispersal, such as zero-estimates for microsatellites FST and one-estimates for mtDNA ΦST are absent. Inferences of sex-biased dispersal can also be made from the R-ratios of mitochondrial and nuclear F-statistics from AMOVAs, where R≪1 suggests female-biased dispersal and R≫4 suggests male-biased dispersal . In our AMOVAs, R-ratios were ΦSC:FSC = 8.3:1 for genetic variance among MUs within clusters and ΦCT:FCT = 2.4:1 for genetic variance among clusters.
(a) S4 Fig from Peacock et al., 2015 , reproduced here under the terms of its Creative Commons CC0 license. (b) A recreated version of this graph, generated using our recalculated FST and ΦST values. In (b), M’Clintock Channel, Norwegian Bay, and Viscount Melville Sound were excluded because of inadequate sample sizes for mitochondrial DNA (N ≤ 3) and the Laptev Sea was excluded because this MU was significantly out of Hardy–Weinberg equilibrium. Points for brown bears were not recalculated and are not shown. Coloured points indicate intra-cluster MU pairs (coloured as in Fig 4); grey points indicate inter-cluster MU pairs.
Worldwide population structure of polar bears
In contrast to Peacock et al., 2015 , but similarly to Paetkau et al., 1999 , we detected four major genetic clusters of polar bears worldwide, additionally finding east–west sub-clusters in the Polar Basin and in the Canadian Archipelago. These findings corroborate previous studies of polar bear genetic structure [4, 12, 17, 19]. We note that we failed to detect a unique genetic cluster of bears on Akimiski Island in James Bay (Southern Hudson Bay) [10, 14, 20], which were not considered separately in this range-wide analysis because of low sample size. Our pairwise FST values between MUs were very similar to those calculated by Paetkau et al. 1999 , and differ tremendously from those in Peacock et al., 2015 , which appear to have been incorrectly calculated: most values in Peacock et al., 2015  are negative, and they range as low as –0.26. Although negative values from the Weir–and–Cockerham estimator of FST  are possible (especially when sample sizes and sample variance in allele frequencies are low), they are typically not this extreme. We were unable to reproduce Peacock et al., 2015 ’s FST values using GenoDive, FSTAT, or Genepop on the full dataset; in all cases, the calculated values were similar to our own and those of Paetkau et al. 1999 . Only when Arlequin was used under certain conditions were we able to reproduce these values. Specifically, the errant values in Peacock et al., 2015  are an artefact caused by large amounts of missing data; they result only when one fails to enforce any missing-data cutoff in Arlequin (Table I of S1 File). When a reasonable missing data cutoff (e.g., 5%) is used, then sensible FST values consistent our own and those of Paetkau et al., 1999  are produced (Table I of S1 File).
Under our grouping of MUs, the variance explained by genetic clusters (~4% for nuclear, ~9% for mitochondrial) was maximized through K-means clustering, and suggests moderate divergence among clusters. The four major genetic clusters are mostly separated by landmasses and multiyear ice that form barriers to gene flow for polar bears. The Hudson Complex and the Canadian Archipelago are separated by Baffin Island, Labrador, and the Melville Peninsula [3, 82]; the Archipelago and the Polar Basin are separated by Greenland in the east and by Banks and Victoria Islands in the west [3, 83]; and Norwegian Bay and the Archipelago are separated by thick multiyear ice, islands, and polynyas . Genetic structure within the four major clusters is likely driven by broad-scale site fidelity to breeding and denning areas [3, 84, 85] and annual reuse of geographically predictable hunting grounds, such as tide cracks and lead systems [3, 86].
Based on our reanalysis of the original data from Peacock et al., 2015 , we have re-established Norwegian Bay as a distinct genetic cluster of polar bears near the northernmost reaches of Canada. Norwegian Bay is currently estimated to comprise 203 individuals (95% CI: 115–291; ), and—together with the neighbouring Queen Elizabeth Islands—it has previously been proposed as a separate designatable unit of polar bears based on ecological and genetic factors . The status of this cluster is particularly relevant for polar bear conservation, as it is expected to coincide with Canada’s last sea-ice refugium . This subpopulation has anecdotally been reported to be phenotypically unique , and we are currently conducting additional genetic analyses on this cluster, including genome scans on more recently collected samples. The Norwegian Bay cluster was likely not revealed in the analyses of Peacock et al., 2015  because of highly unequal sample sizes, and perhaps also by the presence of many related individuals in Davis Strait, which can confound Structure analyses [25–27]. In addition, genetic clusters in Peacock et al., 2015  were selected partially based on comparison of AMOVAs, and the existence of Norwegian Bay as a separate genetic cluster was not among the hypotheses tested (S7 Table of Peacock et al., 2015 ). In addition, all AMOVAs for microsatellites in Peacock et al., 2015  have negative ΘSC values or purportedly explain negative percentages of variance. We were unable to reproduce these unusual AMOVA results using Arlequin on the full dataset (e.g., Table J of S1 File).
Although an analysis of sex-biased dispersal was presented in Peacock et al., 2015 , it gave erroneous results because of incorrectly calculated FST values and the inclusion of populations with inadequate mtDNA sampling (Fig 7a). After correcting for these issues, we find there is little evidence that gene flow is strongly male-biased using the method in Peacock et al., 2015 . In contrast, in AMOVAs, mitochondrial ΦSC values were 8.3× nuclear FSC values (whereas ΦCT is only 2.4× FCT), which may suggest male-biased dispersal within—but not among—genetic clusters (however, cf. Fig 7b). Unfortunately, this comparison is hindered by different sampling regimes for mtDNA vs. nuclear DNA, including low within-cluster sampling of mtDNA (Table 2). In addition, any direct comparison of differentiation between uniparentally and diparentally inherited markers must be interpreted with caution, as such methods generally assume an effective-population-size ratio of 4:1, which is often untrue . Though it would be better to perform sex-specific comparisons using nuclear markers, these methods may be underpowered unless bias in gene flow is extreme (i.e., 80:20), and they may also suffer from pseudoreplication [91, 92]. Therefore, the true extent of sex-biased dispersal in polar bears remains undetermined. Previous genetic studies have also reported conflicting findings of male-biased dispersal [19, 93], as have radio-telemetry studies of home-range size [94, 95]. Based on distances between recaptures, male polar bears appear to have only slightly larger home ranges than females, and this is perhaps because females move less when accompanied by cubs .
Are polar bears migrating en masse into the Canadian Archipelago?
Polar bears rely on sea ice as a platform for locomotion , hunting , mating , and—in some areas—denning . If climate change continues to reduce the extent and duration of Arctic sea ice, polar bears are likely to respond with altered movement patterns, resulting in increased mixing and gene flow between adjacent subpopulations . To determine if changes in movement were already occurring, Peacock et al., 2015  compared recent gene flow (i.e., over the past two generations) calculated using BayesAss with historical gene flow (i.e., time since the most recent common ancestor) calculated using Migrate . They found an apparent reversal of gene flow over time, suggesting a recent influx of polar bears into the Canadian Archipelago from Southern Canada. However, the sampling regime for their BayesAss analysis was not described in the manuscript, and their results show known signs of non-convergence [21, 22]. A correction to the Supplementary Material of Peacock et al., 2015   published while our manuscript was in review states that individuals were randomly sampled from within the four populations used, with sample sizes of 26, 34, 60, and 60, for the Western Basin, Eastern Basin, Canadian Archipelago and Southern Canada, respectively. Unfortunately, BayesAss typically works best when sample sizes are much larger than this , and we were unable to reproduce these results using our own geographically balanced sampling regime with >100 samples per region.
In fact, within the Polar Basin, our BayesAss results detected exactly the opposite pattern of Peacock et al., 2015 : namely, ~60-fold directional gene flow into the Eastern Polar Basin from the Western Polar Basin. This pattern held true in all 40 runs that included an Eastern–Western Polar Basin split. Similarly, our BayesAss results showed ~30-fold directional gene flow from the Western Archipelago into the Eastern Archipelago, though this pattern only held true in 8/20 runs; the remaining 12/20 runs suggested ~30-fold directional gene flow from the Eastern Archipelago into the Western Archipelago. Estimates of these immigration rates were close to the upper bound of 1/3 and—taken together—this suggests that all estimates of gene flow within the Archipelago and the Polar Basin in both this paper and in Peacock et al., 2015  are untrustworthy, probably because of low genetic differentiation (FST ≈ 0.01) between these regions [21–23]. We similarly failed to confirm directional gene flow from the (Eastern) Polar Basin into the Canadian Archipelago; in all of our reanalyses, migration rates between these regions are not significantly directional. Although not significantly different, we did find that immigration into the Canadian Archipelago from Southern Canada (~4.9%) was slightly higher than in the reverse direction (~2.1%). The robustness of this finding across our different sampling regimes and the sampling regime of Peacock et al., 2015  suggests that there may be slight northward gene flow into the Archipelago. Finally, we note that even our preferred BayesAss run (i.e., the 3-cluster run in Fig 6) may be interpreted as having not reached convergence, since proportions of non-migration have been estimated with small variance near the upper bound (Table E of S1 File). However, we believe that these estimates of low gene flow are realistic because the regions are largely separated by land and multiyear ice.
Among the first analyses conducted in Peacock et al., 2015  were decadal comparisons of population structure to determine if it was safe to pool samples collected between the 1980s and the 2010s (S3 Table of Peacock et al., 2015 ). Their results showed that population composition had not changed significantly over this period in any of the regions examined. If polar bears had experienced substantial directional gene flow in response to recent climate change, it seems unlikely that this would not have resulted in detectable changes to population structure over this period, especially since Peacock et al., 2015 ’s high immigration rates of ~15% from both the Eastern Polar Basin and Southern Canada suggest that the Canadian Archipelago would likely not be demographically independent . Although Arctic sea ice has been declining in thickness and extent in some regions since at least the 1950s [104, 105], the rapid loss of sea ice since the mid-1990s has been unprecedented over the last 1,450 years . Therefore, we would expect to see changes in composition from the 1980s to the contemporary samples; however, no such changes were observed. Though our Structure plots suggest a substantial amount of migration and admixture among clusters, there is no clear pattern of directional gene flow. Further, these results might overestimate the amount of mating between genetic clusters, since Structure may be sensitive but not specific with respect to admixture , and cluster membership is estimated with extremely broad credible intervals when using a small number of markers [16, 108]. Therefore, we find the suggestion of mass gene flow into the Archipelago from Southern Canada and the Polar Basin uncompelling, and we strongly caution against managing Arctic Archipelago MUs as if they were being replenished by immigration.
The three–four major genetic clusters selected in Peacock et al., 2015  were selected based on faulty analyses, including miscalculated FST values, AMOVAs, and significance levels. The study was also compromised by highly unbalanced sample sizes and possibly by the inclusion of first- and second-degree relatives, as well as retention of large amounts of systematically missing data. One consequence of these data and analysis issues was the failure to detect a distinct subpopulation of polar bears in Norwegian Bay near Canada’s expected last sea-ice refugium. BayesAss results suggesting a recent influx of bears into the Archipelago and the Western Polar Basin showed known signs of non-convergence, and they were not supported in our own runs of the program. We therefore find the suggestion of strong recent directional gene flow into the Archipelago uncompelling. Many of these problems became obvious only upon examining the paper’s supplementary material; the original authors of Peacock et al., 2015  should be commended for the well-documented results they made available, which allowed us to detect the issues in their study. Recently, supplementary material has been accused of being poorly peer-reviewed, thereby threatening the integrity of the scientific literature . We hope that this example will serve as a reminder to both authors and reviewers to scrutinize this supplementary material more closely in the future. In the interest of even greater openness, we have deposited inputs, outputs, and scripts used to perform our analyses at Open Science Framework (http://osf.io/kqcr4). We encourage both reviewers and readers to further explore this invaluable dataset.
Peacock et al., 2015  was discussed in depth at a Coltman–Davis lab meeting, and the authors benefitted from discussion with all present, including Zijian Sim, Joshua Miller, Curtis Strobeck, Jamie Gorrell, Michelle Viengkone, Christi Bubac, Ian Burns, and Jessica Haines. Nick Lunn and Evan Richardson provided helpful comments on a draft version of this manuscript. Two anonymous referees also provided helpful comments. RMM has been funded by scholarships from Alberta Innovates Technology Futures, the University of Alberta, and the Province of Alberta. Part of this work was funded by an NSERC Discovery Grant to DWC (Grant ID 312207–2011).
Conceived and designed the experiments: RMM CIC. Performed the experiments: RMM. Analyzed the data: RMM. Contributed reagents/materials/analysis tools: RMM CSD DWC. Wrote the paper: RMM. Identified problems in the original analysis: RMM CSD CIC DWC.
- 1. Pagano AM, Durner GM, Amstrup SC, Simac KS, York GS. Long-distance swimming by polar bears (Ursus maritimus) of the southern Beaufort Sea during years of extensive open water. Can J Zool. 2012;90(5):663–76.
- 2. Sahanatien V, Derocher AE. Foxe Basin Polar Bear Project 2010 Interim Report—Part 1: Movements, Habitat, Population Delineation & Inuit Qaujimajatugangnit. Iqaluit, Canada: Nunavut Department of Environment; 2010. p. 29.
- 3. Taylor MK, Akeeagok S, Andriashek D, Barbour W, Born EW, Calvert W, et al. Delineating Canadian and Greenland polar bear (Ursus maritimus) populations by cluster analysis of movements. Can J Zool. 2001;79(4):690–709. ISI:000168172400018.
- 4. Paetkau D, Amstrup SC, Born EW, Calvert W, Derocher AE, Garner GW, et al. Genetic structure of the world's polar bear populations. Mol Ecol. 1999;8(10):1571–84. ISI:000083466800002. pmid:10583821
- 5. Ferguson SH, Taylor MK, Born EW, Rosing-Asvid A, Messier F. Determinants of home range size for polar bears (Ursus maritimus). Ecol Lett. 1999;2(5):311–8.
- 6. Durner GM, Amstrup SC. Movements of a polar bear from northern Alaska to northern Greenland. Arctic. 1995;48(4):338–41.
- 7. Paetkau D, Calvert W, Stirling I, Strobeck C. Microsatellite analysis of population structure in Canadian polar bears. Mol Ecol. 1995;4(3):347–54. ISI:A1995RF43700007. pmid:7663752
- 8. Obbard ME, Thiemann GW, Peacock E, DeBruyn TD, editors. Polar Bears: Proceedings of the 15th Working Meeting of the IUCN/SSC Polar Bear Specialist Group. 15th Working Meeting of the IUCN/SSC Polar Bear Specialist Group; 2009 29 June–3 July, 2009; Copenhagen, Denmark. Gland, Switzerland Cambridge, UK: IUCN; 2010.
- 9. Amstrup SC, Durner GM, Stirling I, McDonald TL. Allocating harvests among polar bear stocks in the Beaufort Sea. Arctic. 2010;58(3):247–59.
- 10. Obbard ME, Middel KR. Bounding the Southern Hudson Bay polar bear subpopulation. Ursus. 2012;23(2):134–44.
- 11. Taylor M, Lee J. Distribution and abundance of Canadian polar bear populations: a management perspective. Arctic. 1995;48(2):147–54. ISI:A1995RF80000005.
- 12. Malenfant RM, Coltman DW, Davis CS. Design of a 9K Illumina BeadChip for polar bears (Ursus maritimus) from RAD and transcriptome sequencing. Mol Ecol Resour. 2015;15(3):587–600. pmid:25187336
- 13. Taylor MA, Laake J, McLoughlin PD, Cluff HD, Messier F. Mark–recapture and stochastic population models for polar bears of the High Arctic. Arctic. 2009;61(2):143–52.
- 14. Crompton AE, Obbard ME, Petersen SD, Wilson PJ. Corrigendum to “Population genetic structure in polar bears (Ursus maritimus) from Hudson Bay, Canada: Implications of future climate change” [Biol. Conserv. 141(10) (2008) 2528–2539]. Biol Conserv. 2014;179:152.
- 15. Crompton AE, Obbard ME, Petersen SD, Wilson PJ. Population genetic structure in polar bears (Ursus maritimus) from Hudson Bay, Canada: implications of future climate change. Biol Conserv. 2008;141(10):2528–39. ISI:000260568300010.
- 16. Viengkone M. Population Structure and Space-use of Polar Bears (Ursus maritimus) in Hudson Bay [M.Sc. Thesis]. Edmonton, AB, Canada: University of Alberta; 2015.
- 17. Miller W, Schuster SC, Welch AJ, Ratan A, Bedoya-Reina OC, Zhao F, et al. Polar and brown bear genomes reveal ancient admixture and demographic footprints of past climate change. Proceedings of the National Academy of Sciences. 2012;109(36):E2382–E90.
- 18. Cronin MA, Macneil MD. Genetic relationships of extant brown bears (Ursus arctos) and polar bears (Ursus maritimus). J Hered. 2012;103(6):873–81. pmid:23125409.
- 19. Campagna L, Van Coeverden de Groot PJ, Saunders BL, Atkinson SN, Weber DS, Dyck MG, et al. Extensive sampling of polar bears (Ursus maritimus) in the Northwest Passage (Canadian Arctic Archipelago) reveals population differentiation across multiple spatial and temporal scales. Ecology and Evolution. 2013;3(9):3152–65. pmid:24102001
- 20. Peacock E, Sonsthagen SA, Obbard ME, Boltunov A, Regehr EV, Ovsyanikov N, et al. Implications of the Circumpolar Genetic Structure of Polar Bears for Their Conservation in a Rapidly Warming Arctic. PLoS ONE. 2015;10(1):e112021. pmid:25562525
- 21. Meirmans PG. Nonconvergence in Bayesian estimation of migration rates. Mol Ecol Resour. 2014;14(4):726–33. pmid:24373147
- 22. Faubet P, Waples RS, Gaggiotti OE. Evaluating the performance of a multilocus Bayesian method for the estimation of migration rates. Mol Ecol. 2007;16(6):1149–66. pmid:17391403
- 23. Wilson GW, Rannala B. Documentation for BayesAss 1.3. Berkeley, CA: University of California Berkeley; 2005.
- 24. Peacock E, Taylor MK, Laake J, Stirling I. Population ecology of polar bears in Davis Strait, Canada and Greenland. J Wildl Manage. 2013;77(3):463–76.
- 25. Rodriguez-Ramilo ST, Wang JL. The effect of close relatives on unsupervised Bayesian clustering algorithms in population genetic structure analysis. Mol Ecol Resour. 2012;12(5):873–84. WOS:000307927700011. pmid:22639868
- 26. Anderson EC, Dunham KK. The influence of family groups on inferences made with the program Structure. Mol Ecol Resour. 2008;8(6):1219–29. pmid:21586009
- 27. Kalinowski ST. The computer program STRUCTURE does not reliably identify the main genetic clusters within species: simulations and implications for human population structure. Heredity. 2011;106(4):625–32. pmid:20683484
- 28. Bhatia G, Patterson N, Sankararaman S, Price AL. Estimating and interpreting FST: the impact of rare variants. Genome Res. 2013;23(9):1514–21. pmid:23861382
- 29. R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2015.
- 30. Hale ML, Burg TM, Steeves TE. Sampling for microsatellite-based population genetic studies: 25 to 30 individuals per population is enough to accurately estimate allele frequencies. PLoS ONE. 2012;7(9):e45170. ARTN e45170 WOS:000308738500125. pmid:22984627
- 31. Fung T, Keenan K. Confidence intervals for population allele frequencies: the general case of sampling from a finite diploid population of any size. PLoS ONE. 2014;9(1):e85925. pmid:24465792
- 32. Meirmans PG, Van Tienderen PH. GENOTYPE and GENODIVE: two programs for the analysis of genetic diversity of asexual organisms. Mol Ecol Notes. 2004;4(4):792–4.
- 33. Nei M. Molecular Evolutionary Genetics. New York: Columbia University Press; 1987.
- 34. Rousset F. GENEPOP'007: a complete re-implementation of the GENEPOP software for Windows and Linux. Mol Ecol Resour. 2008;8(1):103–6. ISI:000253827100016. pmid:21585727
- 35. Holm S. A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics. 1979;6:65–70.
- 36. Katoh K, Misawa K, Kuma Ki, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30(14):3059–66. pmid:12136088
- 37. Hurvich CM, Tsai C-L. Regression and time series model selection in small samples. Biometrika. 1989;76(2):297–307.
- 38. Guindon S, Gascuel O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003;52(5):696–704. pmid:14530136
- 39. Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nat Methods. 2012;9(8):772.
- 40. Excoffier L, Lischer HEL. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10(3):564–7. ISI:000276407300020. pmid:21565059
- 41. Wang J. Does GST underestimate genetic differentiation from marker data? Mol Ecol. In press.
- 42. Excoffier L, Smouse PE, Quattro JM. Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics. 1992;131(2):479–91. pmid:1644282
- 43. Michalakis Y, Excoffier L. A generic estimation of population subdivision using distances between alleles with special reference for microsatellite loci. Genetics. 1996;142(3):1061–4. pmid:8849912
- 44. Slatkin M. A measure of population subdivision based on microsatellite allele frequencies. Genetics. 1995;139(1):457–62. pmid:7705646
- 45. Hardy OJ, Vekemans X. SPAGeDi: a versatile computer program to analyse spatial genetic structure at the individual or population levels. Mol Ecol Notes. 2002;2(4):618–20. WOS:000179678400078.
- 46. Hardy OJ, Charbonnel N, Fréville H, Heuertz M. Microsatellite allele sizes: a simple test to assess their significance on genetic differentiation. Genetics. 2003;163(4):1467–82. pmid:12702690
- 47. Jombart T. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics. 2008;24(11):1403–5. WOS:000256169300015. pmid:18397895
- 48. Jombart T, Ahmed I. adegenet 1.3–1: new tools for the analysis of genome-wide SNP data. Bioinformatics. 2011;27(21):3070–1. WOS:000296099300022. pmid:21926124
- 49. Takezaki N, Nei M. Genetic distances and reconstruction of phylogenetic trees from microsatellite DNA. Genetics. 1996;144(1):389–99. pmid:8878702
- 50. Takezaki N, Nei M, Tamura K. POPTREEW: web version of POPTREE for constructing population trees from allele frequency data and computing some other quantities. Mol Biol Evol. 2014;31(6):1622–4. pmid:24603277
- 51. Nei M, Tajima F, Tateno Y. Accuracy of estimated phylogenetic trees from molecular data. J Mol Evol. 1983;19(2):153–70. pmid:6571220
- 52. Nei M. Estimation of average heterozygosity and genetic distance from a small number of individuals. Genetics. 1978;89(3):583–90. pmid:17248844
- 53. Schliep KP. phangorn: phylogenetic analysis in R. Bioinformatics. 2011;27(4):592–3. pmid:21169378
- 54. Tamura K, Nei M. Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol. 1993;10(3):512–26. pmid:8336541
- 55. Meirmans PG. Seven common mistakes in population genetics and how to avoid them. Mol Ecol. In press.
- 56. Zink RM, Barrowclough GF. Mitochondrial DNA under siege in avian phylogeography. Mol Ecol. 2008;17(9):2107–21. pmid:18397219
- 57. Gilbert KJ, Andrew RL, Bock DG, Franklin MT, Kane NC, Moore JS, et al. Recommendations for utilizing and reporting population genetic analyses: the reproducibility of genetic clustering using the program STRUCTURE. Mol Ecol. 2012;21(20):4925–30. WOS:000309888200004. pmid:22998190
- 58. Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: Linked loci and correlated allele frequencies. Genetics. 2003;164(4):1567–87. WOS:000185248000029. pmid:12930761
- 59. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155(2):945–59. ISI:000087475100039. pmid:10835412
- 60. Kopelman NM, Mayzel J, Jakobsson M, Rosenberg NA, Mayrose I. Clumpak: a program for identifying clustering modes and packaging population structure inferences across K. Mol Ecol Resour. 2015:n/a–n/a.
- 61. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005;14(8):2611–20. WOS:000229961500029. pmid:15969739
- 62. Pritchard J, Wen X, Falush D. Documentation for structure software: Version 2.3. Chicago, USA: University of Chicago; 2010.
- 63. Latch EK, Dharmarajan G, Glaubitz JC, Rhodes OE. Relative performance of Bayesian clustering software for inferring population substructure and individual assignment at low levels of population differentiation. Conserv Genet. 2006;7(2):295–302. ISI:000238026300012.
- 64. Corander J, Marttinen P. Bayesian identification of admixture events using multilocus molecular markers. Mol Ecol. 2006;15(10):2833–43. pmid:16911204
- 65. Corander J, Marttinen P, Siren J, Tang J. Enhanced Bayesian modelling in BAPS software for learning genetic structures of populations. BMC Bioinformatics. 2008;9(1):539.
- 66. Corrander J, Marttinen P, Mäntyniemi S. A Bayesian method for identification of stock mixtures from molecular marker data. Fish Bull. 2006;104(4):550–8.
- 67. Cheng L, Connor T, Aanensen D, Spratt B, Corander J. Bayesian semi-supervised classification of bacterial samples using MLST databases. BMC Bioinformatics. 2011;12(1):302.
- 68. Hubisz MJ, Falush D, Stephens M, Pritchard JK. Inferring weak population structure with the assistance of sample group information. Mol Ecol Resour. 2009;9(5):1322–32. pmid:21564903
- 69. Meirmans PG. The trouble with isolation by distance. Mol Ecol. 2012;21(12):2839–46. pmid:22574758
- 70. Rousset F. Genetic differentiation between individuals. J Evol Biol. 2000;13(1):58–62.
- 71. Drummond CS, Hamilton MB. Hierarchical components of genetic variation at a species boundary: population structure in two sympatric varieties of Lupinus microcarpus (Leguminosae). Mol Ecol. 2007;16(4):753–69. pmid:17284209
- 72. Oksanen J, Blanchet FG, Kindt R, Legendre P, Minchin PR, O'Hara RB, et al. vegan: Community Ecology Package. 2.2–1 ed2015.
- 73. Wilson GA, Rannala B. Bayesian inference of recent migration rates using multilocus genotypes. Genetics. 2003;163(3):1177–91. pmid:12663554
- 74. Heidelberger P, Welch PD. Simulation run length control in the presence of an initial transient. Operations Research. 1983;31(6):1109–44.
- 75. Smith BJ. boa: An R Package for MCMC Output Convergence Assessment and Posterior Inference Journal of Statistical Software. J Stat Softw. 2007;21(11):1–37.
- 76. Wahlund S. Zusammensetzung von Populationen und Korrelationserscheinungen vom Standpunkt der Vererbungslehre aus betrachtet. Hereditas. 1928;11(1):65–106.
- 77. Waples RS. Testing for Hardy–Weinberg proportions: have we lost the plot? J Hered. 2015;106(1):1–19. pmid:25425676
- 78. Fetterer F, Savoie M, Helfrich S, Clemente-Colón P. masie_ice_r00_v01_2008106_4km In: Center NSaID, editor. Multisensor Analyzed Sea Ice Extent—Northern Hemisphere. Boulder, Colorado USA: National Ice Center (NIC) and NSIDC; 2010, updated daily.
- 79. Kalinowski ST. How well do evolutionary trees describe genetic relationships among populations? Heredity. 2009;102(5):506–13. pmid:19174839
- 80. Karl SA, Toonen RJ, Grant WS, Bowen BW. Common misconceptions in molecular ecology: echoes of the modern synthesis. Mol Ecol. 2012;21(17):4171–89. WOS:000308047100005. pmid:22574714
- 81. Weir BS, Cockerham CC. Estimating F-statistics for the analysis of population structure. Evolution. 1984;38(6):1358–70. WOS:A1984TY40400017.
- 82. Peacock E, Derocher AE, Lunn NJ, Obbard ME. Polar Bear Ecology and Management in Hudson Bay in the Face of Climate Change. In: Ferguson SH, Loseto LL, Mallory ML, editors. A Little Less Arctic: Top Predators in the World’s Largest Northern Inland Sea, Hudson Bay. New York: Springer; 2010. p. 93–115.
- 83. Bethke R, Taylor M, Amstrup S, Messier F. Population delineation of polar bears using satellite collar data. Ecol Appl. 1996;6(1):311–7. ISI:A1996TU56700038.
- 84. Lone K, Aars J, Ims R. Site fidelity of Svalbard polar bears revealed by mark-recapture positions. Polar Biol. 2013;36(1):27–39.
- 85. Cherry SG, Derocher AE, Thiemann GW, Lunn NJ. Migration phenology and seasonal fidelity of an Arctic marine predator in relation to sea ice dynamics. J Anim Ecol. 2013;82(4):912–21. pmid:23510081
- 86. Mauritzen M, Derocher AE, Wiig Ø. Space-use strategies of female polar bears in a dynamic sea ice habitat. Canadian Journal of Zoology. 2001;79(9):1704–13.
- 87. IUCN/SSC Polar Bear Specialist Group. 2014 Polar Bear Status Table 2015 [updated Jan. 24, 2015; cited 2015 May 11, 2015]. Available from: http://pbsg.npolar.no/en/status/status-table.html.
- 88. Thiemann GW, Derocher AE, Stirling I. Polar bear Ursus maritimus conservation in Canada: an ecological basis for identifying designatable units. Oryx. 2008;42(4):504–15. ISI:000260810100016.
- 89. Hamilton SG, Castro de la Guardia L, Derocher AE, Sahanatien V, Tremblay B, Huard D. Projected polar bear sea ice habitat in the Canadian Arctic Archipelago. PLoS ONE. 2014;9(11):e113746. pmid:25426720
- 90. Chesser RK, Baker RJ. Effective sizes and dynamics of uniparentally and diparentally inherited genes. Genetics. 1996;144(3):1225–35. pmid:8913763
- 91. Prugnolle F, de Meeus T. Inferring sex-biased dispersal from population genetic tools: a review. Heredity. 2002;88:161–5. WOS:000176215400001. pmid:11920116
- 92. Goudet J, Perrin N, Waser P. Tests for sex-biased dispersal using bi-parentally inherited genetic markers. Mol Ecol. 2002;11(6):1103–14. pmid:12030985
- 93. Zeyl E, Aars J, Ehrich D, Wiig Ø. Families in space: relatedness in the Barents Sea population of polar bears (Ursus maritimus). Mol Ecol. 2009;18(4):735–49. ISI:000262782400013. pmid:19175504
- 94. Amstrup SC, Durner GM, McDonald TL, Mulcahy DM, Garner GW. Comparing movement patterns of satellite-tagged male and female polar bears. Can J Zool. 2001;79(12):2147–58. WOS:000173702200003.
- 95. Laidre KL, Born EW, Gurarie E, Wiig Ø, Dietz R, Stern H. Females roam while males patrol: divergence in breeding season movements of pack-ice polar bears (Ursus maritimus). P R Soc B. 2013;280(1752):20122371. Artn 20122371 WOS:000312591600009.
- 96. Mauritzen M, Derocher AE, Pavlova O, Wiig Ø. Female polar bears, Ursus maritimus, on the Barents Sea drift ice: walking the treadmill. Anim Behav. 2003;66(1):107–13.
- 97. Rode KD, Robbins CT, Nelson L, Amstrup SC. Can polar bears use terrestrial foods to offset lost ice-based hunting opportunities? Front Ecol Environ. 2015;13(3):138–45.
- 98. Ramsay MA, Stirling I. On the mating system of polar bears. Can J Zool. 1986;64(10):2142–51.
- 99. Amstrup S, Gardner C. Polar bear maternity denning in the Beaufort Sea. J Wildl Manage. 1994;58(1):1–10.
- 100. Derocher AE, Lunn NJ, Stirling I. Polar bears in a warming climate. Integr Comp Biol. 2004;44(2):163–76. WOS:000222909700009. pmid:21680496
- 101. Beerli P. How to use Migrate or why are Markov chain Monte Carlo programs difficult to use? In: Bertorelle G, Bruford MW, Hauffe HC, Rizzoli A, Vernesi C, editors. Population Genetics for Animal Conservation. Cambridge, UK: Cambridge University Press; 2009. p. 42–79.
- 102. Peacock E, Sonsthagen SA, Obbard ME, Boltunov A, Regehr EV, Ovsyanikov N, et al. Correction: Implications of the circumpolar genetic structure of polar bears for their conservation in a rapidly warming Arctic. PLoS ONE. 2015;10(8):e0136126. pmid:26273835
- 103. Waples RS, Gaggiotti O. What is a population? An empirical evaluation of some genetic methods for identifying the number of gene pools and their degree of connectivity. Mol Ecol. 2006;15(6):1419–39. ISI:000236798200001. pmid:16629801
- 104. Kwok R, Rothrock DA. Decline in Arctic sea ice thickness from submarine and ICESat records: 1958–2008. Geophys Res Lett. 2009;36(15):L15501.
- 105. Polyak L, Alley RB, Andrews JT, Brigham-Grette J, Cronin TM, Darby DA, et al. History of sea ice in the Arctic. Quaternary Science Reviews. 2010;29(15–16):1757–78.
- 106. Kinnard C, Zdanowicz CM, Fisher DA, Isaksson E, de Vernal A, Thompson LG. Reconstructed changes in Arctic sea ice over the past 1,450 years. Nature. 2011;479(7374):509–12. pmid:22113692
- 107. Bohling JH, Adams JR, Waits LP. Evaluating the ability of Bayesian clustering methods to detect hybridization and introgression using an empirical red wolf data set. Mol Ecol. 2013;22(1):74–86. pmid:23163531
- 108. Putman AI, Carbone I. Challenges in analysis and interpretation of microsatellite data for population genetic studies. Ecology and Evolution. 2014;4(22):4399–428. pmid:25540699
- 109. Pop M, Salzberg SL. Use and mis-use of supplementary material in science publications. BMC Bioinformatics. 2015;16:237. pmid:26525146; PubMed Central PMCID: PMCPMC4630891.