Ten microsatellite loci were characterized for 34 locations from roundtail chub (Gila robusta complex) to better resolve patterns of genetic variation among local populations in the lower Colorado River basin. This group has had a complex taxonomic history and previous molecular analyses failed to identify species diagnostic molecular markers. Our results supported previous molecular studies based on allozymes and DNA sequences, which found that most genetic variance was explained by differences among local populations. Samples from most localities were so divergent species-level diagnostic markers were not found. Some geographic samples were discordant with current taxonomy due to admixture or misidentification; therefore, additional morphological studies are necessary. Differences in spatial genetic structure were consistent with differences in connectivity of stream habitats, with the typically mainstem species, G. robusta, exhibiting greater genetic connectedness within the Gila River drainage. No species exhibited strong isolation by distance over the entire stream network, but the two species typically found in headwaters, G. nigra and G. intermedia, exhibited greater than expected genetic similarity between geographically proximate populations, and usually clustered with individuals from the same geographic location and/or sub-basin. These results highlight the significance of microevolutionary processes and importance of maintaining local populations to maximize evolutionary potential for this complex. Augmentation stocking as a conservation management strategy should only occur under extreme circumstances, and potential source populations should be geographically proximate stocks of the same species, especially for the headwater forms.
Citation: Dowling TE, Anderson CD, Marsh PC, Rosenberg MS (2015) Population Structure in the Roundtail Chub (Gila robusta Complex) of the Gila River Basin as Determined by Microsatellites: Evolutionary and Conservation Implications. PLoS ONE 10(10): e0139832. https://doi.org/10.1371/journal.pone.0139832
Editor: William J. Etges, University of Arkansas, UNITED STATES
Received: June 29, 2015; Accepted: September 17, 2015; Published: October 16, 2015
Copyright: © 2015 Dowling et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This study was supported by the Arizona Game and Fish Department, AGR 4/21/04, https://azgfdportal.az.gov/; US Bureau of Reclamation, 02FG320070, http://www.usbr.gov/; U.S. Forest Service, 43-8399-2-1069, http://www.fs.fed.us/. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
In deserts of western North America, long periods of aridity have been punctuated by occasional wet interludes , leading to fluctuating levels of habitat connectivity within terrestrial and aquatic environments over the last 2–3 million years. Glacial cycles (at approximately 100 kyr intervals) are correlated at middle latitudes with pluvial cycles, causing relatively regular patterns of isolation and connectedness . For some taxa, fluctuating levels of isolation combined with ecological opportunity in arid environments are thought to have resulted in elevated rates of lineage diversification [3–6]. However, in North American freshwater fishes, species richness is lower west of the continental divide, as only about 150 of 750 species reside there . This pattern has been influenced, in part, by tectonic activity and severity of the environment, leading to elevated extinction rates . More recently, human actions have exacerbated this situation, and freshwater and diadromous fishes of North America are declining at an alarming rate . Approximately 39% of described taxa are considered imperiled, representing a 92% increase since 1989. The situation is especially dire as 89% of imperiled taxa listed in 1989 exhibit the same status or worse, indicating that little has been achieved in the past quarter century to improve the status of most endangered fishes.
An interesting case involves the roundtail chub: a complex of closely related species of the cyprinid genus Gila (G. robusta, G. intermedia, and G. nigra [the last formerly “grahami”]) that are endemic to the Gila River basin in the southwestern United States. Like most other fishes of the region, the geographic distribution of populations has been reduced dramatically by human impacts; numbers are dwindling throughout their ranges, and remaining populations face myriad threats to their persistence [10, 11]. This has resulted in listing G. intermedia as endangered , petitions to similarly list G. nigra and G. robusta [13, 14], and inclusion of one or all species in regional conservation plans [15, 16].
The three species of the G. robusta complex provided an excellent opportunity for examining the distribution of genetic variation in threatened and endangered species. These species have had a complex and confused taxonomic history, and several detailed studies of variation of meristic and morphological traits have been completed [17–19]. In addition, DeMarais  examined genetic variation at 25 presumptive allozyme loci within and among populations of this complex. Analysis of the distribution of genetic variation identified significant differences among locations (FST = 0.410), however, this analysis did not identify significant structure associated with hydrogeography or species. Given observed distributional patterns and levels of genetic variation, DeMarais  hypothesized that the form “grahami” arose through past introgression between G. intermedia and G. robusta.
Minckley and DeMarais  summarized available distributional, morphological, and molecular data and examined the taxonomic status of all three species of the complex. Because each morphologically discrete form was consistently collected at the same locations and was always allopatric, they concluded that G. intermedia, G. robusta, and “grahami” represented three distinct taxonomic species. They also noted that some type specimens of “grahami” were actually G. robusta, invalidating this nomen; the earliest available replacement name was Gila nigra. Minckley and DeMarais  further discuss origins of G. nigra, hypothesizing that it may have multiple, independent origins through discrete hybridization events between G. intermedia and G. robusta.
Schwemm  found similar results to those of DeMarais  when he characterized sequence variation of mtDNA and two nuclear loci (introns of S7 and TPI). He found limited divergence among alleles/haplotypes; however, many locations exhibited unique variants, and were frequently monomorphic for these private alleles/haplotypes. Hierarchical analysis failed to associate patterns of sequence variation with species or hydrogeographic connection, and patterns of variation were best explained by fragmentation and independent evolution of local subpopulations.
The present study extends the existing population genetic literature on the G. robusta complex by providing an analysis of microsatellite DNA loci for the same samples used by Schwemm . Our results demonstrate differences in spatial genetic structure between the mainstem form (G. robusta) and the headwater forms (G. intermedia and G. nigra) that have important implications for managing local populations associated with different types of desert stream habitats. The patterns we observed are consistent with patterns of gene flow reported for many desert fishes, where genetic connectivity is a function of hydrogeographic connectivity, and often varies between populations occupying different desert stream environments . Our findings have important implications for conservation management the G. robusta complex and other aquatic organisms that inhabit desert stream environments.
Materials and Methods
Permission to undertake field work and collect specimens was obtained under permits from the states of Arizona and New Mexico, and the U. S. Fish and Wildlife Service (FWS Federal Fish and Wildlife Service Native Endangered Species Recovery Permit Number TE0-39716-1). Specimens were obtained under Arizona State University Institutional Animal Care and Use Committee (IACUC) approval 05-768R.
The three species studied here were once common inhabitants of lower Colorado River basin streams and rivers. Gila robusta historically was found in moderate size and larger mainstem waters including Bill Williams, Gila, Salt, San Pedro and Verde rivers in the lower Colorado River basin and in Colorado, Gunnison, San Juan, and Green rivers in the upper basin where it occupied the largest, deepest pools and attained more than 45 cm total length. Smaller individuals inhabited smaller habitats, as with many western fishes , where they prefer shaded, deeper pools with cover such as undercut banks, boulders, or debris. Best described as a creek fish, Gila intermedia is a Gila River basin endemic that occupies well-developed pools with abundant cover in small to middle-size headwater creeks ; it is most common in marshy areas and ciénegas. Gila intermedia and G. robusta have never been taken syntopically despite spatial proximity of some populations. Gila nigra, also a Gila River basin endemic, inhabits smaller, first-order to middle reaches of medium sized streams, where it is strongly associated with cover such as undercut banks, boulders, and debris. It does not occur in ciénegas and does not co-occur with either of its congeners.
Sampling and DNA extraction
Sampling of Gila and Bill Williams river drainages encompassed 34 sites in seven sub-basins in Arizona and New Mexico (Bill Williams, Agua Fria, Verde, Salt, Santa Cruz, San Pedro, and Gila River mainstem–Fig 1), representing most known extant and some extirpated populations of these taxa. Minckley and DeMarais  summarized information meristic, morphometric and pigmentation characters of samples from locations examined here and identified them to species; therefore, we follow their taxonomic designations.
Approximate locations are identified by symbols with shape and color indicating species and drainage unit, respectively (see legends for detailed information). Locality data are provided in Table 1. Reprinted from the Fish Division drainage map, University of Michigan Museum of Zoology, under a CC BY license, with permission from University of Michigan Museum of Zoology, original copyright 1972.
Efforts were made to sample up to 25–30 individuals/locality; however, the rarity of these species sometimes made it difficult to achieve this goal. Fourteen to 30 individuals were collected from each locality (Table 1). Some of these locations were represented by frozen tissues from whole specimens collected in the 1980s for an allozyme study by DeMarais . Specimens obtained for DNA studies (Schwemm  and here) were captured using standard fisheries methods (i.e., electrofishing, seining, or trapping). Tissues were obtained by removing a piece of right pectoral fin (< 3 mm square) from larger individuals with ethanol sanitized surgical scissors; after which fish were immediately released unharmed. This process was fast (a few seconds), required minimal handling, and caused no harm to the fish so no anesthesia was applied. For larvae/young-of-year, specimens were euthanized in 500 mg/L MS-222. All material was transferred immediately after acquisition to 95% ethanol for storage. Genomic DNA was extracted from tissues by standard proteinase K/phenol/chloroform protocol as modified by Tibbets and Dowling .
All sub-basins are within the Gila River basin except for the Bill Williams River, which is a direct tributary to the lower Colorado River. Taxonomic identity of sampled individuals follows Minckley and DeMarais . Coordinates are provided in UTM, elevation is in meters, and the column “N” provides number of individuals analyzed, with the superscript identifying the original source of material.
Primers for ten microsatellite loci used here were derived from several sources. Six loci (36, 222, 223, 225, 227, 300) were developed by Keeler-Foster et al.  from a G. elegans library. One locus (G294) was developed by Meredith and May  from a G. bicolor obesa library. The remaining three loci, C2 (repeat unit, (GACA)4GTCA(GACA)3(GATA)3; primers 5'-GACAAAGCGGTAGACAAAACCA-3' and 5'-AATCTGAACTGGCTAACCTT-3'), D17 (repeat unit, (GT)13; primers 5'-TGGGCAGGAAAAGAGAAACT-3' and 5'-ATAAAGAGACGGTAAAGAACTC-3'), and D42 (repeat unit, (TCTA)5, primers 5'-TTGCCTGTATAGGGTTGA-3' and 5'-GTTGCTCATTGTTAGTTTGT-3'), were obtained from a library generated from G. robusta using enrichment methods provided by Glenn and Schable . Amplifications used GoTaq (Promega) and the buffer supplied, dNTPs (200 mM final concentration of each dNTP), and IRD labeled primers (0.5 μM final concentration). Reactions were started with a long denature step (95°C, 5 min) followed by a series of touchdown steps where annealing temperature was decreased 1 C each cycle (94°C, 30 sec; 65–50°C, 30 sec; 72°C, 30 sec) to a final temp of 50°C. These same steps were repeated for additional cycles until 25 or 30 total cycles were completed, and the run finished with a long extension step (72°C, 7 min). Products were separated by electrophoresis through 6.5% denaturing gels (KBPlus, LI-COR Biotechnology) for 90–105 mins at 40 W with a minimum of four ladder lanes (50 bp—350 bp size standard, LI-COR Biotechnology) included on each gel. Fragments were visualized on a LI-COR 4300 DNA Analysis system and analyzed using SAGA GT (version 3.3, LI-COR Biotechnology).
Deviations from Hardy-Weinberg equilibrium (FIS) and multilocus equilibrium were examined using FSTAT version 126.96.36.199 . Significance level (0.05) for single and multilocus tests was adjusted using the B-Y correction ; adjusted critical values of 0.00797 and 0.00653, respectively). The unbiased estimate of gene diversity  and allelic richness (AR—corrected for sample size by rarefaction) were calculated using FSTAT and HP-Rare , respectively. Basic statistical analyses (e.g., ANOVA, Kruskal-Wallis) were performed using PASW Statistics (formerly SPSS), release 18.
To examine distribution of variation among sample populations we also used FSTAT to generate Weir and Cockerham  F-statistics. Significance of values for f (≈ FIS), Θ (≈ FST), and F (≈ FIT) were obtained by jackknifing (over individuals and all loci) and bootstrapping (loci only). Comparison of the levels of Θ among the three species was obtained by bootstrapping across samples (2500 permutations) using the comparison among groups of samples function in FSTAT. FST was further partitioned by species or drainage using AMOVA with Arlequin version 3.11 . Sample populations were clustered by neighbor-joining using POPTREE2, using corrected FST as the estimate of genetic distance with confidence of nodes assessed by bootstrapping (1000 replicates) .
Stream distance were estimated from stream data from the Digital Chart of the World  using the network extension in the GIS software ARC/INFO. To test for isolation by (stream) distance, PASSaGE 2  was used to perform Mantel tests and build Mantel correlograms [39–41]. The standardized Mantel statistic (rM) was used to measure the correlation between pairwise FST and stream distance over all sample populations. Mantel correlograms were constructed with five distance classes with an approximately equal number of pairs in each class. For each distance class, sites belonging to the same distance class received a value of 1 and the other pairs received value of 0 and design matrices were compared to a resemblance matrix based on pairwise FST. In this context, the standardized Mantel coefficient (rM) was used as a measure of spatial autocorrelation for distance data, and can be used in the same way in a correlogram. For both the Mantel test and Mantel correlograms, the statistical significance of the standardized Mantel statistic (rM) was tested by randomly permuting the rows and columns of one matrix in tandem (= 9999 permutations) and then counting the number of cases that yielded a Mantel coefficient greater than or equal to the observed value.
We also used Bayesian Assignment Tests to determine whether individuals (or groups of individuals) could be sorted into discrete gene pools. Assignment of individuals to gene pools was generated using STRUCTURE version 2.2 [42, 43] and assignment of groups of individuals to demes was examined using BAPS 5.1 . For BAPS 5.1 analyses, separate runs were completed for each species, treating each sample location as an "informed prior." We then combined all sample locations (over all species) and repeated the analysis. For all runs, we entered a vector of replicate K values (10 replicates per K, from K = 2 to K = n, where n is the number of sample populations); BAPS 5.1 reports the set of estimates with the “best” partition and probability associated with different a priori assumptions.
For STRUCTURE, the default assumption (admixture among samples, correlated allele frequencies across loci) was employed. For each a priori assumed number of populations (K), 10 independent runs of 110,000 replicates each (burn-in = 10,000) were performed. Optimal number of groups (K) was determined using the method of Evanno et al.  as implemented by the web-based program STRUCTURE HARVESTER .The distribution of Q values across runs for each K was summarized using CLUMPP  and the statistic H’ calculated to provide assessment of similarity across replicates; results were visualized using DISTRUCT .
Variation within populations
Genetic variation in Gila was characterized using 744 individuals from 34 locations and 10 microsatellite loci. Genotypes for each individual are provided in S1 Table. Most samples had complete data, with an average amplification failure rate of 5.0 individuals/locus or 0.5% of all samples. Locus 300 had the highest failure rate where 15 individuals (or 2% of all individuals) failed to amplify. Failed amplifications were scattered across populations, reducing concerns over potential impact of null alleles.
Average allelic richness per locus was variable across loci (S2 Table), ranging from 1.3 to 7.4 (for loci C2 and 227, respectively). Average allelic richness per sample ranged from 1.7 (SAB) to 8.9 (NMFKS), with the majority of lower values reported for G. intermedia and G. nigra (Table 2). Populations of G. robusta exhibited higher levels of variation (AR = 6.0) than those of G. intermedia and G. nigra (AR of 4.7 and 5.0, respectively), with these values significantly different among species (Kruskal-Wallis, P = 0.006).
“Species” follows designations in Minckley and DeMarais , “N” is sample size, “AR” is allelic richness averaged across loci, “#M” is the number of monomorphic loci, and “HWE” provides the number of significant deficiencies/excesses of heterozygotes per locus for each sample.
Average gene diversity per locus ranged from 0.073 to 0.752 (for loci C2 and 300, respectively) while average gene diversity per sample ranged from 0.221 to 0.754 (FOS and UEG, respectively) (S3 Table). Fit to Hardy-Weinberg expectations (as indicated by variation in average FIS values across loci and populations) did not vary significantly among species as indicated by variation in average FIS values (across loci and populations) for G. robusta, G. nigra, and G. intermedia (FIS = -0.008, -0.025, and 0.055, respectively; Kruskal-Wallis, P = 0.077). Of 340 individual tests conducted (10 loci, 34 locations), 13 showed deviations from Hardy-Weinberg equilibrium after B-Y correction, with more significant tests identifying heterozygote deficiency than excess (9 and 4, respectively, Table 2). Given the rarity of deviations (< 4% of comparisons) and their scatter across loci (seven loci exhibit deviations), impact from null alleles would be minimal, so all samples and loci were included in remaining analyses.
Remaining deviant samples exhibited significant heterozygote deficiencies. Most locations exhibited only a slight deficiency of hetezygotes, while deviations for G. intermedia from SAB and WAK were larger (overall FIS = 0.268, P = 0.0096 and overall FIS = 0.140, P = 0.0015, respectively). Samples from these locations yielded smaller numbers of alleles (AR = 1.7 and 3.5, respectively). At SAB, five loci were monomorphic, while four of five polymorphic loci exhibited a deficiency of heterozygotes that was not statistically significant.
All pairs of polymorphic loci were tested for genotypic linkage disequilibrium within each population, with 21 of 1184 pairwise tests (1.8%) significant after B-Y correction, with nearly half of the significant tests coming from two sample populations: ROC and TURNM (4 and 6 significant pairs, respectively). TURNM was unusual in that eight of the nine polymorphic loci exhibited an excess of heterozygotes, with two of those values significant (overall FIS = -0.274, P < 0.0001), potentially indicating close relatedness among these individuals. Locus pairs exhibiting significant disequilibrium were not consistent from sample population to sample population, indicating that loci are assorting independently.
Variation among populations
Partitioning of genetic variation into within and among population components identified significant population structure. Jackknife estimates of total genetic variation (F ≈ FIT) for each locus ranged from 0.211–0.407 (loci 222 and 36, respectively), with a jackknife average F across loci of 0.293 (95% confidence interval 0.249–0.342). The within population component (f ≈ FIS) was small and not significantly different from 0 (range -0.083 [locus C2] to 0.075 [locus 36]), consistent with Hardy-Weinberg equilibrium results discussed above (jackknife average f = 0.02, 95% confidence interval -0.01 to 0.048). Therefore, the majority of variation was partitioned among populations: Θ (≈ FST) ranged from 0.227 (locus 300) to 0.384 (locus C2) with a significant jackknife average of 0.278 (95% confidence interval 0.249–0.314).
To further examine the role of historical factors and geography, among population variation (FST) was partitioned by either taxonomy (three species) or river drainage (seven drainages, Fig 1) to see how these factors explain the distribution of genetic variation (calculated as weighted average across loci). When taxonomy was used to define partitions, the majority of the variation was found among local population within species (FSC = 0.271) instead of among species (FCT = 0.016). A similar result was obtained when samples were partitioned by drainage, with considerably more variation attributable to samples within drainages (FSC = 0.245) than among drainages (FCT = 0.052).
When all sample populations from the Gila robusta complex were pooled, the spatial correlation between pairwise FST and stream distance was weak and not statistically significant (rM = 0.165, P = 0.109; Fig 2A). However, when sample pairs were binned into distance classes (Fig 3A), the Mantel correlogram indicated statistically significant standardized Mantel coefficient in the first distance class (rM = 0.174; P = 0.007).
(A) all species, (B) Gila robusta, (C) Gila intermedia, (D) Gila nigra.
(A) all species, (B) G. robusta (C) G. intermedia, and (D) G. nigra. Distance classes with statistically significant (α = 0.05) standardized Mantel coefficients are indicated by a filled circle.
Analysis of population structure independently for each species provides a different picture. Estimates of FST for G. robusta, G. nigra, and G. intermedia were comparable, and not significantly different among species (FST = 0.191, 0.338, and 0.287, respectively; P = 0.263). However, when samples from the Bill Williams River drainage (BOL, TRT) were excluded, the average for G. robusta dropped dramatically (FST = 0.071) and there were significant differences among the three species (P = 0.009). This is reflected in the neighbor joining network of pairwise FST values (Fig 4), where many samples of G. nigra and G. intermedia exhibited long terminal branches while samples of G. robusta (except for those from the Bill Williams drainage) were shorter. Most nodes were not supported by bootstrap analysis with the exception of some pairs of samples in relatively close proximity.
Location acronyms are provided in Table 1. Red, blue, and black labels and symbols identify samples from G. intermedia, G. nigra, and G. robusta, respectively. Numbers on branches reflect the proportion of 1000 bootstrap replicates in which the defined node was found.
For G. robusta, pairwise FST ranged from 0.03–0.42. When pairwise FST was plotted against stream distance, there were three main clusters of points and one outlier in the scatter diagram (Fig 2B). The main cluster of points corresponded to comparisons between sample locations within the Gila River drainage. The remaining two clusters of points (766 to 1094 km) corresponded to comparisons between Bill Williams River samples (i.e., BOL and TRT) and Gila River samples. The outlier point was the pairwise estimate for BOL and TRT. Over all sample locations of G. robusta, we found a moderate correlation (rM = 0.61) between pairwise FST and stream distance that was statistically significant (P = 0.027). When BOL and TRT (from the Bill Williams River drainage) were excluded, the correlation became negative and was not statistically significant (rM = -0.24, P = 0.237). Differences in connectivity between major drainages are also supported by the Mantel correlogram (Fig 3b), where the standardized Mantel coefficient decreased precipitously for comparisons between sample populations in the Gila and Bill Williams drainages.
For G. intermedia, pairwise FST ranged from 0.034–0.638. Over all sample locations, the correlation between pairwise FST and stream distance (rM = 0.20) was weak and not statistically significant. The Mantel correlogram (Fig 3C) displayed a decreasing trend in the value of rM with increasing stream distance. For G. nigra, pairwise FST ranged from 0.066–0.702. Although sparse, scatterplot shape (Fig 2D) resembled the pattern for G. intermedia (Fig 2C). The correlation between pairwise FST and stream distance was weak (rM = 0.20) over all sample populations and not statistically significant. In the Mantel correlogram, the standardized Mantel statistic was significant only for the first distance class (Fig 3D; rM = 0.680, P = 0.001).
BAPS and STRUCTURE were used to estimate the number of groups encompassed by the 34 samples. BAPS determined that K = 28 with each identified group represented by single samples except for two, one containing samples EFE and UEG from G. intermedia and the other comprised of most G. robusta samples (ARA, BLK, LEG, WCL, and VDP) and NMFKS from G. nigra. STRUCTURE was used to characterize assignment probability for all K from 2–34. There was inconsistency across replicates for more divergent samples (e.g., BOL and TRT), as indicated by their consistent assignment to different groups for each value of K and reduced h’ values for these replicates (Fig 5). The method of Evanno et al.  indicated K = 20 (ΔK = 16.0), and ln likelihood values also reached a plateau at K = 20, supporting that conclusion .
“K” represents the number of informed priors for that specific group of replicates and “H΄” is the statistic that measures consistency across replicate runs.
Evaluation of assignment probability plots from STRUCTURE is difficult due to variation among replicates and the large number of distinct samples. Analyses from K = 20 and K = 28 (as predicted by STRUCTURE and BAPS, respectively, Fig 5) yielded similar results, with an increase in number of samples that were distinct for the latter K. Even at higher K certain sets of geographically proximate locations are consistently grouped together: G. robusta from the Verde River (WCL and VDP); G. nigra from Tonto Creek (ROC and SPRSA); and three separate groups of G. intermedia samples (EFE-UEG from Eagle Creek, ODN-TURAZ from the San Pedro River, and CC-SHY from the Santa Cruz River).
While it is difficult to obtain much information from examination of assignment plots for each K (Fig 5), several samples are notable. Individuals from BLK (G. robusta, Salt River), BON and SPRVE (G. intermedia, Gila and Verde rivers, respectively), and NMFKS and TON (G. nigra, Tonto Creek drainage) are routinely difficult to assign to specific groups, especially at low values of K (≤ 10). In addition, individuals from three of these locations (BON, SPRVE, TON) exhibit signs of admixture at lower levels of K (≤ 10) as there is considerable probability of assignment to a group that includes most samples of G. robusta. Similar perspective of two sites assigned to G. nigra (NMFKS, TURNM) indicates that individuals from these locations may actually belong to G. robusta.
Results of the present study were consistent with previous molecular genetic studies of the G. robusta complex [20, 22]. Most of the genetic variation was attributable to differences among local populations within species, with minimal differentiation due to the presence of multiple drainages or species in the analysis. However, our results provided new insight into spatial genetic structure among local populations associated with different stream habitats, with evidence of widespread gene flow among local populations of the mainstem form within the Gila River basin, as well as evidence of more recent, maybe even ongoing, gene flow between proximate populations than distant populations within each headwater form.
Characterization of microsatellite variation for 10 loci failed to group samples by recognized species, a result consistent with the allozyme study of DeMarais  and Schwemm’s  characterization of mtDNA and introns from two single copy nuclear genes. Levels of divergence among populations in this complex were high, and clustering of pairwise FSTs yields a topology with short internodal and long terminal branches, with limited support for grouping of pairs of most populations, let alone those in the same taxonomic group (Fig 4). High divergence also affects Bayesian group assignment, with assignment of sites to a particular group inconsistent across replicates, yielding a distinctive stacked bar pattern for divergent samples, especially at lower values of K (Fig 5). These studies illustrate the importance of local isolation in the evolution of this complex, producing a large number of diagnosably distinct, local populations (K = 20 and 28 for STRUCTURE and BAPS, respectively). It is important to emphasize here that these patterns are probably not an artifact of Bayesian methods  as indicated by high FST values among populations; nor are they an artifact of microsatellite markers, as Schwemm  also noted that many local populations are distinct enough to be diagnosable with unique mtDNA haplotypes and/or nuclear alleles.
Despite the high level of divergence among populations for mtDNA and nuclear sequence data and microsatellites, diagnostic markers were not identified for species. There are a few potential explanations for our inability to identify diagnostic molecular markers. When genetic divergence among subpopulations is large, it is possible that differences among individual subpopulations can obscure differences at deeper hierarchical levels (e.g., species, drainages), reducing the effectiveness of such markers for identifying these higher categories. For example, Hedrick  noted that hypervariable markers like microsatellites are less effective at estimating FST due to high levels of variation within populations. Extending this logic further, high levels of divergence among local subpopulations would further reduce the amount of variation available to discriminate among higher order groups (e.g., species). Resolving hierarchy may require a substantially larger data set, with additional markers capable of resolving deeper evolutionary events.
The observed pattern could also reflect how different evolutionary forces have shaped this complex. Genetic structure within the Gila robusta complex may represent insularization of a historically panmictic population due to natural and artificial habitat fragmentation, with observed patterns of morphological variation due to convergent selection for a common habitat-based morphotype (e.g., ciénegas, headwater reaches), analogous to convergent selection observed in sticklebacks . Assessment of this hypothesis would require identification of specific genes that are the unit of selection for different habitats as well as additional morphological analyses.
Past introgressive hybridization could also be partly responsible for the observed pattern. These species have never been reported to be sympatric naturally despite occurring in close proximity (e.g., Eagle Creek) . DeMarais  and Minckley and DeMarais  hypothesized that Gila nigra was a taxon of hybrid origin, resulting from introgression between G. intermedia and G. robusta. During dry periods, G. intermedia and G. robusta are expected to be geographically isolated in headwater and mainstem reaches, respectively; however, during wetter times these species could co-occur and interbreed, producing local hybrid swarms. As streams again became desiccated, these hybrid populations would become isolated in headwater reaches, allowing for divergence through local adaptation. It is such populations that Minckley and DeMarais  hypothesized might be recognized as Gila nigra, which is morphologically intermediate to G. intermedia and G. robusta. Because the present analysis of microsatellite DNA loci did not identify diagnostic markers for each species, it is impossible to specifically test the introgression hypothesis here. Regardless of the reason, the lack of diagnostic molecular characters to date does not inform the status of G. intermedia, G. nigra, and G. robusta relative to their recognition as distinct species. Instead these results highlight the role that local evolution has played in shaping patterns of variation in these taxa and the importance of accounting for this variation when managing the complex.
The high level of genetic subdivision detected in the present study indicates that forces acting on location populations (e.g., mutation, drift, selection) are driving patterns of genetic variation within the Gila robusta complex. Similar patterns were identified with allozymes  and nuclear and mtDNA sequences , indicating that this result does not solely reflect the rapid rate of microsatellite evolution. These patterns more likely reflect varying levels of hydrogeographic connectivity among stream habitats over the last 2–3 million years , with samples of G. robusta (the mainstem species) from the Gila River basin exhibiting increased variability and lower levels of divergence and hierarchical structure than G. nigra and G. intermedia, which are typically found in smaller, more isolated streams.
Although the results of the present study support substantial divergence among local populations [20, 22], spatial genetic and clustering analyses performed here indicate that relative impact of evolutionary processes on genetic variation depends on distance between localities, as well as potential barriers to dispersal. For example, F-statistic analyses of G. robusta identified considerable variation in allele frequencies among samples (FST = 0.191); however, removal of samples from the Bill Williams River reduced structure considerably (FST = 0.071). This inference was also supported by scatterplots of pairwise FST versus stream distance (Fig 2B) and a Mantel correlogram (Fig 3B), demonstrating high rates of gene flow relative to drift within the Gila River basin and differentiation between populations from Gila and Bill Williams basins (as well as between sample locations within the Bill Williams basin). While long stream distances separate populations from the Gila and Bill Williams basins, observed differentiation may be best explained by inhospitable habitat in the lowermost Colorado and Gila rivers, which may be acting as an isolating mechanism.
Gila intermedia is found in headwater reaches throughout the Gila River drainage and exhibits lower levels of variation within and more differentiation among populations than G. robusta, as expected. Scatterplots of pairwise FST versus stream distance (Fig 2C) and the Mantel correlogram (Fig 3C) indicate isolation by distance up to some threshold level, beyond which effects of drift predominate . Presence of significant divergence among distant locations likely reflects historical processes attributable to the strongly fluctuating environment . Frequent dry periods would have led to divergence among locations due to drift and selection; gene flow would have been possible during pluvial times. Divergence may have been exacerbated by recent anthropogenic modifications to the stream network that formed barriers to dispersal . While the scatter plot, Mantel correlogram, and clustering analyses indicate that gene flow is more effective than drift at shorter distances (with the effects of drift predominating at longer distances), ongoing hydrogeographic isolation is likely to intensify the strength of drift and erode any signal of localized isolation by distance.
Gila nigra also occupies headwater reaches and was also expected to show substantial differentiation among populations, and results based on various types of genetic markers corroborate this inference [20, 22]. While gene flow dynamics generally mirrored those of G. intermedia, drift induced divergence at long distances was less extreme for G. nigra, consistent with the observation that genetic variability was uniformly lower for G. intermedia relative to G. nigra. There are, however, caveats associated with this interpretation. Sampling of G. nigra was limited, with half the samples (MAR, SPRSA, ROC, TON) coming from the same relatively small tributary network of the Salt River (Fig 1). Also, the outcome of hierarchical analysis of assignment probabilities indicated that some samples may not be discrete; the samples NMFKS and TURNM were especially problematic, as they may actually be G. robusta or hybrids (Fig 5).
In general, the results of the spatial genetic analyses indicate that gene flow/drift dynamics depend on stream distance and differ between the mainstem form and the headwater forms. These results support key differences in microevolutionary processes among ecological variants within the G. robusta complex and should be informative for conservation genetic management of local populations irrespective of species designations.
Comparison to other fishes from western North America
Results from our study of the Gila robusta complex indicate considerable evolution at the local population level but also are consistent with the “Stream Hierarchy” model of gene flow , which predicts varying levels of genetic connectivity (and hierarchical structure) within a stream network depending on degree of hydrogeographic connectivity among local populations. Many studies of fishes from desert regions of western North America yield valuable perspective on the role of geographic connectedness and life history on distribution of genetic variation in arid environments. Tibbets and Dowling  contrasted levels of divergence among three species of stream-dwelling desert cyprinids (Agosia chrysogaster, Meda fulgida, and Tiaroga cobitis) from the Gila River basin, noting that patterns of genetic variation reflected expectations derived from consideration of life history and predicted levels of movement among locations. Studies of genetic variation in other cyprinids yielded variable results. In their study of mtDNA variation in Richardsonius, Houston et al.  found most variation distributed among, but not within, major regions, indicating high levels of gene exchange within but not among regions. This contrasts with studies of other minnows (e.g., Rhinichthys osculus–[57, 58] and Lepidomeda–), where there was considerable divergence among localities within drainages as well as among drainages. Johnson  also identified considerable divergence within and among drainage groups in the cyprinid Gila atraria but also noted additional divergence associated with evolved life history differences within this species.
Diversity of pattern is not restricted to cyprinids. Whiteley et al.  quantified allozyme and microsatellite variation within and among populations of mountain whitefish (Prosopium williamsoni) where variation was hierarchically arrayed into five distinct assemblages corresponding to major drainage basins, but with no differentiation within major drainage basins. This contrasts to other salmonids from the same region, which exhibit more divergence among locations than within drainages . Hopken et al.  also examined the importance of geographic structure on patterns of genetic variation in bluehead sucker (Catostomus discobolus), an endemic to the Colorado River basin. They identified three evolutionarily significant units and seven management units within this species, with each group defined by a geomorphological barrier and/or isolation due to aridity. Together, these studies show that levels of genetic connectivity within drainages can vary among taxa based on hydrogeographical patterns and the life history of a species.
Molecular and morphological variation provides critical information for management of this complex. In such situations, it is critical to understand evolutionary processes that generated the underlying genetic diversity, allowing for preservation of the evolutionary legacy and adaptive potential of the complex. To maximize preservation of evolutionary potential, we advocate an approach that preserves available genetic diversity as identified by morphological and molecular analyses. Conservation units should be defined in a hierarchical manner, with genetically distinct units identified within each morphologically recognized species and each subbasin. Importance of local adaptation, drift, and gene flow makes it advisable to consider hydrogeography as well as divergence when developing conservation plans. Note, however, that we found some discrepancies between assignments based on microsatellite data and putative taxonomic status based on morphological traits as defined by Minckley and DeMarais . Because morphological identifications are based upon museum records, such conflicts could represent change in species composition. Given the significance of morphological as well as genetic variation, it is critical that remaining populations of these taxa are characterized to allow for fully informed management of this group.
In addition to maintaining discreteness associated with geographic isolation and evolutionary independence, it is possible that G. nigra may result from admixture of G. robusta and G. intermedia. Connectedness among populations is difficult to envision in today’s environment that includes both physical and biological barriers to exchange; however, there is no obvious resolution of those issues. Instead, we must overcome the general need of placing specific populations into categories and acknowledge that conservation should focus on preserving processes that generate observed patterns as well as the patterns themselves, thus requiring preservation of the entire complex and not just individual species.
Among members of the Gila robusta complex only G. intermedia is listed under the Endangered Species Act, and it thus is the only one to receive protection. There currently is no accommodation for integrated conservation of the complex and little likelihood this will change. Given this restriction, we advocate managing each species independently by sub-drainage, with efforts to avoid mixing stocks from different sub-basins to avert negative consequences associated with outbreeding depression. This requires genetic characterization to match donor and recipient populations prior to translocation. Because of high levels of local differentiation, augmentation should only occur under extreme circumstances (i.e., population collapse, physical evidence of inbreeding depression), with special care to preserve local stocks. Efforts to establish new populations should utilize the nearest geographic population as a source, while avoiding transfer across different subdrainages; this is especially important for headwater forms.
S1 Table. Genotypes for each locus and individual examined in this study of the Gila robusta complex, Arizona—New Mexico.
S2 Table. Allelic richness (AR) for each locus and sample, including averages, for each sample of the Gila robusta complex, Arizona—New Mexico.
M. Haberstich, A. Kelsen, B. Kesner, J. Lee, D. Propst, M. Schwemm, D. Thornbrugh, and P. Unmack assisted with sampling and/or lab work. P. Unmack provided stream distances. R. Clarkson, H. Gante, and M. Schwemm provided helpful discussion and comments. Work was under appropriate state and federal permits and animal care protocols (IACUC protocol 05-768R).
Conceived and designed the experiments: TED PCM. Performed the experiments: TED. Analyzed the data: TED CDA MSR. Contributed reagents/materials/analysis tools: TED CDA PCM MSR. Wrote the paper: TED CDA PCM MSR.
- 1. Axelrod DI. Age and origin of Sonoran desert vegetation. Occ Pap CA Acad Sci 1979; 132:1–74.
- 2. Morrison RB. Quaternary stratigraphic, hydrologic, and climatic history of the Great Basin, with emphasis on Lakes Lahontan, Bonneville, and Tecopa. In: Morrison RB, editor. Quaternary nonglacial geology: Conterminous US. GSA DNAG Vol. K-2; 1991. pp 283–321.
- 3. Douglas ME, Douglas MR, Schuett GW, Porras LW. Evolution of rattlesnakes (Viperidae; Crotalus) in the warm deserts of western North America shaped by Neogene vicariance and Quaternary climate change. Molecular Ecology 2006;15:3353–3374. pmid:16968275
- 4. Neiswenter SA, Riddle BR. Landscape and climatic effects on the evolutionary diversification of the Perognathus fasciatus species group. Journal of Mammology 2011;92:982–993.
- 5. Bryson RW Jr, Riddle BR. Tracing the origins of widespread highland species: a case of Neogene diversification across the Mexican sierras in an endemic lizard. Biological Journal of the Linnaen Society 2012;105:382–394.
- 6. Bryson RW Jr, Riddle BR, Graham MR, Smith BT, Prendini L. As old as the hills: montane scorpions in southwestern North America reveal ancient associations between biotic diversification and landscape history. PLOS One 2013;8:e5282.
- 7. Minckley WL, Douglas ME. Discovery and extinction of western fishes: A blink of the eye in geologic time. In: Minckley WL, Deacon JE, editors. Battle against extinction: Native fish management in the American West. Tucson: University of Arizona Press; 1991 pp. 7–18.
- 8. Smith GR, Badgley C, Eiting TP, Larson PS. Species diversity gradients in relation to geological history in North American freshwater fishes. Evolutionary Ecology Research 2010;12:693–726.
- 9. Jelks HL, Walsh SJ, Burkhead NM, Contreras-Balderas S, Dìaz-Pardo E, Hendrickson DA, et al. Conservation status of imperiled North American freshwater and diadromous fishes. Fisheries 2008;33:372–407.Minckley WL. Fishes of Arizona. Phoenix: Arizona Game and Fish Department; 1973.
- 10. Weedman DA, Girmendonk AL, Young KL. Status review of Gila chub, Gila intermedia, in the United States and Mexico. Nongame technical report 91. Phoenix: Arizona Game and Fish Department; 1996. 120 pp.
- 11. Voeltz JB. Roundtail chub (Gila robusta) status survey of the lower Colorado River basin. Nongame technical report 186. Phoenix: Arizona Game and Fish Department; 2002. 221 pp.
- 12. U.S. Fish and Wildlife Service. Endangered and threatened wildlife and plants; listing the Gila chub as endangered with critical habitat; proposed rule. Federal Register 2002;67(154), 51948–51985.
- 13. U. S. Fish and Wildlife Service. Endangered and threatened species. Review of plant and animal taxa: proposed rule. Federal Register 2006;71(89):26007–260017.
- 14. U.S. Fish and Wildlife Service. Endangered and threatened wildlife and plants; 12-month finding on a petition to list a distinct population segment of the roundtail chub (Gila robusta) in the lower Colorado River basin; proposed rule. Federal Register 2009;74(128):32352–32387.
- 15. Arizona Game and Fish Department. Arizona statewide conservation agreement for roundtail chub (Gila robusta), headwater chub (Gila nigra), flannelmouth sucker (Catostomus latipinnis), Little Colorado River sucker (Catostomus spp. [sic.]), bluehead sucker (Catostomus discobolus) and Zuni bluehead sucker (Catostomus discobolus yarrowi). Phoenix: Arizona Game and Fish Department; 2006. 63 pp.
- 16. Utah Department of Natural Resources. Rangewide conservation agreement and strategy for roundtail chub (Gila robusta), bluehead sucker (Catostomus discobolus), and flannelmouth sucker (Catostomus latipinnis). Publication Number 06–18. Salt Lake City: Utah Department of Natural Resources; 2006.
- 17. Rinne JN. Cyprinid fishes of the Gila from the lower Colorado River. Wassman Journal of Biology 1976;34:65–107.
- 18. DeMarais BD. Morphological variation in Gila (Pisces: Cyprinidae) and geologic history: lower Colorado River basin. Thesis, Arizona State University. 1986.
- 19. Douglas ME, Minckley WL, DeMarais BD. Did vicariance mold phenotypes of western North American Fishes? Evidence from Gila River cyprinids. Evolution 1999;53:238–246
- 20. DeMarais BD. Genetic relationships among fishes allied to the genus Gila (Teleostei: Cyprinidae) from the American Southwest. Dissertation, Arizona State University. 1992.
- 21. Minckley WL, DeMarais BD.Taxonomy of chubs (Teleostei, Cyprinidae, genus Gila) in the American Southwest with comments on conservation. Copeia 2000;2000:251–256.
- 22. Schwemm MR. Genetic variation in the Gila robusta complex (Teleostei: Cyprinidae) in the lower Colorado River. Thesis, Arizona State University. 2006.
- 23. Meffe GK, Vrijenhoek RC. Conservation genetics in the management of desert fishes. Conservation Biology 1988;2:157–169.
- 24. Smith GR. Effects of habitat size on species richness and adult body sizes of desert fishes. In Naiman RJ and Soltz DL, editors. Fishes in North American deserts. New York: John Wiley and Sons; 1981. pp 125–171.
- 25. Minckley WL. Fishes of Arizona. Phoenix: Arizona Game and Fish Department; 1973.
- 26. Tibbets CA, Dowling TE. Effects of intrinsic and extrinsic factors on population fragmentation in three North American minnows (Teleostei: Cyprinidae). Evolution 1996;50:1280–1292.
- 27. Keeler-Foster CL, Spies IB, Bondu-Hawkins V, Bentzen P. Development of microsatellite markers in bonytail (Gila elegans) with cross-species amplification in humpback chub (Gila cypha). Molecular Ecology Notes 2004;4:23–25.
- 28. Meredith EP, May B. Microsatellite loci in the Lahontan tui chub, Gila bicolor obesa, and their utilization in other chub species. Molecular Ecology Notes 2002;2:156–158.
- 29. Glenn TC, Schable NA. Isolating microsatellite DNA loci. Methods in Enzymology 2005;395:202–222. pmid:15865969
- 30. Goudet J. FSTAT, a program to estimate and test gene diversities and fixation indices (version 2.9.3). 2001. Available from http://www2.unil.ch/popgen/softwares/fstat.htm. Accessed 3 August 2009.
- 31. Narum SR. Beyond Bonferroni: Less conservative analyses for conservation genetics. Conservation Genetics 2006;7:783–787.
- 32. Nei M. Molecular Evolutionary Genetics. New York: Columbia University Press;1987,
- 33. Kalinowski ST (2005) HP-Rare: a computer program for performing rarefaction on measures of allelic diversity. Molecular Ecology Notes 5:187–189.
- 34. Weir BS, Cockerham CC (1984) Estimating F-statistics for the analysis of population structure. Evolution 38:1358–1370.
- 35. Excoffier L, Laval G, Schneider S. Arlequin ver. 3.0: An integrated software package for population genetics data analysis. Evolutionary Bioinformatics Online 2005;1:47–50.
- 36. Takezaki N, Nei M, Tamura K. POPTREE2: Software for constructing population trees from allele frequency data and computing other population statistics with Windows-interface. Molecular Biology and Evolution 2010;27:747–752. pmid:20022889
- 37. ESRI [Environmental Systems Research Institute]. Digital chart of the world. Redlands: Environmental Systems Research Institute; 1993.
- 38. Rosenberg MS, Anderson CD. PASSaGE: Pattern Analysis, Spatial Statistics and Geographic Exegesis. Version 2. Methods in Ecology and Evolution 2011;2:229–232.
- 39. Bocard D, Legendre P. Is the Mantel correlogram powerful enough to be useful in ecological analysis? A simulation study. Ecology 2012;93:1473–1481. pmid:22834387
- 40. Meirmans PG. The trouble with isolation by distance. Molecular Ecology 2012;21:2839–2846. pmid:22574758
- 41. Diniz-Filho JAF, Soares TN, Lima JS, Dobrovolski R, Landeiro VL, de Campos Telles MP, Rangel TF, Bini LM. Mantel test in population genetics. Genetics and Molecular Biology 2013;36:475–485. pmid:24385847
- 42. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics 2000;155:945–59. pmid:10835412
- 43. Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: Linked loci and correlated allele frequencies. Genetics 2003;164:1567–1587. pmid:12930761
- 44. Corander J, Walmann P, Marttinen P, Sillanpaa MJ. BAPS2: enhanced possibilities for the analysis of genetic population structure. Bioinformatics 2004;20:2363–2369. pmid:15073024
- 45. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Molecular Ecology 2005;14:2611–2620. pmid:15969739
- 46. Earl DA, vonHoldt BM. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conservation Genetics Resources 2011;4:359–361.
- 47. Jakobsson M, Rosenberg NA. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 2007;23:1801–1806. pmid:17485429
- 48. Rosenberg NA. DISTRUCT: a program for the graphical display of population structure. Molecular Ecology Notes 2004;4:137–138.
- 49. Frantz AC, Cellina S, Krier A, Schley L, Burke T. Using spatial Bayesian methods to determine the genetic structure of a continuously distributed population: clusters or isolation by distance? Journal of Applied Ecology 2009;46:493–505.
- 50. Hedrick PW. Highly variable loci and their interpretation in evolution and conservation. Evolution 1999;53:313–318.
- 51. Colosimo PF, Hosemeann KE, Balabhadra S, Villerreal G Jr., Dickson M, Grimwood J, Schmutz J, Myers RM, Schluter D, Kingsley DM. Widespread parallel evolution in sticklebacks by repeated fixation of ectodysplasin alleles. Science 2005;307:1928–1933. pmid:15790847
- 52. Minckley WL, Marsh PC. Inland fishes of the greater southwest: chronicle of a vanishing biota. Tucson: Univ. Ariz. Press, Tucson; 2009. 426 pp.
- 53. Hutchison DW, Templeton AR. Correlation of pairwise genetic and geographic distance measures: inferring the relative influences of gene flow and drift on the distribution of genetic variability. Evolution 1999;53:1898–1914.
- 54. Polyak VJ, Asmerom Y. Late Holocene climate and cultural changes in the southwestern United States. Science 2001;294:148–151. pmid:11588259
- 55. Clarkson RW, Marsh PC, Dowling TE. Population prioritization for conservation of imperiled warmwater fishes in an arid-region drainage. Aquatic Conservation: Marine and Freshwater Ecosystems 2012;22:498–510.
- 56. Houston DD, Shiozawa DK, Riddle BR. The roles of Neogene geology and late Pleistocene lake levels in shaping the genetic structure of the Lahontan redside shiner Richardsonius egregius (Teleostei: Cyprinidae). Biological Journal of the Linnaean Society 2011;104:163–176.
- 57. Ardren WR, Baumsteiger J, Allen CS. Genetic analysis and uncertain taxonomic status of threatened Foskett Spring speckled dace. Conservation Genetics 2010;11:1299–1315.
- 58. Billman EJ, Lee JB, Young DO, McKell MD, Evans RP, Shiozawa DK. Phylogenetic divergence in a desert fish: Differentiation of speckled dace within the Bonneville, Lahontan, and upper Snake River basins. Western North American Naturalist 2010;70:39–47.
- 59. Johnson JB, Dowling TE, Belk MC. Neglected taxonomy of rare desert fishes: Congruent evidence for two species of leatherside chub. Systematic Biology 2004;53:841–855. pmid:15764555
- 60. Johnson JB. Evolution after the flood: phylogeography of the desert fish Utah chub. Evolution 2002;56:948–960. pmid:12093030
- 61. Whiteley AR, Spruell P, Allendorf FW. Can a common species provide valuable information for conservation? Molecular Ecology 2006;15, 2767–2786. pmid:16911199
- 62. Whiteley AR, Spruell P, Allendorf FW. Ecological and life history characteristics predict population genetic divergence of two salmonids in the same landscape. Molecular Ecology 2004;13:3675–3688. pmid:15548282
- 63. Hopken MW, Douglas MR, Douglas ME. Stream hierarchy defines riverscape genetics of a North American desert fish. Molecular Ecology 2012;22:956–971. pmid:23279045