Extremely Low Microsatellite Diversity but Distinct Population Structure in a Long-Lived Threatened Species, the Australian Lungfish Neoceratodus forsteri (Dipnoi)

The Australian lungfish is a unique living representative of an ancient dipnoan lineage, listed as ‘vulnerable’ to extinction under Australia’s Environment Protection and Biodiversity Conservation Act 1999. Historical accounts indicate this species occurred naturally in two adjacent river systems in Australia, the Burnett and Mary. Current day populations in other rivers are thought to have arisen by translocation from these source populations. Early genetic work detected very little variation and so had limited power to answer questions relevant for management including how genetic variation is partitioned within and among sub-populations. In this study, we use newly developed microsatellite markers to examine samples from the Burnett and Mary Rivers, as well as from two populations thought to be of translocated origin, Brisbane and North Pine. We test whether there is significant genetic structure among and within river drainages; assign putatively translocated populations to potential source populations; and estimate effective population sizes. Eleven polymorphic microsatellite loci genotyped in 218 individuals gave an average within-population heterozygosity of 0.39 which is low relative to other threatened taxa and for freshwater fishes in general. Based on F ST values (average over loci = 0.11) and STRUCTURE analyses, we identify three distinct populations in the natural range, one in the Burnett and two distinct populations in the Mary. These analyses also support the hypothesis that the Mary River is the likely source of translocated populations in the Brisbane and North Pine rivers, which agrees with historical published records of a translocation event giving rise to these populations. We were unable to obtain bounded estimates of effective population size, as we have too few genotype combinations, although point estimates were low, ranging from 29 - 129. We recommend that, in order to preserve any local adaptation in the three distinct populations that they be managed separately.

The Australian lungfish is a unique living representative of an ancient dipnoan lineage, listed as 'vulnerable' to extinction under Australia's Environment Protection and Biodiversity Conservation Act 1999. Historical accounts indicate this species occurred naturally in two adjacent river systems in Australia, the Burnett and Mary. Current day populations in other rivers are thought to have arisen by translocation from these source populations. Early genetic work detected very little variation and so had limited power to answer questions relevant for management including how genetic variation is partitioned within and among sub-populations. In this study, we use newly developed microsatellite markers to examine samples from the Burnett and Mary Rivers, as well as from two populations thought to be of translocated origin, Brisbane and North Pine. We test whether there is significant genetic structure among and within river drainages; assign putatively translocated populations to potential source populations; and estimate effective population sizes. Eleven polymorphic microsatellite loci genotyped in 218 individuals gave an average within-population heterozygosity of 0.39 which is low relative to other threatened taxa and for freshwater fishes in general. Based on F ST values (average over loci = 0.11) and STRUCTURE analyses, we identify three distinct populations in the natural range, one in the Burnett and two distinct populations in the Mary. These analyses also support the hypothesis that the Mary River is the likely source of translocated populations in the Brisbane and North Pine rivers, which agrees with historical published records of a translocation event giving rise to these populations. We were unable to obtain bounded estimates of effective population size, as we have too few genotype combinations, although point estimates were low, ranging from 29 -129. We recommend that, in order to preserve any local adaptation in the three distinct populations that they be managed separately.

Introduction
For species of conservation concern (i.e. endangered or vulnerable), increasing or maintaining effective population size (N e ) and thus maintaining levels of genetic diversity has been considered critically important [1]. This is important because species with low levels of genetic diversity may be less resilient to environmental stressors such as pathogens [2] and climate change [3]. Various management options have been suggested for populations with low genetic diversity, including 'genetic rescue' through the addition of individuals from other populations in an attempt to maintain or increase genetic diversity in threatened populations [4]. In some cases, the introduction of a very small number of individuals has produced significant fitness benefits [5]. However, there is also the possibility of reduced fitness if the introduced individuals are genetically differentiated from the resident population [4]. Thus, management of threatened species requires information on both the genetic diversity within populations and levels of genetic divergence between populations.
Managing long-lived species requires an understanding of the factors affecting population dynamics which can be difficult to determine from current day adult populations as the historical drivers for the present day structure may have been different when they were influencing key recruitment processes in the past [6,7]. However, an analysis of the genetic structure of present day populations can help understand past patterns of historical demographic change, such as changes in population size and also evidence of inbreeding or non-random mating patterns [1]. A number of recent studies of long-lived species have suggested that long generation times can ameliorate the loss of genetic variation due to small effective population sizes [7,8].
The Australian lungfish Neoceratodus forsteri is the most primitive member of the subclass Dipnoi, and the only extant representative of the group in Australia [9]. Australian lungfish reach large sizes (up to 150 cm and 20 kg) and are very long-lived, with some individuals known to live for over 80 years [9,10]. The species also has a very long evolutionary history, having existed with little change in morphology for around 100 million years since the early Cretaceous [9,11]. The Australian lungfish was historically widespread in drainages of eastern Australia before its range contracted to a small area of southeastern Queenlsand during the Pleistocene [12]. The species is currently only found in a few rivers in southeast Queensland, although it is thought to occur naturally in only two river systems, the Mary River and the Burnett River [9,13] see Fig 1A. Populations in other rivers (Brisbane, North Pine, Condamine, Coomera, Albert, see Fig 1A), are thought to be the result of translocations, some of which were documented in the late nineteenth century [14]. Recruitment of juveniles into the population has rarely been observed since the discovery of the lungfish in 1870, leading to the belief that the population may be at risk of extinction [15] and that translocation was an option to increase the distribution and spread the risk across multiple river systems [16]. Lungfish are quite abundant in parts of the Brisbane River (David Roberts, pers comm.), with fewer in the North Pine River and fewer individuals in the other rivers to which they were translocated. Populations are all dominated by adults, with little evidence of recent or regular recruitment in most of these populations [17]. It has been declared a nationally vulnerable species under the Environment Protection and Biodiversity Conservation Act 1999 and is listed as a no-take species under Schedule 2 of the Fisheries Regulation 2008 under the Fisheries Act 1999 (Queensland). For the purposes of managing lungfish populations, Fisheries Queensland has always assumed that Burnett and Mary populations are natural and that the other populations have resulted from translocation (Peter Kind, pers comm.).
Genetic differences between sub-populations can occur when they become geographically isolated by barriers which can include physical barriers such as water falls or disconnected rivers, or chemical barriers such as salinity gradients. N. forsteri are not tolerant of elevated salinities [9] and therefore it is likely that populations in different coastal drainages are effectively isolated from one another by salinity barriers. Depending on how long the populations have been isolated, this is likely to have resulted in genetic differentiation, although the rate that this differentiation will have accrued is likely to be much slower than for other freshwater fish species, due to the very long generation time of lungfish, not reaching maturity until about 20 years of age [9].
A previous genetic study examined 24 allozyme loci and a large portion of mitochondrial DNA and reported extremely low levels of genetic variation [13]. Only two allozyme loci were polymorphic, each with only two alleles, and most individuals from the Mary, Burnett and Brisbane Rivers shared a single mitochondrial haplotype [13]. The authors concluded that the very low genetic variation was probably the result of past bottlenecks resulting from periodic or extended dry periods during the Pleistocene, but they were unable to draw definite conclusions about the levels of genetic structure between river systems, as the very low levels of genetic variation meant that there was low statistical power. The authors also suggested the possibility of inbreeding in the natural populations, detecting significant heterozygote deficits in three of four comparisons. Microsatellites, due to their high mutation rate and the fact that most are non-coding and therefore not subject to selection [18] are likely to provide more power to test these questions. Huey et al. [19] recently developed a panel of novel polymorphic microsatellite loci for the lungfish. In this study, we use these microsatellites to test the following hypotheses: There will be significant genetic structure between the two 'natural populations', the Burnett River and the Mary River, because they are separate drainage basins, isolated by the ocean and the lungfish is an obligate freshwater fish; there will be low levels of genetic differentiation between putatively translocated populations in the Brisbane River and North Pine River and at least one of the natural populations (the assumed source) because the translocations occurred within the last 150 years and the generation time for the lungfish is very long, thus slowing the possibility of genetic divergence since translocation. We also use the genetic data to attempt to estimate effective population sizes, as previous work has suggested that these may be very low and thus represent a risk for the future of the species.

Sampling Strategy
Tissue and scale samples were collected from 218 individuals from seven localities in the Burnett, Mary, Brisbane and North Pine River catchments ( Fig 1A). Two localities were sampled from the Burnett (Upper Burnett and Lower Burnett) and three from the Mary (Upper Mary, Lower Mary and Tinana Creek). Lungfish collected from multiple positions within the same tributary were pooled together for each of these sites. A single locality was sampled from each of the Brisbane (below Wivenhoe Dam) and North Pine Rivers (below North Pine Dam). The number of individuals from each location is shown in Table 1.

Ethics Statement
All field and experimental protocols carried out in this study were approved by the Griffith University Animal Ethics Committee. All procedures were carried out according to Australian Ethics Committee protocol numbers CA2011/10/551 (Seqwater) and ENV/17/14/AEC (Griffith University). inbreeding coefficient (F IS ) averaged across loci with asterisk denoting populations with significant heterozygote deficiency. Effective population size (N E ) estimates with 95% parametric confidence intervals in parentheses were calculated using the linkage disequilibrium method excluding alleles with frequency <0.05. Allelic richness (AR) and private allelic richness (AR priv ) rarefied to minimum sample size of 16 alleles. Bottleneck results correspond to one-tail Wilcoxon test for heterozygote excess, NS = not significant. M-ratio is the Garza-Williamson index following [28].

Laboratory Methods
A standard phenol-chloroform extraction method [20] was used to extract DNA from scales or fin clip tissue samples. Of 336 microsatellite primers developed and tested on N. forsteri, 115 successfully amplified a product of expected size and of these, 94 were monomorphic and 21 were variable [19]. Further testing of the 21 variable loci reported by Huey et al. [19], showed nine had inconsistent amplification results and were excluded from this study, while the remaining 11 loci were screened across all individuals. The 11 loci include: 1LF041, 2LF034, 2LF026, 1LF005, 2LF049, 2LF069, 2LF041, 1LF044, 2LF031, 2LF032 and 1LF117 [19]. PCR amplification methods can be found in Huey et al. [19]. Microsatellites were fluorescently labeled using the multi-tailed primer tagging method of Real et al. [21] and run on an ABI3130 Fragment Analyser using the G5 dye set. Alleles were scored using GeneMapper V3.1 (Applied Biosystems).

Data Analyses
Genetic diversity and effective population size. Genetic diversity averaged across eleven loci within each of the seven population samples was quantified from observed (H O ) and expected (H E ) heterozygosity using ARLEQUIN v3.5.1.2 [22]. Measures of genetic diversity standardized for sample size including allelic richness (AR) and private allelic richness (AR priv ) were calculated using HP-Rare [23]. Polymorphism information content (PIC) was evaluated for each locus using CERVUS v3.0.7 [24], where PIC > 0.5 is considered highly informative; 0.5 > PIC > 0.25 reasonably informative; and PIC<0.25 slightly informative [25]. Tests for deviation from Hardy-Weinberg Equilibrium (HWE) for each locus-population combination were carried out using exact tests implemented in ARLEQUIN. The inbreeding coefficient (F IS ) was estimated over all loci for each population using the Analysis of Molecular Variance (AMOVA) framework in ARLEQUIN and tested for deviation from the random expectation by 10,000 permutations of alleles among individuals within populations.
Genetic evidence for a recent reduction in local population size was tested using the software BOTTLENECK v1.2.02 [26] which compares observed and expected heterozygosity where the expected value is the equilibrium expectation conditioned on the observed number of alleles [27]. The Two-Phased (TPM) mutation model was used, incorporating 95% single-step changes with variance of multiple-step changes set to 12%. Statistical significance was evaluated from 5000 simulations using the one-tailed Wilcoxon sign-rank test as recommended from the power analysis of Piry et al. [26]. We also calculated Garza and Williamson's [28] M-ratio for each population sample using ARLEQUIN. The M-ratio is the mean ratio of the number of alleles to the range of allele size. It is sensitive to population bottlenecks because it measures the proportion of unoccupied allelic states given the range in allele size, and this ratio is reduced as alleles are randomly lost due to drift [28].
An estimate of effective population size (N e ) was made for each of the seven population samples using the single-sample linkage disequilibrium method [29], implemented in the software NeEstimator v2.01 [30]. Low frequency alleles (<0.05) were excluded from analysis and parametric estimates of 95% confidence intervals were reported.
Population genetic structure. Genetic structure among the seven population samples was quantified by estimating pair-wise and global F ST values as a weighted average over eleven loci in ARLEQUIN. These were tested for significant deviation from panmictic expectations by 10,000 permutations of individuals among populations. The critical value (α) was corrected for multiple tests using the BY False Discovery Rate method (BY-FDR) which controls experiment-wide Type I error without the loss of power associated with the Bonferroni adjustment [31]. A hierarchical AMOVA was also run, with catchment as the highest level of the hierarchy.
The grouping was Burnett (upper and lower), Mary (upper, lower and Tinana) Pine and Brisbane. The AMOVA was performed in ARLEQUIN as a weighted average over the 11 loci with significance of F-statistics determined by 1,000 permutations.
We tested for the existence of distinct genetic groups within the pooled set of 218 individual multi-locus genotypes using model-based Bayesian clustering. The probability of an admixture model was tested for up to eight clusters (K) using STRUCTURE 2.3.1 [32]. The location of individuals in sampling groups was incorporated as a prior in the analysis (LOCPRIOR model). This model is useful in cases where genetic structure is weak and the model does not tend to over specify structure where none exists [33]. Models were tested using 20 independent MCMC simulations, each consisting of 2×10 6 iterations after a burn-in of 2×10 5 iterations. The most likely number of clusters was assessed first by plotting the mean probability of data for successive values of K to find a plateau in probability values and then choosing the smallest of these values [34]. Secondly, we calculated Delta K or the second-order rate of change L˝(K) following Evanno et al. [35] using the online application Structure Harvester [36]. We also used STRUCTURE to assign individuals from putatively translocated population samples (Brisbane and North Pine) to the major genetic clusters formed among the known natural/source populations (Burnett, Mary, Tinana). This analysis utilized the USEPOPINFO model, setting the natural samples to POPFLAG = 1, and translocated samples to POPFLAG = 0. Allele frequencies were updated using only individuals with POPFLAG = 1 so that individuals from translocated samples were forced to cluster with one or more of the natural population groups. Twenty replicate STRUCTURE runs were collated using the software CLUMPP [37] and visualized with DISTRUCT [38].

Results
Levels of diversity were extremely low whether viewed as an average over loci within population samples (Table 1), or as individual loci across samples (Table 2). Expected heterozygosity of the 11 polymorphic loci averaged 0.39 across the seven sample sites and allelic richness per site averaged just over 2 when sample sizes were standardized across populations at 16 individuals (Table 1). In terms of individual loci, the number of alleles per locus averaged 3.55 and the average PIC was 0.35 with only two loci (1LF005 and 1LF044) achieving PIC >0.5 which is considered a highly informative level of variation ( Table 2). Four of the five sample sites from natural populations had significant deficits of heterozygotes, with only the site with a small sample size (Mary lower) not showing a significant deficit (Table 1). In contrast, both sample sites from assumed translocated populations (North Pine and Brisbane) had F IS values very close to zero. The estimates of N E were mostly quite small, but confidence levels were broad, indicating that the magnitude of sampling error was comparable to the signal of among locus genetic linkage in the data. This was probably because of the low levels of diversity overall, limiting the power to detect linkage disequilibrium. All bottleneck tests were non-significant and M-ratio values ranged from 0.5-0.6 for each population sample ( Table 2).
The overall F ST value across 11 loci was 0.111 and was highly significant (P<0.001). Each of the 11 locus-by-locus F ST values were also significant at P<0.05 and ranged from 0.019 for locus 2LF032 to 0.243 for locus 1LF005 ( Table 2). Most of the pairwise F ST values between the seven population samples were significant (Table 3) and ranged from 0.015 to 0.197. Only three comparisons involving the lower Mary, which had a much smaller sample size than other sites, were non-significant. In the hierarchical AMOVA, the F CT value was 0.033 (P>0.05), the F SC value was 0.085 (P<0.001), indicating significant differences within at least one of the catchments. This result was possibly caused by the highly significant differences between Tinana Creek and the other two Mary River sites.
Examination of likelihood scores produced by STRUCTURE from 20 replicate runs across K values from one to eight showed the mean likelihood score (Ln(K)) plateaued at K = 4 while the Delta K method indicated K = 2, although the latter method did not show relatively strong support for any value (Fig 1B and 1C). We infer that K = 4 captures the majority of structure in our sample and the admixture bar plot for K = 4 ( Fig 1D) shows a strong relationship between the population samples and four genetic groups. Clustering the data using values of K > 4 did not produce biologically sensible solutions but merely forced some populations to accommodate more admixture (see Burnett and Brisbane for K = 5, Fig 1E). We had hypothesized that the populations in each of the river systems would constitute separate populations. This was partly supported, in that the upper Mary sample was distinct from the upper and lower Burnett samples (Fig 1D). The lower Mary was distinct from the upper Burnett, but not from the lower Burnett. Furthermore, unexpectedly, the sample from Tinana Creek, part of the Mary River catchment, was very distinct from any of the samples from other natural populations (Fig 1D). Also, unexpectedly, the sample from the North Pine River was distinct from all natural populations (Fig 1D). The Brisbane River sample was most similar to the Mary upper and lower samples (Fig 1D). In summary, there is evidence for three genetically distinct populations within the native range of Australian lungfish, two in the Mary catchment and one in the Burnett. A fourth distinct group was found in a putatively translocated population, North Pine River. Assignment of putatively translocated samples (North Pine and Brisbane) to potential source populations in the native range of lungfish (Burnett, Mary) using STRUCTURE was complicated by strong genetic structure within the Mary catchment where the sample from Tinana Creek comprised a distinct genetic group. For this reason we set potential source populations in the native range of lungfish to K = 3 groups using the USEPOPINFO model in STRUCTURE. These three groups are geographically distinct and correspond with 1) Burnett River (pooled Burnett upper and Burnett lower samples); 2) Mary river (pooled Mary upper and Mary lower samples); 3) Tinana Creek. The data from putatively translocated samples (North Pine and Brisbane) were treated as having unknown affinity and assigned to the three genetic clusters from the native range. Twenty replicate runs of STRUCTURE at K = 3, showed all individuals sampled from putatively translocated populations (North Pine and Brisbane) had a close affinity to the Mary River genetic cluster (Fig 1F).

Genetic Diversity
The level of genetic variation in this long-lived vulnerable species is indeed extremely low. This had already been reported for allozymes by Frentiu et al. [13], who found only 2 polymorphic allozyme loci out of 24 sampled. They also found very little variation in mitochondrial DNA, spanning both the control region and part of the ATPase 6 and 8 genes. During screening of microsatellite markers for this project, only 19% of 115 clearly interpretable loci were polymorphic. Recent reviews of microsatellite development using comparable next-generation sequencing methods suggest an average return of around 60% polymorphic loci is typical across a broad taxonomic range of species [39,40]. While the level of microsatellite polymorphism in the Australian lungfish is therefore unusually low, it is higher than reported for the endangered Hawaiian monk Seal Monachus schauinslandi of 5% from a survey of 154 loci [1]. The monk seal is considered depauperate in genetic diversity (mean heterozygosity of polymorphic loci = 0.49) owing to a severe population bottleneck due to human exploitation [1]. Our results for the Australian lungfish suggest an even lower average heterozygosity of 0.39 for polymorphic microsatellite loci. This value is also well below the mean of 0.68 for a survey of freshwater fishes by McCusker and Bentzen [41]. This value is also well below the expected heterozygosity reported for other large-bodied, long-lived and threatened riverine fishes including the arapaima, Arapaima gigas [42], the beluga sturgeon, Huso huso [43] and Murray cod, Maccullochella peelii [44]. However, heterozygosity values lower than reported here for lungfish are known in freshwater fishes. For example, an average H E of 0.18 was recorded in marble trout populations subject to periodic catastrophic floods causing genetic bottlenecks [45].
We surveyed a range of populations from the species' native range as well as putatively translocated populations but found very similar levels of diversity in terms of heterozygosity and allelic richness across all sites. Therefore low genetic diversity does not appear to be a consequence of recent human activities as in the case of the monk seal. The Australian lungfish has either experienced a number of bottlenecks in the past, possibly due to Pleistocene range contraction as suggested by Frentiu et al. [13] and Kemp [12], or has had a naturally very low effective population size. The low levels of allozyme and mitochondrial DNA diversity [13] and microsatellite diversity reported here, even with the extremely long life-time, suggest that effective population sizes have been low for a significant period.
There is also some genetic evidence of inbreeding in the natural populations, as all but one of them had significant heterozygote deficits. This was also observed in the two allozyme loci analysed by Frentiu et al. [13], so it is likely to be an accurate reflection of mating patterns. It is very interesting that this heterozygote deficit was not observed in either of the translocated populations. This is particularly interesting, as Kemp [46] reported deformities in lungfish embryos sampled from the Brisbane River population, which could potentially be evidence of inbreeding depression, but our genetic data do not support a significant deviation from random mating as a likely explanation for this observation.
While our microsatellite markers were sufficiently variable for an analysis of population structure, they were not sufficiently variable for good estimates of N e . The linkage disequilibrium method works on the number of allelic combinations and so when variability is low, precision is severely affected. Waples et al. [29] showed that when the number of alleles per locus is around 2 (as is the case here), you would need to screen 180 loci to obtain the same precision as for a typical microsatellite data set with 20 microsatellite loci and 10 alleles per locus. Given the low levels of microsatellite variation in this species, it will be necessary to develop a SNP analysis, where it should be possible to identify many more loci, which will not be any less variable than our current microsatellite loci.

Genetic Differentiation
We had hypothesized that samples taken from populations believed to be natural would be differentiated between the two river systems, because the species is restricted to freshwater and highly intolerant of elevated salinities. This prediction was partly supported in that samples from the Burnett and the main stem of the Mary River were differentiated and appeared as distinct populations in the STRUCTURE analysis. However, we had not expected that the sample from Tinana Creek, a tributary of the Mary River, would also be distinct. Clearly there is a barrier separating Tinana Creek from the rest of the Mary River system despite their close proximity to one another (Fig 1A). Tinana Creek flows into the Mary River not far from the mouth, with both localities having tidal estuarine reaches in the lower sections. Salinities in these lower sections will be higher than tolerable limits during normal river flow conditions, thus restricting migration between the two systems. However, these localities flood periodically, reducing salinities throughout the system and therefore it might be expected that fish could disperse between the two systems at such times. It appears however that this does not happen or occurs very rarely. Earlier genetic studies [13] did not uncover this divergent population because Tinana Creek was not included in their analysis.
Interestingly, a number of other freshwater species show genetic differentiation between Tinana Creek and the rest of the Mary River. These include the threatened Mary River cod Maccullochella mariensis [47], the rainbowfish Melanotaenia dublayi [48] and a freshwater crayfish Cherax dispar [49]. The fact that a number of species show a similar pattern suggests that this divergence is caused by the same process, probably historical isolation.
The other unexpected finding was that the sample from the North Pine River was differentiated from all the samples taken from natural populations and from the Brisbane River population, particularly in the initial STRUCTURE analysis (Fig 1D). This difference could have two explanations. First, there could have been a natural population in the North Pine River historically. If this population had been in the river over many generations then the differentiation would be expected. However, this seems unlikely, as there have been no reports of lungfish in this river system before the translocations [50]. Given the proximity to Brisbane it is likely that a natural population would have been discovered by early settlers and explorers pre1896. An alternative explanation is that the differentiation reflects a founder effect resulting from the very small number of individuals introduced. Although five fish were initially taken to the North Pine river, Kemp [50] reports that O'Connor, who was responsible for the translocation, later reported that only three survived the journey. Artificial introductions of small numbers of individuals can result in significant genetic differentiation between the source population and the translocated one [51]. If this was the case, then we might have expected to see a significant result from the bottleneck tests for this sample. However, as the diversity was so low in the source population, possibly the BOTTLENECK test, which relies on an elevated heterozygosity relative to the number of alleles expected under mutation drift equilibrium [27], had limited power. A recent simulation study by Peery et al. [52] showed that overall, the power of this bottleneck test to detect even quite large population declines can be quite low unless sample sizes are very large (around 100) and there are a large number of loci (around 20). In contrast, Mratio results were indicative of a reduction in lungfish population size in all our samples, as values were below the critical value of 0.68, which is derived from putatively stable wild populations by Garza and Williamson [28]. However Peery et al. [52] also found power to detect bottlenecks using this method was low when genetic diversity is low, which is the case with lungfish. Deciding among possible explanations for the unexpected genetic differentiation of the North Pine River lungfish sample is not possible based on our microsatellite data, and we defer a final conclusion until a more extensive genomic data set is available.
Our results for the Brisbane River sample support the hypothesis that this population has come from the Mary River proper. It clearly has not come from Tinana Creek, or from the Burnett. This finding is consistent with historical records that the source population of various translocations made by O'Connor in the late nineteenth century was the Mary River [14]. Some literature suggests that the Brisbane River population may in fact not be the result of a translocation [50,53]. In fact Kemp and Hynen [53] identified jaw bones collected from the Brisbane River, dating back to 3850 ybp as belonging to lungfish. However, results from Frentiu's study, while not totally convincing due to the limited variability of the markers used, found no evidence for the Brisbane River population being distinct [13]. Our results also find no evidence to support this notion, as individuals sampled from the Brisbane River do not form a unique genetic cluster but group together with individuals from the Mary River (upper and lower). When the Brisbane sample was treated as unknown and classified against the three known natural population groups (Burnett, Mary, Tinana), it was most similar to the Mary group. Furthermore comparisons between the Brisbane River and Mary River samples based on F ST values were lower than with other samples.
Evidence for genetic distinction between Mary and Brisbane river lungfish populations was found using randomly amplified DNA fingerprinting (RAF) markers in an unpublished report by Lissone [54]. This data showed a single band (out of~1,400 total bands) at high frequency in the Brisbane sample that was absent in other samples. This result could indicate that Brisbane is a distinct natural population but it is also consistent with a founder event associated with the documented historical translocation. In addition, the well-known repeatability issues associated with PCR-based DNA fingerprinting dictates cautious interpretation of these results [55]. Lissone's [54] data therefore does not confirm that lungfish present in the Brisbane river today represent a natural and distinct population as asserted by Kemp and Huynen [53]. However it is plausible that a natural population did occur in the Brisbane historically as supported by the recent finding of ancient jawbones [53]. Again, to finally resolve this issue will require a more extensive analysis of a wide portion of the lungfish genome.
We conclude that there are at least three distinct natural populations of the Australian lungfish, the Mary River proper, Tinana Creek and the Burnett River. Our evidence suggests that the North Pine River and Brisbane River populations have been sourced from the Mary River proper. While numbers of lungfish may be quite high in some of these sites, the extremely low levels of genetic diversity at neutral loci might be cause for concern, as they may also reflect low diversity in coding genes that are responsible for the long-term survival of the populations. If the effective population sizes are indeed very low, then this diversity could be further eroded over the next few generations. Given that these microsatellite loci did not have sufficient power to estimate effective population size with any precision, we propose to use a RAD-sequencing approach [56] to sample a much higher proportion of the genome and therefore make use of whatever variation we can detect.
'Genetic rescue' has sometimes been proposed as a strategy to increase genetic diversity in small and/or threatened populations [4]. However, in the present case, further movement of individuals between the three natural populations is not recommended. Furthermore, it is recommended that the three natural populations be managed as such. Our analysis indicates that each is a discrete population, so mixing them may inhibit any local adaptation [4]. The two translocated populations have similar levels of genetic diversity to the natural populations and do not show evidence of inbreeding or genetic bottlenecks, although a SNP analysis with many more loci may be more powerful to detect these effects. Nevertheless, it appears that these populations contain as much diversity as the natural populations and may provide a source of individuals and genetic diversity should any of the other populations experience extreme declines.
Supporting Information S1 Microsatellite Dataset. Microsatellite dataset used for all analyses in Excel spreadsheet format. Data for each individual in rows, data for each locus in columns. Data for each allele of each locus is recorded in a separate column with the header row giving the name of the locus. Missing data marked by "?". First column is individual sample code; second column is population code corresponding to population groupings analysed in the manuscript. (XLSX)