Figures
Abstract
The taxonomy of the mottled rabbitfish Siganus fuscescens species complex has long been challenging. In this study, we analyzed microsatellite genotypes, mitochondrial lineages, and morphometric data from 373 S. fuscescens individuals sampled from the northern Philippines and Hong Kong (South China Sea, Philippine Sea and Sulu Sea basins), to examine putative species boundaries in samples comprising three co-occurring mitochondrial lineages previously reported to characterize S. fuscescens (Clade A and Clade B) or S. canaliculatus (Clade C). We report the existence of two cryptic species within S. fuscescens in the northeast region of the South China Sea and northern Philippine Sea, supported by genetic and morphological differences. Individual-based assignment methods recovered concordant groupings of individuals into two nuclear genotype clusters (Cluster 1, Cluster 2) with (1) limited gene flow, if any, between them (FST = 0.241; P < 0.001); (2) low frequency of later-generation hybrids; (3) significant association with mitochondrial Clade A and Clade B, respectively; and (4) subtle yet significant body shape differences as inferred from geometric morphometric analysis. The divergence between mitochondrial Clade C and the two other clades was not matched by genetic differences at microsatellite marker loci. The occurrence of discordant mitonuclear combinations (20.5% of the total number of individuals) is thought to result from mitochondrial introgression, consistent with a scenario of demographic, and presumably spatial, post-Pleistocene expansion of populations from northern regions into a secondary contact zone in the South China Sea. Mitonuclear discordance due to introgression obscures phylogenetic relationships for recently-diverged lineages, and cautions against the use of mitochondrial markers alone for species identification within the mottled rabbitfish species complex in the South China Sea region.
Citation: Ravago-Gotanco R, de la Cruz TL, Pante MJ, Borsa P (2018) Cryptic genetic diversity in the mottled rabbitfish Siganus fuscescens with mitochondrial introgression at a contact zone in the South China Sea. PLoS ONE 13(2): e0193220. https://doi.org/10.1371/journal.pone.0193220
Editor: Tzen-Yuh Chiang, National Cheng Kung University, TAIWAN
Received: January 11, 2017; Accepted: February 7, 2018; Published: February 21, 2018
Copyright: © 2018 Ravago-Gotanco et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data and links to data repositories are within the paper and Supporting Information files. The microsatellite dataset has been deposited in the Dryad Digital Repository (doi:10.5061/dryad.b8649). The authors’ final draft and the appended supplementary information are available from IRD’s Hyperarchives en ligne (http://www.hal.ird.fr/).
Funding: Funding for this research was provided by the Global Environment Facility-Coral Reef Targeted Research Capability Building for Management Project (GEF-CRTR) through the Southeast Asia Center of Excellence grant to the University of the Philippines Marine Science Institute. Additional funding was provided by the Office of the Vice Chancellor for Research and Development, University of the Philippines Diliman (OVCRD Grant PhDIA # 121206 to RRG) and the University of the Philippines Marine Science Institute (DRP, RRG). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Molecular genetic approaches enhance our understanding of biodiversity patterns and mechanisms. The recognition of cryptic species has increased in frequency with the use of molecular genetic markers [1, 2]. Uncovering and characterizing spatial distribution patterns of cryptic diversity provide insight into the processes that influence diversification and speciation, particularly in the marine environment where few barriers to dispersal and gene flow are apparent [3–5]. In the Coral Triangle, the global hotspot of marine biodiversity [6, 7], contrasting phylogeographic patterns have been observed across taxa [8–11]. These results suggest that various biogeographic hypotheses proposed to explain the origins of the exceptional species richness in the region may not be mutually exclusive and may act in concert, underscoring the complexity of marine diversification mechanisms [5, 12, 13].
The family Siganidae, commonly known as rabbitfishes, consists of 29 recognized species, with greatest species richness observed in the Coral Triangle [14–16]. Although species groups are identified primarily based on body depth and coloration [14], some closely-related species remain difficult to distinguish because of a lack of sharp differences in color patterns [14]. In particular, the white-spotted rabbitfish Siganus canaliculatus (Park, 1797) may be difficult to differentiate from the mottled rabbitfish Siganus fuscescens (Houttuyn, 1782). The two species occur sympatrically across the central Indo-West Pacific [14], where they occupy similar seagrass and reef-associated habitats. They are primarily distinguished based on the size and number of white spots on the body, with S. canaliculatus having larger and fewer spots relative to S. fuscescens [14]. Genetic studies based on allozyme markers [17] and mitochondrial cytochrome b (cytb) gene sequences [18, 19] have suggested the recognition of S. fuscescens and S. canaliculatus as valid biological species because sympatric populations of the two species exhibit significant allozyme frequency differences and are characterized by divergent mitochondrial lineages. This conclusion is challenged by the report of paraphyletism in both cytb and internal transcribed spacer (ITS) sequence phylogenies of S. canaliculatus and S. fuscescens morphotypes [20], and by non-concordance between morphotype and genetic clustering based on AFLP loci [21]. Authors have concluded that S. fuscescens and S. canaliculatus interbreed, hence represent morphological variants or geographically isolated groups within a single biological species [20, 21].
Apart from the conflicting evidence regarding the species status of S. canaliculatus and S. fuscescens, additional cryptic diversity has been reported within the S. fuscescens morphotype. Allozyme data suggest two cryptic species of S. fuscescens across the broader Coral Triangle region [17]. Mitochondrial cytb and control region (CR) phylogenies meanwhile reveal the sympatric occurrence of three divergent mitochondrial lineages of S. fuscescens, designated Clade A, Clade B and Clade C, in the Philippines, at the northern apex of the Coral Triangle, and a location further north (Hong Kong) [19]. Clade A is widely-distributed across the central Indo-West Pacific region, occurring in the Philippines, Japan, Taiwan, the Indonesian archipelago, the Solomon Islands and New Caledonia [18, 22]. The distribution of Clade B appears to be restricted to the northern periphery of the S. fuscescens’ distributional range, having been reported from Japan, Taiwan, the northern Philippines, Vietnam and Thailand [21, 23–26]. Clade C is present in the northwestern Pacific [19–21, 23, 26]. Mitochondrial DNA sequences have been useful in the study of lineage diversification in Siganidae [18, 20], and of their phylogeography and population genetic structure [19, 21, 23, 26]. However, the mitochondrial marker alone is inadequate to detect reproductive isolation and species boundaries [27], and its utility to identify species is limited when there is cross-species hybridization and introgression [28]. Combining mitochondrial markers with nuclear markers is a powerful means to examine potential cryptic speciation and species boundaries within a species complex.
In this study, we use mitochondrial DNA sequences and nuclear (microsatellite) markers, to test the hypothesis of cryptic speciation among co-occurring lineages of the mottled rabbitfish species complex in the northern Philippines and Hong Kong. If the mtDNA lineages characterize reproductively isolated groups, we expect: (1) matching differences in the nuclear genome, in the form of distinct microsatellite genotypic clusters with minimal admixture [29]; and (2) mitonuclear disequilibrium, i.e. a significant association between mtDNA lineage and genotype cluster [30–32]. However, concordance between mitochondrial and nuclear patterns is not always observed among recently-diverged lineages [33, 34]. Mitonuclear discordance may be caused by incomplete lineage sorting of ancestral polymorphism, or recent gene flow or introgression between previously isolated lineages, e.g. secondary contact following allopatric divergence (reviewed in [35]). To evaluate these two competing hypotheses, we employ coalescent analysis under an isolation-with-migration (IM) model to jointly estimate mitochondrial gene flow and the timing of lineage divergence. In addition, geometric morphometric analysis enables us to characterize shape variation and examine the potential association between morphological variation and genetic differentiation.
Using an integrative approach combining genetic data from mitochondrial and nuclear markers with morphometric analysis, we demonstrate that S. fuscescens specimens from locations where three mitochondrial lineages occur in sympatry constitute two cryptic species. We also report mitonuclear discordance likely due to introgression, which cautions against the use of mitochondrial markers alone for species identification in the mottled rabbitfish species complex.
Materials and methods
Sample collection
Mottled rabbitfish Siganus fuscescens specimens were collected from eight locations in the South China Sea, the Philippine Sea, and the Sulu Sea. Specimens from the Philippines were purchased from local fishermen who collected the samples within a 2–5 km radius from the collecting locality. These locations were Bolinao, Pangasinan (BOL); Coron, Palawan (CRN); Currimao, Ilocos Norte (CUR); Brgy. Sabang, Masinloc, Zambales (MAS); Morong, Bataan (MOR); Patnanungan, Polillo island, Quezon (PAT); and San Fernando, La Union (SNF) (Fig 1). Specimens from Hong Kong (sample HKG; Fig 1) were purchased from fishermen in the Sai Kung district. A small piece of muscle tissue was biopsied and preserved in 95% ethanol. DNA was extracted from tissue samples using a Chelex-proteinase K protocol [36]. A map showing the locations of the eight sampled populations is shown (Fig 1).
Circle size is proportional to sample size. (A) Genotype clusters; (B) MtDNA clades. Background map of the northern Philippines and surrounding regions was downloaded from Digital Vector Maps, San Diego (http://digital-vector-maps.com/). Abbreviations for sampling locations: BOL, Bolinao; CRN, Coron; CUR, Currimao; HKG, Hong Kong; MAS, Masinloc; MOR, Morong; PAT, Patnanungan; SNF, San Fernando. (A) Genotype clusters; (B) MtDNA clades.
Ethics statement
Siganus fuscescens is a commercially-harvested species in the Philippines and Hong Kong, and is not included in any IUCN or CITES list. All collections were performed by commercial fishers in public domains, and no permission was required for access.
Microsatellite genotyping
Individuals were genotyped at 12 microsatellite markers previously developed for S. fuscescens: Sfus6, Sfus8, Sfus9, Sfus21, Sfus22, Sfus56, Sfus76, Sfus95, Sfus97, Sfus98, Sfus113, and Sfus167 [24]. Microsatellite DNA fragments were PCR-amplified in four separate multiplex reactions. Each reaction consisted of 1x PCR Buffer, 2.0 mM MgCl2, 100 ng/μL bovine serum albumin, 200 μM of each dTNP, 0.25 units Taq polymerase (New England Biolabs, Ipswich MA), 0.25 μM each primer, and 1 μL template DNA in a final reaction volume of 10 μL. Forward primers were 5’-modified with 6-FAMTM, NEDTM, PETTM, or VIC® fluorescent dye. Amplification conditions consisted of 94°C for 90 s, 38 cycles of 94°C for 30 s, locus-specific annealing temperatures for 30 s, 72°C for 60 s and a final extension at 72°C for 25 minutes. Annealing temperatures, fluorescent label-primer combinations, and pooling of amplified microsatellite loci for fragment analysis have been described elsewhere [24]. Amplified fragments were combined with an internal size standard, GeneScanTM 500 LIZTM, Hi-DiTM formamide (Applied Biosystems Inc., Carlsbad CA) and deionized water to a final volume of 10 μL. Fragments were run on an ABI 3730xl DNA Analyzer (Cornell University Life Sciences Core Laboratories Center, Ithaca NY). Allele calling was done in Geneious R6 (https://www.geneious.com) [37] using the Microsatellite plug-in. MICROCHECKER (http://www.microchecker.hull.ac.uk) [38] was used to check the microsatellite data for genotyping errors such as large allele drop-out, and to detect the possible presence of null alleles.
Genotype cluster identification
Two approaches were used to identify genotype clusters from the microsatellite data. Correspondence analysis (CA), as implemented in the software GENETIX v. 4.05 (http://www.genetix.univ-montp2.fr/genetix/genetix.htm) [39] was run to visualize the distribution of individual microsatellite genotypes in multidimensional space, and identify clusters of individuals with similar genotypes. We also did Bayesian structure analysis using STRUCTURE v.2.3.4 (http://pritchardlab.stanford.edu/structure.html) [40], to assign individuals to clusters which minimize departures from Hardy-Weinberg equilibrium and linkage disequilibrium. Bayesian structure analysis also allows the estimation of the individual ancestry coefficient q, which is the proportion of an individual genotype having ancestry from each of K defined clusters. Ten replicate MCMC simulations were run for each value of K (K = 2 to 8). Each run comprised one million iterations, with an initial burn-in of 100,000 steps. An admixture model with correlated allele frequencies was chosen, using sampling location as a prior [41]. The most likely number of clusters was inferred based on the ΔK method [42], as implemented in the Structure Harvester tool (http://taylor0.biology.ucla.edu/structureHarvester/) [43]. Ancestry coefficients for each K value were averaged across replicate runs using CLUMPP [44]. To assign an individual to a cluster, we initially used a threshold value of q > 0.9 [45], while individuals having q values > 0.10 and < 0.9 were categorized as admixed. Results were plotted using DISTRUCT v. 1.1 [46].
To complement STRUCTURE results in identifying admixed individuals, we ran additional analyses using the program NEWHYBRIDS v. 1.1 [47]. NEWHYBRIDS assumes that the sample is drawn from a mixture of parental/purebred and admixed individuals, and estimates the posterior probability (q) that an individual belongs to one of six genotypic classes. In the present case, two parental classes corresponding to a scenario of K = 2 clusters (P1, P2), and 4 hybrid categories (F1, F2, F1 x P1 and F1 x P2) were initially assumed. Ten independent runs of 300,000 sweeps following a burn-in of 100,000 sweeps were done, with convergence assessed by visual examination of P(z) values from each run. Average NEWHYBRIDS q values were calculated from independent runs, and a probability threshold of q > 0.50 was used to assign individuals into classes.
Mitochondrial lineage identification
The present paper focuses on cryptic speciation within S. fuscescens based on both mitochondrial and nuclear data. It includes mitochondrial control region (CR) sequences and subsequent lineage identification for 143 individuals derived from earlier work which examined the phylogeography and genetic connectivity of Siganus fuscescens across the Philippine archipelago [19]. To ascertain lineage membership for additional samples without DNA sequencing, a multiplex PCR approach was developed to selectively amplify clade-diagnostic CR fragments. Representative mtDNA control region sequences for mottled rabbitfish species complex Clade A, Clade B, and Clade C (S1 Table), were aligned and analysed using Geneious R6 [37] to identify clade-specific variable sites. Two clade-specific primers were designed using PRIMER3 [48]: a Clade C-specific forward primer (SigCR-141F,5’-TCAAAATAGCTTGGATTAAATACTGG-3’), and an S. fuscescens Clade B-specific reverse primer (SigCR-398R, 5’-CGATGAAAGATAGGGGAGTAA-3’). These were combined with primers 14F and 14R which specifically amplify the mt CR in S. fuscescens [49], in a multiplex PCR to amplify specific fragment sizes diagnostic for mtDNA lineages. PCR amplification was performed in 10 μL reaction volume comprising 1x PCR Buffer, 2 mM MgCl2, 200 μM of each dNTP, 0.25 units Taq polymerase (Invitrogen), 0.3 μM of each primer, and 1 μL template DNA. Cycling conditions were 94°C for 3 min, 35 cycles of denaturation at 94°C for 30 s, annealing at 65°C for 1 min, and elongation at 72°C for 2 min with a final extension at 72°C for 5 min. Amplified fragments were resolved on 2% TBE-agarose gels and detected following ethidium bromide staining, with fragment profiles diagnostic for mtDNA lineage (S1 Fig).
To test the null hypothesis of random association between S. fuscescens mitochondrial clades and the nuclear genotype clusters, we used the CNDm program [50]. This program estimates the normalized cytonuclear disequilibrium (D*), a measure of cytonuclear association, based on observed counts of mtDNA clade-microsatellite cluster combinations.
Genetic diversity and differentiation
Microsatellite genotype and allele frequencies were used to obtain estimates of genetic diversity and differentiation for groups of individuals. To estimate diversity, we calculated the mean number of alleles, observed and expected heterozygosity (HO, HE) and allele frequencies using GENEPOP [51]. Allelic richness standardized for minimum sample size was calculated using the package hierfstat v. 0.04–22 for R [52]. Exact tests of deviations from Hardy-Weinberg equilibrium, the inbreeding coefficient (FIS), and exact tests of linkage disequilibrium between microsatellite loci were assessed using GENEPOP. To estimate levels of genetic differentiation we did an analysis of molecular variance (AMOVA) as implemented in ARLEQUIN v. 3.5 [53], to calculate hierarchical fixation indices, FST for different hypothesized groupings of individuals. We used G"ST calculated using the GENALEX software package [54] as an unbiased estimator of F'ST, i.e. the value of FST standardized by the maximum obtainable value accounting for within-population heterozygosity [55, 56].
To visualize spatial patterns of genetic differentiation based on nuclear and mitochondrial markers, we used GENALEX for multidimensional clustering of populations using principal coordinates analysis (PCoA). We estimated population-pairwise FST and ΦST values calculated from microsatellite genotypes and mtDNA CR sequences, respectively. We tested for correlations between matrices of genetic distance (FST or ΦST) and geographic distance using a Mantel test as implemented in the package IBDWS v. 3.2.3 (http://ibdws.sdsu.edu/~ibdws/) [57], with 1000 random permutations to test for significance.
Estimating gene flow: Isolation-with-migration model
The Isolation-with-Migration model (IM) [58] was used to examine the relative contribution of competing demographic scenarios that explain the sharing of mitochondrial haplotypes between genotype clusters: (1) divergence under complete isolation (incomplete lineage sorting) and (2) divergence with gene flow (hybridization and introgression), by estimating the posterior probability of migration since the time of population splitting [59, 60]. We used the Bayesian Markov-Chain Monte Carlo method implemented in the program IMa2p [61], a parallel implementation of the IMa2 program [59], to jointly estimate posterior distributions of six demographic parameters, scaled by the mutation rate μ: population sizes for ancestral (q2) and descendant (q0, q1) populations, bidirectional migration rates (m1>2 and m2>1), where m is the migration rate per generation into the specified population, and divergence time (t) for pairs of populations. The IM model assumes that the populations under study are panmictic, markers are selectively neutral, and free from recombination. We tested the selective neutrality of nucleotide variation in CR sequences using Tajima's D [62], and Fu's Fs [63], estimated using ARLEQUIN. Tests for recombination were done on the CR sequence alignment using the RDP method [64] as implemented in the RDP3 package [65], and a pairwise homoplasy index (PHI) test [66]. None of these methods detected evidence of recombination in the control region sequence alignment. While IMa2p can analyse multiple populations, a large number of loci are required for better estimation, and require a well resolved, rooted phylogeny of populations as input [67]. Consequently, we performed multiple population pairwise analyses instead, with mtDNA data analysed as three separate pairwise comparisons in which individuals were assigned to three genetic groups according to the microsatellite data (see results). Preliminary runs with broad parameter intervals were performed to set the upper bounds for q, t and m priors for each pairwise analysis. Following identification of appropriate values for priors, analyses were run until stationarity was reached before genealogy sampling commenced. Three independent MCMC runs of 100 million generations (100,000 genealogies) were done for each pairwise analysis under the HKY model of evolution. Sampled genealogies were combined in an L-mode analysis to estimate demographic parameters. The mutation-scaled IMa2p parameter estimates were converted to demographic quantities: divergence time (t), effective population size (N), and population migration rates (2Nm). We used two mutation rates, 2% and 5% per million years, to cover the range of mutation rates approximated for the control region in bony fishes [68]. The mutation rates were converted to per locus rates by multiplying the fragment length in base pairs for demographic conversions [69]. Demographic quantities were estimated following the equations given in the IMa documentation (https://bio.cst.temple.edu/~hey/software/software.htm). To convert divergence time from generations to years, we estimated generation time following the formula T = (α + ω)/2 where α is the age at first reproduction and ω is the age at last reproduction [70]. Age at first reproduction for S. fuscescens is 5 to 7 months, or roughly 0.5 year [71]. Maximum age was calculated to be 5 years, using an age-at length model for S. fuscescens derived by Bellefleur [72], for a maximum reported length of 25 cm [71]. Average generation time was thus estimated to be 2.75 years.
Morphological differentiation and geometric morphometric analysis
Morphological identification of the specimens as S. fuscescens followed Woodland [14]. The left side of each fish was photographed using a mounted digital camera. The dorsal, pelvic, anal and caudal fins of each fish were spread out and pinned in place prior to photo documentation. Landmarks were chosen in accordance with the guidelines of Zelditch et al. [73]. A total of 22 landmarks were selected for their capacity to represent overall body shape (S2 Fig). The Cartesian coordinates of each landmark were digitized using tpsDig2 v. 2.05 (http://life.bio.sunysb.edu/morph/) [74]. Geometric morphometric analyses were subsequently conducted using the PAleontological STatistics (PAST) software v.1.72 [75]. To eliminate the confounding effect of size differences, the digitized raw landmarks were transformed using the generalized least squares Procrustes superimposition [76, 77], by initially scaling landmark configurations of all specimens to unit size, then aligning all landmarks as closely as possible. Size-independent landmark variables were then subjected to several ordination methods. Multivariate analyses were run using two approaches to simplify descriptions of morphological differences among groups of individuals. Principal component analysis [78] was conducted without a priori information from genetic data. Canonical variates analyses (CVA), were run based on a priori information separately for (1) mitochondrial clade; and (2) nuclear cluster assignments. Thin-plate spline (TPS) deformation grids were used to visualize the differences between landmarks over the entire body shape, based on the interpolation function of landmark differences, taking all displacements of all landmarks relative to all others into account [73, 79]. Multivariate analysis of variance (MANOVA) and pairwise comparisons were done to test the null hypothesis of no morphological difference between groups.
Results
Multi-locus microsatellite genotypes, mtDNA lineage assignment based on control region sequences (direct sequencing or multiplex PCR methods), and morphometric measurements were obtained for a total of 343 individuals identified as Siganus fuscescens from their color patterns.
Microsatellite genotype clusters
All 12 microsatellite loci analyzed were polymorphic. No identical multilocus genotypes were observed. Further, there was no evidence for large allele dropout following analysis of the genotype data using MICROCHECKER.
Correspondence analysis (CA) of genotype data from 12 microsatellite loci clustered the 343 individuals into three distinct groups (Fig 2). Factorial axis 1 accounted for 54.9% of the total variation and differentiated a genotype cluster exclusive to the Philippines (hereafter Cluster 1) from a second cluster sampled on both sides of the South China Sea (Cluster 2). Factorial axis 2, which accounted for 20.8% of the total variation, separated individuals according to geographical location. Cluster 2 was split into two sub-clusters, one consisting of samples from the Philippines exclusively (a sub-cluster designated Cluster 2A), distinct from the other one, consisting of individuals collected from Hong Kong exclusively (designated Cluster 2B).
Two-dimensional plot of correspondence analysis of individual genotypes at 12 microsatellite loci. Each point represents an individual (total N = 343). Colors indicate mtDNA control region lineages: Clade A (black), Clade B (white), Clade C (grey). Symbols indicate sampling region: Philippines (triangles) and Hong Kong (circles). Putative hybrids as revealed by NEWHYBRIDS analysis are marked (‘h’).
STRUCTURE results revealed the most likely number of clusters to be K = 2 based on the ΔK method. However, the highest posterior probability of the data was observed when K = 3, with individuals from Hong Kong exhibiting q values > 0.2 and < 0.8 (Fig 3) indicating admixture between two clusters. While the ΔK method detects the uppermost level of population structure, i.e. the lowest K value [42], it may underestimate the true number of clusters particularly when genetic differentiation is weak or complex, as in the case of isolation-by-distance [80]. To examine the possibility of further substructure, we performed additional STRUCTURE runs separately for Cluster 1 and Cluster 2 individuals. No further sub-structure was detected among Cluster 1 individuals. Further sub-structuring was detected among Cluster 2 individuals, with the HKG samples clearly emerging as a cluster distinct from its Philippine homologue (q > 0.9), consistent with the clustering uncovered using CA.
Each individual is represented by a vertical bar where the apportion of ancestry is represented by segments of different colors corresponding to different mitochondrial lineages and nuclear genotype clusters. Individuals with mixed ancestry are indicated by different colors corresponding to the inferred proportion of ancestry in a particular group. (a) MtDNA lineage membership; (b) Assignment to nuclear genotype cluster based on correspondence analysis; (c) Individual ancestry coefficient (q) inferred from STRUCTURE analysis when K = 2; (d) Individual ancestry coefficient (q) inferred from STRUCTURE analysis when K = 3. Putative hybrids are marked (‘h’).
Under a scenario of K = 2 genotype clusters, STRUCTURE results indicate admixture (0.1 < q < 0.9) and potential hybrid ancestry for 13 individuals. The majority (N = 10) of the putative hybrids were in the HKG sample, with 1 and 2 putative hybrids in SNF and MOR, respectively. The NEWHYBRIDS algorithm however identified only 7 of the 13 individuals having mixed ancestry in STRUCTURE as putative hybrids. All 7 individuals (marked in Fig 3) were categorized as F2 hybrids: 4 from sample HKG, 1 from sample SNF, and 2 from sample MOR. The 5 HKG individuals identified as putative hybrids by STRUCTURE, but not by NEWHYBRIDS, were categorized as parental genotypes by NEWHYBRIDS.
Mitochondrial lineage identification and distribution
For 143 individuals, mtDNA control region sequences (792 bp) were obtained from a previous study (S1 Table) [19]. The multiplex PCR approach was used for lineage identification of an additional 200 individuals. The majority of the haplotypes were identified as Clade A (77.8%), while Clade B constituted 20.7% of the total sample of haplotypes. Five individuals (1.5% of total) were characterized by Clade C haplotypes, previously reported to be S. canaliculatus [19]. Clade A and Clade B were sympatric in 6 of the 8 locations sampled, i.e. all locations except BOL and CRN (Table 1; Fig 1B). Clade C was sampled along with both Clades A and B from3 locations: HKG, MAS, MOR (Table 1; Fig 1B).
Discordance between mtDNA haplotype and microsatellite genotype
Cluster 1 of CA consisted almost entirely of individuals with Clade A haplotypes, with the exception of one individual possessing a Clade B haplotype, out of 197 individuals in the cluster. Clusters 2A and 2B combined consisted of a mix of individuals with different mtDNA lineages: 69 individuals with Clade A, 67 with Clade B, and 5 with Clade C. Factorial axis 2 separated individuals according to geographical location. The five individuals exhibiting Clade-C haplotypes were all in Cluster 2, and occurred in both sub-clusters 2A and 2B.
A large proportion of the total variance in microsatellite genotypes was explained by the partition of the total sample into two major genotype clusters, Cluster 1 and Cluster 2. This was evident from two clustering approaches (CA and STRUCTURE) where assignment of individuals to either cluster was consistent (Fig 3) and appeared to be broadly concordant with mtDNA variation.
Results from the test of cytonuclear equilibrium indicated a significantly positive association between mitochondrial lineage and genotype cluster. Clade-A haplotypes were generally associated with Cluster 1 (D* = 0.976, P < 0.0001), and Clade-B haplotypes were generally associated with Cluster 2 (D* = 0.953, P < 0.0001) (Table 2). These mitonuclear combinations accounted for 79.5% of the total sample size (individuals with mixed ancestry and individuals harboring mitochondrial Clade C being excluded from the total). The majority of the non-concordant ('mismatch') mitonuclear combinations were Clade A x Cluster 2. This concerned 67 out of 68 individuals (98%) (Table 3), all scored in locations where mitochondrial clades A and B were sympatric. The proportion of mismatch per location varied widely, but it was notably high in sample HKG (Table 3).
Genotype clusters: Genetic diversity and differentiation
A wide range of allelic diversity was scored at the 12 microsatellite loci, from 4 to 40 alleles per locus (Table 4 and S2 Table). Significant deviations from Hardy-Weinberg equilibrium were detected at 42 of 96 population-locus tests (43.75%, S2 Table) and 7 loci overall, all with significantly positive FIS values suggesting heterozygote deficit (S2 Table). Exact tests of linkage disequilibrium also revealed significant association for 19 locus pairs out of a total of 66 tests (66.3%). Since most samples consisted of a mixture of individuals from two genetically distinct clusters, the significant FIS and linkage disequilibrium are explained, at least partly, by the Wahlund effect.
Grouping individuals into three genotype clusters, significant departures from Hardy-Weinberg equilibrium expectations were observed for 11 out of a total of 36 tests (30.56%), and overall at 3, 6 and 4 loci in Cluster 1, Cluster 2A, and Cluster 2B, respectively, with significantly positive FIS values in all cases (Table 4). No significant association between pairs of loci was observed. Allelic richness was generally higher for Cluster 1 (10.7), compared to Cluster 2A and Cluster 2B (9.2 and 9.6, respectively). Relatively large differences in gene diversity at loci: Sfus 22, Sfus 56, Sfus97, Sfus167 were observed among the three clusters (Table 4).
Substantial genetic difference was found between Cluster 1 and Cluster 2 (overall FST = 0.241, P < 0.001; G"st = 0.749), with all population-pairwise comparisons between the two clusters reaching significant FST values (S3 Table). All 12 loci exhibited significant differentiation between clusters, with the higher levels of differentiation observed at loci Sfus56, Sfus167, Sfus22, and Sfus97 (Table 5). Nine and 11 loci exhibited private alleles for Cluster 1 and Cluster 2, respectively. The proportion of private alleles was high in 10 out of 12 loci (44% - 87% of the total number of alleles). However, there was no fixation for alternative alleles at any locus.
Genotype clusters and mitochondrial lineages exhibited contrasting patterns of genetic differentiation. Allele frequencies were not significantly different among Cluster 1 samples from five locations: BOL, CRN, MAS, MOR, and SNF, (FST = 0.002; P > 0.05). This apparent lack of genetic structure contrasts with the genetic differentiation observed for Clade A haplotype samples from BOL, CRN and MOR (ΦST = 0.090; P < 0.0001), with MOR significantly different from BOL and CRN (S3B Fig). Meanwhile, Cluster 2 samples from five sites (CUR, HKG, MAS, MOR, and PAT) exhibited genetic differentiation (FST = 0.047, P < 0.0001) and grouped into three populations concordant with geographical location: western Philippine population (including samples from CUR, MAS, MOR); the Philippine Sea population (represented by sample PAT) and the Hong Kong population (HKG) (S3A Fig). However, Clade B haplotype samples from four of these sites (CUR, HKG, MOR, PAT) did not exhibit significant differentiation in haplotype distribution (ΦST = 0.042; P = 0.123; S3B Fig). Furthermore, for Cluster 2 samples (and only these), the standardized genetic differentiation [FST/(1-FST)] was apparently correlated with geographical distance (Mantel test: Z = 3544.3, r = 0.936, P = 0.016) (S4 Fig), suggesting possible isolation by distance.
Gene flow estimates
The IMa2p analyses of mtDNA sequences suggested recent divergence of the three genotype clusters, with asymmetrical migration between clusters. Based on mutation rates of 2% and 5% substitution per million years for the control region, the divergence between pairs Cluster 1/Cluster 2A and Cluster 1/Cluster 2B was estimated to have occurred approximately 111,000–298,000 years before present (BP) (Table 6), with the 95% highest posterior density (HPD) between 33,000–820,000 years BP. The divergence between Cluster 2A and Cluster 2B was more recent, between 53,000 to 133,000 years BP (95% HPD = 21,000–290,000 years BP). All posterior probability plots for divergence time showed well-defined unimodal peaks (Fig 4), although the distribution for Cluster 1/Cluster 2A was relatively broader.
Probability densities were estimated under the isolation-with-migration model (IMa2; [63]). (a) Estimated divergence times; (b, c, d). Pairwise estimates of population migration rate (2Nm) between nuclear clusters.
Asymmetrical mitochondrial gene flow occurred during divergence of the two major genotype clusters, Cluster 1 and Cluster 2. The population migration rate (2Nm) was significantly non-zero from Cluster 1 to Cluster 2A (2Nm = 10.4; LLR P < 0.05; Table 6), where the estimated probability of zero migration was also zero (Fig 4). However, migration in the opposite direction (Cluster 2A to Cluster 1) was not significantly higher than zero (2Nm = 1.47; LLR: P > 0.05; Table 6, Fig 4). Bidirectional migration rates between Cluster 1 and Cluster 2B were likewise asymmetrical, with gene flow from Cluster 1 to Cluster 2B (2Nm = 9.44, LLR: P < 0.05; Table 6) higher than gene flow in the opposite direction (2Nm = 1.65; LLR: P > 0.05). Similar patterns of asymmetrical gene flow occurred between Cluster 2A and Cluster 2B, with migration rate into Cluster 2B significantly non-zero (2Nm = 11.21; Table 6), and higher than migration into Cluster 2A (Fig 4).
Geometric morphometric analysis
The first 3 principal components (PC) accounted for 69.5% of the total variation in shape among S. fuscescens specimens. PC1 represents 36.21% of total variance, and may be considered to be the major shape axis, while PC2 represents 23.53% of total variance. Ordination of PC1 and PC2 showed large overlaps between groups for microsatellite genotypes (Fig 5A) and mtDNA clades. This was due to a combination of highly negative loadings of landmarks on the dorsal fin (landmarks 6–13) and highly positive loadings of landmarks on the anal fin (landmarks 17–19), with moderate loading contribution from landmarks on the snout (landmark 1–5) and the pectoral–pelvic area (landmarks 20–22).
Each point represents an individual. Circle color indicates nuclear genotype cluster or mtDNA lineage. Scatter plot of individual scores based on analysis of 22 landmarks: (A) Principal plan (PC1 x PC2) of principal components analysis where individuals are distinguished according to nuclear genotype cluster; (B) Principal plan (CV1 x CV2) of canonical variates analysis where individuals are distinguished according to mtDNA lineage; (C) Principal plan of canonical variates analysis where individuals are distinguished according to nuclear genotype clusters; (D) Thin-plate spline deformation grid for the average Cluster 1 individual relative to the consensus from the total sample; pixel color ranges show compression (dark blue) to expansion (red); (E) Thin-plate spline deformation grid for the average Cluster 2A individual; (F) Thin-plate spline deformation grid for the average Cluster 2B individual.
Canonical variates analysis (CVA) based on nuclear genotype cluster membership clustered individuals into generally non-overlapping morphospaces (Fig 5C). Meanwhile, CVA based on mtDNA lineage showed broad overlaps between Clade A and Clade B morphospaces, and Clade C individuals grouped with either Clade A or Clade B individuals (Fig 5B). Multiple Analysis of Variance (MANOVA) revealed significant differences in body shape among the 22 landmarks for both the microsatellite clusters and mitochondrial clades.
Pairwise comparisons of mitochondrial clades A, B and C, revealed significant difference between Clades A and B only (Wilk’s lambda = 0.523; Pillai trace = 0.537; P < 0.0001). Meanwhile, all pairwise comparisons between microsatellite Clusters 1, 2A, and 2B were significant (Wilk’s lambda = 0.114; Pillai trace = 1.320; P < 0.0001).
Thin-plate spline transformation grids indicate that Cluster 1 individuals were characterized by an expansion of the snout region (landmarks 1–5) and a slight contraction of the overall body area, particularly the dorsal, pectoral-pelvic and caudal areas (landmarks 6–16, 21–22), relative to the average shape (Fig 5D). Cluster 2A individuals were characterized by a contraction of the snout area (landmarks 1–5) and an overall expansion of the body and the caudal region (Fig 5E). Cluster 2B individuals were characterized by an expansion of the snout area and a contraction of the pelvic-anal area (Fig 5F).
Discussion
This study revealed some incongruence in the patterns of genetic variation and population structure between mtDNA and microsatellite markers, both highlighting the complex evolutionary and demographic history of the S. fuscescens species complex and providing novel insights into it.
Cryptic speciation in S. fuscescens: Limited gene flow and associated morphological differences between sympatric genotypic clusters
Siganus fuscescens individuals grouped into two main nuclear genotype clusters, with limited gene flow between them. The sympatric occurrence of differentiated nuclear genotypic clusters indicates reproductive isolation, hence cryptic species. Grouping individuals into two clusters accounted for a large proportion of the total genetic variation, and individual assignment to genotype cluster was robust and concordant between two different clustering approaches, i.e. correspondence analysis (CA), and Bayesian assignment. The signal of three genotype clusters recovered through all three approaches was due to the genetic partition of Cluster 2 into two sub-clusters. One of these two sub-clusters (Cluster 2A) was exclusive to the Philippines, and the other one (Cluster 2B) exclusive to Hong Kong. This sub-clustering was consistent with an isolation-by-distance scenario of genetic differentiation. Therefore, there is so far no reason to suspect additional cryptic speciation within Cluster 2.
Limited gene flow between genotype clusters is indicated by significant genetic differentiation between Cluster 1 and Cluster 2 at all 12 microsatellite marker loci. Microsatellite- or intron-based estimates of genetic differentiation of FST (or equivalent) > 0.200, as is the case here between S. fuscescens Clusters 1 and 2, may well indicate separate species (see [83–85]). For example, microsatellite-based estimates of genetic differentiation were as low as FST = 0.029 between sibling species of the humbug damselfish, Dascyllus abudafur and D. aruanus [86, 87], FST = 0.041 between cryptic species of the European anchovy Engraulis encrasicolus [88] and FST = 0.060 between reproductively isolated species within the grey mullet Mugil cephalus [89]. In S. fuscescens, the low proportion of putative F2 hybrids (< 4% of the total number of samples, or ~ 2% following a more stringent criterion for the identification of hybrids), provides further support to limited contemporary interbreeding between genotype clusters. This result is unlikely to be due to a low power to detect hybrids, i.e. type II error identifying true hybrids as parental genotypes. Both the STRUCTURE and NEWHYBRIDS approaches are expected to be highly efficient, with > 95% power to detect an individual's true status given the combination of the number of loci used in the present study and the observed level of differentiation as explained in the following. Simulations indicate that 12 polymorphic microsatellite loci is an adequate number of markers to detect hybrid status for a lower threshold FST = 0.210 between species [45]. Meanwhile, the discrepancy in the number of putative hybrids identified by STRUCTURE and NEWHYBRIDS (13 and 7 individuals, respectively) can be attributed to a decreased efficiency of hybrid identification when the proportion of hybrids is low, known to be more pronounced for NEWHYBRIDS than STRUCTURE [45], or to the potential confounding effect of population structure in the identification of hybrids. Lastly, the detection of hybrids is more challenging when divergence is based primarily on allele frequency differences rather than fixed alleles [90].
Microsatellite genotypes of S. fuscescens showed a bimodal distribution pattern, characterized by a dominance of parental genotypes with few intermediates or hybrids [91]. The absence of F1 hybrids and the small proportion of hybrids identified as F2 support the hypothesis of limited contemporary interbreeding between clusters. In hybrid zones, such a bimodal distribution of nuclear genotypes has been associated with strong prezygotic barriers (reviewed in [92]). In S. fuscescens where reproduction is via external fertilization in spawning aggregations [93, 94], temporal variation in spawning period among genotype clusters may represent a prezygotic reproductive barrier, although this hypothesis remains to be tested. While mate choice in spawning aggregations is unlikely, assortative mating may be caused by demographic effects, i.e. differences in the relative abundance and effective population size of the two genotype clusters in the contact zone. Of the 8 locations sampled, six locations predominantly harbored one of the two clusters (96% - 100% of the size of a sample; Table 3). Only in two locations (MAS, MOR), were both Clusters 1 and 2 well-represented.
Geometric morphometric analysis revealed subtle, yet significant variation in body shape consistent with nuclear clusters, providing further support to the recognition of two cryptic species within S. fuscescens from the northern Philippines and Hong Kong. Concordance of morphological variation with nuclear genotypes, and not mitochondrial lineages, has been observed for recently-diverged groups with potentially high levels of hybridization and mitochondrial introgression [95–97]. Moreover, shape variation may provide initial clues to ecomorphological adaptation allowing for resource partitioning and coexistence of the cryptic S. fuscescens species in coastal and seagrass habitats. In particular, variation in snout length and body depth, has been correlated with diet differences [98] and foraging in simple versus complex microhabitats [99]. Laboratory experiments [100] as well as in situ observations in macroalgal beds [101] have shown selective feeding in S. fuscescens. Further studies on microhabitats and diet preferences in S. fuscescens are warranted. Body shape differences may be used to test the hypothesis of potential microhabitat or diet differences between the two cryptic species here uncovered within S. fuscescens.
Microsatellite markers provide no support to the hypothesis of reproductive isolation of mitochondrial lineage C
Siganus fuscescens collected from the northern part of the species' distributional range exhibited mitochondrial phylogenetic diversity with three mtDNA lineages co-occurring: Clade A, Clade B, Clade C [19, 21, 23, 26]. Despite substantial sequence divergence separating Clade C from Clades A and B (3.3% for the mitochondrial CR and 6.2% for the cytb marker), with divergence times dating back to the late Pliocene-early Pleistocene (1.8–5.2 MYa) [19, 26], no matching genetic differences were shown by microsatellite markers. While Cluster 1 and Cluster 2 were associated with, respectively, Clade A and Clade B, Clade-C individuals did not flag a distinct nuclear genotype cluster. Instead, all five Clade-C individuals were found within Cluster 2. The pattern of individuals possessing a Clade-C haplotype thought to be characteristic of S. canaliculatus [19], but with an S. fuscescens nuclear background is consistent across different nuclear marker types, including ITS sequences [20], AFLP [21], and microsatellites (this study). Thus, the present study provided no support to the hypothesis that distinct mtDNA lineage C would represent a reproductively isolated lineage, at least for the S. fuscescens populations from the Philippines and Hong Kong. However, such a conclusion cannot be extended to other locations across the broader Indo-Pacific where distinct mitochondrial lineages also occur in sympatry, pending an analysis of nuclear markers.
Mitonuclear discordance and mitochondrial introgression: Pleistocene vicariance and secondary contact in the South China Sea
Mitonuclear discordance is a common phenomenon in animal systems, particularly for recently-diverged taxa or incipient species (reviewed in [35]). For S. fuscescens, roughly a fifth of all sampled individuals exhibited mitonuclear discordance, largely due to Clade A mtDNA haplotypes occurring in both nuclear Clusters 1 and 2. Geographical differences in the spatial distribution of mtDNA lineages and nuclear clusters tend to dismiss incomplete lineage sorting as the cause for mitonuclear discordance [33, 35]. Similar geographical distributions are observed for associated mtDNA lineages and nuclear clusters, i.e. Clade A- Cluster1, Clade B-Cluster 2, but distributions are markedly different between the two groups. In particular, Cluster 1 is widely distributed across the Philippines (R. Ravago-Gotanco, unpublished data), parallel to the distribution of its associated mitochondrial Clade A, which is widespread across the Philippines [19], the Japanese archipelago, and across the Coral Triangle [21, 23]. Meanwhile, Cluster 2 has been found only in the South China Sea and north Philippine Sea basins, and not in other locations in the Philippine archipelago (R. Ravago-Gotanco, unpublished data). Its associated mitochondrial Clade B exhibits a similarly restricted distribution in the northern periphery of the distributional range of S. fuscescens, in the South China Sea and Philippine Sea basins [19, 23], and regions further north, i.e. the coast of China, the Japanese archipelago, and Taiwan [21, 23, 26], although it was also found in low frequency at two southern locations in the South China Sea and the Indian Ocean [23].
Mitochondrial introgression, and not incomplete lineage sorting, is the likely cause of the observed mitonuclear discordance, as evidenced by asymmetric mitochondrial gene flow between genotype clusters. Introgression can arise from multiple demographic, selective, and behavioural mechanisms. These include neutral drift following population or range expansion [102], asymmetry in the relative abundance of hybridizing taxa [103], sex-biased dispersal [104, 105], asymmetry in mate choice [106], and differential selection [28, 107, 108]. For S. fuscescens where reproduction occurs through external fertilization of gametes released in spawning aggregations [92, 93], mate choice, sex-biased dispersal, and other behavioural mechanisms favouring heterospecific crosses appear unlikely. A more parsimonious explanation for the mitonuclear discordance and differential geographical distribution patterns of mtDNA clades and genotype clusters is a neutral process following secondary contact of lineages diverged in allopatry. Demographic, and presumably spatial expansion of S. fuscescens Clade A and Clade B has been previously suggested [19]. In particular, Clade B is thought to have originated in areas north of the Coral Triangle, when the South China Sea and East China sea were isolated during Pleistocene glacial maxima. Clade B revealed a more pronounced genetic signature of demographic expansion estimated at ~20,000 years BP, consistent with the timing of the last Pleistocene glacial maxima, whereas Clade A underwent a more recent expansion (<10,000 years BP). The resulting demographic, and presumably southward range expansion of Clade B may have resulted in secondary contact with the ancestral Clade A which is widespread in the South China Sea and northern Philippine Sea. Asymmetric mitochondrial gene flow from Cluster 1 to Cluster 2A and from Cluster 1 to Cluster 2B, accounts for mitochondrial introgression of Clade A haplotypes into Cluster 2 individuals, resulting in a preponderance of discordant individuals with a Clade A haplotype and a Cluster 2 nuclear background. Since neutral introgression occurs primarily from the local to the invading species [101], and levels of introgression are higher for mitochondrial genes than nuclear genes [34, 109, 110], the introgressed Clade A-Cluster 2 individuals are consistent with a scenario of introgression of a local southern haplotype (Clade A) into invading Clade B-Cluster 2 individuals from expanding northern populations. The genetic signatures of Pleistocene allopatric divergence, followed by demographic expansion, secondary contact, and introgression have been similarly observed in other marine teleosts across the broader Coral Triangle region. These include the unicornfish species Naso hexacanthus and N. caesius [111], and four species of surgeonfishes Acanthurus spp. [112]. Across the northwest Pacific, two Pleistocene-diverged mitochondrial lineages of ice goby Leucopsarion petersii which have differential geographical distribution and phenotypic differences, exhibited mitochondrial introgression at admixture zones [113]. Extensive mitochondrial introgression was likewise reported for lineages of the three-spined stickleback Gasterosteus aculeatus sampled across the northern Pacific (Japan, Russia, and Alaska), with almost complete mitochondrial replacement of Japanese populations with mtDNA characteristic of Pacific lineages, despite very limited nuclear gene flow (low proportions of F1 hybrids) as estimated from allozyme markers [114]. In the South China Sea, Qiu et al. [115] reported secondary contact resulting in the occurrence of F1 hybrids between two closely-related mitochondrial lineages of the four-eyed sleeper Bostrychus sinensis, with lineages exhibiting a pronounced clinal latitudinal distribution. Extensive mitochondrial introgression concomitant with demographic expansion in the Pleistocene has also been inferred between two sergeant fishes, Abudefduf sexfasciatus and A. vaigiensis lineage A, whose geographic distributions largely overlap in the Coral Triangle [116]. These congruent results from various marine teleosts across the broader Coral Triangle, northwest Pacific and South China Sea suggest that mitochondrial introgression in recently-diverged lineages may be common, despite apparent reproductive isolation supported by limited nuclear gene flow.
Mitochondrial introgression in S. fuscescens does not appear to be recent. The low frequency of hybrids, all of which were identified as F2 and possibly even later-generation (backcrossed) hybrids, indicate that there is limited contemporary gene flow between the two clusters. Further, while putative hybrids were the most centrally located of all the 343 individuals in the correspondence analysis, they were still clearly localized within either cluster of non-overlapping individuals (2 and 5 individuals in Cluster 1 and Cluster 2, respectively). The putative hybrids were not found to be geometrically intermediate between clusters, as would be expected for F1 hybrids. This pattern of relative abundance of backcross hybrids despite the rarity or absence of F1 hybrids has been observed in several taxa [117], and is widely thought to be due to strong pre- or post-zygotic isolating mechanisms precluding the formation of F1 hybrids.
Within the contact zone, the degree of mitonuclear discordance and mitochondrial introgression varied among the 7 South China Sea populations. The HKG population had the highest level of discordance, where Clade A-Cluster 2 individuals represented 90.9% of the population and accounted for almost 30% of all discordant individuals. Four Philippine populations (CUR, MAS, MOR and PAT) exhibited moderate proportions of introgressed individuals (17.9% to 42.5%). Mitonuclear discordance was not observed in the SNF and BOL samples, which may be attributed to the relative rarity of Clade B in these areas (only 2 of 106 total individuals sampled), and the absence of Cluster 2 individuals from these locations. This interesting distribution pattern of mtDNA lineages and genotype clusters might be a consequence of the potentially restricted gene flow or isolation of the Lingayen Gulf (BOL, SNF) from the rest of northwestern Philippine coast due to hydrographic circulation patterns [19]. Habitat availability may also have influenced demographic processes and recruitment patterns leading to the relative dominance of Cluster 1 individuals in these areas [19]. However, we cannot eliminate the possibility of differential selection or adaptive divergence mechanisms accounting for the rarity of Clade-B and Cluster-2 genotypes in the Lingayen Gulf. Further studies are needed to examine these competing hypotheses of neutral versus adaptive processes shaping fine-scale genetic diversity across the western Luzon coast.
Conclusion
This study reports the existence of two cryptic species of S. fuscescens in the northeast South China Sea (Philippines and Hong Kong) and northern Philippine Sea region, supported by the recovery of two nuclear clusters characterized by limited contemporary gene flow and concordant morphological differences. Mitonuclear discordance due to hybridization, introgression, and a complex demographic history obscures phylogenetic relationships for recently-diverged taxa such as the sibling species within the mottled rabbitfish species complex. The lack of correspondence between mitochondrial lineage Clade C and nuclear variation for individuals collected from the South China Sea strongly cautions against the use of mitochondrial markers alone for distinguishing S. canaliculatus from S. fuscescens. Similarly, because of extensive mitochondrial introgression, the mitochondrial marker is mostly helpless for distinguishing the two cryptic species here uncovered within S. fuscescens. Further studies are needed to determine the generality of these observations for other regions across the distributional range of S. canaliculatus and S. fuscescens, to help resolve the taxonomic ambiguities in this species complex. Geometric morphometric analysis proves a promising tool which can be further explored for the identification of the two cryptic species under S. fuscescens. Studies on the reproductive behavior and ecology, e.g. feeding ecology and microhabitat use are warranted to elucidate ecological partitioning and reproductive isolation between the two cryptic species.
Supporting information
S1 Fig. Multiplex-PCR fragment profiles diagnostic for mitochondrial lineages in the mottled rabbitfish species complex.
Diagnostic amplicons from S. fuscescens Clade A (lanes 1–4, ~850 bp), S. fuscescens Clade B (lanes 5–8, ~450 bp and 850 bp), and Clade C (lane9, ~250, 450, 700, and 850 bp) are shown. M, 1 Kb Plus DNA Ladder (Invitrogen). Fragment sizes are indicated in basepairs (bp).
https://doi.org/10.1371/journal.pone.0193220.s001
(TIF)
S2 Fig. Landmarks and measurements for geometric morphometric analysis of external shape in Siganus fuscescens.
https://doi.org/10.1371/journal.pone.0193220.s002
(TIF)
S3 Fig. Ordination plot of principal coordinate analysis of population pairwise genetic distances for Siganus fuscescens microsatellite genotypes and mtDNA sequences.
(A) Scatterplot of population pairwise FST estimates based on microsatellite data. Red circles correspond to Cluster 1 individuals for each population, blue circles to Cluster 2 individuals for each population. (B) Scatterplot of population pairwise ΦST estimates based on mtDNA control region sequence data. Closed circles correspond to Clade A haplotypes for each population, open circles to Clade B haplotypes for each population.
https://doi.org/10.1371/journal.pone.0193220.s003
(TIF)
S4 Fig. Plots of standardised genetic distance versus geographic distance in Siganus fuscescens from the northern Philippines and Hong Kong.
Pairwise genetic distance [Fst/(1-Fst)] was plotted against the Napierian logarithm of geographic distance. SD ship distance (in km), measured using the path tool of Google Earth (http://www.google.com/earth/). Closed circles correspond to comparisons between samples from within the Philippine archipelago. Open circles correspond to comparisons across the South China Sea (i.e. Philippines vs. Hong Kong). Dotted lines indicate linear regression of genetic distance against geographic distance within the Philippine archipelago. There was no evidence of increasing genetic distance with increasing geographic distance at the scale of the Philippine archipelago, except, perhaps, for Cluster2 S. fuscescens. There was no evidence of an effect of isolation by distance at the scale of the northern South China Sea except, perhaps, for mitochondrial Clade B. Population-pairwise genetic distances were calculated separately for (A) Cluster1. (B) Cluster 2. (C) Mitochondrial Clade A. (D) Mitochondrial Clade B.
https://doi.org/10.1371/journal.pone.0193220.s004
(TIF)
S1 Table. Mottled rabbitfish species complex: Genbank accession numbers for the mtDNA control region sequences used in this study for mitochondrial lineage identification and their frequency distribution across sampling locations.
https://doi.org/10.1371/journal.pone.0193220.s005
(PDF)
S2 Table. Genetic diversity estimates of Siganus fuscescens populations and genotype clusters at 12 microsatellite loci.
Estimates are calculated by grouping individuals according to location and genotype cluster. The number of individuals (N), number of alleles (Na), observed heterozygosity (HO), expected heterozygosity (HE), inbreeding coefficient (FIS) and its associated significance value (P) are presented for each sample at each locus, and across all loci. P values significant following a sequential Bonferroni correction are highlighted.
https://doi.org/10.1371/journal.pone.0193220.s006
(PDF)
S3 Table. Matrix of pairwise FST values and associated P values for Siganus fuscescens samples grouped by genotype clusters and sampling location.
Below the diagonal: pairwise FST values. Above the diagonal: P values. Significant P values following the sequential Bonferroni correction are highlighted.
https://doi.org/10.1371/journal.pone.0193220.s007
(PDF)
Acknowledgments
This study was supported by the GEF-Coral Reef Targeted Research Center of Excellence (Philippines) Program, University of the Philippines Office of the Vice Chancellor for Research and Development (PhD IA Grant #121206 to RRG), and the UP Marine Science Institute. We thank C. Lumibao, and C. Balitaan for assistance in the field and laboratory.
References
- 1. Bickford D, Lohman DJ, Sodhi NS, Ng PK, Meier R, Winker K, et al. Cryptic species as a window on diversity and conservation. Trends Ecol Evol. 2007;22:148–155. pmid:17129636
- 2. Pfenninger M, Schwenk K. Cryptic animal species are homogeneously distributed among taxa and biogeographical regions. BMC Evol Biol. 2007;7:121. pmid:17640383
- 3. Rocha LA, Bowen BW. Speciation in coral-reef fishes. J. Fish Biol. 2008;72:1101–1121.
- 4. Hubert N, Meyer CP, Bruggemann HJ, Guérin F, Komeno RJL, Espiau B, et al. Cryptic diversity in Indo-Pacific coral-reef fishes revealed by DNA-barcoding provides new support to the centre-of-overlap hypothesis. PLoS ONE. 2012;7; e28987. pmid:22438862
- 5. Bowen BW, Rocha LA, Toonen RJ, Karl SA. The origins of tropical marine biodiversity. Trends Ecol Evol. 2013;28:359–366. pmid:23453048
- 6. Roberts CM, McClean CJ, Veron JE, Hawkins JP, Allen GR, McAllister DE, et al. Marine biodiversity hotspots and conservation priorities for tropical reefs. Science. 2002;295:1280–1284. pmid:11847338
- 7. Veron JEN, Devantier LM, Turak E, Green AL, Kininmonth S, Stafford-Smith M, et al. Delineating the Coral Triangle. Galaxea. 2009;11:91–100.
- 8. Carpenter KE, Barber PH, Crandall ED, Ablan-Lagman MCA, Ambariyanto , Mahardika GN, et al. Comparative Phylogeography of the Coral Triangle and Implications for Marine Management. J Mar Biol. 2011:1–14.
- 9. Gaither MR, Rocha LA. Origins of species richness in the Indo-Malay-Philippine biodiversity hotspot: evidence for the centre of overlap hypothesis. J Biogeogr. 2013;40:1638–1648.
- 10. Cowman PF, Bellwood DR. The historical biogeography of coral reef fishes: Global patterns of origination and dispersal. J Biogeogr. 2013;40:209–224.
- 11. Tornabene L, Valdez S, Erdmann M, Pezold F. Support for a 'Center of Origin' in the Coral Triangle: Cryptic diversity, recent speciation, and local endemism in a diverse lineage of reef fishes (Gobiidae: Eviota). Mol Phylogenet Evol. 2015;82:200–210. pmid:25300452
- 12. Barber PH. The challenge of understanding the Coral Triangle biodiversity hotspot. J Biogeogr. 2009;36:1845–1846.
- 13. Malay MCMD, Paulay G. Peripatric speciation drives diversification and distributional pattern of reef hermit crabs (Decapoda: Diogenidae: Calcinus). Evolution. 2010;64:634–662. pmid:19796150
- 14. Woodland DJ. Revision of the fish family Siganidae with descriptions of two new species and comments on distribution and biology. Indo-Pac Fishes. 1990;19:1–136.
- 15. Woodland DJ, Anderson RC. Description of a new species of rabbitfish (Perciformes: Siganidae) from southern India, Sri Lanka and the Maldives. Zootaxa. 2014;3811:129–136.
- 16. Randall JE, Kulbicki M. Siganus woodlandi, new species of rabbitfish (Siganidae) from New Caledonia. Cybium. 2005;29:185–189.
- 17.
Adams M, Woodland DJ. Molecular systematics of the rabbitfishes of the Oramin Complex: towards a resolution of the Siganus fuscescens / S. canaliculatus species problem using allozyme electrophoresis (Siganidae: Pisces). In: Keekpai, HE et. al. (eds.). Proc 4th Indo-Pac Fish Conf; 1994. 28 November—4 December 1993; Bangkok, Thailand: Kasetsart University, Bangkok, 1994, pp 373–385.
- 18. Borsa P, Lemer S, Aurelle D. Patterns of lineage diversification in rabbitfishes. Mol Phylogenet Evol. 2007;44:427–435. pmid:17383906
- 19. Ravago-Gotanco RG, Juinio-Meñez MA. Phylogeography of the mottled spinefoot Siganus fuscescens: Pleistocene divergence and limited genetic connectivity across the Philippine archipelago. Mol Ecol. 2010;19:4520–4534. pmid:20875063
- 20. Kuriiwa K, Hanzawa N, Yoshino T, Kimura S, Nishida M. Phylogenetic relationships and natural hybridization in rabbitfishes (Teleostei: Siganidae) inferred from mitochondrial and nuclear DNA analyses. Mol Phylogenet Evol. 2007;45:69–80. pmid:17590356
- 21. Hsu TH, Adiputra YT, Burridge CP, Gwo JC. Two spinefoot colour morphs: Mottled spinefoot Siganus fuscescens and white-spotted spinefoot Siganus canaliculatus are synonyms. J. Fish Biol. 2011;79:1350–1355. pmid:22026611
- 22. Lemer S, Aurelle D, Vigliola L, Durand JD, Borsa P. Cytochrome b barcoding, molecular systematics and geographic differentiation in rabbitfishes (Siganidae). C R Biol. 2007;330:86–94. pmid:17241952
- 23. Iwamoto K, Abdullah MF, Chang CW. Genetic isolation of the mottled spinefoot Siganus fuscescens Ryukyu Archipelago population: Biogeography. 2015;17:61–85.
- 24. Ravago-Gotanco R, Lumibao CY, Pante MJR. Isolation and characterization of thirteen microsatellite markers for the rabbitfish, Siganus fuscescens. Conserv Genet Resour. 2010;2:225–227.
- 25.
Lam YH. Genetic and Morphological Analyses of the Rabbitfish Siganus canaliculatus/Siganus fuscescens species complex in the South China Sea. M.Sc, Thesis, Chinese University of Hong Kong. 2010.
- 26. Yagishita N, Yamaguchi A. Genetic population structure of the mottled spinefoot Siganus fuscescens in Japan. Fish Sci. 2016; 82:289–301.
- 27. Moritz C, Cicero C. DNA barcoding: promise and pitfalls. PLoS Biol. 2004;2:e354. pmid:15486587
- 28. Ballard JW, Whitlock MC. The incomplete natural history of mitochondria. Mol Ecol. 2004;13:729–744. pmid:15012752
- 29. Mallet J. A species definition for the modern synthesis. Trends Ecol Evol. 1995;10:294–299. pmid:21237047
- 30. Zink RM, Barrowclough GF. Mitochondrial DNA under siege in avian phylogeography. Mol Ecol. 2008;17:2107–2121. pmid:18397219
- 31. Schnabel A, Asmussen MA. Definition and properties of disequilibria within nuclear-mitochondrial-chloroplast and other nuclear-dicytoplasmic systems. Genetics. 1989;123:199–215. pmid:2806883
- 32. Asmussen MA, Arnold J, Avise JC. Definition and properties of disequilibrium statistics for associations between nuclear and cytoplasmic genotypes. Genetics. 1987;115:755–768. pmid:3582962
- 33. Funk DJ, Omland KE. Species-level paraphyly and polyphyly: frequency, causes, and consequences, with insights from animal mitochondrial DNA. Annu Rev Ecol Evol Syst. 2003:397–423.
- 34. Chan KM, Levin SA. Leaky prezygotic isolation and porous genomes: rapid introgression of maternally inherited DNA. Evolution. 2005;59:720–729. pmid:15926684
- 35. Toews DPL, Brelsford A. The biogeography of mitochondrial and nuclear discordance in animals. Mol Ecol. 2012;21:3907–3930. pmid:22738314
- 36. Yue GH, Orban L. A simple and affordable method for high-throughput DNA extraction from animal tissues for polymerase chain reaction. Electrophoresis. 2005;26:3081–3083. pmid:16047311
- 37. Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28:1647–1679. pmid:22543367
- 38. van Oosterhout C, Hutchinson WF, Wills DPM, Shipley P. Microchecker: software for identifying and correcting genotyping errors in microsatellite data. Mol Ecol Resour. 2004;4:535–538.
- 39. Belkhir K, Borsa P, Chikhi L, Raufaste N, Bonhomme F. GENETIX 4.05, logiciel sous Windows TM pour la génétique des populations. Laboratoire génome, populations, interactions, CNRS UMR. 1996;5000:1996–2004.
- 40. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155:945–959. pmid:10835412
- 41. Hubisz MJ, Falush D, Stephens M, Pritchard JK. Inferring weak population structure with the assistance of sample group information. Mol Ecol Resour. 2009;9:1322–32. pmid:21564903
- 42. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005;14:2611–2620. pmid:15969739
- 43. Earl DA, von Holdt BM. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour. 2011;4:359–61.
- 44. Jakobsson M, Rosenberg NA. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics. 2007;23:1801–1806. pmid:17485429
- 45. Vaha JP, Primmer CR. Efficiency of model-based Bayesian methods for detecting hybrid individuals under different hybridization scenarios and with different numbers of loci. Mol Ecol. 2006;15:63–72. pmid:16367830
- 46. Rosenberg NA. distruct: a program for the graphical display of population structure. Mol Ecol Resour. 2004;4:137–138.
- 47. Anderson EC, Thompson EA. A model-based method for identifying species hybrids using multilocus genetic data. Genetics. 2002;160:1217–1229. pmid:11901135
- 48. Rozen S, Skaletsky H. Primer3 on the WWW for General Users and for Biologist Programmers. Bioinformatics Methods and Protocols.1999:. 365–386.
- 49. Oh DJ, Kim JY, Lee JA, Yoon WJ, Park SY, Jung YH. Complete mitochondrial genome of the rabbitfish Siganus fuscescens (Perciformes, Siganidae). DNA Seq. 2007;18:295–301. pmid:17541835
- 50. Basten CJ, Asmussen MA. The exact test for cytonuclear disequilibria. Genetics. 1997;146:1165–1171. pmid:9215917
- 51. Rousset F. genepop'007: a complete re-implementation of the genepop software for Windows and Linux. Mol Ecol Resour. 2008;8:103–106. pmid:21585727
- 52. Goudet J. hierfstat, a package for r to compute and test hierarchical F-statistics. Mol Ecol Notes. 2005;5:184–186.
- 53. Excoffier L, Lischer HE. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10:564–567. pmid:21565059
- 54. Peakall R, Smouse PE. GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research—an update. Bioinformatics. 2012;28:2537–2539. pmid:22820204
- 55. Hedrick PW. A standardized genetic differentiation measure. Evolution. 2005;59:1633–1638. pmid:16329237
- 56. Meirmans PG, Hedrick PW. Assessing population structure: FST and related measures. Mol Ecol Resour. 2011;11:5–18. pmid:21429096
- 57. Jensen JL, Bohonak AJ, Kelley ST. Isolation by distance, web service. BMC Genet. 2005;6:13. pmid:15760479
- 58. Nielsen R, Wakeley J. Distinguishing migration from isolation: a Markov chain Monte Carlo approach. Genetics. 2001;158:885–896. pmid:11404349
- 59. Hey J, Nielsen R. Multilocus methods for estimating population sizes, migration rates and divergence time, with applications to the divergence of Drosophila pseudoobscura and D. persimilis. Genetics. 2004;167:747–760. pmid:15238526
- 60. Won YJ, Hey J. Divergence population genetics of chimpanzees. Mol Biol Evol. 2005;22:297–307. pmid:15483319
- 61. Sethuraman A, Hey J. IMa2p—parallel MCMC and inference of ancient demography under the Isolation with migration (IM) model. Mol Ecol Resour. 2016;16:206–215. pmid:26059786
- 62. Tajima F. The effect of change in population size on DNA polymorphism. Genetics. 1989;123:597–601. pmid:2599369
- 63. Fu YX. Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics. 1997;147:915–925. pmid:9335623
- 64. Martin D, Rybicki E. RDP: detection of recombination amongst aligned sequences. Bioinformatics. 2000;16:562–563. pmid:10980155
- 65. Martin DP, Lemey P, Lott M, Moulton V, Posada D, Lefeuvre P. RDP3: a flexible and fast computer program for analyzing recombination. Bioinformatics. 2010;26:2462–2463. pmid:20798170
- 66. Bruen TC, Philippe H, Bryant D. A simple and robust statistical test for detecting the presence of recombination. Genetics. 2006;172:2665–2681. pmid:16489234
- 67. Hey J. Isolation with migration models for more than two populations. Mol Biol Evol. 2010;27:905–920. pmid:19955477
- 68. Bowen BW, Muss A, Rocha LA, Grant WS. Shallow mtDNA coalescence in Atlantic pygmy angelfishes (genus Centropyge) indicates a recent invasion from the Indian Ocean. J Hered. 2006;97:1–12. pmid:16394255
- 69. Hey J, Nielsen R. Integration within the Felsenstein equation for improved Markov chain Monte Carlo methods in population genetics. Proc Natl Acad Sci U S A 2007;104:2785–2790. pmid:17301231
- 70.
Pianka ER. Evolutionary Ecology. Harper and Rowe Publishers, New York. 1978.
- 71.
Del Norte AGC, Pauly D. Virtual population estimates of monthly recruitment and biomass of rabbitfish, Siganus fuscescens off Bolinao, northern Philippines: Second Asian fisheries forum. Asian Fisheries Society, Manila, Philippines. 1990: 851–854.
- 72.
Bellefleur DL. How to design a marine reserve for rabbitfish (Siganus fuscescens). M. Sc. Thesis. Dalhousie University. 1997. Available from http://www.nlc-bnc.ca/obj/s4/f2/dsk3/ftp04/mq24801.pdf.
- 73.
Zelditch M, Swiderski D, Sheets DH, Fink W. Geometric Morphometrics for Biologists: A primer: Elsevier Academic Press: Waltham. 2004.
- 74.
Rohlf FJ. tpsDig, digitize landmarks and outlines, version 2.05. Department of Ecology and Evolution, State University of New York at Stony Brook. 2005.
- 75. Hammer Ø, Harper DAT, Ryan PD. PAST: Paleontological Statistics Software Package for education and data analysis. Palaeontol Electr. 2001;4.
- 76. Rohlf FJ, Slice D. Extensions of the Procrustes method for the optimal superimposition of Landmarks. Syst Biol. 1990;39:40–59.
- 77.
Dryden IL, Mardia KV. Statistical shape analysis: J. Wiley Chichester; 1998.
- 78. Pearson K. On lines and planes of closest fit to systems of points in space. Phil Mag. 1901;2:559–572.
- 79. Slice DE. Geometric Morphometrics. Annu Rev Anthropol. 2007;36:261–281.
- 80. Jombart T, Devillard S, Balloux F. Discriminant analysis of principal components: a new method for the analysis of genetically structured populations. BMC Genet. 2010;11:94. pmid:20950446
- 81. Rice WR. Analyzing tables of statistical tests. Evolution. 1989; 43:223–225. pmid:28568501
- 82.
Nei M. Molecular evolutionary genetics: Columbia University Press; 1987.
- 83. Fauvelot C, Borsa P. Patterns of genetic isolation in a widely distributed pelagic fish, the narrow-barred Spanish mackerel (Scomberomorus commerson). Biol J Linn Soc Lond. 2011;104:886–902.
- 84. Borsa P, Arlyza IS, Laporte M, Berrebi P. Population genetic structure of blue-spotted maskray Neotrygon kuhlii and two other Indo-West Pacific stingray species (Myliobatiformes: Dasyatidae), inferred from size-polymorphic intron markers. J Exp Mar Biol Ecol. 2012;438:32–40.
- 85. Borsa P, Shen K-N, Arlyza IS, Hoareau TB. Multiple cryptic species in the blue-spotted maskray (Myliobatoidei: Dasyatidae: Neotrygon spp.): An update. C R Biol. 2016;339:417–426. pmid:27543138
- 86. Borsa P, Sembiring A, Fauvelot C, Chen W-J. Resurrection of Indian Ocean humbug damselfish, Dascyllus abudafur (Forsskål) from synonymy with its Pacific Ocean sibling, Dascyllus aruanus (L.). C R Biol. 2014;337:709–716. pmid:25433563
- 87. Liu SYV, Chang FT, Borsa P, Chen W-J, Dai C-F. Phylogeography of the humbug damselfish, Dascyllus aruanus (Linnaeus, 1758): evidence of Indo-Pacific vicariance and genetic differentiation of peripheral populations. Biol J Linn Soc Lond. 2014;113:931–942.
- 88. Karahan A, Borsa P, Gucu AC, Kandemir I, Ozkan E, Orek YA, et al. Geometric morphometrics, Fourier analysis of otolith shape, and nuclear-DNA markers distinguish two anchovy species (Engraulis spp.) in the Eastern Mediterranean Sea. Fish Res. 2014;159:45–55.
- 89. Shen KN, Jamandre BW, Hsu CC, Tzeng WN, Durand JD. Plio-Pleistocene sea level and temperature fluctuations in the northwestern Pacific promoted speciation in the globally-distributed flathead mullet Mugil cephalus. BMC Evol Biol. 2011;11. pmid:21450111
- 90. Sanz N, Araguas RM, Fernández R, Vera M, García-Marín J-L. Efficiency of markers and methods for detecting hybrids and introgression in stocked populations. Conserv Genet. 2009;10:225–236.
- 91. Harrison RG, Bogdanowicz SM. Patterns of Variation and Linkage Disequilibrium in a Field Cricket Hybrid Zone. Evolution. 1997;51:493–505. pmid:28565372
- 92. Jiggins CD, Mallet J. Bimodal hybrid zones and speciation. Trends Ecol Evol. 2000;15:250–255. pmid:10802556
- 93. De La Paz RM, Aragones NV. Abundance and seasonality of siganid fishes (Teleostei, Perciformes) in Cape Bolinao, Pangasinan, Philippines. Philippine J Sci. 1990;119:223–235.
- 94.
Rodriguez SF. The Barangen fishing concession in Bolinao: an ethnographic study of a customary marine tenure system. M.Sc. Thesis; University of the Philippines, Diliman. 1997.
- 95. Yaacov DB, Arbel-Thau K, Zilka Y, Ovadia O, Bouskila A, Mishmar D. Mitochondrial DNA Variation, but Not Nuclear DNA, Sharply Divides Morphologically Identical Chameleons along an Ancient Geographic Barrier. PLoS ONE 2012;7:e31372. pmid:22457709
- 96. Sullivan JP, Lavoué S, Arnegard ME, Hopkins CD. AFLPs resolve phylogeny and reveal mitochondrial introgression within a species flock of African electric fish (Mormyroidea: Teleostei). Evolution. 2004;58:825–841. pmid:15154558
- 97. Pagès M, Bazin E, Galan M, Chaval Y, Claude J, Herbreteau V, et al. Cytonuclear discordance among Southeast Asian black rats (Rattus rattus complex). Mol Ecol. 2012;22:1019–1034. pmid:23278980
- 98. Russo T, Pulcini D, O’Leary Á, Cataudella S, Mariani S. Relationship between body shape and trophic niche segregation in two closely related sympatric fishes. J. Fish Biol. 2008;73:809–828.
- 99. Ruehl CB, Shervette V, Dewitt TJ. Replicated shape variation between simple and complex habitats in two estuarine fishes. Biol J Linn Soc Lond. 2011;103:147–158.
- 100. Pillans RD, Franklin CE, Tibbetts IR. Food choice in Siganus fuscescens: Influence of macrophyte nutrient content and availability. J. Fish Biol. 2004;64:297–309.
- 101. Noda M, Ohara H, Urakawa K, Murase N, Yamamoto KI. Diet and prey availability of Siganus fuscescens occurring in a Sargassum bed at Futaoi Island in the Sea of Hibiki with respect to feeding on large brown macroalgae. Nippon Suisan Gakkai Shi. 2011;77:1008–1019.
- 102. Currat M, Ruedi M, Petit RJ, Excoffier L. The hidden side of invasions: massive introgression by local genes. Evolution. 2008;62:1908–1920. pmid:18452573
- 103. Cianchi R, Ungaro A, Marini M, Bullini L. Differential patterns of hybridization and introgression between the swallowtails Papilio machaon and P. hospiton from Sardinia and Corsica islands (Lepidoptera, Papilionidae). Mol Ecol. 2003;12:1461–1471. pmid:12755875
- 104. Crochet PA, Chen JZ, Pons JM, Lebreton JD, Hebert PDN, Bonhomme F. Genetic Differentiation at Nuclear and Mitochondrial Loci among Large White-Headed Gulls: Sex-Biased Interspecific Gene Flow? Evolution. 2003;57:2865–2878. pmid:14761064
- 105. Petit RJ, Bodénès C, Ducousso A, Roussel G, Kremer A. Hybridization as a mechanism of invasion in oaks. New Phytol. 2003;161:151–164.
- 106. Wirtz P. Mother species-father species: unidirectional hybridization in animals with female choice. Anim Behav. 1999;58:1–12. pmid:10413535
- 107. Whitney KD, Randell RA, Rieseberg LH. Adaptive introgression of herbivore resistance traits in the weedy sunflower Helianthus annuus.Am Nat. 2006;167:794–807. pmid:16649157
- 108. Cheviron ZA, Brumfield RT. Migration-selection balance and local adaptation of mitochondrial haplotypes in rufous-collared sparrows (Zonotrichia capensis) along an elevational gradient. Evolution. 2009;63:1593–1605. pmid:19187247
- 109. Martinsen GD, Whitham TG, Turek RJ, Keim P. Hybrid populations selectively filter gene introgression between species. Evolution. 2001;55:1325–1335. pmid:11525457
- 110. Petit RJ, Excoffier L. Gene flow and species delimitation. Trends Ecol Evol. 2009;24:386–393. pmid:19409650
- 111. Horne JB, van Herwerden L. Long-term panmixia in a cosmopolitan Indo-Pacific coral reef fish and a nebulous genetic boundary with its broadly sympatric sister species. J Evol Biol. 2013;26:783–799. pmid:23305496
- 112. DiBattista JD, Whitney J, Craig MT, Hobbs JP, Rocha LA, Feldheim KA, et al. Surgeons and suture zones: Hybridization among four surgeonfish species in the Indo-Pacific with variable evolutionary outcomes. Mol Phylogenet Evol. 2016;101:203–215. pmid:27143241
- 113. Kokita T, Nohara K. Phylogeography and historical demography of the anadromous fish Leucopsarion petersii in relation to geological history and oceanography around the Japanese Archipelago. Mol Ecol. 2011;20:143–164. pmid:21062386
- 114. Yamada M, Higuchi M, Goto A. Extensive introgression of mitochondrial DNA found between two genetically divergent forms of threespine stickleback, Gasterosteus aculeatus, around Japan. Environ Biol Fish. 2001;61:269–284.
- 115. Qiu F, Li H, Lin H, Ding S, Miyamoto MM. Phylogeography of the inshore fish, Bostrychus sinensis, along the Pacific coastline of China. Mol Phylogenet Evol. 2016;96:112–117. pmid:26732489
- 116. Bertrand JAM, Borsa P, Chen W-J. Phylogeography of the sergeants Abudefduf sexfasciatus and A. vaigiensis reveals complex introgression patterns between two widespread and sympatric Indo-West Pacific reef fishes. Molecular Ecology. 2017; 26: 2527–2542. pmid:28160340
- 117. Nydam ML, Harrison RG. Introgression despite substantial divergence in a broadcast spawning marine invertebrate. Evolution. 2011;65:429–442. pmid:21044056