Integration of Genetic and Phenotypic Data in 48 Lineages of Philippine Birds Shows Heterogeneous Divergence Processes and Numerous Cryptic Species

The Philippine Islands are one of the most biologically diverse archipelagoes in the world. Current taxonomy, however, may underestimate levels of avian diversity and endemism in these islands. Although species limits can be difficult to determine among allopatric populations, quantitative methods for comparing phenotypic and genotypic data can provide useful metrics of divergence among populations and identify those that merit consideration for elevation to full species status. Using a conceptual approach that integrates genetic and phenotypic data, we compared populations among 48 species, estimating genetic divergence (p-distance) using the mtDNA marker ND2 and comparing plumage and morphometrics of museum study skins. Using conservative speciation thresholds, pairwise comparisons of genetic and phenotypic divergence suggested possible species-level divergences in more than half of the species studied (25 out of 48). In speciation process space, divergence routes were heterogeneous among taxa. Nearly all populations that surpassed high genotypic divergence thresholds were Passeriformes, and non-Passeriformes populations surpassed high phenotypic divergence thresholds more commonly than expected by chance. Overall, there was an apparent logarithmic increase in phenotypic divergence with respect to genetic divergence, suggesting the possibility that divergence among these lineages may initially be driven by divergent selection in this allopatric system. Also, genetic endemism was high among sampled islands. Higher taxonomy affected divergence in genotype and phenotype. Although broader lineage, genetic, phenotypic, and numeric sampling is needed to further explore heterogeneity among divergence processes and to accurately assess species-level diversity in these taxa, our results support the need for substantial taxonomic revisions among Philippine birds. The conservation implications are profound.


Introduction
Despite difficulties of species delineation, species-level diversity is arguably the most important measure of biodiversity [1,2]. Most of the world's terrestrial vertebrate diversity has been described to the species level, and species recognition is important in conservation efforts and public awareness [3][4][5]. In the tropics, where species richness is highest and where much undiscovered biological diversity is believed to exist [6][7][8], cryptic diversity can be overlooked or obscured by taxonomy that often relies heavily on phenotypic characters [9][10][11]. Conventional species delimitation has relied on divergence exhibited in sympatry, wherein intrinsic barriers to gene flow provide evidence of species limits [12]. Species status for populations diverging in allopatry, such as island taxa, can be more difficult to ascertain [1,13]. In many cases, differences between populations are not deemed sufficient for species-level recognition, causing these populations to be recognized as subspecies [14]. In island systems, individual islands often host endemic subspecies of wide-ranging species [3,15]. Taxonomy of island populations is important as they are more prone to extinction than mainland populations; in birds the majority of recent extinctions have been of island endemic species [16,17]. Island endemism is important for conservation planning; these efforts often focus on island endemics in tropical island systems, as in the Philippines [18,19]. Conservation of island populations can be affected by inaccurate species delineations [10], making this an important conservation issue.
Divergence and speciation represent an inherently multidimensional process. Taxonomy based solely on phenotypic differences, or solely on stochastic genetic variation, reflects only one dimension of divergence between populations [20]. Taxonomy also uses bins to describe a continuous process. To aid us in accurately determining diversity around the taxonomic "bin" of species, we adopt an approach that integrates phenotypic and genotypic evidence. Through this integration, we can assess the need for taxonomic revision and also visualize the processes of divergence to understand how taxa might diverge in different ways towards speciation, e.g., remaining phenotypically cryptic despite deep genetic divergence, or, conversely, being dramatically different in phenotype despite shallow genetic divergence [9,20]. We compare taxa in a divergence process space in which multiple general routes to speciation exist. In this space, populations diverging equivalently along both genetic and phenotypic axes represent one general route to speciation, and populations diverging more along one axis than the other represent two other general routes, one with rapid phenotypic divergence and the third with low phenotypic divergence relative to deeper genetic divergence [20].
Standardized scoring of phenotypic characters can identify divergent populations that may merit species-level elevation [13], and this approach has supported elevation of populations to full biological species status on islands, including the Philippines [21,22]. This approach leverages data from comparisons of well-defined species pairs in sympatry, serving as a baseline for species-level divergence among allopatric populations [13]. However, divergence and speciation can also occur in the absence of obvious phenotypic divergence [12], especially in island systems where populations can evolve in isolation [9,15]. Using genetic markers such as mtDNA, we can infer rates of gene flow, evolutionary isolation, and time since common ancestry [23,24]. It may be tempting to use simple genotypic or phenotypic markers alone to determine species-level divergence, but completion of the speciation process may not be fully encapsulated (or diagnosed) this way [12,24]. Here we do not delimit species, but rather use these two different divergence measures to explore patterns of divergence among largely codistributed Philippine bird populations.
The Philippines are one of the world's "hottest" biodiversity hotspots, hosting many endemic and threatened species and subspecies [25,26]. Only 3% of original primary vegetation remains in the Philippines [25], where nearly half of all endemic species are threatened with extinction [26]. Many endemic subspecies are already extinct [27]. Thirty percent of bird species here are considered endemic, but nearly 80% of non-endemic species include multiple subspecies that are endemic to different islands [28]. Compared with mammals, amphibians, and freshwater fish, Philippine birds have significantly lower levels of species-level endemism [25]. However, evidence from a genetic study of 7 bird species suggested this endemism might be greatly underestimated [10]. Determining which endemic populations deserve full species status and which do not remains an unresolved issue [3,10,27,29,30], but our developing knowledge highlights the need for research with applications in both conservation and evolutionary biology. Here we examine genetic and phenotypic divergences within and among 48 species of Philippine birds, considering the patterns of these divergences and the cases in which lineages may have become full species.

Study Region and Sampling Design
In the Philippines, over 570 species of migrant and resident birds are distributed across a tropical archipelago stretching latitudinally along the Western Pacific Rim [28]. Over 7,000 islands make up the Philippines, and many islands host their own endemic species and subspecies [27]. Avian colonization and vicariance events in the Philippines are likely to have occurred multiple times and from multiple sources [31][32][33], and heavily restricted gene flow is common between at least some island populations [32,34]. Unlike many mainland taxa, species ranges in the Philippines are likely to have been quite stable over time, having experienced little Pleistocene fluctuation [32].
During prior research, fieldwork archived museum specimens representing avian assemblages on 9 islands, representing 4 Pleistocene Aggregate Island Complexes, or PAICs [33], to provide a substantial degree of geographic co-distribution to our broad comparisons. This sampling scheme affected sample sizes within populations. Using museum specimens, we obtained between 1 and 11 individuals per population (average n = 4) from 117 populations, including 109 described subspecies of 48 Philippine bird species representing 31 families and 12 orders (Table A in S1 File). Our taxonomic sampling includes lineages with undifferentiated populations, lineages with recognized subspecific differentiation, and lineages with now-recognized full biological species. Thus, among lineages, our samples represent differentiation that spans the divergence and speciation process. Most orders are represented by only one species, whereas the order Passeriformes is represented by 34 species and 79 subspecies. Nearly all (98%) of the subspecies sampled are endemic to the Philippines. Since our study began, several of these species have been split by some authorities [35][36][37], but we have retained these comparisons because they are complementary and useful in assessing among-lineage divergence patterns in Philippine birds.

Phenotypic Comparisons
We performed 96 pairwise phenotypic comparisons following the Tobias et al. [13] quantitative method for scoring plumage and morphometric characters. We are not using this method to delimit species. Rather, we are using it as a heuristic standard by which to gauge population divergence in phenotypic space; it provides a transparent and broadly applied methodology [36] that has some basic utility when comparing diverse lineages. Using this method, minor, moderate, major, and exceptional differences in plumage received scores of 1, 2, 3, and 4, respectively, and differences among morphometric characters were calculated as Cohen's d effect sizes (the difference between means divided by standard deviation) for wing chord, tail length, tarsus length, and bill length between populations and scored as follows: |0.2-2| = minor, |2-5| = moderate, |5-10| = major, and |> 10| = exceptional [13]. Morphometric comparisons were performed among samples of n = 2-10 with equal sex ratios for all pairwise comparisons. Whenever possible only males were compared, and immature birds were not included in plumage or morphometric analyses. In total, 728 study skins were examined for phenotypic comparison (Table A in S1 File). We were unable to follow the recommendation from [13] to include vocal scores due to insufficient data being available, and we did not include scores for the presence or absence of hybrid zones, because most of the populations we studied are strictly allopatric. Because selection acts non-independently on morphometric characters more commonly than on plumage characters, each phenotypic pairwise comparison considers the 3 highest scores for plumage and only the single greatest increase and decrease in morphometric effect sizes between populations. Tobias et al. [13] recommended that total scores of 7 or more should be regarded as sufficiently divergent to be considered as full species. Despite the exclusion of vocal scoring and geographic structure, we also consider a phenotypic score of 7, conservatively, to be a conceptual threshold for identifying what we term phenotypically highly divergent taxa.

Divergence Levels and Speciation Processes
We make a key conceptual distinction between divergence and speciation. Populations diverge before and after speciation; it is a continuous process. We therefore focus more on the process, though we consider divergence levels in both categorical and continuous ways (or semi-continuous in the case of phenotypic divergence scoring). We do not propose species limit thresholds in this study. Placing thresholds for species limits on genetic data is an inherently contentious issue, in part because no proposed divergence value clearly indicates the completion of speciation [20,43,44]. Here we have chosen a rather high threshold (5% corrected pairwise divergence in ND2) to be conservative in erecting a conceptual category that we term genetically highly divergent lineages, useful to highlight lineages for further studies on species limits. We have chosen this conservative threshold in part to include consideration of the variability of mutation rates among birds [45], especially at shallow levels of divergence [46][47][48][49]. Specifically, we set a genetic divergence threshold at 5% Jukes-Cantor corrected ND2 p-distance. The choice of 5% was done blindly with respect to the data, using a value that would be recognized as conservative among those generating the primary literature on species limits and genetic divergence. Using a generic mtDNA divergence rate of~2% per million years [45,50], this represents approximately 2.5 Myr of divergence. More than half of the 192 sister-species pairs considered by Weir and Schluter [51] had divergence levels below this threshold, and even lower threshold values have been proposed by others [52,53]. We emphasize, however, that this is simply an indication of lineage divergence and not an indication of speciation. Genetic divergence on these terms occurs through neutral processes, whereas speciation is thought to be driven largely by selection [12,54], and legitimate biological species can have lower genetic divergence values [51] while divergences of this threshold level can occur within a single biological species [55]. Indeed, there are numerous ways in which mtDNA can be misleading about species limits and relationships between populations [56][57][58][59][60][61][62][63][64].
Similarly, while we have used the methods of Tobias et al. [13] to provide a measure of phenotypic divergence, we do not apply their method to delimit species. We do, however, use their phenotypic divergence threshold of a phenotypic score of 7 or greater to denote what we call phenotypically highly divergent taxa. In terms of these genetic and phenotypic thresholds, population pairs were binned into four categories: a) those that are both phenotypically and genetically highly divergent (i.e., above these conceptual thresholds), b) those that are phenotypically highly divergent (i.e., a score of 7 or greater) but not genetically highly divergent, c) those that are highly genetically divergent (i.e., 5% genetic distance or greater) but not phenotypically highly divergent, and d) those populations whose divergence did not cross either conceptual high-divergence threshold. These bins represent different approaches to divergence and speciation (or lack thereof), and categorizing divergence among population pairs in such a way enabled us to test for heterogeneity among taxa. Specifically, we compared frequencies of Passeriformes versus non-Passeriformes species in each of the four bins, estimating chi-squared significance values assuming a null model of homogeneity. Separately from our categorical treatments, divergence data were examined as continuous (or semi-continuous) variables in divergence process space defined by phenotypic and genetic axes following [20].
With appropriate caution, we assume that populational divergences (phenotypic and genotypic) are independent between species, and we have a null expectation that these divergence processes will be independent with respect to higher taxonomy (and we emphasize that this assumption is not being applied to completion of the speciation process). In other words, we expect that divergence between populations within one species is not affected by similar processes occurring within another species, and we use a null model of phylogenetic independence for these processes (e.g., that divergence processes within species are not affected by what order the species is in). Note that this assumption or null model is independent of completed speciation; it considers divergence only. From reviews of genetic data we know this assumption to be violated in a strict sense (mutation rates vary among major avian lineages), but not to such a degree that it has prevented the utility and widespread application of this divergence metric [12]. For example, Weir and Schluter [45], in their review of another commonly used mtDNA gene cytochrome-b, found that "A molecular rate of approximately 2.1% (± 0.1%, 95% confidence interval) was maintained over a 12-million-year interval and across most of 12 taxonomic orders." Similarly, we have chosen a phenotypic divergence metric that also has wide applicability [13], even though doubts remain about its universal effectiveness within the narrower scope of species delimitation [65,66].
Without a reliable phylogeny that encompasses our sampled taxa, we cannot effectively test or correct for phylogenetic effects in our datasets, although we do examine such effects insofar as possible. Additionally, integrating genetic and phenotypic divergence data by making a bivariate plot in our divergence and speciation process space produced multiple pairwise comparisons within some species, introducing non-independence in some cases that confounds analyses. To correct for this when asking questions about possible higher-order effects, we averaged all within-species comparisons for species represented by more than two populations. We then used the corrected datasets to perform analyses of variance (ANOVA and MANOVA) to test for the effect of taxonomy on genetic and phenotypic divergence. Specifically, we tested for the effects of taxonomic order and family on divergence. Because many taxonomic orders were represented only by one species, we performed additional analyses treating all non-Passeriformes species as a single group (n = 14 species). Ordinary least squares regressions were also performed on single-lineage divergence estimates.
We do not have sufficient information to know whether all of our pairwise comparisons are between sister populations. To study the process of divergence and speciation within a lineage, it is common to focus on sister populations or taxa, and given our sampling it is likely that many inter-island comparisons are between sisters. However, when studying the divergence process among many lineages for overarching patterns of divergence, it is not necessary to include only sister lineages. In fact, sister populations are not needed to examine how genetic and phenotypic divergence generally proceed within a lineage. And, because sufficiently detailed intraspecific phylogenies do not yet exist for these 48 lineages (and this is not a phylogeographic study), our approach simply makes contrasts between and among populations. (We know that some comparisons are not between sisters, given that we have multiple populations of some species included, which we correct for when making among-lineage comparisons.) The presence of sister and non-sister lineages in the divergence process space does not materially affect the overarching understanding of how phenotype and genotype diverge-particularly among island populations such as these in which there is so little evidence of gene flow.

Results
Genetic distances from 96 pairwise comparisons ranged from 0.02% to 12.5% (ND2 Jukes-Cantor corrected p-distance), and average divergence among populations was 3.16% (Table 1,  Table B in S1 File). Phenotypic divergence scores from 96 comparisons ranged from 1 to 16 and averaged 6.0 ( Table 1, Table B in S1 File). More than half of the species studied (25 of 48) revealed divergence levels surpassing our high-divergence thresholds either genetically, phenotypically, or both. In total, 42 pairwise comparisons (out of 96) were between populations that were highly divergent in either genotype or phenotype by our conceptual thresholds, suggesting the possibility of 29 species-level splits within 25 species (Tables B and C in S1 File). All 29 of these highly divergent populations are recognized as subspecies endemic to the Philippines [27]. In addition to comparisons of divergence, genetic analyses revealed mtDNA paraphly at the subspecific level in 8 species (Figure A in S2 File). ND2 haplotype relationships indicated undescribed cryptic populations within at least 4 taxa: Accipiter virgatus confusus, Phapitreron leucotis brevirostris, Zosterops montanus vulcani, and Copsychus mindanensis mindanensis ( Figure A in S2 File). Comparisons within the remaining 40 species recovered haplotypically distinct populations, indicating a preponderance of genetic island endemism and lack of gene flow.
As expected given their recognized importance in generating Philippine biodiversity [33], genetic comparisons between PAICs were on average higher than those within PAICs. This Table 1. Genetic distances (Jukes-Cantor corrected p-distance) and phenotypic scores (based on the Tobias et al. [13] method) for 96 pairwise comparisons. Results are binned into four categories: populations diverging across both genetic and phenotypic conceptual thresholds (A1-11), high phenotypic divergence (B1-23), high genetic divergence (C1-8), and populations with divergence levels that did not surpass either conceptual divergence threshold (D1-54  Table G in S1 File) and when testing a smaller subset of 10 species corrected for non-independence (i.e., each of these species' among-or within-PAIC genetic distances are represented by a single value, the average of all respective comparisons for that taxon; t = 4.1, df = 9.8, p = 0.002; Table G in S1 File).
Integrating genotypic and phenotypic data in a bivariate divergence process space (Fig 1) enabled us to consider them together both in a continuous and discontinuous manner. For the latter, we erected bins representing our conceptual thresholds delimiting phenotypically and genetically highly divergent lineages and considered which pairwise comparisons crossed those thresholds. Many comparisons were highly divergent on both axes, crossing both genetic and phenotypic thresholds (Fig 1, bin A). These included 8 species (Table 1: A1-11), and this group represents one general divergence route toward speciation in this space (Fig 2, route a, even progression of divergence along both axes). There were 14 species (Table 1: B1-23) with populations that were highly divergent along the phenotypic axis alone (Fig 1, bin B), representing a second possible general route toward speciation (Fig 2, route b, rapid phenotypic divergence relative to genetic divergence), and 6 species (Table 1: C1-8) with populations that were highly divergent along the genetic axis alone (Fig 1, bin C), representing a third possible general divergence route toward speciation (Fig 2, route c, low phenotypic divergence relative to deeper genetic divergence). Of the 96 pairwise comparisons, 54 did not surpass either divergence threshold (though several approached these thresholds), and these were binned together in the lower regions of this divergence process space (Table 1: D1-54; Fig 1, bin D).
Relative frequencies of Passeriformes vs. non-Passeriformes populations were heterogeneous among bins. Only one non-Passeriformes species, Otus megalotis, occurred in bin A, and none occurred in bin C. However, in bin B non-Passeriformes outnumbered Passeriformes species, despite a lower frequency (27%) in the overall dataset. Phenotypically and genetically high-divergence (i.e., threshold-surpassing) levels occurred at similar frequencies among Passeriformes (19 and 18, respectively, out of 70 pairwise comparisons) but not among non-Passeriformes (15 and 1, respectively, out of 26 comparisons; Table 2). When comparing all 42 highly divergent pairwise comparisons together, frequencies of non-Passeriformes and Passeriformes were not significantly different than null expectations of homogeneity (i.e., taxonomy did not affect the overall number of comparisons considered to be highly divergent). However, when comparing either phenotypically or genetically highly divergent comparisons separately, frequencies of non-Passeriformes and Passeriformes were significantly different than null expectations (P < 0.05; Table 2, Table D in S1 File). In other words, more non-Passeriformes were phenotypically highly divergent than expected by chance, while more Passeriformes were genetically highly divergent than expected based on their frequencies in the overall dataset. Within individual bins, where sample sizes were small, frequencies of Passeriformes vs. non-Passeriformes were not significantly different than null expectations except for bin B, where the high occurrence of non-Passeriformes taxa was significant (P < 0.001;  Divergence in Philippine Birds S1 File). We note, however, that these tests were non-significant when the six species-level splits recognized by [35][36][37] were removed. Pairwise comparisons among species, corrected for within-species non-independence (Table E in S1 File), were distributed across the 3 general routes of the speciation process space (Fig 2), with the majority of species appearing to follow the general divergence routes of a and b, with little representation along the general route c (i.e., few with low phenotypic divergence relative to deeper genetic divergence). Analyses of variance (ANOVA and MANOVA) on these data indicated significant effects from taxonomy (Passeriformes vs. non-Passeriformes) on divergence (Table 3). Taxonomic order and family, treated individually, did not significantly affect either phenotypic or genetic divergence (Table 3). When treating all non-Passeriformes orders as a single group, however, taxonomy significantly affected overall divergence (MAN-OVA, P < 0.01) and phenotypic divergence alone (ANOVA, P < 0.05), but not genetic divergence (ANOVA, P = 0.17; Table 3). As above, these effects were non-significant when the six species-level splits recognized by [35][36][37] were removed.
Because these results suggesting effects at higher taxonomic levels might be affected by taxonomic philosophy and species limits concepts, we approached it in two other ways, by contrasting two taxonomic philosophies and by re-analyzing the data from a phylogenetic species concept perspective, in which we consider each reciprocally monophyletic mtDNA population to be a phylogenetic species, such that genetic independence equals an independent lineage against which to measure divergence (Table H in S1 File). The philosophical effect is arguably small, but the quantitative comparisons of phylogenetic species suggest the effect is not only real, but more extensive than the comparisons above suggest (Table H in S1 File, Table 4). Comparing only reciprocally monophyletic populations, and excluding the six species-level splits noted above, Passeriformes versus non-Passeriformes significantly affected genetic divergence (ANOVA, P < 0.024), and taxonomic order and family, treated individually, significantly affected phenotypic divergence (ANOVA, P < 0.04; Table 4). Overall divergence was affected by all three levels of higher taxonomy (MANOVA, P < 0.014; Table 4).
Ordinary (linear) least squares regressions within each group (non-Passeriformes and Passeriformes) indicated that the slope of the relationship between genetic and phenotypic divergence was significantly different from zero for Passeriformes (P < 0. 001), but not for non-Passeriformes (P = 0.64; Table D in S1 File). Overall, phenotypic divergence was correlated Table 2. Results of chi-squared tests of taxonomic heterogeneity among genetically and phenotypically highly divergent populations. For these tests all taxa were treated as either non-Passeriformes or Passeriformes. Expected frequencies were obtained from the frequencies of non-Passeriformes and Passeriformes comparisons in the overall dataset (27% and 73%, respectively; see Table F  Divergence in Philippine Birds with genetic divergence (linear regression P < 0.01), and it appeared to increase logarithmically (fitted nonlinear least squares regression R 2 = 0.19; Fig 3, Table D in S1 File).

Discussion
We found high levels of both phenotypic and genetic divergence and much cryptic diversity within the 48 species we studied, providing insights into the divergence and speciation process among these lineages and important ramifications for taxonomy and conservation.

Divergence patterns
In considering how populations have diverged in the multidimensional speciation process space, two central results emerged. First, there may be higher-order taxonomic effects in how lineages are undergoing divergence; and, second, phenotypic divergence among lineages appears to have a logarithmic relationship with genetic divergence. Support for heterogeneity among different lineages in the divergence or speciation processes occurring among them suggest that Passeriformes exhibit less phenotypic divergence relative to genetic divergence than other orders. Or, more precisely, the non-Passeriformes taxa in our study exhibited more plumage and morphometric divergence (traits likely under selection) relative to mitochondrial genetic divergence (likely to be neutral or nearly neutral) than Passeriformes taxa. These findings suggest that higher-order taxonomy probably affects divergence and speciation processes in this assemblage of lineages. For many birds, especially oscine Passeriformes, song can be more important for mate selection than plumage [67]. Because we did not measure vocalizations, phenotypic divergence in Passeriformes is probably underestimated  [45,50,68] and may be faster in Passeriformes than in other birds [46][47][48][49]. The heterogeneity in divergence patterns we observed between Passeriformes and non-Passeriformes might thus occur because of differential mtDNA mutation rates among birds and/or through underestimates of phenotypic divergence among oscine Passeriformes due to our exclusion of song. We also note that a different subset of Philippine birds (or a complete sampling), or using different high-divergence thresholds, might also affect these findings. As noted in results, heterogeneity in higher-order taxonomic effects vanished when currently recognized full species were removed from analyses (18 lineages treated here as subspecies from 8 species, four each of Passeriformes and non-Passeriformes; Table B in S1 File). However, when we adopted a phylogenetic species approach higher-order taxonomic effects remained strong. Insofar as these questions involve lineage divergence in genetics and phenotype, examinations need not be restricted to divergences below the species level (particularly in a system where there is so little evidence of gene flow). Determining how and to what extent higher-order taxonomic effects affect the divergence process among Philippine birds will require more comprehensive study. This would ideally include more lineages both above and below the species level and more extensive genotypic and phenotypic data. Overall, our data show an apparent logarithmic increase of phenotypic divergence with respect to genetic divergence (Fig 3, Table D in S1 File). This is of interest for two reasons. First, it suggests that speciation in this system might be dominated by selection promoting divergence rather than by a gradual accumulation of differences in allopatry (neutral processes  Table D in S1 File) and fitted curve showing apparent logarithmic increase of phenotypic divergence with respect to genetic divergence. In the equation P = ln (G) * p 1 + p 2 , "P" refers to phenotypic score and "G" refers to genetic distance. would more likely be linear). Secondly, it suggests limits to phenotypic divergence over time in this assemblage of lineages. This is not unexpected, as there are likely to be evolutionary constraints on phenotypic divergence that exceed any constraints on the continued accumulation over time of putatively neutral genetic divergence. In other words, on the genetic axis there is relentless ongoing mutation promoting divergence, while on the phenotypic axis such predictably directional forces are lacking. Interestingly, Delmore et al. [69] found a similar relationship among an assemblage of North American migratory bird species. Such findings require further research into how phenotypic divergence accrues within and among lineages.
In considering the generalized routes of divergence toward speciation in Figs 1 and 2, we found substantial numbers of lineages that appear to have followed the general routes a and b. Population pairs that were both genetically and phenotypically highly divergent (Fig 1 bin A, Table 1 A1-12) are the most likely candidates for elevation to full biological species. Large genetic distances suggest long periods of evolutionary isolation, and large phenotypic differences imply divergent selection, although selection is difficult to disambiguate from phenotypic plasticity [70]. This group includes taxa with large divergences in the Dicrurus hottentottus and Otus megalotis complexes, species complexes that include multiple contentious subspecies and putative species [21,35,36,71,72]. Population pairs among 14 species that were highly divergent phenotypically but not genetically (Fig 1 bin B, Table 1 B1-19) also seem likely to have undergone divergent selection (with the caveat that phenotypic plasticity may influence these results). Although the species in this group did not exceed our genetic threshold of 5%, relatively large distances (e.g., 3-4%) separated many populations (e.g., Table 1 B16-23), and many may well be full biological species [13]. Some of the highly divergent populations in this group (e.g., within the Phapitreron leucotis and Chrysocolaptes lucidus species complexes) have recently been elevated to full species [36].
The third general route to speciation (c in Fig 2) was less well represented among our 48 lineages. Population pairs that were not phenotypically divergent but were separated by greater than 5% genetic distances (Fig 1, bin C) included 6 species of Passeriformes (Table 1, C1-8). The divergences observed here likely result from extended time in evolutionary isolation, as changes in mtDNA sequence data are usually interpreted to be neutral or nearly neutral [73,74,75]. Canalization of phenotypic characters, notably plumage color and pattern, and body size and shape, may contribute to the lack of phenotypic divergence observed between these oscine Passeriformes populations [20], as occurs in some other noteworthy avian genera (e.g., Scytalopus, Empidonax). The populations in this group (Table 1, C1-8) represent deep divergence between cryptic populations (e.g., Ixos philippinus guimarasensis separated from other subspecies by more than 12% genetic distance despite very similar phenotype) and may merit elevation to full biological species as recently done by Dickinson and Christidis [37].

Initial taxonomic implications
While we are cautious about applying simplistic threshold values to determine species limits, our data suggest that cryptic species remain within many currently recognized species of Philippine birds. Using conservative conceptual speciation thresholds we found at least 29 populations, currently recognized as Philippine endemic subspecies, that may with further study warrant consideration as full biological species. Average genetic distance from 96 within-species comparisons was 3.15%, and distances less than this separate many sympatric species in well-studied mainland systems [20,52]. Our results support the suggestion of Lohman et al.
(2010) that the lower rates of endemism in birds (compared to other vertebrates) in the Philippines may be an artifact of misclassifying distinct island populations of birds as subspecies, rather than species. Lohman et al. [10] also predicted that, upon further investigation, a more accurate measure of endemism among birds in the Philippines may exceed 50%, as occurs in other terrestrial vertebrates [18]. With highly divergent populations occurring in more than half of the species we studied, our data provide empirical support for this prediction. Taxonomic revision, taking into consideration multiple types of comparison (e.g., genomic comparisons, phenotypic scoring, behavioral and ecological traits, etc.) is needed in this system. Uniting phenotypic and genetic datasets, as we have done here, will prove essential to such revisions.

Implications for conservation
Taxonomic designation can have real-world consequences on the conservation of populations [19], and in the Philippines birds and other wildlife are severely threatened by anthropogenic forces [17,18]. Our results reemphasize the urgent need for a reappraisal of Philippine avian diversity expressed by Peterson [29] and Lohman et al. [10]. Further research is also warranted on the 29 endemic populations in 25 species that our data suggest to be largely through the speciation process (e.g., estimates of gene flow and evaluation of traits promoting reproductive isolation). In addition, our results show surprisingly little haplotype sharing among populations (i.e., high rates of genetic endemism), revealing a level of diversity with important implications for management and conservation below the species level for populations that have likely not yet achieved speciation.
Supporting Information S1 File. Table A, List of all specimens compared in this study including taxon identification, locality, field catalog numbers, museum voucher numbers, and GenBank accession numbers for ND2 sequences generated in this study. Table B, Genetic distances and phenotypic scores for all 96 pairwise comparisons in this study presented in taxonomic order. Table C, Potential splits in 25 species supported by highly divergent pairwise comparisons, presented in taxonomic order. Table D, Chi-squared tests for taxonomic heterogeneity among highly divergent lineages and among bins. Table E, Genetic and phenotypic divergence scores after correcting for non-independence. Table F, Genetic distances for all 96 pairwise comparisons in this study including sample size, ND2 fragment size, and standard deviation of genetic distance both uncorrected and Jukes-Cantor corrected p-distances. Table G, Genetic divergence within and among Pleistocene Aggregate Island Complexes (PAICs) for 10 species, corrected for nonindependence. Table H, Genetic distances and phenotypic scores for all reciprocally monophyletic pairwise comparisons in this study presented in taxonomic order. Note: Tables A-H Figure B, Plot of all 48 species included in the dataset with species labels. Data points here reflect data corrected for non-independence among species (see Table E in S1 File). (PPT) project assistance. We also thank Director Pollisco, Director Mundita Lim, Carlo Custodio, Anson Tagtag, Director Augustus Momongan, Reynaldo Yray, Division chief Kho, Gloria Dawson, Roberto Puentespina, and many others who participated in this project. Special thanks also go to our collaborators at the National Museum of the Philippines, including Virgilio Palpal-latoc, Rolly Urriza, and Directors Gabriel Casal and Corazon Alvina, and to Edwin Cedella, Almeo Bontigao, and many others for their invaluable assistance in the field. We thank Herman Mays of the Cincinnati Museum of Natural History and Robert Kennedy for lending tissue samples. Finally, we thank Kevin McCracken, Naoki Takebayashi, and several anonymous reviewers for comments on earlier drafts.