Pollination Mode and Mating System Explain Patterns in Genetic Differentiation in Neotropical Plants

We studied genetic diversity and differentiation patterns in Neotropical plants to address effects of life history traits (LHT) and ecological attributes based on an exhaustive literature survey. We used generalized linear mixed models (GLMMs) to test the effects as fixed and random factors of growth form, pollination and dispersal modes, mating and breeding systems, geographical range and habitat on patterns of genetic diversity (HS, HeS, π and h), inbreeding coefficient (FIS), allelic richness (AR) and differentiation among populations (FST) for both nuclear and chloroplast genomes. In addition, we used phylogenetic generalized least squares (pGLS) to account for phylogenetic independence on predictor variables and verify the robustness of the results from significant GLMMs. In general, GLMM revealed more significant relationships among LHTs and genetic patterns than pGLS. After accounting for phylogenetic independence (i.e., using pGLS), FST for nuclear microsatellites was significantly related to pollination mode, mating system and habitat. Plants specifically with outcrossing mating system had lower FST. Moreover, AR was significantly related to pollination mode and geographical range and HeS for nuclear dominant markers was significantly related to habitat. Our findings showed that different results might be retrieved when phylogenetic non-independence is taken into account and that LHTs and ecological attributes affect substantially the genetic pattern in Neotropical plants, hence may drive key evolutionary processes in plants.


Introduction
The search for patterns in evolutionary ecology has been extensively discussed in literature as a central problem in ecology (see [1,2] for recent reviews). Lawton ([3] p 145) stated, "Without bold, regular patterns in nature, ecologists do not have anything very interesting to explain". However, the observed variables and the ecological and evolutionary mechanisms affecting the patterns operate at different scales across space, time and ecological organization, challenging the finding of such patterns and their ecological and evolutionary causes [4]. In population genetics, genetic diversity and its distribution both within and among populations may be determined by microevolutionary processes such as demographic history, selection and gene flow which in turn may operate at different scales of space, time and ecological organization (see [5] for a review). Studying this feature should then be the first step to understand the evolutionary path that a species undertake. Plant life-history traits (hereafter LHT) such as growth form, pollen and seed dispersal modes and breeding system, as well as geographical distribution and other ecological attributes may also influence genetic structure, gene flow and effective population size. As a consequence, life-history may determine the relative importance of microevolutionary processes, thus affecting population genetic structure [6].
Previous reviews in population genetics aiming to find patterns in genetic diversity and population structure in plants have shown relationship between LHT and population differentiation for isozyme/allozyme loci and dominantly inherited nuclear DNA markers such as RAPD, ISSR and AFLP [6][7][8][9][10][11][12]. These studies indicate that either long-lived woody or outcrossing species have higher genetic diversity within than among populations, contrary to annual selfing species. Long-lived outcrossing species may have larger effective population sizes preventing the loss of genetic diversity and population differentiation due to genetic drift [6]. Genetic diversity is significantly higher in larger populations, mainly in self-incompatible species, but the level of inbreeding F IS is independent of population size [13]. In addition, due to low population density, significant rates of self-fertilization and biparental inbreeding (outcross between related individuals), tropical trees have significantly higher genetic differentiation than temperate forest trees [14]. Patterns in plant genetic diversity are also related to habitat fragmentation, still LHT such as pollination and seed dispersal modes are not related to the susceptibility to the loss of genetic diversity [15].
In general, the findings from these reviews reveal important patterns and their causes. However, they suffer serious statistical flaws. In this type of reviews, data is often non-normal distributed and researchers usually transform data to achieve normality and homogeneity of variance and rely on the robustness of ANOVA or use Generalized Linear Models (GLM) [8,9,[16][17][18], (Fig 1, arrow "a"). These approaches may lead to errors since they ignore random effects from different LHT and treat all of them as fixed factors committing pseudoreplication (see [19,20] for a review). A more appropriate method to analyze such data would be generalized linear mixed models (GLMMs) because it combines desirable properties of two statistical frameworks; i.e. linear mixed models, incorporating random effects, and GLM, which handles non-normal data [21]. GLMM could thus match better the structure of the data (Fig 1, arrow "b").
Besides the mixed structure of the data, traditional reviews have also ignored the phylogenetic relationship among the species (but see [16,17]). Conventional methods and GLMMs do not account for phylogenetic dependence on the predictors such as LHT. Related species tend to resemble each other more than species randomly chosen from a phylogenetic tree, as a consequence of stochastic character evolution (Brownian motion) along a phylogeny (phylogenetic signal [22]). Thus, the phenotypes of a set of species may not represent independent samples from populations or lineages therefore such statistical methods may not be appropriate for comparative analysis and hypotheses testing about trait correlation across species [23] due to risk of inflating type I error. Thus GLMMs should be applied only if mixed factors (predictors) do not show phylogenetic signal (Fig 1, arrow "b").
Actually, it is necessary to first test for phylogenetic signal in the studied traits and if they are statistically significant then apply phylogenetically based methods for comparisons and hypotheses testing [24,25] (Fig 1, arrow "c"). When phylogenetic relationships are accounted for, related species tend to have similar levels of genetic diversity and differentiation for nuclear [17] and chloroplast molecular markers [16]. For example, in European Temperate and Boreal-Temperate angiosperms were found significant effects of reproduction system, pollination and dispersal modes, successional status and geographical distribution on population differentiation. [16]. However, when phylogenetic non-independence was considered using Phylogenetic Independent Contrast [23], only seed mass and geographical distribution remained significant [16]. Similarly, significant relationships were found for growth form, plant size, perenniality, seed dispersal mode, seed mass, pollination mode and mating system with genetic differentiation [17]. Yet, only perenniality, breeding and mating systems showed significant relationship with genetic differentiation at nuclear genome when phylogenetic nonindependence was considered [17].
It is fundamental that phylogenetic relationships are previously considered in such reviews and analyses. However, reliable estimates of phylogenetic signal depend on enough sample size, which is not always available throughout reviews. Small samples might bias such estimates by overspread or clumping species throughout phylogenies and undesirably reveal weak or strong phylogenetic signals, respectively. To solve the problem of small sample size, the effect of phylogenetic signal could be directly inferred from GLMM analyses. Because phylogenetic non-independence inflate type I error (i.e., reject null hypotheses more times than expected by chance), only the significant relationships from GLMMs could suffer phylogenetic effects. Then, the robustness of significant GLMM relationships (and not non-significant ones) should be tested in the light of phylogenetic methods as pGLS (Fig 1, arrow "d"). The Neotropics is one of the most diverse ecozones in the world; this vast biodiversity is a large repository of genetic information and has been defined as an active center of evolution [26]. Around 37% of seed plant species in the world occur in the Neotropics [27]. It includes eight of the 25 megadiverse hotspots, and some of the most threatened hotspots [28]. Hence, detection of genetic diversity and differentiation patterns is essential to address future research and conservation strategies in the Neotropics. For instance, geographical restricted species (i.e., presenting small effective population sizes) might experience rapid evolutionary changes due to rapid changes in allelic frequencies (genetic drift and founder effect [6,28]). In addition, previous works have showed that long-lived woody species may have higher genetic diversity and lower differentiation among populations due to larger effective populations sizes [6], however, in Neotropical rainforest, canopy trees usually occur at low density raising the question whether this expectation would hold in the Neotropics.
Furthermore, in Angiospermae, organelle DNA is usually inherited maternally but nuclear DNA is biparentally inherited. As a result, nuclear markers are transmitted via pollen and seeds whereas maternally inherited markers are transmitted via seeds only. Additionally, because of the haploid nature and mode of inheritance, the effective population size of the nuclear genome is four times the size of the chloroplast genome, leading to a stronger effect of genetic drift on population genetic structure based on chloroplast data [29]. Thus, the comparative analysis of nuclear and organelle genomes, with different modes of inheritance, and mutation and evolutionary rates, may provide different patterns of genetic diversity and differentiation and still clarify the relative importance of pollen and seed flow on population differentiation [30].
Because of their underrepresentation in most review studies and their importance for conservation, here, we focused on Neotropical plants addressing the effects of life-history traits, geographical range and habitat on patterns in genetic diversity and differentiation based on a literature survey. We used GLMMs to test the effects of growth form, pollination and dispersal mode, mating system (mixed, outcrossing or selfing), breeding system (monoecious, dioecious, hermaphrodite), geographical range and habitat on patterns of genetic diversity (He S , He, π and h), inbreeding (F IS ), allelic richness (AR) and differentiation among populations (F ST ), for both nuclear and chloroplast genomes in Neotropical plants. In addition, we used phylogenetic generalized least squares (pGLS) to account for phylogenetic relationships and verify the robustness of the results found by significant GLMMs.
Many Neotropical trees are widespread and have outcrossing mating system [6,9], in addition, long-lived woody species have typically larger effective population sizes [6], therefore we hypothesized that this growth form (i.e. trees) will have significant higher genetic diversity and allelic richness, and lower F ST and F IS than the other growth forms. Similarly, we expect that species with long distance dispersal modes might have lower F ST and F IS, as well as species with long distance pollination modes.

Database survey
We surveyed publications of 'population genetics of Neotropical plants' from the Thomson Reuters Web of Science database (http://apps.webofknowledge.com), using Web of Science platform (ISI hereafter), and Scopus Search (http://www.elsevier.com/online-tools/scopus). The survey was performed considering the availability of publications in both databases from 1945 (first register in ISI) to December 2013. We used the most frequent keywords in the area: "population genetics", "phylogeography", "mating system", "reproductive system", "breeding system", and "genetic structure", combined with (AND) "Neotropical trees" and "Neotropical plants". Additional keywords such as "pollen dispersal" and "gene flow" retrieved no additional articles. We excluded phylogenetic studies, reviews and development of molecular markers such as microsatellite primer development or SNPs discovery. We included only Angiospermae due to the low number of Gymnospermae species in the Neotropics (they occur mostly in Patagonia, across southern Argentina and Chile, as well as in highlands across Andes and Mesoamerica). We considered as Neotropics the region comprising the Neotropical Floristic Kingdom [31], which includes southern Florida, lowlands in Mexico, Central America, Caribe and South America, excluding southernmost South America (south Argentina and Chile, mainly Patagonia) and Andean highlands.

Life-history traits and ecological attributes
From each article we compiled information related to the studied species, number of individuals and populations, species life-history traits (i.e. dispersal and pollination modes, mating system, breeding system and growth form) and ecological attributes (i.e. geographical range and habitat). Data on LHT and attributes were obtained from the original articles or from articles about pollination system, seed dispersal or botanical reviews. We avoided creating many categories (for example mammalian seed dispersal by scatter-hoarding or endozoocory) due to the low sample size per category.
We categorized species growth form as epiphytes, herbs, non-woody scandent vines, palms, shrubs, trees and woody long-lived vines. For seed dispersal mode we found species dispersed by autochory, bats (predominantly dispersed by bats), birds, water (hydrocory), terrestrial mammals, mixed (birds and mammals including bats) and wind. For pollination mode, we classified species pollinated by bats, beetles, butterflies, flies, hummingbirds, large bees, moths, small bees, wasps and wind (S1 Table). For breeding system we found monoecious, dioecious and hermaphrodite species. For mating system we found species with outcrossing and mixed systems. Due to the low number of studies for some LHT, we joined the categories: herbs and non-woody scandent vines (herbs hereafter); trees and woody long-lived vines (trees); autochory and wind dispersal (wind); butterfly and moth pollination (Lepidoptera); large and small bees and wasps (Hymenoptera).
For ecological attributes, species geographical range was classified in two categories, narrow (species endemic to a specific habitat or species with restrict geographical range that occur in twenty or fewer localities) and widespread (S2 Table). Because this information is ambiguous in many studies we also checked geographical range using GBIF database (www.gbif.org). For habitat, we found species from deserts, grasslands, mangroves, mixed forests (i.e. species occurring in both rainforests and seasonally dry forests), savannas, seasonally dry forests (SDTFs hereafter), rainforests, rocky fields, rocky savannas and wetlands (S2 Table).

Genetic data
To compile genetic data we first classified the studies according to the molecular marker (S3 Table): dominant nuclear markers (AFLP, ISSR, RAPD); nuclear microsatellites; isozyme/allozyme (isozyme hereafter); nuclear sequences (mainly nrDNA ITS); chloroplast sequences (intergenic spacers); chloroplast microsatellites. For chloroplast, those studies using CAPs (cleaved amplified polymorphism) and similar markers (such as chloroplast RFLP and AFLP) were grouped (CAPs hereafter) due to the similar genetic information provided.
For chloroplast microsatellites and CAPs we compiled only genetic differentiation (F ST ). For nuclear dominant markers (AFLP, ISSR, RAPD); nuclear microsatellites and isozymes we obtained F ST , genetic diversity within population (i.e. He for articles that studied only one population), overall population genetic diversity (i.e. He S for articles that included more than one population), defined by [32] as genetic diversity at species level. For nuclear microsatellites and isozymes we also obtained inbreeding coefficient (F IS ) and allelic richness based on rarefaction (AR). For chloroplast and nuclear sequences we obtained F ST and nucleotide (π) and haplotype (h) diversities. However F ST for nuclear sequences could not be analyzed due to the low sample size. Number of alleles and haplotypes were not analyzed due to the effect of sample size. Since articles not always reported the same genetic parameters, data set could vary among parameters (see Tables 1, 2 and 3).

Data Analysis
We fitted Generalized Linear Mixed Models (GLMMs) to investigate the effects of species LHT and ecological attributes on genetic diversity and differentiation. LHT (i.e. growth form, dispersal and pollination modes and breeding systems), and habitat type were treated as multistate categorical variables, whereas geographical range (i.e. narrow or widespread) and mating system (i.e. mixed or outcrossing) were treated as binary variable. Models were fitted for each genetic parameter. Species LHT and ecological attributes were fitted as fixed factors, and species identity was considered a random factor as multiple variables were measured per species. Analyses were performed using MCMCglmm package [33] implemented in R version 3.2.1 (R core team 2014). MCMCglmm uses a Bayesian framework with Markov Chain Monte Carlo algorithm in which a total of 80,000 iterations chains were used with 2,000 chains of burn-in with a Gaussian distribution.
The effect of molecular markers on genetic parameters, was analyzed by fitting separate models for (1) nuclear dominant markers (i.e. AFLP, ISSR and RAPD); (2) isozymes; (3) nuclear microsatellites; (4) chloroplast markers (i.e. CAPs, chloroplast microsatellite and sequences). Due to the inheritance mode, chloroplast genome (uniparental) has 1/4 the effective population size of nuclear genome (biparental), which may affect the estimation of genetic variability and differentiation parameters, introducing noises in analyses precluding the detection of any pattern due to life-history traits or ecological attributes. When a species was studied more than once with the same molecular marker, the mean of the genetic parameters was used (40 cases in 186 species).

Accounting for phylogenetic relationships
To account for phylogenetic non-independence on the effects of LHT and ecological attributes on genetic diversity and differentiation, we first obtained the reference phylogenetic hypothesis of the species included in each analysis. We used the internal master tree Phylomatic tree R20120829 from the platform Phylomatic [34] to built the phylogenetic hypothesis. Since we had no information on branch length for inclusion in the comparative analyses, all branch lengths were assigned a value of 1, which may not significantly bias the results [35]. Even the exact phylogeny of some taxa is still in debate, improvements on the phylogeny would not modify dramatically the results because polytomies are mainly at terminal nodes and most of phylogenetic relationships are well resolved at deeper levels [36].
To account for phylogenetic relationships we first tested whether the studied life-history traits and ecological attributes have phylogenetic signal (i.e. phylogenetically related species tend to be more similar than distantly related species [22]). We used Abouheif's proximity test of serial independence [37,38] using the function abouheif.moran from the R-package adephylo [39] Then, we fitted Phylogenetic Generalized Least Square Models (pGLS [40]) to the genetic parameters verifying whether GLMM had resulted in robust inferences and hence the pattern persisted when phylogenetic relationships were accounted for. pGLS is a comparative method Table 1. Mean values of genetic diversity and differentiation in Neotropical plants per life-history trait (LHT) and ecological attribute, across all the studies included in the analyses, for nuclear molecular markers. N, number of species analyzed; F ST , genetic differentiation among populations; He S , overall population genetic diversity, He, genetic diversity within population; SD, standard deviation; n, sample size (number of species with available data in the regression analysis).

Life-history trait and ecological attribute
Isozymes He S ± SD(n)

Isozymes
He ± SD(n) --- (Continued) in which the covariance among species resulting from the phylogeny is expressed in the regression error term. It is easily extended to multivariate data and can be used unambiguously when polytomies are found in the phylogeny as in our case. The analyses were carried out using the package caper [41] implemented in R version 3.2.1.

Data description
Our survey retrieved 358 articles comprising studies from 186 species, belonging to 45 families and 116 genera (S4 Table). Tree species were the most studied in Neotropics (109 species, S2 Table), followed by herbs (25 species), shrubs (21), epiphytes (17) and palms (14). Wind-and mammal-dispersed species were the most studied (81 and 60 species, respectively) and only two studied species were bat-dispersed. Most studied species were pollinated by Hymenoptera (i.e. large and small bees and wasps, 130 species), followed by Lepidoptera (butterflies and moths, 14) and wind-, fly-and beetle-pollinated species were underrepresented in the studies performed so far in Neotropical plants (S2 Table). For breeding system, hermaphrodite plants were the most studied (142), followed by monoecious (25) and dioecious (19). For mating system, plants with an outcrossing system (124) were the most studied (S2 Table).
We found high mean values and variation of F ST for all LHT and ecological attributes for both nuclear and chloroplast molecular markers (Table 1). However, plants from deserts and grasslands had low values of F ST at nuclear genome and plants pollinated by flies had high F ST at chloroplast genome. The surveyed data also showed high variation in genetic diversity for both nuclear and chloroplast genomes (Tables 1, 2 and 3).
Isozymes He S ± SD(n)

Genetic patterns
General genetic patterns. Genetic differentiation among populations (F ST ) for nuclear microsatellite markers was significantly related to growth form, dispersal and pollination modes and breeding system (Fig 2, see also S5 Table). F ST was significantly lower for trees and palms than for epiphytes, as well as lower for wind-and mammal-dispersed plants than for bird-dispersed (Fig 2A and 2B). In addition, F ST values were significantly higher for plants pollinated by Hymenoptera and with monoecious and hermaphrodite breeding systems than for dioecious (Fig 2C and 2D, S5 Table). For chloroplast genome shrubs had higher F ST than the other growth forms (S1 Fig, S6 Table). Table 2. Mean values of genetic diversity and differentiation in Neotropical plants per life-history trait (LHT) and ecological attribute, across all the studies included in the analyses, for nuclear molecular markers. N, number of species analyzed; F IS , inbreeding coefficient; AR, allelic richness; h, haplotype diversity; π, nucleotide diversity; SD, standard deviation; n, sample size (number of species with available data in the regression analysis).
Life-history trait and ecological attribute N Isozymes F IS ± Microsatellites Isozymes AR ± Microsatellites Nuclear sequences SD(n) F IS ± SD(n) SD(n) AR ± SD(n) h ± SD(n) π ± SD(n) Inbreeding coefficients (F IS ) retrieved from isozymes were significantly higher for herbs than for the other growth forms (Fig 3A, S7 Table) and for plants inhabiting rocky fields than for plants from rainforests, mixed and SDTFs (Fig 3B, S7 Table).
Plants inhabiting rainforests showed slightly higher genetic diversity overall populations (He S , Fig 4A, S5 Table) for nuclear dominant markers. Furthermore, He S values retrieved from nuclear microsatellite markers were higher in palms and trees than in epiphytes and shrubs, as well as in plants pollinated by Hymenoptera (Fig 5, S5 Table), while He S retrieved from isozymes was not significantly related to any LHT or ecological attribute (S5 Table).
Within population genetic diversity (He) was not significantly related to any LHT or ecological attribute studied (S6 Table), neither were haplotype (h) and nucleotide (π) diversities Table 3. Mean values of genetic diversity and differentiation in Neotropical plants per life-history trait (LHT) and ecological attribute across all the studies included in the analyses, for chloroplast molecular markers. N, number of species analyzed; F ST , genetic differentiation among populations; h, haplotype diversity; π, nucleotide diversity; SD, standard deviation; n, sample size (number of species with available data in the regression analysis).
Life-history trait and ecological attribute N F ST ± SD(n) h ± SD(n) π ± SD(n) from nuclear and chloroplast sequences (S6 and S8 Tables), nor allelic richness retrieved from isozymes. Yet, allelic richness retrieved from nuclear microsatellites showed significant higher values for plants pollinated by beetles than for other pollination modes (Fig 6A, S6 Table). Accounting for phylogenetic relationships in genetic patterns. The test of serial independence showed that some LHT presented a significant phylogenetic signal (Table 4, see also Fig 7). For species with data for He S for dominant markers, pollination mode, breeding system and geographical range had no significant phylogenetic signal (Table 4). For species with data for AR and F ST for nuclear microsatellites, most LHT and ecological attributes had no significant phylogenetic signal (Table 4).
Phylogenetic generalized least squares showed significant effects of pollination mode, mating system and habitat on genetic differentiation (F ST ) for nuclear microsatellites (Fig 2E-2G, Tables 5 and 6). Plants pollinated by Lepidoptera had higher F ST than plants pollinated by bats, beetles, birds and Hymenoptera insects (Fig 2E). In addition, plants with outcrossing mating system had significantly lower F ST than mixed system species (Fig 2F). While, plants inhabiting mangroves, rocky fields and rocky savannas had higher F ST than plants from other habitats (Fig 2G, Table 6). Likewise, habitat significantly affected genetic diversity overall populations (He S ) for dominant markers (Fig 4B). He S values were significantly lower for plants inhabiting mixed, rainforest rocky fields, savannas and wetlands (Table 6).
Allelic richness for nuclear microsatellites had significantly higher values for plants pollinated by beetles than other pollination modes ( Fig 6B, Table 5) and plants with wide geographical range than narrow (Fig 6C, Table 6). However growth form (S9 Table), dispersal mode (S10 Table) and breeding system (S11 Table) had no significant effects on AR when phylogenetic relationships where accounted for. We also found no significant effects of LHT nor ecological attributes on F IS for isozymes, H es for microsatellites and F ST for chloroplast markers when phylogenetic relationships were accounted for.

General genetic patterns
Plants pollinated by Hymenoptera have higher F ST and He S for nuclear microsatellites. Although bees may fly long distances and may potentially promote long distance pollen dispersal they may also display temporary specialization and stay in the same plant patch [42] leading to low gene flow. However, many works using parentage analysis showed long-distance pollen dispersal in Neotropical trees [14]. Though, the higher F ST and He S for plants pollinated by Hymenoptera may be an artifact of phylogenetic signal since it disappears when accounting for phylogenetic independence (see below). Pollination mode was also correlated to allelic richness for nuclear microsatellites. We found data on AR for only one species pollinated by beetles (Annona crassiflora, Annonaceae). Annona species are protogynous [43], which potentially promote long-distance pollen flow increasing allelic richness. Yet, this result may also be an artifact of taxonomic sampling bias due to the low number of species and biased family sampling. Notwithstanding, this relationship was maintained when phylogenetic non-independence was accounted for (see below).
Dispersal mode and breeding system explained differences in genetic differentiation at nuclear microsatellite markers, but the relationships were not recovered when phylogenetic non-independence was accounted for (see below). Wind-and mammal-dispersed species had lower genetic differentiation at nuclear genome. Similar to large sized mammals, wind may promote long distance seed dispersal [44], potentially increasing gene flow and decreasing genetic differentiation among population. Dioecious species had lower F ST at nuclear microsatellites than monoecious and hermaphrodite species. Dioecious species are obligate outcrossing which is correlated to long-distance gene flow [45], decreasing population genetic differentiation. In addition, all dioecious species studied so far in Neotropics are long-lived trees or palms, which also presented lower genetic differentiation. The association between dioecy and woodiness may be an outcome of strong selection for outcrossing in plants with a long life span [46].
Mating or breeding systems did not explained variation in F IS , but growth form and habitat did. Growth form was an important LHT in predicting genetic differentiation at chloroplast genome, inbreeding for isozymes and genetic diversity overall populations for nuclear microsatellites in Neotropical plants. Herbs showed a higher F IS value than other growth forms, it might be due to a sampling effect since we found data on inbreeding only for three herb species (one Poaceae species and two Begoniaceae). For chloroplast genome, shrubs presented higher genetic differentiation than the other life forms. In fact, we found data for chloroplast genome for seven shrub species (six Malvaceae and one Rubiaceae) from different habitats (rainforests, SDTF and rocky savannas), with different pollination (bees, birds, butterflies) and dispersal  modes (birds, wind and mammals). Many Malvaceae species have mixed-mating system with outcrossing rates lower than 80%, and also apomictic seed production (see [32,47] for a review) that may decrease genetic diversity within population and increase genetic differentiation among populations.  Although plants from rocky fields showed significantly higher inbreeding coefficients (F IS ), than other habitats, this result may be the effect of taxonomic sampling bias (i.e. low number of species and biased family sampling in Asteraceae (1) and Veloziaceae (2)). In addition, mean posterior value of F IS for rocky field plants was only slightly higher than zero (see Fig 2B) and the effect was not recovered when phylogenetic dependence was taken into account (see below). Genetic diversity overall populations (He S ) was slightly higher for plants from rainforests for nuclear dominant markers. Due to patterns in seed mortality and spatial recruitment patterns of trees (see [48] for a review], gene flow may reach long distances in rainforests increasing effective population size and genetic diversity [10].

Genetic patterns accounting for phylogenetic non-independence
As expected, the non-independence of LHTs along sister clades (i.e. related species have more similar LHT than expected by chance) had significant effects on the genetic patterns detected in Neotropical plants. When accounted for phylogenetic non-independence in nuclear microsatellites, pollination mode, mating system and habitat had significant effects on genetic differentiation (F ST ) whereas pollination mode and geographical range had a significant effect on allelic richness (AR). Species pollinated by Lepidoptera (butterflies and moths) had higher genetic differentiation than the other pollination modes. Butterflies and moth usually fly short distances (see [49] and references therein) leading to short distance pollen flow increasing genetic differentiation among populations. In addition, the higher allelic richness for species pollinated by beetles was recovered when accounted for phylogenetic non-independence. The relationships of all other LHT (e.g. growth form, breeding system and dispersal mode) on genetic differentiation found by GLMM were not recovered, although these LHT had no significant phylogenetic signal for species analyzed for nuclear microsatellites. For chloroplast F ST , the effect of growth form showed by the GLMM was most likely due to phylogenetic signal because no significant effect was detected when phylogenetic non-independence was taken into account.
Habitat was the only ecological attribute showing significant effects on genetic diversity overall population (Hes) for dominant nuclear markers. Species occurring in rainforest, rocky fields, savannas, wetlands and mixed forests had significant lower values of genetic differentiation than grasslands and SDTFs.
Our findings showed that different results might be retrieved when phylogenetic non-independence is taken into account. However, the surveyed literature was biased for most LHT and attributes, mainly due to the low number of genera (116) and families (45) studied so far in the Neotropics. For instance, all epiphytes are from three families (Araceae, Bromeliaceae, Orchidaceae). For Orchidaceae, only three genera were studied and for Araceae only two genera. For pollination mode, for instance, all wasp-pollinated species are Moraceae (Ficus spp.) and all fly pollinated species are Araceae. Thus, the analysis of pattern of genetic diversity and differentiation in relation to LHT and plant attributes may require larger samples sizes, i.e. large taxonomic sampling and also wide variation in LHTs and ecological attributes. Such information is unfortunately lacking so far and efforts should be guided in this direction in the future.     Moreover, the detection of phylogenetic signal for a given trait depends on the sampling (taxon sampling, number of species) and accuracy of phylogenetic tree [24,50]. Thus, although we found no significant phylogenetic signal for some traits, for instance, dispersal mode and breeding system, it does not mean that the trait has not evolved under a Brownian motion model and rather than a taxonomic sampling bias is what may not allow us the detection of any pattern of phylogenetic signal. It is important to note that traits related to seed dispersal mode (e.g. seed size and number) in Angiospermae have significant phylogenetic signal and evolutionary constrains imposed by reproductive structures [51]. Also, many traits in seed plants have evolutionary interdependencies that may constrain evolutionary and adaptive responses [52]. Actually, the Neotropical plants studied so far showed a significant relationship of pollination mode and mating system with genetic differentiation, yet showed no significant phylogenetic signal in our data.
We found no effect of dispersal mode on genetic differentiation at nuclear microsatellites, despite the non-significant phylogenetic signal in our data. Our findings agree with previous predictions that the general higher rates of pollen flow compared to seed flow could obscure the effects of seed dispersal mode on genetic differentiation at nuclear molecular markers [17,53]. Indeed, many works show higher contribution of pollen dispersal than seed dispersal to gene flow in Angiospermae [53] and in Neotropical plants [54,55]. In the surveyed data, we found genetic differentiation for both chloroplast (FST m ) and nuclear genome (FST b ) for 16 species. The ratio of pollen to seed flow considering inbreeding calculated as proposed by [56], i.e. (pollen flow)/(seed flow) , was higher than 1 for all species except one, indicating higher contribution of pollen than seed to historical gene flow, which may explains our non significant results.
Despite the differences in data set, our results agree with that obtained by [17]. They also found no significant relationships of breeding system for genetic differentiation for both nuclear molecular markers and maternally inherited markers but significant effects for mating system and pollination mode for nuclear molecular markers. They analyzed 150 species from all climate zones worldwide but species from Northern Hemisphere are overrepresented in their data. Here, we focused exclusively on Neotropical species and analyzed a rather larger data set (186 species). Thus, phylogenetic relationships among analyzed species may be different due to taxonomic and evolutionary history in different climatic zones and biomes. However, our finds on the effects of dispersal mode differ from [17].
Many works found relationships between growth form and genetic polymorphism and differentiation for allozymes/isozymes, i.e. long-lived woody perennials have higher levels of allozyme variation and lower genetic differentiation [6][7][8][9]. Our results do not corroborate this pattern. We found significant relationships for isozymes only between inbreeding coefficient and growth form and habitat. Though such relationships did not hold anymore when phylogenetic non-independence was taken into account, most likely due to the phylogenetic signal in growth form and also due to the different taxonomic sampling bias (their data included Gymnospermae and more species from Temperate than Neotropical regions).

Concluding Remarks
Our findings show that LHTs affect substantially the pattern of genetic diversity in Neotropical plants, hence may drive key evolutionary processes along with ecological features. Considering phylogenetic non-independence, pollination mode and mating system were the LHT with significant effects on genetic differentiation (F ST ) and pollination mode for allelic richness (AR) for nuclear microsatellites. Ecological attributes also explained genetic differentiation for nuclear microsatellites. For chloroplast genome, studies published so far using CAPs, sequencing or microsatellites show no significant patterns associated to LHT or ecological attributes in genetic differentiation neither in genetic diversity (h and π). Genetic diversity for nuclear genome (He S and He) could not be explained by any LHT or geographical range. However, habitat was significantly related to genetic diversity overall population (He S ) at nuclear dominant markers. We expected that trees have significant higher genetic diversity and allelic richness, and lower F ST and F IS than the other growth forms. However, we found no significant effect of growth form in any genetic parameter analyzed. Nevertheless, plants with outcrossing mating system had lower F ST .
Since here we showed that some LHT are phylogenetically non-independent and that the relationship between LHT and ecological attributes and genetic parameters might be different when phylogenetic non-independence is taken into account, we call for caution when interpreting results from previous reviews. Most reviews seeking for patterns in genetic diversity and differentiation underestimated (or even did not account for) the effect of phylogenetic non-independence. Spurious relationships may thus arise from conventional statistical analysis by inflating type I error when not accounted for such non-independence. We acknowledge the limitations of our results due to biased taxonomical sampling in the data set and the low number of studied species for some LHT. However, this is a first attempt to find patterns in genetic diversity and genetic differentiation for Neotropical plants using a large data set from Neotropical plants.  Table. Mean values of the posterior distribution of the GLMM for nuclear genome, for genetic parameters. He, genetic diversity within population; F IS , inbreeding coefficient; AR, allelic richness. Note that both genetic parameters are calculated for isozymes and microsatellite makers. Significant values are denoted in bold and grey-shaded. (DOCX) S7 Table. Mean values of the posterior distribution of the GLMM for nuclear genome, for genetic parameters. h, haplotype diversity π, nucleotide diversity. Significant values are denoted in bold. (DOCX) S8 Table. Mean values of the posterior distribution of the GLMM for chloroplast genome, for genetic parameters. F ST , genetic differentiation among populations; h, haplotype diversity; π, nucleotide diversity. Significant values are denoted in bold and grey-shaded. (DOCX) S9 Table. Phylogenetic generalized least squares for growth form, for each genetic parameter analyzed. Significant values are denoted in bold. F ST , genetic differentiation among populations; He S , mean genetic diversity among populations, He, genetic diversity within population; F IS , inbreeding coefficient. SE, standard error. (DOCX) S10 Table. Phylogenetic generalized least squares for dispersal mode, for each genetic parameter analyzed. Significant values are denoted in bold. F ST , genetic differentiation among populations; He S , mean genetic diversity among populations, He, genetic diversity within population; F IS , inbreeding coefficient. SE, standard error. (DOCX) S11 Table. Phylogenetic generalized least squares for breeding system, for each genetic parameter analyzed. Significant values are denoted in bold. F ST , genetic differentiation among populations; He S , mean genetic diversity among populations, He, genetic diversity within population; F IS , inbreeding coefficient. SE, standard error. (DOCX)