Although interfertility is the key criterion upon which Mayr’s biological species concept is based, it has never been applied directly to delimit species under natural conditions. Our study fills this gap. We used the interfertility criterion to delimit two closely related oak species in a forest stand by analyzing the network of natural mating events between individuals. The results reveal two groups of interfertile individuals connected by only few mating events. These two groups were largely congruent with those determined using other criteria (morphological similarity, genotypic similarity and individual relatedness). Our study, therefore, shows that the analysis of mating networks is an effective method to delimit species based on the interfertility criterion, provided that adequate network data can be assembled. Our study also shows that although species boundaries are highly congruent across methods of species delimitation, they are not exactly the same. Most of the differences stem from assignment of individuals to an intermediate category. The discrepancies between methods may reflect a biological reality. Indeed, the interfertility criterion is an environment-dependant criterion as species abundances typically affect rates of hybridization under natural conditions. Thus, the methods of species delimitation based on the interfertility criterion are expected to give results slightly different from those based on environment-independent criteria (such as the genotypic similarity criteria). However, whatever the criterion chosen, the challenge we face when delimiting species is to summarize continuous but non-uniform variations in biological diversity. The grade of membership model that we use in this study appears as an appropriate tool.
Citation: Lagache L, Leger J-B, Daudin J-J, Petit RJ, Vacher C (2013) Putting the Biological Species Concept to the Test: Using Mating Networks to Delimit Species. PLoS ONE 8(6): e68267. https://doi.org/10.1371/journal.pone.0068267
Editor: Enrico Scalas, Universita' del Piemonte Orientale, Italy
Received: January 17, 2013; Accepted: May 28, 2013; Published: June 20, 2013
Copyright: © 2013 Lagache et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: Funding was provided by the LinkTree project (ANR BIODIVERSA), by the EU Network of Excellence EvolTree, by ANR-10-EQPX-16 XYLOFOREST and by the INRA (CJS grant). The genotyping was funded by grants from the Conseil Régional d'Aquitaine n°20030304002FA, n°20040305003FA and from the European Union, FEDER n°2003227. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
According to the biological species concept, the ability to interbreed (i.e. interfertility) is a defining property of species . Yet, to our knowledge, the interfertility criterion has never been used to delimit species on the basis of mating events observed under natural conditions. Only artificial crosses have been used for this purpose, including in fungi (e.g. ), plants , or insects . However, this approach has been criticized (e.g. [5,6]) because artificial crosses bypass some pre-mating barriers to hybridization: mating events observed under artificial conditions might not reflect what would naturally occur. Hence, to date, there is no satisfactory example of the use of the interfertility criterion to delimit species. In fact, the methods used most frequently for species delimitation are not derived from the well-known biological species concept but are derived from other concepts such as the phylogenetic species concept, the genotypic species concept and the morphological species concept. Species definitions according to these concepts and possible associated criteria for species delimitation are listed in Table 1.
|Species concept||Species definition according to this concept||Possible criterion for species delimitation derived from this definition||Possible method of species delimitation using this criterion||First application of this method at the study site|
|Biological species concept||Species are “groups of actually or potentially interbreeding natural populations, which are reproductively isolated from other such groups”. According to Hausdorf , “natural populations” can be replaced by “individuals” in this statement without change of meaning.||Higher natural interfertility between individuals within than among species||Clustering of the network of natural mating events between individuals using Continuous Stochastic Block Model (C-SBM) .||this study|
|Phylogenetic species concept||A species is “a diagnosable cluster of individuals within which there is a parental pattern of ancestry and descent, beyond which there is not, and which exhibits a pattern of phylogenetic ancestry and descent among units of like kind” .||Higher genetic relatedness between individuals within than among species||Clustering of the network of relatedness relationships between individuals using C-SBM .||this study|
|Genotypic species concept||A species is a “genotypic cluster of individuals that can overlap without fusing with its siblings” [17,52]||Higher genotypic similarity between individuals within a species||Clustering of the individuals based on their multilocus genotype with STRUCTURE ||Guichoux et al. 2012 |
|Morphological species concept||Species are “the smallest detected samples of self-perpetuating organisms that have unique sets of characters” [53,54].||Higher morphological similarity between individuals within than among species||Clustering of the individuals based on several morphological traits with a factorial discriminant analysis .||Bacilieri et al. 1996 |
One potential method of species delimitation based on the interfertility criterion is the analysis of mating networks. Mating networks represent mating events between individuals . Nodes of the network represent the individuals and links connect the individuals between whom mating events have occurred. Applying methods of network clustering [8–10] to mating networks should allow the identification of subsets of strongly interconnected nodes that correspond to species. If the biological species concept is strictly interpreted, then a species should correspond to a connected component of the mating network (Figure 1A). A connected component is a subset of nodes within the network that are directly or indirectly connected but are not connected to nodes not contained in the subset. According to a relaxed biological species concept, which allows for some level of hybridization between species [11–13], a species should correspond to a community in the mating network (Figure 1B). Communities are subsets of nodes with a high density of links within the group and a lower density of links between different groups . It is in this latter case, when species hybridize, that species delimitation based on the interfertility criterion is particularly challenging and network analysis may be particularly useful.
Each node of the network, represented by a black star or a white circle, is an individual. Each link of the network, represented by a thin black line, corresponds to a mating event between two individuals. In A, there is no mating event between the two groups of individuals whereas in B, a few mating events occur between groups. Species boundaries according to a strict application of the biological species concept are indicated by a continuous thick black line. Species boundaries according to a relaxed interpretation of the biological species concept, allowing interspecific hybridization, are indicated by a broken red line. In network theory, the continuous black line delimits the connected components of the network whereas the broken red line delimits communities.
The idea of analyzing mating networks to delimit species according to the biological species concept was proposed more than 40 years ago by Sokal and Crovello  but it does not appear to have been put into practice. Building a mating network is indeed a difficult task as it requires a very large data set of mating events collected under natural conditions. The species should be sympatric and have semi-permeable reproductive barriers so that the issue of species delimitation is relevant. Furthermore, the species should be not only outcrossing (with a low selfing rate) but also highly polygamous and have multiple offspring per generation so that actual mating events are representative of potential mating events between individuals at a given time [15–17]. If such data were available, would the analysis of mating networks be an effective method to delimit species based on the interfertility criterion? Would the boundaries between species be the same as those obtained using other species delimitation criteria?
To answer these questions, we investigate the congruence between four methods of species delimitation, derived from the biological, morphological, genotypic and phylogenetic species concepts (Table 1), by applying them to two hybridizing tree species living in sympatry. The study site is a 5 ha mixed stand of Quercus robur and Q. petraea comprising 298 adult trees originating from natural regeneration . As many other closely related plant species , these two oak species hybridize under natural conditions , including in the studied stand [21–23]. To delimit species according to the interfertility criterion, we analyze the network of observed natural mating events between pairs of adult trees by using a method of network clustering. Each node of the mating network corresponds to an adult tree and each link corresponds to at least one mating event between two trees. To cluster individuals, we selected among available methods of network clustering [8–10] the Continuous Stochastic Block Model (C-SBM) recently introduced by Daudin et al. . C-SBM synthesizes the heterogeneity of a real network by producing a simplified version of the network composed of a few virtual nodes, called extremal hypothetical nodes (EHNs). Unlike many methods of network clustering, which assume that each node belongs to only one group, C-SBM allows nodes to exhibit mixed connectivity behavior by assuming that each node of the real network is a mixture of the EHNs. This method is thus particularly suited to our study. Indeed, because the two previously identified oak species [23,25] are known to hybridize , we expected to find some individuals with a mixed reproductive behavior, i.e. breeding with both species. The same method was used to delimit species based on genetic relatedness between individuals. In that case, each node of the network corresponds to an adult tree and links connect the individuals that are considered to be related based on their genotype. Finally, we compare individual assignments obtained by analyzing the mating network and the relatedness network with those previously obtained in the same study site using criteria of morphological and genotypic similarities [23,25]. We then discuss how to summarize continuous but non-uniform variations in biological diversity.
Species Delimitation based on Interfertility
According to the AIC criterion, the best model for the mating network was the one with four EHNs, followed by the models with five and three EHNs (Figure S1 in File S1). We selected the model with three EHNs because the two other models highlighted the structure of the sampling design (Text S1 in File S1). According to the connectivity matrix for the EHNs (Figure 2A), EHN0 corresponds to a virtual node not connected to the whole network. This EHN, which is systematically present in the network models produced by C-SBM , makes it possible to take into account the variation in the number of links attached to the nodes of the real network. The two other EHNs, called EHNB1 and EHNB2, were strongly connected within themselves and were not connected to the other EHNs.
In A, nodes that are on the edge between EHN0 and EHNB1 are classified in group B1 whilst nodes on the edge between EHN0 and EHNB2 are classified in group B2. Other individuals are classified as intermediates (group Bi). In B, nodes that are on the edge between EHN0 and EHNP1 are classified in group P1 whilst nodes on the edge between EHN0 and EHNP2 are classified in group P2. Other individuals are classified as intermediates (group Pi). Connectivity matrices for the EHNs are presented next to each triangular representation. Non-zero values are given in bold.
The nodes of the mating network (each corresponding to an individual) were then represented in a triangle, with one EHN at each point (Figure 2A). The higher the proportion of a given EHN in the mixture of a node, the closer the node was to this EHN in the triangle. According to the connectivity matrix for the EHNs (Figure 2A), the nodes that had a high proportion of EHN0 in their mixture were weakly connected to the mating network. The nodes that had a high proportion of EHNB1 in their mixture belonged to a group of nodes strongly connected to each other and weakly connected to nodes with a high proportion of EHNB2. Conversely, the nodes that had a high proportion of EHNB2 in their mixture belonged to a group of nodes strongly connected to each other and weakly connected to nodes with a high proportion of EHNB1. There were, therefore, two groups of adult trees in the mating network within which mating events were frequent and between which mating events were rare. The graphical representation of the network confirmed this result (Figure 3A). According to the relaxed interpretation of the biological species concept, these two groups of individuals should correspond to two biological species (Figure 1B).
Individuals classified into the B1 group (in A) or the P1 group (in B) are shown in green, individuals belonging to the B2 group (in A) or the P2 group (in B) are shown in yellow, and intermediate individuals are shown in black.
In order to assign the individuals to the two species, we classified the nodes of the mating network according to their relative proportions of EHNB1 and EHNB2. We assumed that an individual belonged to species B1 if the corresponding node was a mixture of EHN0 and EHNB1 and only of these two nodes. Conversely, we assumed that an individual belonged to species B2 if the corresponding node was a mixture of EHN0 and EHNB2. Other individuals were classified as being reproductively intermediate (group Bi). In the triangular representation (Figure 2A), individuals assigned to species B1 were on the edge between EHN0 and EHNB1 (n=78 individuals) whilst individuals assigned to species B2 were on the edge between EHN0 and EHNB2 (n=121 individuals). Intermediate individuals were within the triangle (n=7 individuals). The three groups are shown in different colors in the network representation (Figure 3A).
Species Delimitation based on Relatedness
According to the AIC criterion, the optimal number of EHNs in the relatedness network was six. Models with three, four, five and seven EHNs were also good models (Figure S2 in File S1). As we did not find any satisfactory way to identify the best model (Text S2 in File S1), we selected the model with three EHNs to facilitate a comparison between the relatedness network structure and the mating network structure. According to the connectivity matrix for the EHNs (Figure 2B), EHN0 corresponded to a virtual node not connected to the whole network. The two other EHNs, called EHNP1 and EHNP2, were strongly connected within themselves and were not connected to the other EHNs. Like the mating network, the individuals were, therefore, classified into three groups called P1, P2 and Pi. Group P1 (n=70 individuals located on the edge between EHN0 and EHNP1 in the triangular representation; Figure 2B) and group P2 (n=108 individuals located on the edge between EHN0 and EHNP2; Figure 2B) comprised individuals with high within-group and low between-group degrees of relatedness. The third group Pi (n=28 individuals located within the triangle; Figure 2B) included trees related to both P1 and P2 individuals and trees with few relatives. The three groups are shown in different colors in the network representation (Figure 3B).
Species Delimitation based on Morphology and Multilocus Genotypes
The morphological similarity criterion has previously been used by Bacilieri et al.  to identify all trees from the study site. Based on their results, we assigned the individuals to two pure morphological groups (called M1 and M2 in this study and corresponding to Q. robur and Q. petraea, respectively) and to a morphologically intermediate class (called Mi). Guichoux et al.  used genotypic similarity as a criterion to assign the trees of the study site to species. Based on their results, we classified the adult trees in two purebred groups (hereafter called G1 and G2) and one genetically intermediate class (Gi).
Congruence between the Four Methods of Species Delimitation
In order to assess the congruence between the four methods of species delimitation, we compared the spatial distribution of the three groups of individuals identified with each method. The species boundaries are very similar (Figure 4). Among the 206 adult trees included in the mating network and in the relatedness network, there were 97 trees classified consistently in the B1, P1, G1 and M1 groups and 63 trees classified consistently in the B2, P2, G2 and M2 groups. We therefore re-named groups B1, P1, G1 and M1 Q. robur and groups B2, P2, G2 and M2 Q. petraea. Based on this classification, there were only four species inversions associated with the delimitation methods (Table S1 in File S1). Among the 206 adult trees, 42 were classified as intermediates according to at least one method. Surprisingly, no individual was classified as intermediate according to all four methods. Therefore, 91% of the discrepancies between the four methods were caused by assignments to the intermediate class (Figure S3 and Table S1 in File S1).
In A, B and C, individuals classified into the B1, P1 or G1 species, respectively, are represented by yellow triangles. Individuals classified into the B2, P2 or G2 species are represented by green diamonds. Intermediate individuals are represented by black crosses. In D, individuals classified into M1 are shown in red, individuals classified into M2 in blue and morphologically intermediate individuals are indicated by black crosses. Individuals of the M1 group are assigned to Q. robur and individuals of the M2 group to Q. petraea on the basis of current taxonomical practices.
There were nine discrepancies between the individual assignments according to the genotypic and morphological similarity criteria on the one hand and the interfertility criterion on the other hand. We investigated whether the biotic environment of the individuals might account for these discrepancies. Our hypothesis is that the neighborhood of each tree influences its mating system and might thus influence its assignment to species based on the interfertility criterion, whereas it would hardly affect its assignment to species based on the genotypic and morphological criteria. We therefore examined the neighborhood of each tree for which the assignment to species based on genotypic and morphological similarity criteria were congruent (N= 192). For each tree, we calculated the proportion of allospecific neighbors within a radius of 69m (corresponding to the average distance of pollen dispersal within stand for Q. petraea, the species with the smallest dispersal ability ). We found, by performing a logistic regression, that the proportion of allospecific neighbors had a significant effect on the congruence between the individual assignments according to the genotypic and morphological similarity criteria on the one hand and the interfertility criterion on the other hand (χ2=6.5, df=1, p-value=0.01). The individuals with congruent assignments had fewer allospecific neighbors on average (29%, versus 51% for individuals with incongruent assignments). Hence, individual species assignments based on the interfertility criterion were environment-dependent.
To our knowledge, this is the first time that the interfertility criterion is used successfully to delimit species under natural conditions. The analysis of a network of mating events between pairs of adult trees, constructed on the basis of a powerful paternity analysis of a large number of seedlings produced under natural conditions, allowed us to identify two groups of interfertile individuals with only a few mating events between groups. The two groups that were delimited, corresponding to two species according to a relaxed interpretation of the biological species concept (Figure 1B), were closely congruent with those obtained previously using morphological and genotypic similarity as criteria for species delimitation [23,26]. Indeed, 88% of the individuals were classified consistently according to the interfertility, morphological similarity and genotypic similarity criteria. Our results do not support earlier claims that the interfertility criterion cannot be applied in the field (e.g. [14,15]), particularly in the genus Quercus . They show instead that the analysis of mating networks can be used for delimiting species according to the biological species concept, as first suggested by Sokal and Crovello .
However this method of species delimitation has two main drawbacks. First, adequate network data are difficult to assemble. In our study we performed a paternity analysis on as many as 3046 offspring produced by 51 mothers in order to construct the mating network for adult trees. Despite the very large number of offspring, our network data did not allow us to assign all the individuals in the forest stand to species. Not all individuals sired offspring and some sired too few offspring to be reliably connected to the network. For example, three of the individuals whose assignment based on the interfertility criterion differed from that based on the three other criteria were represented by a single offspring in the progeny test. They were thus connected to the mating network through just a single link. Second, the sampling design may generate some heterogeneity in the network structure that blurs the biological heterogeneity caused by the existence of different species. This happened in our network data because we harvested the offspring of only 20% of the trees in the stand. The harvested trees (i.e. mother-trees), therefore, had more links in the mating network than the other trees. To solve both problems, one would have to harvest seeds from all the individuals in the stand, assuming that all of them produced seeds. In principle, this goal could be achieved with our biological system by extending sampling over multiple years, because oak species are perennial and monoecious. However this would be impossible for annual or dioecious species. Another possibility to reduce the noise caused by sampling would be to introduce the sampling structure as a covariate in the statistical model (e.g. ). Unfortunately, the Continuous Stochastic Block Model , which was selected for this study because it allows modeling continuous variations in the connectivity properties of the nodes, does not currently allow the incorporation of covariates.
Our results further show that the analysis of the network of contemporary relatedness relationships is a relevant method for delimiting species. The two groups found in our study might be interpreted as corresponding to two different ‘phylogenetic species’ , if phylogenetic relationships are considered in a broad sense so as to include contemporary pedigree relationships. Methods of species delimitation derived from the phylogenetic species concept have almost exclusively focused on deep ancestry using tree-based phylogenetic methods (reviewed in 30, but see 31). These methods are not well-suited for delimiting hybridizing species because horizontal gene transfers between species, caused by hybridization and subsequent backcrossing events, produce conflicts between gene trees and species trees [32,33]. Compared to data on mating events, data on relatedness were easier to acquire and there was no sampling issue. The analysis of the relatedness network revealed two groups of individuals with high within-group and low between-group degrees of relatedness. These two groups were highly congruent with those obtained using interfertility, morphological similarity and genotypic similarity as criteria, indicating that the analysis of relatedness networks may have potential for species delimitation. However, this method also has some drawbacks: the best model had five groups of related individuals and we did not find any hypothesis accounting for their origin; the number of species should thus be known in advance in order to apply this method.
By comparing the results obtained with the four criteria used for species delimitation (i.e. interfertility, relatedness relationships, morphological or genotypic similarities), we showed that the species boundaries were largely congruent across methods of species delimitation. Our analyses confirmed the existence of two groups of individuals that were both morphologically and genetically differentiated. We also showed that the individuals of each group preferentially mated and were more related with each other than with individuals from the other group. Therefore, there were two ‘evolutionary lineages’ in the studied stand. The Lineage Species Concept introduced by Simpson [34,35], then taken up by Wiley  and de Queiroz [16,37,38], focuses on the question of congruence among methods of species delimitation. For these authors, modern species concepts (e.g. morphological, phylogenetic, genotypic and biological) assimilate, explicitly or implicitly, species ‘to separately evolving (segments of) metapopulation lineages’ and are thus all by-products of the lineage species concept [16,17]. This should account for the high degree of congruence among species delimitation methods.
Another important result of this comparison is that, irrespective of the criterion used for delimiting species, we found intermediate individuals that had features of both species. Interestingly, the individuals classified as intermediates often differed across methods. In particular, no individual was consistently classified as intermediate according to all four methods. These discrepancies might be explained by the thresholds that were chosen empirically to delimit purebred species and by data quality problems. As mentioned above, examining more offspring per parent tree may improve species delimitation based on the interfertility criterion. Similarly, a greater number of molecular markers  may improve methods of species delimitation based on the genotypic and relatedness criteria. Likewise, a larger number of morphological markers  may improve morphological species delimitation. However, we believe that these discrepancies may also reflect a biological reality. Indeed, as shown in other studies [40–42], including in oaks [22,43], species relative abundance affects hybridization rate. An individual tends to reproduce with its neighbours. If it is surrounded by numerous allospecifics and few conspecifics (e.g. [22,42]), this can result in much hybridization. Such an individual will tend to be assigned to another species or to a reproductively intermediate class, according to methods of delimitation based on interfertility. Therefore, we expect some discrepancies in species assignments between methods based on environment-dependent criteria (such as that based on the interfertility criterion) and methods based on environment-independent criteria (such as that based on the genotypic similarity criterion). Because of these fundamental differences among methods, it is impossible to compute a reference dataset that would give the correct assignment of each individual. Our results thus cannot be used to identify one method of species delimitation that would produce more reliable assignments than the others. Instead, our results show that different methods of species delimitation produce slightly different results when applied to real biological data.
Our results confirmed the existence of two differentiated groups of individuals at the study site, corresponding to two species: Quercus robur and Q. petraea. However, depending on the criterion used for assigning individuals to species (i.e. interfertility, relatedness, morphological or genotypic similarities), the boundary between species was not exactly the same. Most of the differences stem from assignment of individuals to an intermediate category. This finding illustrates the continuous nature of variation between species. The model we used, which belongs to a category called ‘grade of membership models’ (reviewed in 10) is appropriate for synthesizing continuous (but not uniform) variations in biological diversity. However, to get closer to the species concepts, which generally define species as groups of individuals, we finally classified the individuals into non-overlapping groups. Our approach, therefore, illustrates the influence of concepts on our (mis)representation of species and on our understanding of biological diversity. Frost and Hillis , as well as Mayr , proposed defining species as ‘a whole’ instead of as a group of individuals. According to our study, species could also be defined as an ‘extreme point’ to which individuals are more or less close, thus allowing the possibility of an individual being a mixture of two different species.
Materials and Methods
Species Delimitation based on Interfertillity
To construct the mating network for the adult trees, we made use of a progeny test involving 3046 offspring resulting from open pollination, harvested from 51 mother-trees distributed across the entire stand (Figure S4 in File S1). A paternity analysis was conducted  by genotyping all the offspring from the test and all the adults trees for which DNA was available, using 12 multiplexed microsatellite (SSR) markers developed by Guichoux et al. . According to the paternity analysis, 1575 offspring had only one possible father in the stand, 54 offspring had several potential fathers in the stand and 1417 offspring had no father in the stand . Based on the offspring for which only a single father was found, we identified 198 father-trees in the stand. These trees included 43 trees that were also mothers, because oak trees are monoecious. Based on these results, we reconstructed 1629 mating events between 206 adult trees within the stand. These mating events allowed us to identify 751 couples of trees that mated at least once, indicating that they were interfertile under natural conditions. These data were represented by an undirected and unweighted network in which each of the 206 nodes corresponded to an adult tree and each of the 751 links corresponded to at least one mating event between two trees.
Then, the network was modeled with C-SBM . The parameters of the model are the connectivity coefficients between the EHNs and the coefficients of the mixture of EHNs for each node of the real network. For each possible number of EHNs, the parameters of the model were inferred by the maximum likelihood method, derived using the MATLAB program C-Mixnet (available at http://www.agroparistech.fr/mia/doku.php?id=productions:logiciels/). Then, the optimal number of EHNs in the network was determined by using the AIC criterion . The results were visualized with the software pajek .
Species Delimitation based on Relatedness
In order to build the relatedness network, we estimated the relatedness of the 206 adult trees included in the mating network. The estimation was performed with the software COANCESTRY , which offers seven different estimators of relatedness. As recommended by Wang , we used the 1629 offspring for which both parents were known to determine the most suitable estimator. The triadic likelihood estimator (denoted TrioML in COANCESTRY ) was selected because it produced relatedness values closest to zero for unrelated offspring, closest to 0.25 for half-sibs and closest to 0.5 for full-sibs. With this estimator, the highest relatedness value between two unrelated offspring was 0.22. We therefore treated 0.22 as a threshold: trees whose relatedness value was higher than this were considered to be related individuals and the other trees were considered to be unrelated. The relatedness relationships were then represented by an unweighted and undirected network with 206 nodes, each corresponding to an adult tree, and 1078 links connecting the individuals considered to be related. As in the case of the mating network, we modeled the network structure using C-SBM  and we visualized the results with the software pajek .
Species Delimitation based on Morphology
The morphological similarity criterion has previously been used by Bacilieri et al.  to identify all trees from the study site. These authors performed a factorial discriminant analysis (FDA) based on 31 leaf morphological traits to delimit the species. Their study revealed the presence of two groups of individuals differing in their morphology. The first axis of the FDA accounted for 33% of the total variance and was highly correlated to the morphological markers traditionally used by taxonomists to distinguish Q. robur from Q. petraea. The distribution of the individuals along this axis was used to assign, graphically, the individuals to two pure morphological groups (called M1 and M2 in this study and corresponding to Q. robur and Q. petraea respectively) and to a morphologically intermediate class (called Mi). Among the 206 adult trees included in the mating and relatedness networks, 123 trees were assigned to M1, 80 to M2 and 3 to Mi (Figure S5 in File S1).
Species Delimitation based on Multilocus Genotypes
Guichoux et al.  used genotypic similarity as a criterion to assign the trees of the study site to species. These authors genotyped the adult trees with the multiplex of 12 SSRs developed by Guichoux et al.  and with a chip of 262 single-nucleotide polymorphisms (SNP) enriched with markers highly differentiated between species . They used the software structure  to group the individuals into genotypic clusters but did not formally determine the optimal number of genotypic clusters in the stand before performing the clustering. Here we used the ΔK statistic  to identify the number of genetically different groups. The optimal number of clusters was two (Figure S6 in File S1), as previously assumed by Guichoux et al. . The adult trees were therefore classified in two purebred groups and one genetically intermediate class. Among the 206 adult trees included in the mating and relatedness networks, 78 trees were assigned to the first purebred group (hereafter called G1), 118 to the second purebred group (G2) and 10 to the genetically intermediate class (Gi) (Figure S7 in File S1).
We are grateful to Alexis Ducousso who established the Petite Charnie progeny test and shared information about the stand, and for his help, together with that of Stefanie Wagner, during sampling. We thank ONF for the management of the stand. Patrick Léger greatly helped with microsatellite genotyping. The genotyping was performed at the Genome-Transcriptome facility of Bordeaux. We are particularly grateful to François Hubert for helpful discussions about phylogeny and to Bastien Castagneyrol, Virgil Fievet, Cyril Dutech, Antoine Kremer and two anonymous reviewers for their constructive comments on the manuscript.
Conceived and designed the experiments: LL RJP. Performed the experiments: LL. Analyzed the data: LL JBL JJD CV. Contributed reagents/materials/analysis tools: JBL JJD. Wrote the manuscript: LL JBL JJD RJP CV.
- 1. Mayr E (1942) Systematics and the origin of species. New York: Columbia University Press.
- 2. Dettman JR, Jacobson DJ, Turner E, Pringle A, Taylor JW (2003) Reproductive isolation and phylogenetic divergence in Neurospora: comparing methods of species recognition in a model eukaryote. Evolution 57: 2721-2741. doi:https://doi.org/10.1554/03-074. PubMed: 14761052.
- 3. Marcussen T, Borgen L (2011) Species delimitation in the Ponto-Caucasian Viola sieheana complex, based on evidence from allozymes, morphology, ploidy levels, and crossing experiments. Plant Syst Evol 291: 183-196. doi:https://doi.org/10.1007/s00606-010-0377-z.
- 4. Marin J, Crouau-Roy B, Hemptinne J-L, Lecompte E, Magro A (2010) Coccinella septempunctata (Coleoptera, Coccinellidae): a species complex? Zool Scripta 39: 591-602. doi:https://doi.org/10.1111/j.1463-6409.2010.00450.x.
- 5. Hibbett DS, Fukumasa-Nakai Y, Tsuneda A, Donoghue MJ (1995) Phylogenetic diversity in shiitake inferred from nuclear ribosomal DNA sequences. Mycologia 87: 618-638. doi:https://doi.org/10.2307/3760806.
- 6. Taylor JW, Jacobson DJ, Kroken S, Kasuga T, Geiser DM et al. (2000) Phylogenetic species recognition and species concepts in fungi. Fungal Genet Biol 31: 21-32. doi:https://doi.org/10.1006/fgbi.2000.1228. PubMed: 11118132.
- 7. Fortuna MA, García C, Guimarães PR Jr, Bascompte J (2008) Spatial mating networks in insect-pollinated plants. Ecol Lett 11: 490-498. doi:https://doi.org/10.1111/j.1461-0248.2008.01167.x. PubMed: 18318718.
- 8. Newman M (2003) The structure and function of complex networks. Siam Rev 45: 167-256. doi:https://doi.org/10.1137/S003614450342480.
- 9. Fortunato S (2010) Community detection in graphs. Phys Rep 486: 75-174. doi:https://doi.org/10.1016/j.physrep.2009.11.002.
- 10. Leger J-B, Vacher C, Daudin J-J (2013) Detection of structurally homogeneous subsets in graphs. Statist Comput (in press).
- 11. Orr HA (2001) Some doubts about (yet another) view of species. J Evol Biol 14: 870-871. doi:https://doi.org/10.1046/j.1420-9101.2001.00340.x.
- 12. Noor MAF (2002) Is the Biological Species Concept showing its age? Trends Ecol Evol 17: 153-154. doi:https://doi.org/10.1016/S0169-5347(02)02452-7.
- 13. Coyne JA, Orr HA (2004) Speciation. Sunderland, Mass., USA: Sinauer Associates.
- 14. Sokal RR, Crovello TJ (1970) The Biological Species Concept: a critical evaluation. Am Nat 104: 127-153. doi:https://doi.org/10.1086/282646.
- 15. de Meeûs T, Durand P, Renaud F (2003) Species concepts: what for? Trends Parasitol 19: 425-427. doi:https://doi.org/10.1016/S1471-4922(03)00195-8. PubMed: 14519574.
- 16. de Queiroz (2005) Ernst Mayr and the modern concept of species. Proc Natl Acad Sci U S A 102: 6600-6607. doi:https://doi.org/10.1073/pnas.0502030102. PubMed: 15851674.
- 17. Hausdorf B (2011) Progress toward a general species concept. Evolution 65: 923-931. doi:https://doi.org/10.1111/j.1558-5646.2011.01231.x. PubMed: 21463293.
- 18. Streiff R, Ducousso A, Kremer A (1998) Spatial genetic structure and pollen gene flow in a mixed oak stand. Genet Sel Evol 30: S137-S152. doi:https://doi.org/10.1186/1297-9686-30-S1-S137.
- 19. Rieseberg LH, Carney SE (1998) Plant hybridization. New Phytol 140: 599-624. doi:https://doi.org/10.1046/j.1469-8137.1998.00315.x.
- 20. Petit RJ, Bodénès C, Ducousso A, Roussel G, Kremer A (2003) Hybridization as a mechanism of invasion in oaks. New Phytol 161: 151-164. doi:https://doi.org/10.1046/j.1469-8137.2003.00944.x.
- 21. Lepais O, Gerber S (2011) Reproductive patterns shape introgression dynamics and species succession within the European white oak species complex. Evolution 65: 156-170. doi:https://doi.org/10.1111/j.1558-5646.2010.01101.x. PubMed: 20722727.
- 22. Lagache L, Klein EK, Guichoux E, Petit RJ (2013) Fine-scale environmental control of hybridization in oaks. Mol Ecol 22: 423-436. doi:https://doi.org/10.1111/mec.12121. PubMed: 23173566.
- 23. Guichoux E, Garnier-Géré P, Lagache L, Lang T, Bourry C et al. (2013) Outlier loci highlight the direction of introgression in oaks. Mol Ecol 22: 450-462. doi:https://doi.org/10.1111/mec.12125. PubMed: 23190431.
- 24. Daudin J-J, Pierre L, Vacher C (2010) Model for heterogeneous random networks using continuous latent variables and an application to a tree–fungus network. Biometrics 66: 1043-1051. doi:https://doi.org/10.1111/j.1541-0420.2009.01378.x. PubMed: 20105159.
- 25. Bacilieri R, Ducousso A, Petit RJ, Kremer A (1996) Mating system and asymmetric hybridization in a mixed stand of European oaks. Evolution 50: 900-908. doi:https://doi.org/10.2307/2410861.
- 26. Bacilieri R, Ducousso A, Kremer A (1996) Comparison of morphological characters and molecular markers for the analysis of hybridization in sessile and pedunculate oak. Ann Sci Forestières 53: 79-91. doi:https://doi.org/10.1051/forest:19960106.
- 27. Donoghue MJ (1985) A critique of the Biological Species Concept and recommendations for a phylogenetic alternative. Bryologist 88: 172-181. doi:https://doi.org/10.2307/3243026.
- 28. Mariadassou M, Robin S, Vacher C (2010) Uncovering latent structure in valued graphs: a variational approach. Annals Appl Statistics 4: 715-742. doi:https://doi.org/10.1214/10-AOAS361.
- 29. Eldredge N, Cracraft J (1980) Phylogenetic patterns and the evolutionary process. New York: Columbia University Press.
- 30. Sites JJW, Jonathon CM (2004) Operational criteria for delimiting species. Annu Rev Ecol Evol Syst 35: 199-227. doi:https://doi.org/10.1146/annurev.ecolsys.35.112202.130128.
- 31. Moalic Y, Arnaud-Haond S, Perrin C, Pearson GA, Serrao EA (2011) Travelling in time with networks: revealing present day hybridization versus ancestral polymorphism between two species of brown algae, Fucus vesiculosus and F. spiralis. BMC Evol Biol 11: 33. doi:https://doi.org/10.1186/1471-2148-11-33. PubMed: 21281515.
- 32. Edwards SV (2009) Is a new and general theory of molecular systematics emerging? Evolution 63: 1-19. doi:https://doi.org/10.1111/j.1558-5646.2008.00549.x. PubMed: 19146594.
- 33. Maddison WP (1997) Gene trees in species trees. Syst Biol 46: 523-536. doi:https://doi.org/10.1093/sysbio/46.3.523.
- 34. Simpson GG (1951) The species concept. Evolution 5: 285-298. doi:https://doi.org/10.2307/2405675.
- 35. Simpson GG (1961) Principles of animal taxonomy. New York: Columbia University Press.
- 36. Wiley EO (1978) The Evolutionary Species Concept reconsidered. Syst Biol 27: 17-26.
- 37. de Queiroz K (2007) Species concepts and species delimitation. Syst Biol 56: 879-886. doi:https://doi.org/10.1080/10635150701701083. PubMed: 18027281.
- 38. de Queiroz K (1998) The General Lineage Concept of species, species criteria, and the process of speciation. Oxford University Press.
- 39. Vähä J-P, Primmer CR (2006) Efficiency of model-based Bayesian methods for detecting hybrid individuals under different hybridization scenarios and with different numbers of loci. Mol Ecol 15: 63-72. PubMed: 16367830.
- 40. Field DL, Ayre DJ, Whelan RJ, Young AG (2008) Relative frequency of sympatric species influences rates of interspecific hybridization, seed production and seedling performance in the uncommon Eucalyptus aggregata. J Ecol 96: 1198-1210. doi:https://doi.org/10.1111/j.1365-2745.2008.01434.x.
- 41. Focke WO (1881) Die Pflanzenmischlinge. Berlin: Bornträger.
- 42. Hubbs CL (1955) Hybridization between fish species in nature. Systematics Zoology 4: 1-20.
- 43. Lepais O, Petit RJ, Guichoux E, Lavabre JE, Alberto F et al. (2009) Species relative abundance and direction of introgression in oaks. Mol Ecol 18: 2228-2242. doi:https://doi.org/10.1111/j.1365-294X.2009.04137.x. PubMed: 19302359.
- 44. Frost DR, Hillis DM (1990) Species in concept and practice: herpetological applications. Herpetologica 46: 86-104.
- 45. Mayr E (1992) A local flora and the Biological Species Concept. Am J Bot 79: 222-238. doi:https://doi.org/10.2307/2445111.
- 46. Guichoux E, Lagache L, Wagner S, Léger P, Petit RJ (2011) Two highly validated multiplexes (12-plex and 8-plex) for species delimitation and parentage analysis in oaks (Quercus spp.). Mol Ecol Resour 11: 578-585. doi:https://doi.org/10.1111/j.1755-0998.2011.02983.x. PubMed: 21481218.
- 47. Batagelj V, Mrvar A (1998) PAJEK - Program for large network analysis.
- 48. Wang JL (2010) Coancestry: a program for simulating, estimating and analysing relatedness and inbreeding coefficients. Mol Ecol Resour 11: 141-145. PubMed: 21429111.
- 49. Wang JL (2007) Triadic IBD coefficients and applications to estimating pairwise relatedness. Genet Res 89: 135-153. PubMed: 17894908.
- 50. Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155: 945-959. PubMed: 10835412.
- 51. Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software structure: a simulation study. Mol Ecol 14: 2611-2620. doi:https://doi.org/10.1111/j.1365-294X.2005.02553.x. PubMed: 15969739.
- 52. Mallet J (1995) A species definition for the modern synthesis. Trends Ecol Evol 10: 294-299. doi:https://doi.org/10.1016/0169-5347(95)90031-4. PubMed: 21237047.
- 53. Nelson G, Plantick N (1981) Systematics and biogeography : cladistics and vicariance. New York: Columbia University Press.
- 54. Mishler BD (1985) The morphological, developmental, and phylogenetic basis of species concepts in bryophytes. Bryologist 88: 207-214. doi:https://doi.org/10.2307/3243030.
- 55. Legendre G, Legendre P (1984) Ecologie numérique. Paris, France: Masson.