Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Are you with me? Co-occurrence tests from community ecology can identify positive and negative epistasis between inversions in Mimulus guttatus

  • Luis J. Madrigal-Roca ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    madrigalrocalj@yahoo.com

    Affiliation Ecology and Evolutionary Biology’s Department of the University of Kansas, Lawrence, Kansas, United States of America

  • John K. Kelly

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Supervision, Validation, Writing – original draft, Writing – review & editing

    Affiliation Ecology and Evolutionary Biology’s Department of the University of Kansas, Lawrence, Kansas, United States of America

Abstract

Chromosomal inversions are structural genetic variants that can play a crucial role in adaptive evolution and speciation. Patterns of attraction and repulsion among unlinked inversions — whether they tend to be carried by the same or different individuals— can indicate how selection is acting on these polymorphisms. In this study, we compare analytical techniques using data from 64 inversions that segregate among 1373 F2 plants of the yellow monkeyflower Mimulus guttatus. Mendelian assortment provides a strong null hypothesis for contingency tests. Here, we show how co-occurrence metrics used in community ecology can provide additional insight regarding coupling and repulsion of inversions at genotypic level. The centered Jaccard/Tanimoto index and the affinity score describe the specific way that inversions interact to generate epistasis for plant survival. We further explore the use of network analysis to visualize inversion interactions and to identify essential third and fourth order interactions, which expand the traditional pairwise scope of the co-occurrence metrics. We suggest that a combination of different statistical approaches will provide the most complete characterization of the fitness effects, both for inversions and other polymorphisms essential to adaptation and speciation.

Introduction

Co-occurrence analysis traces its origins to the 18th century, evolving from probabilistic frameworks to the development of contingency tables and formal statistics in the 20th century [1]. This analytical approach now serves as a fundamental tool across biological disciplines, quantifying similarities between diverse sample types including community observations, diseases, and genes. In both ecology and genetics, co-occurrence analysis has become instrumental in decoding complex interaction patterns, building on the hypothesis that co-occurring entities share functional relationships [2]. The approach illuminates community and landscape-level interactions in ecology, while in genetics, linkage disequilibria statistics characterize non-random relationships between loci as examples of co-occurrence metrics [3]. Though terminology varies across disciplines, these associations share theoretical foundations, enabling the reciprocal application of ecological statistics to genetic data and vice versa.

The mechanisms driving co-occurrence differ between community ecology and genetics. Species coexistence or repulsion across locations may stem from environmental preferences [4], biotic interactions [5], or historical artifacts [6]. In the genetic context, associations between alleles at different loci can arise from physical linkage or through natural selection maintaining associations when loci interact to determine fitness. This paper explores statistical approaches for detecting and characterizing associations, leveraging an experimental design where co-occurrence among genetically unlinked loci must result from differential survival or gametic selection. This methodology offers fresh insights into genomic data relationships, paralleling applications in ecological studies and advancing our understanding of genetic interactions and regulatory networks.

In genetics, co-occurrence is usually characterized as linkage disequilibrium—the non-random association between alleles at two genetic loci [3,7]. The original measure, D, quantifies the covariance of allelic state (0 or 1) between loci across gametes for biallelic loci. This haploid-level metric [8] spawned normalized derivatives such as D’ [9] and r2 [10]. These statistics score associations at the allelic level, which is particularly significant given that genotype data typically comes from diploid individuals where the phase of alleles at different loci remains unknown. We examine the centered Jaccard/Tanimoto index (cJ/T) [11,12] and the affinity score [13] as measures of association at the diploid genotype level. The cJ/T evaluates associations by comparing observed co-occurrence against independence-based null expectations, revealing positive (cJ/T > 0) or negative (cJ/T < 0) associations. The affinity score, expressed as a log odds ratio (α), similarly captures co-occurrence patterns, with positive values indicating coupling and negative values suggesting genetic repulsion. These metrics establish a sophisticated framework for analyzing complex genetic interactions, particularly valuable where traditional LD measures prove inadequate.

The co-occurrence metrics we examine are applicable to any genetic polymorphism, though we focus on chromosomal inversion genotype data. Inversions represent large-scale structural variants that can encompass substantial chromosome portions, containing more genetic material than single nucleotide polymorphisms (SNPs). The Australasian snapper Chrysophrys auratus exemplifies this, with inversions encompassing three times more base pairs than reported SNPs [14]. As adaptive characters subject to selection, inversions frequently exert significant fitness effects [15]. In our study species Mimulus guttatus, a 4.5mb inversion on chromosome 6 influences both morphological traits and fitness measures [16]. While inversions are known to suppress recombination between closely linked, co-adapted loci contributing to adaptation [17,18] less research has addressed how unlinked inversions segregating within the same population might affect fitness. Our recent work identified 64 inversion polymorphisms segregating within a large experiment using crosses between inbred yellow monkeyflower lines [19]. We now evaluate interaction effects (epistasis in genetic terminology), specifically examining whether inversions generate genetic incompatibilities—negative interactions that reduce fitness or cause lethality when different inversions co-occur [20].

Pairwise association tests face an inherent limitation in examining only two entities simultaneously, despite biological systems often involving complex, multi-way interactions. Network analysis has emerged as an increasingly important tool in biology [21,22], offering an intuitive representation of complex data that emphasizes entity interactions. This association mapping approach represents entities (e.g., genes and structural variants) as vertices or nodes, with their interactions depicted as links or edges [23]. By synthesizing pairwise tests, network analysis provides a comprehensive view of the interaction landscape. In this paper, we demonstrate how network analysis can reveal higher-order interactions among genetic loci in determining plant survival. The combination of co-occurrence indexes and network analysis enables us to transcend pairwise comparisons and understand the complex interplay between multiple inversions.

Using the Mimulus inversion dataset, we investigate whether combining ecological co-occurrence analysis with network analysis can reveal novel patterns of positive and negative epistasis —coupling and repulsion in linkage nomenclature. We focus on inversions on different chromosomes, ensuring that any detected associations must arise from differential plant survival. We also examine individual inversion effects on survival through segregation distortion analysis and explore potential “domino effects” on subsequent co-occurrence analysis. These patterns illuminate the functional relationships and coevolutionary dynamics of inversions in Mimulus. Understanding inversion interactions advances our knowledge of adaptation mechanisms, speciation, and genome evolution in plants. Furthermore, our methodological approach, which integrates ecological and genetic perspectives with network analysis, establishes a valuable framework for studying complex genomic interactions across species, potentially advancing our understanding of adaptation and evolution across diverse taxa.

Materials and methods

Data preparation and inversion identification

The raw data analyzed in our study were published by [24] as part of a replicated F2 mapping experiment using inbred lines of the yellow monkeyflower, Mimulus guttatus, from the Iron Mountain (IM) population in Oregon’s Cascade Mountains (44.402217N, 122.153317W). From a panel of 187 whole-genome sequenced inbred lines, they selected a reference line (IM767) and nine alternative lines that were genetically unrelated to both IM767 and each other (based on genome-wide nucleotide diversity). Each alternative line was crossed with IM767 (as pollen donor) to generate F1 hybrids. A single F1 plant from each cross was self-fertilized to produce F2 seeds. Both parental lines and F2 progeny were grown in controlled greenhouse conditions. All parental lines were scored for orientation at 64 inversion polymorphisms [19]. For each segregating inversion in a cross, F2 individuals were classified as RR (homozygous for the reference orientation), RI (heterozygous), or II (homozygous for the inverted orientation). We present the complete genotype matrix for F2 individuals from the nine crosses as Supplemental Excel Spreadsheet 1 for future reference. The number of F2s varied among the crosses: IM62 (n = 92), IM155 (n = 188), IM444 (n = 148), IM502 (n = 163), IM541 (n = 164), IM664 (n = 157), IM909 (n = 148), IM1034 (n = 151), and IM1192 (n = 162). Since not all inversions segregate in all crosses, we detected 941 unique valid combinations from the 2016 possible pairs of inversions (64 choose 2), representing 47% of the combinatorial space. When each inversion was decomposed into heterozygous or homozygous states and their presence/absence analyzed in a genotypic context, the combinatorial space expanded to 8128 possible combinations. Of these, we detected 3764 valid contrasts (approximately 46% of the combinatorial space) between loci on different chromosomes. All analyses were performed using R 4.4.2 [25].

Segregation distortion analysis for individual inversions

To test for segregation distortion (SD) at individual inversion loci (Fig 1A), we performed a goodness-of-fit test to determine if inversion genotype frequencies (RR, RI, II) followed Mendelian proportions for an F2 population (0.25/0.50/0.25). We used the GTest function from the R package DescTools [26], applying the Williams’ correction. We also characterized the pattern of selection in cases of SD by employing a sequential analysis proposed by [27]. We used the segregation distortion value (SDV), defined as the negative decimal logarithm of the SD-associated p-values, to explore the relationship between individual tests and the pairwise contrasts performed subsequently. This association was evaluated because segregation distortion involves complex relationships between genes, which can shape the transmission advantage of certain haplotypes (determined in our context by the combination of inversions) and affect their genetic associations.

thumbnail
Fig 1. Relationship between the tabulated information employed for hypothesis testing under the different approaches presented in this study.

A. Goodness-of-fit approach to assess the presence of segregation distortion (SD) for each inversion. B. 3x3 table framework used for performing tests of independence between dosage levels of pairs of inversions. Note that f referrers to the frequency of each genotype following Mendelian proportions, and N is the total number of individuals. To the right is depicted an example scenario where a deviation from independence exists. In that example, below the diagonal there are tabulated the expected counts and above it the observed ones. With red color is represented a case of deflation of the counts in relation to the expectation (repulsion) and in blue a case of inflation in the counts (coupling). C. 2x2 table framework used as basis for co-occurrence analysis with ecological indexes.

https://doi.org/10.1371/journal.pone.0321253.g001

Contingency table analysis

We applied contingency analysis to each pair of inversions segregating within each cross to evaluate the null hypothesis of independence between inversion genotypes. The independence between each pair of inversions was evaluated using an omnibus χ² test (applied to the 3×3 table in Fig 1B, left). For reference, a representation of the data structure is provided in Fig 1B for 400 individuals (right). We used chisq.test from the base R package and computed the p-value associated with the χ² statistic through bootstrapping (10,000 iterations). We used bootstrapping because small expected frequencies were observed in some tests which can cause poor approximation of the χ² statistic to its theoretical distribution. Additionally, we employed the standardized residual method and analyzed the relative contribution of each cell in the 3×3 table to assess between which dosages of inversions there were significant deviations from expectations. For this analysis, we used the chisq.posthoc.test function from the R package of the same name [28].

Co-occurrence metrics

Two different indexes were employed for quantifying the co-occurrence and repulsion patterns: the centered Jaccard/Tanimoto index (cJ/T) [11] and the affinity score described recently by [1]. The cJ/T index and the hypothesis testing framework for it were analyzed using jaccard.test.pairwise from the jaccard R package [12]. The p-values were estimated using the bootstrap method (B = 10,000 replicates), which provides an accurate and efficient framework of analysis when compared with other available approaches [11]. For the affinity estimation, we used the Co-occurrenceAffinity R package [13], specifying the data type as binary in the Co-occurrenceAffnity function. In the context of this study, these two indices decompose the 3×3 contingency table and perform hypothesis testing on the information contained in the four submatrices reported in Fig 1C. Given the lower number of degrees of freedom of the 2×2 tables, the tests based on these should be less powerful in terms of signal detection but inform the direction of co-occurrence patterns.

We utilized these two co-occurrence frameworks because both are 0-centered, which naturally facilitates the identification of coupling or repulsion patterns between the inversions analyzed (Fig 2). The affinity score method provides an important statistical advantage over conventional similarity indexes (uncentered Jaccard/Tanimoto, Sørensen-Dice, and Simpson). When using traditional indexes, the strength and direction of associations can be distorted by how frequently entities occur in the dataset. Affinity score eliminates this prevalence bias [1], producing consistent measurements of association strength regardless of whether the analyzed entities are common or rare in the population. Nevertheless, it is important to note that these measures are also susceptible to false discoveries common in genomic applications, and careful interpretation is required in cases where independence between tests is not achieved (e.g., there will be four tests associated with the same pair of inversions, as seen in Fig 1C after decomposing the 3×3 table depicted in Fig 1D).

thumbnail
Fig 2. Patterns of inversion genotypes co-occurrence across F2 individuals of Mimulus guttatus explored through the affinity and centered Jaccard/Tanimoto indexes.

Each panel shows a representative hypothetical pair of chromosomes from diploid individual plants (squares), illustrating three scenarios of association between Inversion 1 (in homozygous state) and Inversion 2 (in heterozygous state). In repulsion (left panel), inversion genotypes occur together less frequently than expected by chance (Affinity or centered Jaccard/Tanimoto < 0), suggesting potential negative fitness effects of their co-occurrence. Under random association (middle panel), inversions assort independently (Affinity or centered Jaccard/Tanimoto ≈ 0), following Mendelian expectations. In coupling (right panel), inversion genotypes co-occur more frequently than expected by chance (Affinity or centered Jaccard/Tanimoto > 0), indicating possible positive epistatic interactions (coupling). Blue and purple segments represent inverted regions. Gray and yellow ellipses represent the chromosomes, and two per individual are depicted according to the ploidy level of our model plant.

https://doi.org/10.1371/journal.pone.0321253.g002

Furthermore, a significant distinction from traditional linkage disequilibrium (LD) measures is that these two indexes enable us to analyze the relationship between inversions at the genotypic level (Fig 2). In contrast, traditional methods, such as D and D’, focus on a broader allele scope through haplotypes. The genotypic scope is preferred when using Chi-squared approaches and odds ratios tests to avoid potential biases introduced by summing allele frequencies [29]. Nevertheless, the main reason we avoided direct use of LD metrics relates to their variability with the allele frequencies at each locus and linear to non-linear shifts in the LD structure between loci [8]. We included a simulation-based contrast between two widely used LD metrics (D and D’) and the co-occurrence indexes employed in this study, which is explained in the Extended Methods and presented as Supporting Information (S3 Table in S1 Word Doc and S4 Fig).

Stochastic simulations demonstrate the ability of alternative approaches to detect interactions

To assess the behavior of different statistical frameworks (one global approach using 3×3 tables and two specific approaches using 2×2 tables), we developed a simulation in R 4.3.2 [30] using the script Aux1_Contingency_tables_simulation. The null model distributed 400 individuals across 9 cells of a contingency table, representing combinations of genotypes from two inversions (Fig 1B, left). We populated 100,000 simulated observed tables using Mendelian expectations (Fig 1B, right). These simulated tables were compared against expected counts using an omnibus χ² test. Each table was also decomposed into four submatrices, with information transformed into binary vectors (presence/absence) to apply Jaccard and affinity measures. For each generated table, we obtained and stored one global p-value from the omnibus χ² test and four p-values per co-occurrence index (one per submatrix). We then analyzed the distribution of all these p-values.

We simulated two alternative scenarios beyond the null model. Expected frequencies were modified by multiplying the Mendelian expectation by a survival rate and normalizing by dividing by the sum of that product in each cell (Equation 1). The survival rate represents the probability that an individual with a given genotype survives to be observed relative to Mendelian inheritance expectations. A survival rate of 1 indicates no deviation from Mendelian expectations, while values below 1 indicate reduced survival (deflation). In the first alternative model, we deflated the 9th cell counts using a survival rate of 0.2 (relative survival for other cells was 1). In the second model, we deflated the 8th cell counts (survival rate 0.2), inflated the 9th cell counts (survival rate 1), and set the relative survival for remaining cells at 0.7. Statistical tests were performed as previously described, retaining only the p-values for distribution analysis. We visualized frequency histograms and cumulative distribution functions using the ggplot2 framework in R [31].

(1)

Where is the Mendelian expected frequency, and is the expected frequency modified by the survival rate. These specific survival rates were chosen to test the ability of each statistical method to detect deviations from additivity. This allows us to establish proof-of-principle for our methods before applying them to real data, where the deviations may be more subtle.

Congruence between methods

We explored the level of congruence among statistical frameworks by linking values from the standardized residuals derived from the χ² post hoc test, the cJ/T, and affinity scores through adjusted linear models using the R base function lm. Since the residuals did not follow a normal distribution in any of the paired models, we implemented robust linear regression using the rlm function from the MASS package [32]. We estimated the p-values associated with model coefficients using the rob.pvals function from the repmod R package [33]. After visualization, we removed outliers using a cutoff threshold of 0.15 for the weights derived from the regression model.

Network visualization and analysis

The pairwise centered Jaccard/Tanimoto indexes served as reference for building interaction networks using the igraph [34] package in R. We used only one of the co-occurrence indexes since they are linearly related (see results below). Networks provide an effective visualization of repulsion and coupling patterns, alongside other metadata such as non-significant relations volume, chromosomal membership of the variants studied, etc. We considered patterns significant when raw p-values of the centered Jaccard/Tanimoto (used for weighing network edges) were smaller than 0.05. Edges belonging to significant omnibus χ² tests were highlighted to demonstrate the difference between significant and non-significant contrast weights according to the establish convention for this study.

We then extracted subgraphs defined by edges with raw index-associated p-values less than 0.05. Using these subgraphs, we assessed for significant third and fourth order motifs (e.g., triangles, cycles, etc.). In this context, a motif represents an association pattern defined by nodes connected in a specific manner with frequency higher than expected by chance [35,36]. Node significance was assessed through a permutational procedure that rewired each network multiple times, counting motif appearances. This generated a null distribution of counts per motif, allowing us to determine the probability of observing counts as extreme or more extreme than those observed under the null hypothesis (randomness). For simplicity, we evaluated only overrepresented motifs with a minimum count of 200, excluding scarce motifs. With null distributions established, we determined the most significant motifs across networks (as the negative decimal logarithm of adjusted p-values), the observed-to-expected ratio, and Z-scores associated with observed counts.

Heterogeneity test for the shared inversions (study case)

Three inversions (29, 32, and 40) segregate in all nine crosses. For these inversions, we generated networks to visually inspect whether configurations were variable or conserved. We employed a replicated G-based goodness of fit test [37] to assess whether relationship patterns were conserved or variable across the nine genetic families (crosses). For both 3×3 and 2×2 table frameworks, we determined the Gtotal, Gpooled, and Ghet statistics and contrast them against the distribution with appropriate degrees of freedom.

Relationship between SD and co-occurrence patterns

We also assessed if inversions showing significant segregation distortion were more or less likely to be involved in interactions with other inversions. For this, we used the results from the omnibus independence tests and the SDV. We explored the Spearman correlation between an inversion’s SDV value and two metrics: (1) the frequency with which the inversion had raw p values below 0.05 in independence tests with other inversions, and (2) the mean p value of such tests. Additionally, we fitted a generalized linear model to analyze the number of significant tests as a function of the SDV, controlling for genetic family (cross), using a quasi-Poisson family in R’s GLM framework.

p-values correction

We implemented multiple methods to control false discoveries in our statistical analyses. For the omnibus test, we applied the Benjamini-Hochberg procedure to correct p-values within each line [38]. We also employed this method for FDR control during the motif discovery phase of our network analysis. For post hoc tests, we implemented the Benjamini-Yekutieli method since the contrasts from the 3×3 table are not independent [39].

However, these corrections proved problematic for our pairwise co-occurrence metrics and the omnibus test. Standard methods were excessively conservative given our study design: with 150–200 F2 individuals per cross and our bootstrap-based omnibus test, there exists an inherent lower limit to possible p-values for each cross and inversion pair combination. Traditional family-wise error rate controls (like the Bonferroni correction) were also overly conservative given the substantial number of interdependent pairwise tests. We attempted to use the q-value approach described by [40] to control the FDR. However, this method relies on p0 (the estimated proportion of true null hypotheses), which requires a uniform distribution of p-values above 0.5. Our p-value distribution violated this assumption, rendering the method unsuitable for our analysis.

Results

Testing of methods via simulation

Regarding the co-occurrence tests, simulations confirmed the close relationship between our different analytical approaches. Under the null model of independent assortment (Mendelian expectations), the omnibus χ² test and all co-occurrence indexes (centered Jaccard/Tanimoto index and affinity score) yielded results consistent with expected p-value distributions. In all cases, we retrieved the uniform distribution, although slight deviations were detected due to the discrete nature of the data. Nevertheless, analysis of the cumulative distribution function and frequency histograms demonstrated the appropriate behavior of all statistical frameworks (Fig S1).

Under the first alternative scenario, where double homozygotes for the inversions had lower viability, the p-values derived from the omnibus χ² test showed an evident deviation from the uniform distribution (Fig S2A), with a shift toward more extreme and lower p-values compared to the previously described null distribution. Similarly, the centered Jaccard/Tanimoto index and affinity score for submatrix 2 (containing the deflated cell counts) also showed a shift toward lower p-values (Fig S2C, G), while preserving the null distribution in the other submatrices (Fig S2B, D, E, F, H, I). For the final simulation, an equivalent pattern was observed, with the histograms and cumulative distribution plots showing that two submatrices (sub2 and sub4) exhibited deviation from the null expectations for the centered Jaccard/Tanimoto index and affinity score, effectively detecting both introduced perturbations (Fig 3C, E, G, I).

Segregation distortion of inversions

From the 64 inversions analyzed in this study, 10 showed a significant pattern of segregation distortion in at least one cross (Table 1). The most extreme case is inversion 49 in cross 444, where homozygotes for the inversion were completely absent among F2 plants. This pattern suggests that the allele is a recessive lethal, but remarkably, this same inversion occurs frequently in homozygotes in six other crosses and it is homozygous in the IM444 line itself, which is a viable and fertile genotype. Inversion 53 is significantly distorted in three crosses, although the pattern of distortion varies (inverted homozygote deficiency in two cases, reference homozygote deficiency in the other). All other cases in Table 1 are inversions that are distorted in only one cross. Finally, all significant cases favor the zygotic selection model over the gametic model, although this may reflect restrictive assumptions for sequential test employed to detect them.

thumbnail
Table 1. Significant segregation distortion for inversions segregating in a natural population of Mimulus guttatus from Iron Mountain, Oregon, USA.

https://doi.org/10.1371/journal.pone.0321253.t001

Co-occurrence of Mimulus inversions

Applying the tests to the Mimulus’ data, we find that 73 of 941 valid contrasts (~ 8%) yielded raw p < 0.05 for the omnibus test. Among contrasts that distinguish between homozygous and heterozygous inversion states, we detected 355 significant associations (~9% of valid combinations) using at least one of three analytical approaches: cJ/T index, affinity coefficients, or post hoc tests (based on uncorrected p-values). Nevertheless, a lower number of contrasts were detected as significant (Table 2) for all 2x2 table-based approaches and the omnibus test (67, accounting for ~ 2% of the valid combinatorial space). S2 Table in S2 Word Doc shows all the valid contrasts that were detected by at least one of the 2x2 table-based approaches employed here. Importantly, there are 174 cases where the omnibus test fails to detect a signal, but the 2x2-based analyses capture them.

thumbnail
Table 2. Significant contrasts according to the raw p-values that were detected by the three 2x2 table-based approaches employed to assess the co-occurrence patterns of pairs of inversions in lines of Mimulus guttatus from Iron Mountain, Oregon.

https://doi.org/10.1371/journal.pone.0321253.t002

As a general summary, aggregating results that strongly deviated from the null hypothesis and were detected by all three 2×2-based approaches (67 in total), we observed distinct patterns in the frequency of inversion combinations. The most frequent deviations from independence occurred in heterozygous-homozygous combinations (~58%), with 20 negative and 19 positive associations (combining both arrangements of homozygous and heterozygous inversions). Double heterozygous combinations showed 16 deviations (9 negative vs. 7 positive, 24%). The least frequent deviations appeared in double homozygous combinations with 12 cases (7 positive vs. 5 negative, 18%).

While most inversion pairs showed no evidence of interaction, several notable exceptions emerged. For example, inversion 35 on chromosome 8 exhibited an interaction with inversion 62 on chromosome 14 within the 62 cross (Table 3, top). This interaction was driven by a strong deficiency of “mixed genotypes” —combinations of heterozygous at one inversion but homozygous at the other— indicated by negative cJ/T and alpha statistics for these genotype combinations. Inversion 35 demonstrated an even stronger interaction with another locus (inversion 43) in the 444 cross (Table 3, lower). However, the nature of this interaction differed, being driven entirely by a lower-than-expected count for the double inversion homozygote.

thumbnail
Table 3. Example contrasts of the inversion 35 and its dosages states with another inversion for two of the nine analyzed crosses of Mimulus guttatus, from lines of Iron Mountain, Oregon, USA.

https://doi.org/10.1371/journal.pone.0321253.t003

Congruence between statistical frameworks

There was strong agreement between the 2×2 table-based methods (χ² post hoc approach, cJ/T, and Affinity), as depicted in Fig S3. The most significant signals were simultaneously detected by at least two of the three methods based on the submatrices depicted in Fig 1 (Jaccard/Tanimoto, affinity, and χ² post hoc test). This congruence was expected given that robust regression analysis revealed significant linear associations between the three measures, supporting the hypothesis testing under each approach (Fig S3).

Visualization of inversion associations as networks

Networks revealed idiosyncratic patterns of repulsion and attraction; for the same inversion pair, patterns varied from cross to cross. The importance of most inversions within a cross is indicated by node sizes in Fig 4, based on eigenvector centrality measures. While most interactions between edges did not deviate from null expectations (gray edges), there were significant signals of coupling and repulsion according to raw p-values derived from the cJ/T index-based hypothesis testing framework. The strongest signals were more abundant in lines 155, 444, 664, and 502, while they were scarcer in the remaining lines.

Network motif analysis revealed significant patterns of inversion interactions across different lines (Fig 5). We found that specific motifs, particularly s.4.4, s.4.6, and s.3.2, were consistently enriched across multiple crosses, with observed/expected ratios close to or exceeding 1.5. The strongest signals were detected in crosses 502 and 909, where certain motifs showed high statistical significance (-log10(adj. P) > 1.25) and substantial enrichment compared to random expectations. Z-score analysis (Fig 5D) further supported the non-random distribution of these motifs, with several network-motif pairs showing significant deviations from expected frequencies. The relationship between observed and expected motif counts demonstrated clear structural patterns in the inversion co-occurrence networks, suggesting organized rather than random associations between inversions (Fig 5E).

The most prominent result derived from the estimated network was the variability in coupling/repulsion patterns. In some cases, a pair of inversions interacted positively in one cross but negatively in another. As an example, we examined inversions 29, 32, and 40, which appeared in all studied lines. There was no consistent relationship between nodes in the networks across lines. In all cases, inversion dosage combinations exhibited different patterns of repulsion and attraction as a function of the genetic family – cross (Fig 6).

thumbnail
Fig 3. Distribution of the p-values derived from the omnibus test, the affinity analysis, and the centered Jaccard/Tanimoto-based analysis under a situation with a deflation in the counts of the 8th cell and an inflation of those in the 9th cell.

Those coordinates are based on Fig 1B. There is a shift in the distribution of the p-values to the left in the case of the omnibus test (both signals are merged), while in the cases of the submatrices 2 and 4 analyzed with the affinity and Jaccard/Tanimoto indexes, both events are captured as deviations from the null model (Affinity_sub2, Affinity_sub4, cJ/T_sub2, and cJ/T_sub4). Histograms showing the density of each bin for the log-transformed p-values are complemented with a subplot depicting the cumulative distribution function (CDF).

https://doi.org/10.1371/journal.pone.0321253.g003

This finding was further supported by a formal replicated independence G test, in which we found no significant pooled effect (Table 4), suggesting fluctuating relationships between inversions across lines. Additionally, we found evidence of heterogeneity for one of the analyzed contrasts. In this case, the relationship between inversion 40 in heterozygous state and inversion 32 in homozygous state appeared to differ significantly among genetic families (crosses).

thumbnail
Table 4. Replicated G test of independence between inversion dosages present in all nine crosses performed using Mimulus guttatus lines from a natural population of Iron Mountain, Oregon, USA.

https://doi.org/10.1371/journal.pone.0321253.t004

Interestingly, we found no relationship between single inversion distortion and inter-inversion interaction. The SDV, a measure directly associated with the effect size of segregation distortion, was not significantly associated with the number of significant tests with raw p values less than 0.05 where the distorted inversion appears (t = 0.66, p = 0.507). Additionally, Fig 7 shows a weak association between these two measures and between the mean p value of the distorted inversions and their own SDV.

thumbnail
Fig 4. Networks based on the Jaccard/Tanimoto score for nine genetic families of Mimulus guttatus from Iron Mountain, Oregon, USA.

Significant results are those detected by all three 2x2 table-based approaches. In gray are represented the edges that were not recognized by the omnibus test. A) C_1034. B) C_1192. C) C_155. D) C_444. E) C_502. F) C_541. G) C_62. H) C_664. I) C_909.

https://doi.org/10.1371/journal.pone.0321253.g004

thumbnail
Fig 5. Systematic analysis of network motifs in inversion co-occurrence networks across six Mimulus guttatus lines from Iron Mountain, Oregon, USA.

(A) Third and fourth order motifs. (B) Statistical significance of different motifs across networks shown as -log10 of adjusted p-values. Tiles in blank correspond to motif-cross combination excluded from analysis with observed counts below 200. (C) Enrichment analysis showing the ratio of observed to expected motif frequencies for each line-motif combination. (D) Z-score distribution for network-motif pairs, indicating deviation from random expectation. (E) Scatter plot of observed versus expected motif counts; point size indicates absolute Z-score and color denotes statistical significance (TRUE/FALSE, FDR < 0.05). Crosses analyzed: C_155, C_444, C_502, C_541, C_664, and C_909. Motifs analyzed include s.3.2, s.4.4, and s.4.6, representing different patterns of inversion co-occurrence.

https://doi.org/10.1371/journal.pone.0321253.g005

thumbnail
Fig 6. Co-occurrence network for the homozygous and heterozygous states of inversions 29, 32, and 40 in nine crosses of a population of Mimulus guttatus, Iron Mountain, Oregon, USA.

In blue are depicted the attraction interactions, while in red are depicted the repulsion ones. The size of the nodes is based on the eigenvectors (measure of importance in the network). A star symbol signals those interactions that are significant according to the centered Jaccard/Tanimoto. Red labels correspond to inversions which interaction pattern was detected as heterogeneous through a replicated independence G-test ( = 17.766, p = 0.023).

https://doi.org/10.1371/journal.pone.0321253.g006

thumbnail
Fig 7. Correlation space between the segregation distortion value of the inversion (SDV), the number of times the inversion is implicated in a pairwise contrast with a raw p value less than 0.05, and the average p value of that inversion across all valid pairwise contrasts.

The inversions studied are from a natural population of Mimulus guttatus from Iron Mountain, Oregon, USA. The correlation coefficients are derived from the Spearman test (rank-based approach).

https://doi.org/10.1371/journal.pone.0321253.g007

Discussion

Genomic inversions are structural mutations that reverse the orientation of chromosomal segments, reducing the recombination rate in heterozygotes The suppression of recombination acts as an effective barrier to genetic exchange between different orientations, except for cases of gene conversion or ‘gene flux’ [18,41]. Despite recent interest in the evolutionary role of inversions [4244], understanding of these and other structural variants remains limited, although they are increasingly recognized as a substantial source of genetic variability in natural populations [45,46]. Inversions can also be central elements in reproductive isolation, given their major effect as recombination suppressors [20]. Many studies in the last decade have focused on understanding the role of inversions in phenotypic polymorphisms [47], behavior [48], the evolution of sex chromosomes [49], and other phenomena [46].

Traditionally, inversions have been considered in isolation, even though they are more common within populations than previously thought. In many species, single individuals may carry inversions at multiple loci [19,50], making it imperative to consider the interaction networks among inversions. Co-occurrence analysis offers a powerful approach to unravel potential epistatic relationships between inversions. The study of epistatic interactions between inversion loci could shed light on diverse phenomena. For instance, inversion interactions could shape hybrid incompatibilities, as minor parent ancestry is less likely to persist in genome regions with low rates of hybridization [51]. An inversion can prevent gene flow between populations at the genetic loci contained within it, consequently preventing deleterious reshuffling of alleles between local and migrant entities [52]. From a more global perspective, we can explore the degree of coadaptation of a given set of inversions [15]. For the data considered in this paper, with plants produced from controlled crosses, gametic competition or the differential germination/survival of F2 plants must be the cause of co-occurrence, which makes the interpretation of results straightforward.

As a key finding of this research, we observed a significant impact of specific inversion groups on plant survival, using segregation distortion values of individual inversions as a proxy. Among the significant signals detected, most inversions were cross-specific, though inversion 53 showed common distortion across three crosses, suggesting a moderately pervasive effect in the population. These observations of significant departures from Mendelian inheritance across multiple chromosomal inversions provide compelling evidence for investigating genotypic co-occurrence patterns in this system. Our sequential χ² testing approach revealed consistent signatures of zygotic selection rather than gametic selection, indicating that selective pressures act on specific genotype combinations after fertilization. This is particularly evident in inversions such as Inv_49, Inv_55, and Inv_53, which exhibit highly significant G-statistics and substantial segregation distortion values.

The observed genotypic frequency patterns demonstrate strong asymmetries, with certain genotype combinations consistently underrepresented across different inversions. For instance, Inv_49 shows a striking disparity between RR (92) and II (0) genotypes, suggesting strong selective pressures against specific genotypic states. These patterns, combined with significant zygotic selection signatures, indicate that the inversions are not evolving independently but are subject to functional interactions influencing their co-occurrence. Selection strength varies among inversions, as evidenced by segregation distortion values ranging from 114.38 to 2.2. This variation suggests a complex selective landscape where different inversions experience varying degrees of selective pressure. Such heterogeneity in selection intensity further supports the need for genotypic-based co-occurrence analysis, as it could reveal whether inversions under stronger selection tend to show specific association patterns with other inversions.

Given these observations, genotypic-based co-occurrence analysis becomes crucial for understanding the evolutionary dynamics of these inversions in current evolutionary studies. This approach could reveal potential epistatic interactions between inversions, identify functionally related inversion modules, and uncover possible developmental constraints influencing which inversion combinations can coexist. Furthermore, such analysis could provide insights into how these inversions might contribute to adaptation, particularly if certain inversion combinations tend to be maintained together more frequently than expected by chance.

The clear evidence for zygotic selection also suggests these inversions might play important roles in development or fitness-related traits. Understanding their genotypic co-occurrence could therefore provide valuable insights into the functional architecture of adaptation in this system. This knowledge is particularly relevant for understanding how structural variants like inversions contribute to evolutionary processes and genetic variation maintenance in natural populations.

The utility of co-occurrence metrics

Applied to this study, co-occurrence metrics reveal patterns of putative coupling (positive epistasis) and repulsion (incompatibilities) detectable within a genotypic scope. Our simulations demonstrate the advantages of these metrics, which have been primarily used in ecology. Traditional tests analyze the full 3-by-3 genotype table (Fig 1), whereas Affinity and cJ/T metrics operate on “slices” of this table. An omnibus test provides greater power in detecting overall signals related to expectation deviations. Co-occurrence metrics offer more specificity, providing granular resolution of the underlying causes of such deviations. Fig 3 illustrates the practical potential of these indexes. When simultaneous inflation of one category and deflation of another occur, omnibus tests conflate these effects. In contrast, the affinity index and Jaccard/Tanimoto measures detect distinct signals and identify interactions as either positive or negative.

When applied to the Mimulus data, co-occurrence metrics reveal a pattern of variable epistasis (Table 2), with network representation providing a clear picture of this complexity (Fig 4). This depiction encompasses all inversions present in each line and all valid interactions established between loci. The architecture of these networks is cross-specific, and the importance of each inversion varies across crosses. Interactions between inversion pairs are modified by the overall genetic background. Almost every set of inversions in each genetic family is unique, consequently altering the association network as a function of the present inversion subset. These findings align with the concept explored by [53] of cryptic genetic variation and how background-dependent effects may be more prevalent than our and other current findings suggest. This concept fundamentally posits that a mutation’s effect on the organismal phenotype space (survival in this study) depends not only on the focal component associated with it (inversions and their interactions) but also on the interplay and epistatic relationships established in the genetic background space and intermediate phenotype level (gene expression level).

Overall, the deviation patterns indicate that heterozygous-homozygous combinations exhibit the highest number of significant deviations from independence. However, we cannot directly associate this result with a determined expectation since, due to our experimental design, we observed only the immediate product of selection in F2 plants in terms of survival. We lacked prior information about how specific genotypic combinations between inversions might have shaped the survivorship space of M. guttatus. Notably, none of the deviations displayed an unequivocal directional bias, meaning that coupling and repulsion interactions occurred at equivalent frequencies.

The network motif analysis revealed consistent patterns of inversion interactions that provide insights into the structural organization of chromosomal inversions in M. guttatus. Specifically, motifs s.4.6 and s.3.2 (Fig 5) showed significant enrichment across multiple lines, suggesting these association patterns are not random but likely reflect underlying biological constraints or advantages. In the context of chromosomal inversions, these undirected motifs can represent different scenarios of genetic architecture integration. For instance, highly prevalent patterns (as observed in crosses 909 and 502) may indicate the group sizes and overall structural shape of interaction networks between inversions that must be inherited together to maintain adaptive allele combinations or prevent deleterious genetic combinations. The variation in motif enrichment across lines further suggests that these association patterns are influenced by the genetic background, consistent with previous results. Importantly, the detection of significant motifs, even without considering the direction of interactions (coupling or repulsion), indicates fundamental organizational principles in how inversions can be combined within a genome. This network-based analysis provides a robust method for exploring multiway interactions that extends beyond the classic pairwise tests traditionally used in linkage disequilibrium and association studies [54].

From all pairwise contrasts, we extracted a case study featuring three inversions (29, 32, and 42) that segregate in all nine crosses (Fig 6). The results clearly demonstrate how specific genotype interactions vary between crosses, thus illustrating how selection changes as a function of the genetic background. The replicated independence test confirms the significance of this variation (Table 4), while the Jaccard/Tanimoto metrics quantify the nature of coupling (positive) and incompatible (negative) interactions. Regarding the pooled effect, we observed no significant signal—there is no directional epistasis. In network terminology, there are no conserved edges. The study identified two key inversions for which the heterogeneity test revealed significant effects: inversion 32 (in homozygous state) and inversion 40 (in heterozygous state). This suggests that epistatic interactions between those inversions are not consistent across different genetic backgrounds. This indicates that a particular combination of inversions might be advantageous in one genetic context but disadvantageous in another. Therefore, when evaluating the selective advantage of inversions, it is crucial to consider how they interact within the entire genetic system rather than studying them in isolation [15].

Importantly, inversions showing segregation distortion were not more or less likely to interact with other inversions. The association between the SDV, a measure of each inversion’s individual effect, and the association relationships was minimal (Fig 7). The SDV correlates weakly with both the number of pairwise contrasts in which that inversion is implicated with a raw p value under 0.05 and the mean p value of such pairwise contrasts. We had anticipated observing a positive association regarding the number of strong interactions, assuming that a highly distorted loci might be more implicated in coupling and repulsion within the genomic context. This addresses an important question in QTL mapping: are regions showing a marginal (single locus) effect more likely to exhibit interactions with other parts of the genome? Our results are negative in this regard. This decoupling of individual and interactive effects aligns with modern perspectives on the modular nature of genetic architectures (Wagner et al., 2007) and the complex relationship between individual allelic effects and epistatic networks [55].

When a particular inversion at a given dosage level occurs more frequently alongside another inversion at a specific dosage level, we can conceptualize this relationship as a putative functional interaction. Patterns of epistatic interactions in the context of inversions have been previously identified using linkage disequilibrium-based approaches. For instance, results from [56] suggest that inversions in Drosophila pseudoobscura contribute to maintaining positive epistatic interactions between loci included in individual inversions. However, according to our literature review, findings regarding epistatic interactions strictly between inversions (functional association between genes from different inversions) are rare, particularly in plants.

Most research in this area has been conducted with Drosophila spp. [15]. Among studies investigating linkage disequilibrium between inversions, [57] reported interactions between different inversions in Drosophila subobscura, although evidence only supported intrachromosomal interactions. This limitation might stem from methodological constraints, as epistatic interactions between unlinked inversions in different chromosomes have been documented in Drosophila melanogaster [58].

Future directions

Our study demonstrates the potential of co-occurrence analysis in unraveling the complex interactions between inversions. This approach offers a novel perspective on several key aspects of inversion biology:

1) Capturing complex genetic interactions.

Co-occurrence analysis provides an approach to understanding genetic systems [57]. Directed network analysis of motifs in the observed networks could also provide additional information about the directional nature of regulatory structures, improving the understanding of how these structures shape the association of genetic loci. This particular aspect is an active effort in bioinformatics, given that is a challenging problem which complexity increase with the number of nodes under consideration [36].

2) Identifying potential supergene associations.

Co-occurrence analysis could help identify interactions between supergenes by revealing consistent patterns of association between specific inversions and gene clusters. Since supergenes consist of coadapted genes locked within genomic inversions and inherited as single units [59], persistent co-occurrence patterns between different supergenes could indicate functional relationships or selective advantages of specific supergene combinations. This could reveal how supergenes work together at a broader genomic level.

3) Exploring inversion dynamics.

Our approach provides insights into the evolutionary dynamics of genomic inversions across different chromosomal locations. While inversions are known to suppress recombination between arrangements when in heterozygous state [41,60,61], our co-occurrence analysis focuses on a different scale - examining how different inversion loci are inherited together across the genome. This allows us to understand potential functional relationships between inversions and their collective role in the evolutionary history of the species[45,46].

4) Refining our understanding of inversion evolution.

Despite recent interest in the evolutionary role of inversions [42], our understanding remains limited. Co-occurrence analysis offers a new tool to explore how inversions evolve and interact, potentially revealing patterns that other methods might miss. Future research should aim to expand our approach, potentially integrating co-occurrence analysis with other genomic and phenotypic data. This could provide a more comprehensive understanding of inversion effects and their role in evolution. Additionally, applying this method to larger datasets and diverse species could reveal broader patterns in inversion dynamics and evolution.

Conclusions

This study demonstrates the application of co-occurrence statistics—traditionally used in community ecology—to investigate interactions between chromosomal inversions in Mimulus guttatus. By employing multiple statistical approaches, including contingency table analysis, centered Jaccard/Tanimoto index, and affinity scores, we detected patterns of coupling and repulsion between inversions across different experimental crosses. Our findings reveal a complex landscape of inversion interactions, with some pairs co-occurring more frequently than expected by chance among F2 plants, while others exhibit patterns of repulsion. These interactions varied across different genetic backgrounds, highlighting the context-dependent nature of inversion effects.

The network analysis approach provided a novel visualization of these complex interactions, revealing line-specific patterns and centrality of certain inversions within the genomic architecture. This method offers a promising tool for future studies aiming to unravel the functional relationships between structural variants in the genome. Our study also underscores the importance of considering dosage effects in inversion studies, as we observed different interaction patterns for homozygous and heterozygous states of inversions. This nuanced approach provides a more comprehensive understanding of how inversions may influence fitness and adaptation.

While our research provides valuable insights into inversion dynamics in M. guttatus, it also highlights the need for larger sample sizes and more comprehensive genomic data to fully capture the complexity of these interactions. Future studies should aim to integrate this co-occurrence approach with phenotypic data and functional genomics to elucidate the mechanisms underlying these interaction patterns. In conclusion, this study demonstrates the utility of ecological co-occurrence methods in genomics research and opens new avenues for investigating the role of structural variants in evolution. By bridging concepts from ecology and genetics, we provide a framework for future research on the complex interplay between genomic architecture and adaptive evolution.

Supporting information

S1 Fig. Distribution of the p-values derived from the omnibus test, the affinity analysis, and the centered Jaccard/Tanimoto-based analysis under the null model.

Histograms showing the density of each bin for the log-transformed p-values are complemented with a subplot depicting the cumulative distribution function (CDF).

https://doi.org/10.1371/journal.pone.0321253.s001

(EPS)

S2 Fig. Distribution of the p-values derived from the omnibus test, the affinity analysis, and the Jaccard/Tanimoto-based analysis under a situation with a deflation of the 9th cell counts.

Those coordinates are based on Fig 1A. There is a shift in the distribution of the p-values to the left, given the deviation that this modification implies for the null hypothesis. Histograms showing the density of each bin for the log-transformed p-values are complemented with a subplot depicting the cumulative distribution function (CDF).

https://doi.org/10.1371/journal.pone.0321253.s002

(EPS)

S3 Fig. Linear Associations Among Various Metrics.

This figure illustrates the linear relationships between different pairs of metrics. Plot A shows the relation between the standardized residuals from the post hoc test and the centered Jaccard/Tanimoto scores. Plot B depicts the relationship between the standardized residuals from the post hoc test and the affinity score. Lastly, Plot C demonstrates the linear relationship between the centered Jaccard/Tanimoto score and the affinity score. Each plot includes a trend line, indicating the direction and strength of the linear relationship between the two metrics. The summary of the model is included in the superior edge of each subplot.

https://doi.org/10.1371/journal.pone.0321253.s003

(EPS)

S4 Fig. Mean value variation of various metrics across the allele frequency landscape defined by two hypothetical inversions.

Uppermost left panel, D. Uppermost right panel, D’. Bottom left panel, cJ/T. Bottom right panel, α.

https://doi.org/10.1371/journal.pone.0321253.s004

(EPS)

S1 Excel Spreadsheet. Plant genotypes for each of the inversions analyzed in this study.

https://doi.org/10.1371/journal.pone.0321253.s007

(XLSX)

Acknowledgments

We thank the ongoing Mimulus pan-genome project for the early access to IM767 and IM62 genomes, specially John Willis and Lila Fishman. Also, we thank the journal editor and the reviewers for their valuable feedback and recommendations, which significantly improved the content and the way we convey the information in the present paper.

References

  1. 1. Mainali KP, Slud E, Singer MC, Fagan WF. A better index for analysis of co-occurrence and similarity. Sci Adv. 2022;8(4):eabj9204. pmid:35080967
  2. 2. Müller H, Mancuso F. Identification and analysis of co-occurrence networks with NetCutter. PLoS One. 2008;3(9):e3178. pmid:18781200
  3. 3. Weir BS. Linkage Disequilibrium and Association Mapping. Annu Rev Genom Hum Genet. 2008 Sep 1;9(1):129–42.
  4. 4. Bimler M, Stouffer D, Lai H, Mayfield M. Accurate predictions of coexistence in natural systems require the inclusion of facilitative interactions and environmental dependency. Journal of Ecology. 2018;106(5):1839–52.
  5. 5. Wright JS. Plant diversity in tropical forests: a review of mechanisms of species coexistence. Oecologia. 2002;130(1):1–14. pmid:28547014
  6. 6. Villalobos F, Carotenuto F, Raia P, Diniz-Filho J. Phylogenetic fields through time: temporal dynamics of geographical co-occurrence and phylogenetic structure within species ranges. Philosophical Transactions of the Royal Society B: Biological Sciences. 2016;371(1691):20150220.
  7. 7. Kang JTL, Rosenberg NA. Mathematical Properties of Linkage Disequilibrium Statistics Defined by Normalization of the Coefficient D = pAB - pApB. Hum Hered. 2019;84(3):127–43. pmid:32045910
  8. 8. Smith RD. The nonlinear structure of linkage disequilibrium. Theor Popul Biol. 2020;134:160–70. pmid:32222435
  9. 9. Lewontin R. The interaction of selection and linkage. I. General considerations; heterotic models. Genetics. 1964;49(1):49–67.
  10. 10. Hartl DL, Clark AG. Principles of Population Genetics [Internet]. Sinauer Associates; 1997. Available from: https://books.google.com/books?id=4ypuQgAACAAJ
  11. 11. Chung NC, Miasojedow B, Startek M, Gambin A. Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data. BMC Bioinformatics. 2019;20(Suppl 15):644. pmid:31874610
  12. 12. Chung NC, Miasojedow B, Startek M, Gambin A. jaccard: Test Similarity Between Binary Data using Jaccard/Tanimoto Coefficients [Internet]. 2018. Available from: https://CRAN.R-project.org/package=jaccard
  13. 13. Mainali K, Slud E. CooccurrenceAffinity: Affinity in Co-Occurrence Data [Internet]. 2023. Available from: https://CRAN.R-project.org/package=CooccurrenceAffinity
  14. 14. Catanach A, Crowhurst R, Deng C, David C, Bernatchez L, Wellenreuther M. The genomic pool of standing structural variation outnumbers single nucleotide polymorphism by threefold in the marine teleost Chrysophrys auratus. Mol Ecol. 2019;28(6):1210–23. pmid:30770610
  15. 15. Singh BN. Chromosome inversions and linkage disequilibrium in Drosophila. Current Science. 2008;94(4):459–64.
  16. 16. Lee Y, Fishman L, Kelly J, Willis J. A segregating inversion generates fitness variation in yellow monkeyflower (Mimulus guttatus). Genetics. 2016;202(4):1473–84.
  17. 17. Kirkpatrick M, Barton N. Chromosome inversions, local adaptation and speciation. Genetics. 2006;173(1):419–34.
  18. 18. Faria R, Johannesson K, Butlin R, Westram A. Evolving inversions. Trends in Ecology & Evolution. 2019;34(3):239–48.
  19. 19. Veltsos P, Madrigal-Roca L, Kelly J. Testing the evolutionary theory of inversion polymorphisms in the yellow monkeyflower (Mimulus guttatus). Nature Communications. 2024;15(1):10397.
  20. 20. Sloan DB, Warren JM, Williams AM, Kuster SA, Forsythe ES. Incompatibility and Interchangeability in Molecular Evolution. Genome Biol Evol. 2023;15(1):evac184. pmid:36583227
  21. 21. Gosak M, Markovič R, Dolenšek J, Slak Rupnik M, Marhl M, Stožer A, et al. Network science of biological systems at different scales: A review. Phys Life Rev. 2018;24:118–35. pmid:29150402
  22. 22. Proulx SR, Promislow DEL, Phillips PC. Network thinking in ecology and evolution. Trends Ecol Evol. 2005;20(6):345–53. pmid:16701391
  23. 23. Charitou T, Bryan K, Lynn DJ. Using biological networks to integrate, visualize and analyze genomics data. Genet Sel Evol. 2016;48:27. pmid:27036106
  24. 24. Veltsos P, Kelly JK. The quantitative genetics of gene expression in Mimulus guttatus [Internet]. 2023 [cited 2024 Nov 2]. Available from: http://biorxiv.org/lookup/doi/10.1101/2023.11.21.568003
  25. 25. R Core Team. R: A Language and Environment for Statistical Computing [Internet]. Vienna, Austria: R Foundation for Statistical Computing; 2024. Available from: https://www.R-project.org/.
  26. 26. Signorell A. DescTools: Tools for Descriptive Statistics [Internet]. 2014 [cited 2025 Jan 30]. p. 0.99.59. Available from: https://CRAN.R-project.org/package=DescTools
  27. 27. Fu Q, Meng X, Luan S, Chen B, Cao J, Li X, et al. Segregation distortion: high genetic load suggested by a Chinese shrimp family under high-intensity selection. Sci Rep. 2020;10(1):21820. pmid:33311524
  28. 28. Ebbert D. chisq.posthoc.test: A Post Hoc Analysis for Pearson’s Chi-Squared Test for Count Data [Internet]. 2019. Available from: https://CRAN.R-project.org/package=chisq.posthoc.test
  29. 29. Sasieni PD. From genotypes to genes: doubling the sample size. Biometrics. 1997;53(4):1253–61. pmid:9423247
  30. 30. R Core Team. R: A Language and Environment for Statistical Computing [Internet]. Vienna, Austria: R Foundation for Statistical Computing; 2023. Available from: https://www.R-project.org/.
  31. 31. Wickham H. ggplot2: Elegant Graphics for Data Analysis [Internet]. New York: Springer-Verlag; 2016. Available from: https://ggplot2.tidyverse.org
  32. 32. Venables WN, Ripley BD. Modern Applied Statistics with S [Internet]. Fourth. New York: Springer; 2002. Available from: https://www.stats.ox.ac.uk/pub/MASS4/.
  33. 33. Marin DH. repmod: Create Report Table from Different Objects [Internet]. 2021. Available from: https://CRAN.R-project.org/package=repmod
  34. 34. Csárdi G, Nepusz T, Traag V, Horvát S, Zanini F, Noom D, et al. igraph: Network Analysis and Visualization in R [Internet]. 2024. Available from: https://CRAN.R-project.org/package=igraph
  35. 35. Wernicke S, Rasche F. FANMOD: a tool for fast network motif detection. Bioinformatics. 2006;22(9):1152–3.
  36. 36. Wong E, Baur B, Quader S, Huang C-H. Biological network motif detection: principles and practice. Brief Bioinform. 2012;13(2):202–15. pmid:22396487
  37. 37. Sokal RR, Rohlf FJ. Biometry: the principles and practice of statistics in biological research. 3 edition. 11 print. New York (NY): Freeman; 2010. 887 p.
  38. 38. Benjamini Y, Hochberg Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society: Series B (Methodological). 1995;57(1):289–300.
  39. 39. Benjamini Y, Yekutieli D. The control of the false discovery rate in multiple testing under dependency. Annals of Statistics. 2001;29(4):1165–88.
  40. 40. Storey J, Tibshirani R. Statistical significance for genomewide studies. Proc Natl Acad Sci USA. 2003;100(16):9440–5.
  41. 41. Berdan EL, Barton NH, Butlin R, Charlesworth B, Faria R, Fragata I, et al. How chromosomal inversions reorient the evolutionary process. J Evol Biol. 2023;36(12):1761–82. pmid:37942504
  42. 42. Charlesworth B, Flatt T. On the fixation or nonfixation of inversions under epistatic selection. Mol Ecol. 2021;30(16):3896–7. pmid:34218481
  43. 43. Mérot C. Making the most of population genomic data to understand the importance of chromosomal inversions for adaptation and speciation. Mol Ecol. 2020;29(14):2513–6. pmid:32497331
  44. 44. Singh BN. Hundred Years of Research on Inversion Polymorphism in Drosophila. Current Science. 2019;117(5):761.
  45. 45. Harringmeyer O, Hoekstra H. Chromosomal inversion polymorphisms shape the genomic landscape of deer mice. Nature Ecology & Evolution. 2022;6(12):1965–79.
  46. 46. Wellenreuther M, Mérot C, Berdan E, Bernatchez L. Going beyond SNPs: The role of structural genomic variants in adaptive evolution and species diversification. Mol Ecol. 2019;28(6):1203–9. pmid:30834648
  47. 47. Sanchez-Donoso I, Ravagni S, Rodríguez-Teijeiro JD, Christmas MJ, Huang Y, Maldonado-Linares A, et al. Massive genome inversion drives coexistence of divergent morphs in common quails. Curr Biol. 2022;32(2):462-469.e6. pmid:34847353
  48. 48. Koch EL, Morales HE, Larsson J, Westram AM, Faria R, Lemmon AR, et al. Genetic variation for adaptive traits is associated with polymorphic inversions in Littorina saxatilis. Evol Lett. 2021;5(3):196–213. pmid:34136269
  49. 49. Cheng C, Kirkpatrick M. Inversions are bigger on the X chromosome. Mol Ecol. 2019;28(6):1238–45. pmid:30059177
  50. 50. Zhou Y, Yu Z, Chebotarov D, Chougule K, Lu Z, Rivera LF, et al. Pan-genome inversion index reveals evolutionary insights into the subpopulation structure of Asian rice. Nat Commun. 2023;14(1):1567. pmid:36944612
  51. 51. Schumer M, Xu C, Powell D, Durvasula A, Skov L, Holland C, et al. Natural selection interacts with recombination to shape the evolution of hybrid genomes. Science. 2018;360(6389):656–60.
  52. 52. Coughlan JM, Willis JH. Dissecting the role of a large chromosomal inversion in life history divergence throughout the Mimulus guttatus species complex. Mol Ecol. 2019;28(6):1343–57. pmid:30028906
  53. 53. Chandler CH, Chari S, Dworkin I. Does your gene need a background check? How genetic background impacts the analysis of mutations, genes, and evolution. Trends Genet. 2013;29(6):358–66. pmid:23453263
  54. 54. Gerard D. Pairwise linkage disequilibrium estimation for polyploids. Mol Ecol Resour. 2021;21(4):1230–42. pmid:33559321
  55. 55. Phillips PC. Epistasis--the essential role of gene interactions in the structure and evolution of genetic systems. Nat Rev Genet. 2008;9(11):855–67. pmid:18852697
  56. 56. Schaeffer S, Goetting-Minesky M, Kovacevic M, Peoples J, Graybill J, Miller J. Evolutionary genomics of inversions in Drosophila pseudoobscura: Evidence for epistasis. Proceedings of the National Academy of Sciences of the United States of America. 2003;100(14):8319–24.
  57. 57. Sperlich D, Feuerbach-Mravlag H. Epistatic gene interaction, crossing over, and linked and unlinked inversions in drosophila subobscura. Evolution. 1974;28(1):67–75. pmid:28563027
  58. 58. Singh B, Das A. Epistatic interaction between unlinked inversions in Indian natural populations of Drosophila melanogaster. Genetics Selection Evolution. 1991;23(5):371.
  59. 59. Dobzhansky T. Genetics of the evolutionary process. Vol. 139. Columbia University Press; 1970.
  60. 60. Jay P, Jeffries D, Hartmann F, Véber A, Giraud T. Why do sex chromosomes progressively lose recombination?. Trends in Genetics. 2024.
  61. 61. Stevison L, Hoehn K, Noor M. Effects of inversions on within- and between-species recombination and divergence. Genome Biology and Evolution. 2011;3(1):830–41.