Figures
Abstract
Sex determination is a flexible process in fish, controlled by genetics or environmental factors or a combination of both depending on the species. Revealing the underlying molecular mechanisms may have important implications for research on reproductive development in vertebrates, as well as sex-ratio control and selective breeding in fish. Phenotypic sex in rainbow trout (Oncorhynchus mykiss) is primarily controlled by a XX/XY male heterogametic sex determination system. Unexpectedly in genetically XX all-female farmed populations, a small proportion of males or intersex individuals are regularly observed. Spontaneous masculinisation is a highly heritable trait, controlled by minor sex-modifier genes that remain unknown, although several Quantitative Trait Loci (QTL) were detected in previous studies. In the current work we used genome-based approaches and various statistical methods to further investigate these QTL. DNA markers that were previously identified in a French commercial population on chromosomes Omy1, Omy12 and Omy20 were validated in six different farmed trout populations. Functional candidate genes that may be involved in spontaneous masculinisation by reducing germ cell proliferation and repressing oogenesis of XX-rainbow trout in the absence of the master sex determining gene were identified. In particular, syndig1, tlx1 and hells on Omy1, as well as khdrbs2 and csmd1 on Omy20 deserve further investigation to validate their potential sex-modifier roles as well as their interaction with rearing temperature. Those findings could be used to produce all-female populations that are preferred by farmers due to a delayed maturation of females and higher susceptibility of male trout to diseases.
Citation: Dehaullon A, Fraslin C, Bestin A, Poncet C, Guiguen Y, Quillet E, et al. (2025) In-depth investigation of genome to refine QTL positions for spontaneous sex-reversal in XX rainbow trout. PLoS One 20(5): e0313464. https://doi.org/10.1371/journal.pone.0313464
Editor: Arnar Palsson, University of Iceland, ICELAND
Received: October 24, 2024; Accepted: March 31, 2025; Published: May 7, 2025
Copyright: © 2025 Dehaullon et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Raw sequence data that were used in this study are deposited in the ENA (Project PRJEB75960 in https://www.ebi.ac.uk/ena/browser/view/PRJEB75960).
Funding: The European Maritime and Fisheries Fund and the French National Government supported the NeoBio project (R FEA470016FA1000008). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Sex reversal has been defined as a mismatch between the phenotypic and the genetic sex of an individual [1]. In fish, sex reversal can be induced by hormonal treatments (estrogens to feminize and androgens to masculinize) during critical fry stages [2,3]. Manipulating the sexual phenotype of fish has been used to prevent early maturity and reproduction, as well as increase production in various cultured species or create subsequent sterile generations for fisheries management [3]. Sex-reversed individuals can be used to produce populations that are genetically all-male or all-female (also known as ‘monosex’). Rainbow trout (Oncorhynchus mykiss) is a worldwide cultured salmonid species. The production of large rainbow trout over 1 kg body weight has become a growing commercial interest with the development of a market for fresh and smoked fillets. A major bottleneck for this commercial market is the early maturation of males (1–2 year) relative to females (2–3 years) that causes stopped growth, increased sensitivity to pathogens (Saproleignia spp.), and reduced flesh quality [4,5,6]. Therefore, all-female rainbow trout stocks are often preferred in aquaculture. These stocks are currently produced by crossing hormonal sex-reversed genetically XX-females, known as neomales (XX males), with XX-females [2]. The growing concerns or potential risks to human and environmental health make the prevalent approach of producing neomales via the oral application of the hormone 17-alpha methyltestosterone less sustainable. The finding of a sex control alternative to hormonal treatment based on a safe, consumer and environmentally friendly method is a major challenge for the production of all-female populations [7].
Sex determination and sex differentiation mechanisms in fish are diverse and complex. Most of the teleosts are gonochorists meaning that all individuals within a species develop either as males or females and remain the same throughout their lives. Various systems of sex determination have been observed in gonochoristic fishes [1], spanning from a genetic sex determination (GSD) system with a genetic monofactorial system, XX/XY or ZZ/ZW, to different multifactorial systems, or environmental sex determination (ESD). During sex differentiation, the testis and ovary pathways are mutually antagonistic and compete for control of gonad fate. Understanding sex-differentiation processes requires to unravel the origin and developmental pathways of cells and organs involved in the formation of the primordial gonad. During primary sex differentiation in vertebrates, the primordial bipotential gonad commits to either an ovary or a testis developmental fate. Within both ovary and testis, a clear distinction between germ and somatic cells can be made, with the former having the potential to mitotically divide and enter meiosis, and the latter differentiating into associated structural and endocrine cells. In teleosts, primordial germ cells (PGCs) are established and specialized at the early blastula stage and migrate to the genital ridge during embryonic development [8].
Spontaneous sex reversal occurs because of a failure to maintain the initiated pathway or a failure to repress the opposite pathway [9], switching the sexual phenotype of the organism to the opposite sex. Once the gonad has committed to a particular fate, it begins to produce sex-specific hormones that will have local and direct effects on germ cell development, but will also drive the secondary process of sex differentiation in which the somatic tissues of the organism differentiate toward one sex or the other. The developmental pathway leading to steroid production in gonadal somatic cells requires complex regulation of multiple genes [10] involved in the differentiation of steroid-producing cells.
Over the last two decades evidence has accumulated that both somatic and germ cells play critical roles in fish gonadal differentiation. However, the exact mechanisms and factors involved remain to be elucidated [11,12]. As evidenced from studies in mammals [13], somatic cells may first differentiate in response to the activation of master sex-determining genes, and subsequent differentiation of PGCs into male or female gametes follows in the differentiating testis or ovary in response to signals derived from surrounding somatic cells. Alternatively, it is possible that, first, PGCs interpret internal genetic or external environmental cues and directly transform into spermatogonia or oogonia; then, the surrounding somatic cells may be induced by the germ cells to differentiate accordingly to provide an appropriate hormonal environment for further gonadal differentiation [14]. It is thought that control of estradiol synthesis could play a key role not only for ovarian, but also for testicular differentiation and sex change in fish [15]. This working hypothesis states that the gonadal aromatase gene, cyp19a1a, up-regulation would be needed not only for triggering, but also for maintaining ovarian differentiation; and that cyp19a1a down-regulation would be the only necessary step for inducing a testicular differentiation pathway. In Atlantic salmon as well as in loach and goldfish, it was shown that germ cells are not essential for gonadal sex differentiation [16]. Germ cell-free female gonads develop an ovarian somatic structure in these species, unlike in zebrafish or medaka where all germ cell-free fish develop somatic testes. Evidence from both zebrafish and medaka models [17,18,19,20] suggests important feminizing roles of the germ cells in gonadal differentiation and thus corroborates the alternative hypothesis. In particular, when germ cells are ablated in medaka, XX fish show female-to-male sex reversal, while XY fish exhibit male-to-female sex reversal [19]. Any disruption of this germ cell/somatic cell cross-talk would then disrupt ovarian somatic cell differentiation, leading to an absence of estrogen synthesis and a subsequent masculinization. More subtle regulations not involving the complete loss of germ cells, but instead some differential germ cell proliferation rates, could be also suggested as important triggers of gonadal sex differentiation. For example, the Japanese medaka sex determining gene, dmrt1Y, has been found to be an inhibitor of germ cell proliferation [21] and is only expressed during testicular differentiation [22].
In GSD species, the master sex determining trigger is encoded by a gene on a sex chromosome, which activates a network of downstream regulators of sex differentiation. Currently, there have been more than 20 sex determining genes discovered in fish, mammals, birds and frogs, and as many as 13 are derived from the transforming growth factor Beta (TGF-β) pathway [23]. This important pathway in sex determination/differentiation of fishes potentially regulate the number of germ cells, and/or inhibit aromatase activity to determine and maintain sex (e.g., Chen et al. [24]). Interestingly, sdY (sexually dimorphic on the Y-chromosome gene), the master sex-determing gene of rainbow trout and other salmonid species, does not belong to the usual master sex gene families (the DM domain and Sox protein families), nor does it belong to TGF-β signaling pathway. It is a truncated duplicated copy of the immune response gene irf9 (Interferon Regulatory Factor 9) [25]. While very specific to salmonids, the gene sdY is acting directly on the classical sex differentiation regulation network in all vertebrates as its protein has been shown to interact with foxl2 (Forkhead Box L2) [26] that is a well-known conserved female differentiation factor [27]. foxl2 was shown to be involved in the goat polled intersex syndrome, a deletion of the gene triggering early testis differentiation and XX female-to-male sex reversal [28,29].
Although phenotypic sex in rainbow trout is primarily determined by a male heterogametic XX/XY GSD system, spontaneous masculinization of XX fish is a phenomenon that has been repeatedly observed in various rainbow trout populations, at generally limited frequencies (~1–2%) with some individuals being only partially affected (intersex) [30]. Spontaneous sex reversal is a highly heritable trait [30] and several Quantitative Trait Loci (QTL) associated with this trait were detected [27,31], highlighting the existence of several minor sex-determining genes that are independent of the major sex determinant carried by the sex chromosome. In particular, in a previous study [30], we identified in a French trout population two QTL located on chromosome 1 with a strong evidence while two suggestive QTL were identified on chromosomes 12 and 20 using the rainbow trout reference genome of Swanson line [32]. All SNP variants from these four QTL regions that covered tens of genes were further investigated in the present study.
Firstly, we sought to validate the existence of the four QTL in six different French rainbow trout populations, and secondly, we wanted to refine their location in the initial discovery population by combining machine learning and principal components approaches, and considering the new reference genome derived for Arlee line [33].
Materials and methods
Sequenced samples from the discovery population
The first data set was initially produced by Fraslin et al. [30]. The 4 QTL regions that were discovered were further studied in depth in this work (Fig. 1).
The fish samples and records came from the French rainbow trout farm “Les Fils de Charles Murgat” (Beaufort, France; UE approval number FR 38 032 001). The 60 dams sequenced genomes were initially aligned against the reference assembly genome Omyk_1.0 (GenBank assembly accession GCA_002163495.1) of the double haploid line Swanson from Washington State University [32]. Following the same procedure as described in Fraslin et al. [30], these 60 sequences were realigned against the reference genome USDA_OmykA_1.1 of the WSU double haploid line Arlee (GenBank assembly accession GCA_013265735.3) [33]. This realignment was performed because French rainbow trout populations were known to be phylogenetically closer to Arlee’s genome that to Swanson’s one [34]. On average, 9.252E + 7 paired reads (1.456E + 7 sd) were mapped on the Arlee genome, corresponding to a mean coverage of 12X. While the percentage of properly paired sequence reads over all sequenced pairs was in average 87.3% for the alignment against Swanson’s genome assembly, these statistics went up to 94.1% for the alignment against Arlee’s genome assembly.
After mapping, duplicates were marked using the MarkDuplicate tool, and base quality score where recalibrated using ApplyBQSR tool from GATK 4.2.2.0 [35], then variants were called independently with the three different variants calling tools HaplotypeCaller from GATK, Freebayes 1.3.5 [36] and SAMtools Mpileup 1.11 [37]. Recommended quality filters were used to ignore low-confidence alignments: a minimum mapping quality of 30, a minimum base quality required to consider a base for calling of 10X reads, minimum phred-scaled confidence threshold at which variants should be called of 30. A total of 29,229,949 variants that were obtained by the three different variant calling tools were kept for further quality controls using vcftools 1.15 and keeping indels and SNPs located only on known chromosomes, removing variants located on un-located contigs or mitochondrial chromosome. Further quality filtering on variant coverage were performed using SelectVariant and Variant Filtration tools (GATK) according to the hard-filtering recommendations [38]. Briefly QD < 2.0 (to normalize variant quality), FS > 60.0 (measure of the Phred-scaled probability that there is strand bias at the site), MQ < 40.0 (mapping quality), SOR > 3.0 (Strand Odds Ratio), MQRankSum < -12.5 (u-based z-approximation from the Rank Sum Test for mapping qualities) and ReadPosRankSum < -8.0 (u-based z-approximation from the Rank Sum Test for site position within reads). Final numbers of 21,517,540 SNPs and 4,850,647 indels, for a total of 26,368,187 variants, were kept for further analysis.
Among the 60 sequenced females, a subset of 23 dams with at least 10 progeny with phenotypic sex recorded and extreme proportions of sex-reversed offspring, either at least 25% or below 6% of neomales among their XX daughters (see Table A in S1 Tables) were used in the current study. For each of the 3 chromosomes investigated in the present study, all SNPs that exhibited polymorphism among the genomes of those 23 dams were kept for the final analysis, i.e., 854,157 SNPs for Omy1, 965,320 SNPs for Omy12 and 492,959 SNPs for Omy20.
Genotyped samples from six validation populations
We sought to validate the existence of the four QTL detected by Fraslin et al. [30] as linked to spontaneous sex-reversal of XX trout in six diverse French populations of rainbow trout (Table 1, Fig. 1). Population A was composed of XX sibs from the same birth cohort of the parents used to produce the initial QTL discovery population [30]. Fish from populations B and C, and those from populations E and F came from two other French breeding compagnies, while fish from population D came from a commercial site using fry from unknown origin.
Two 96 SNPs genotyping arrays using microfluidic real-time PCR Fluidigm Kasp chemistry were designed. A set of 192 SNPs identified in Fraslin et al [30] (see Table B in S1 Tables) were selected, corresponding to: 140 SNPs in the two main QTL identified on chromosome Omy1, 19 SNPs in a putative QTL region detected on Omy12 and 33 SNPs in a large 6 Mb region with putative QTL on Omy20.
Pieces of caudal fin sampled from 315 fish, corresponding to at least 30 XX fish per population, including a minimum of eight neomales (Table 1), were sent to Gentyane genotyping platform (INRAE, Clermont-Ferrand, France) for genotyping after DNA extraction using the DNA advance kit from Beckman Coulter following manufacturer instructions. After quality control, 19 SNPs were eliminated (including 14 from Omy1) from the analyzes of all the populations because their genotyping rate was below 90%, 173 SNPs were thus kept for QTL validation.
Statistical analysis for QTL confirmation in six validation populations
An exact Fisher test was run for each of the six validation populations using the fisher.test function in R version 4.3.1 [39] which implements the method developed by Mehta and Patel [40,41] and improved by Clarkson, Fan and Joe [42] to test the independence of genotypes at each SNP and phenotypic sex of fish.
This test was applied for the set of 173 SNPs across all six populations (Fig. 1). In any of these populations, ~ 170 SNPs were polymorph and had a genotyping rate above 90%. To account for multiple testing using Bonferroni correction, SNPs were considered significant at the genome level in a given population when p_value < 0.0005 (i.e., log10(p_value) < 3.3, assuming that about 100 SNPs were not highly redundant among ~170 tested for each population). SNPs only had a putative effect when p_value < 0.005 (i.e., log10(p_value) < 2.3).
Exact fisher test on all SNPs of QTL in 23 extreme dams of the discovery population
In order to refine the QTLs boundaries and confront the results between the validation and discovery populations (Fig. 1), an exact Fisher test was run to detect the association of all the sequence variants observed in the four extended QTL regions with the average progeny sex-ratios of the 23 extreme dams of the discovery population.
A first putative threshold of p_value < 0.005 (i.e., log10(p_value) < 2.3) was used to identify putative QTL. A second, more stringent, threshold (p_value < 0.002) was considered to focus on the main significant associations with sequence variants in the discovery population. Because of the strong linkage disequilibrium observed across SNPs along the chromosome [43], the QTL region on Omy20 was expanded to test, one-by-one, all the variants spanning the chromosome from 27 to 38 Mb.
Statistical approaches for refinement of QTL locations in the discovery population
To refine the QTL locations in the discovery population using the latest genome assembly we i) used Random Forests approaches with haplotypes and ii) applied a Discriminant Analysis of Principal Components on the sequence data from the 23 dams selected above (Fig. 1.).
Machine learning approach using random forests
Random forest (RF) is a machine learning method that aggregates complementary information from an ensemble of classification or regression trees trained on different bootstrap samples (animals) drawn with replacement from the original data set [44]. First, every tree is built using a bootstrap sample of the observations. Second, at each node, a random subset of all predictors (the size of which is referred to as mtry hereafter) is chosen to determine the best split rather than the full set. Therefore, all trees in a forest are different. A particularity of the RF is the out-of-bag data, which corresponds to the animals not included (roughly 1/3) in the bootstrap sampling for building a specific tree. It can be used as an internal validation set for each tree, which allows the computation of RF error rate based on misclassified animals in the out-of-bag data. RF can be applied successfully to “large p, small n” problems, and it is also able to capture correlation as well as interactions among independent variables [45].
Five genomic zones were considered to define haplotypes of subsequent SNPs spanning all the QTL regions from 67 to 69Mb on Omy1, from 8.5 to 9.5Mb on Omy12, and from27 to 29Mb, 33.5 to 35.5 Mb, and 36–37Mb on Omy20. In total 21,310 variants for Omy1, 12,191 for Omy12, and 56,643 for Omy20, were sonsidered in the RF analysis.
The genotypes were recorded depending of the number of reference alleles present in the genotype: “2” for a homozygous reference genotype, “1” for a heterozygous genotype, “0” for the alternative homozygote; missing genotypes were encoded “5”.
To minimize the RF error rates, a preliminary set of analyses was run to establish the optimum SNP window size (swind), the number of sampled trees (ntree), and the number of sampled predictor variables (mtry) chosen at random to split each node. In any analysis, when variables are highly correlated, they can act as substitutes for one another. This weakens the evidence of association for any single correlated variable with the outcome if all variables are included in the same model.Therefore, to avoid redundancy of information and limit the number of variables to analyze, haplotypic genotypes were built by retaining one SNP out of two and gathering them in the same haplotype across a window of 80 initial SNPs (correspond to swind = 40). We varied swind from 20 to 200 subsequent SNPs, ntree from 100 to 1,000 and mtry from 5 to 20. The final parameter set (swind = 40, ntree = 400 and mtry = 7) was chosen as consistently giving the lower median error rate over 50 runs for any of the three tested chromosomes. A final set of 246 haplotypes for Omy1 (131 for Omy1_a and 115 for Omy1_b), 152 haplotypes for Omy12 and 707 haplotypes for Omy20 (247, 288, and 172, respectively for Omy20_a, Omy20_b and Omy20_c) were then analyzed by averaging results across 100 RF runs. We used the package RandomForest 4.18 [46] in R version 4.3.1 [39].
Haplotypes were ranked by decreasing order of importance in the prediction model, using the mean decrease in Gini index. The Gini index measures the probability for a specific feature of being misclassified when chosen randomly. The higher is the value of mean decrease in Gini index, the higher is the importance of the variable in the model.
A list of 45 positional candidate genes was established based on results of the top-ranked haplotypes as well as the preliminary Fisher exact tests on SNP.
Discriminant analysis of principal components to identify the most promising variants
The Discriminant Analysis of Principal Components (DAPC) helps explore the genetic structure of biological populations. This multivariate method consists in a two-step procedure: 1) genetic data are centred and used in a Principal Component Analysis (PCA); 2) a Linear Discriminant Analysis is applied to the principal components of PCA. A basic matrix operation is used to express discriminant functions as linear combination of alleles, allowing one to compute allele contributions, i.e., to identify the most interesting variants [47].
The 33,731 SNPs that were in the list of genes identified by RF or Fisher’s exact test as well at the SNPs in the close upstream and downstream regions of theses genes (±2 kb) were used in the DACP. Filtering out SNPs with call rate below 100% using PLINK 1.9 [48], DAPC was applied on a final set of 27,828. The adegenet R package [49] was used to identify, by DAPC, the variants that best discriminate between dams with high and dams with low sex-reversal ratios in their offspring.
Annotation of genes and variants
Genes within QTL regions were annotated using the NCBI O. mykiss Arlee genome assembly USDA_OmykA_1.1. (GCA_013265735.3) (Gao et al. 2021). In addition to NCBI gene summaries, functional information for genes was extracted from the human gene database GeneCards® (https://www.genecards.org/) that also includes protein summaries from UniProtKB/Swiss-Prot (https://www.uniprot.org/uniprotkb/).
SNP annotation was performed for the 33,731 SNPs identified for the list of genes of interest, applying the software SNPEff4.3T [50] on the NCBI annotation file annotation released for O. mykiss Arlee genome reference assembly (GCA_013265735.3). These annotations permitted to retrieve any information about the predicted impact of the variants on the genes, and, in some cases, on the resulting proteins. Relevant variants were further studied using the Genome Data Viewer of NCBI to indicate their intronic or exonic positions within the genes.
Ethics statement
All applicable international and national guidelines for sampling, care, and experimental use of organisms for the study have been followed. As part of the standard breeding practices for a commercial breeding program, the handling of fish was not subject to oversight by an institutional ethic committee. Animals were treated in compliance with the European Communities Council Directive (9858/CEC) for the care and use of farm animals.
Results
Validation of the existence of the four QTL in various French rainbow trout populations
Among the 173 SNPs that could be tested for their association with spontaneous sex-reversal in the validation populations, 117 had a significant effect in at least one population with 50 SNPs on Omy1 having an effect in population A, that was genetically close to the QTL discovery population. Most of these 50 SNPs had also an effect observed in populations B, C and E. However, none of the SNPs tested on Omy12 or Omy20 had an effect in population A, probably because of sampling issues for low frequency variants in the population but 7 SNPs on Omy12 (out of 17 tested) had an effect in populations C, D, E and F; and 13 SNPs on Omy20 (out of 30 tested) had an effect in populations C, D and F.
The list of the 90 SNPs that had at least a putative associated effect in two validations populations is given in Table C in S1 Tables. The 63 SNPs that had at least a putative effect in four validation populations, or a clear significant effect in two or more populations is given in Table 2 along with the NCBI annotation of 50 SNPs located within or in the close vicinity (< 5kb) of genes.
Extension of the QTL region boundaries in the discovery population
Because most of the SNPs located at the boundaries of the QTL regions reported by Fraslin et al. [30] had a significant effect in some of the validation populations, a Fisher’s exact test was applied on the dams’ average progeny phenotypes in the discovery population for all sequence variants located in QTL regions (enlarged by 0.5 Mb at least on each side). Based on these tests, new extended boundaries for the QTL were defined (see Table 3). In particular, QTL in three different regions of Omy20 were enlarged, spanning from 27 to 29 Mb, 33.5 to 35.5 Mb, and 36–38 Mb, respectively for Omy20_a, Omy20_b and Omy20_c.
In total, 246 SNPs had a p_value ≤ 0.005 (see Table D in S1 Tables), and the 19 SNPs with a more stringent p_value ≤ 0.002 were mainly located within genes or in their close vicinity (Table 4).
Identification of the most relevant haplotypes and genes explaining sex-reversal in the discovery population
Initially, a Random Forest (RF) analysis was performed for each of the three chromosomes with all their QTL regions jointly analyzed. Respectively for the RF on Omy1, Omy12 and Omy20, the 30, 15 and 30 top- ranked haplotypes based on Gini index as well as the corresponding genes are reported in Tables E, F and G in S1 Tables. For the 2-Mb region on Omy1 (246 haplotypes spanning from 67 to 69 Mb), the median error rate was 28% with a minimum error rate of 15% and a maximum rate of 37% of fish phenotypes wrongly classified as the opposite sex across 100 RF simulations. While the RF analysis of error rate indicated that the 2 Mb tested on Omy1 were really useful to discriminate the dams with high and low progeny sex-reversal ratios, the median error rate derived from the RF analysis on Omy12 (152 haplotypes spanning 8.5 to 9.5 Mb) was 42% (ranging from 29% to 51% across the 100 RF runs), indicating that the best haplotypes and genes identified on this chromosome were not so strongly associated with sex-reversal. In terms of error rate, the RF analysis on the 707 haplotypes covering the QTL regions on Omy20 (247 haplotypes from 27 to 29 Mb, 288 haplotypes from 33.5 to 35.5 Mb, and 172 haplotypes from 36 to 37 Mb) gave intermediate results in-between the two previous analyses, misclassifying one third of the dams on average with a minimum error rate of 29% and a maximum rate of 38% across 100 RF simulations.
These results are consistent with the higher proportions of genetic variance explained by QTL on Omy1 than on Omy12 or Omy20 [30]. In addition, the 1,105 haplotypes covering the QTL regions along the three chromosomes were considered all together in a subsequent full RF analysis. The empirical distribution of error rates for the full RF analysis gave median, minimum and maximum values similar to the ones of the RF on Omy20 (33%, 29%, and 38%, respectively) while we were expecting that these values will be at least as low as for the RF on Omy1. Due to the very small size of our dataset (n = 23), it is likely that having over 1,100 haplotypes to consider in the full RF analysis created some “noise” in the classification process.
The top ten RF ranked haplotypes in the 2 Mb evaluated on Omy1 (see Table E in S1 Tables) fell within the nine following genes (Fig.2): sdhaf4, tlx1, syndig1 and vsx1 genes located in QTL Omy1_a; cep68, fbxw4, hells, LOC110527930, and gbf1 in QTL Omy1_b. Six of those genes have been identified with significant effects on sex-reversal in several validation populations (Table 2) or within some significant SNPs in the discovery population (Table 4). The first top-ranked haplotype included two genes from QTL Omy1_b (sdhaf4 and tlx1), while the second and third top-ranked haplotypes were in the sequence of the syndig1 gene on Omy1_a.
A) QTL region a on chromosome Omy1. B) QTL region b on chromosome Omy1. Results of the Fisher exact test in the Discovery population (Fisher Discovery, in pink) and the validation populations (Fisher Validation, in purple) and of the Discriminant Analysis of Principal Components (DAPC) in the discovery population (in yellow). Each bar represents a SNP that was tested for association (* p_value < 0.005; ** p_value < 0.002). All genes and candidate genes (based on the combination of test performed in this manuscript) are presented along the QTL. Haplotypes are ranked using the Mean Gini Index value over 100 runs of Random Forrest analysis performed within chromosome Omy1 (dark red: highest Gini index to dark blue: lowest Gini index, values available in Table E in S1 Tables).
These results are not fully consistent with our first GWAS [30] using progeny imputed genotypes and Bayesian mixed model methodologies. In this previous analysis, the syndig1 gene was not included in the credibility interval for Omy1_a. Although there was no SNP within this gene among the most significant SNPs (p_value ≤ 0.002) in the discovery population, two SNPs had, however, a p_value < 0.005 (see Table D in S1 Tables). In addition, syndig1 exhibited highly significant SNPs in four validation populations, including population A. These results make syndig1 a very plausible candidate gene for QTL Omy1_a. As regards to the initial QTL Omy1_b, there is now an extended credibility interval, including genes before slc2a15a such as tlx1 and cep68, and genes after the uncharactezized LOC110527930 such as gbf1. While LOC110527930 initially was the best candidate gene for Omy1_b, the top-ranked haplotypes now point towards genes such as tlx1, cep68, fbxw4 and hells that are now prefered within Omy1_b region (Fig. 2).
On the 1 Mb tested on Omy12 (see Table F in S1 Tables), the first three haplotypes most likely explain sex-reversal were within the same two hcn1 and ccbe1 genes that were also identified in three populations unrelated to the discovery population (see Table C in S1 Tables).
On Omy20 (Fig.3) the best ten ranked haplotypes in a 5Mb window investigated in the RF analysis (see Table G in S1 Tables) were overlapping with the following eight genes, presented in decreasing order of importance: lrrc59, csmd1, khdrbs2, abcg5, akt3, rgs17, caskin2 and dst. No variants from this region could be tested in the validation populations, however, some highly significant SNPs are detected in the discovery population for four of these genes (akt3, khdrbs2, dst and csmd1, see Table 4).
A) QTL region a on chromosome Omy20. B) QTL region b on chromosome Omy20. C) QTL region c on chromosome Omy20. Results of the Fisher exact test in the Discovery population (Fisher Discovery, in pink) and the validation populations (Fisher Validation, in purple) and of the Discriminant Analysis of Principal Components (DAPC) in the discovery population (in yellow). Each bar represents a SNP that was tested for association (* p_value < 0.005; ** p_value < 0.002). All genes and candidate genes (based on the combination of test performed in this manuscript) are presented along the QTL. Haplotypes are ranked using the Mean Gini Index value over 100 runs of Random Forrest analysis performed within chromosome Omy20 (dark red: highest Gini index to dark blue: lowest Gini index, values available in Table G in S1 Tables).
When all the haplotypes were analyzed together across all the QTL regions spanning the 3 chromosomes (see Table H in S1 Tables) and ranked through RF, the first one was positionned within lrrc59 gene on Omy20_c, the second and third ones included sdhaf4 and tlx1 on Omy1_b, and the fourth one was located within the csmd1 gene on Omy20_b. The next four top-ranked haplotypes were all on Omy20, located within abcg5, caskin2, khdrbs2, and dst genes. Some genes identified on Omy1 in validation populations (syndig1, hells, fbxw4 and LOC110527930) appeared inn the RF haplotype analyses ranked with very similar Gini indices (Fig 2). Noticeably, most of the top-ranked haplotypes invovled sequences located within genes, and the ranking was consistent between the full RF analysis and the RF limited to their chromosomic location.
Based on the results of the 60 top-ranked haplotypes on the Gini index of this full RF analysis, as well as the additional results from Fisher’s exact tests (Tables 2 and 4), we identified a list of 45 genes (see Table I in S1 Tables) whose variants in their sequence or in their close vicinity (2kb) were the most likely to explain a significant part of sex-reversal phenotypes.
Identification of the potential causative variants combining DAPC and SNP annotations
In order to find the best 100 SNPs to discriminate among the 23 dams the ones with high progeny sex-reversal ratios from those with low ratios, the sequence variants of those 45 genes (±2 kb) were considered in a DAPC analysis.. Running a simple PCA on all those 27,828 variants (without missing genotypes out of 33,731 SNPs) did not allow to correctly discriminate the two groups of dams (Fig 4A) whereas the DAPC perfectly discriminated the two groups (Fig 4B).
A. PCA of the 27,828 variants in the 45 candidate genes (±2 kb) and B. PCA of the 100 top variants selected by DAPC to discriminate the “Fem” group of the seven dams with low progeny sex-reversal ratios and the “Mal” group of the 16 dams with high progeny sex-reversal ratios.
The genotypes for the 100 top SNPs (from 27,828, SNPs) that were genotyped for the 23 dams, as well as for five additional dams, are given in Table J in S1 Tables. These five dams, with at least six offspring with a record for phenotypic sex the discovery population (see Table A in S1 Tables), were used as an internal validation: three dams (noted AS, AT and AU) with high progeny sex-reversal ratio and two dams (noted BS and BT) with no sex-reversed offspring. The top-ranked 100 SNPs were located within or nearby the downstream or upstream regions of 16 genes belonging to the list of the 45 genes previously identified by RF and Fisher’s exact test: on Omy1, syndig1, vsx1, and myoz1; on Omy12, hcn1 and ccbe1; on Omy20, arfgef3, cep170aa, abcg5, khdrbs2, dst, hmgcll1, cilk1, rps6ka2, LOC110499423, lrrc59 and caskin2. Most of these 100 SNPs were located in intronic regions of the genes, two were synonymous mutations and none were splice donor variants in introns or missense variants in exons. When focusing on the 50 best SNPs to discriminate between the two groups of dams, only hcn1 remained on Omy12; syndig1 and myoz1 were kept for Omy1_a and Omy1_b, respectively; and, all genes, except LOC110499423, remained for QTL regions on Omy20.
Discussion
Significance of the different QTL regions across French rainbow trout populations
All the QTL regions previously identified [30] were confirmed as associated with female to male sex-reversal in two or more populations different from the discovery one. Using the list of the 100 top-ranked SNPs under DAPC in the discovery population we confirmed the hypothesis of two QTL on Omy1, one QTL on Omy12, and we now suggest the existence of three distinct QTL on Omy20.
Regarding QTL Omy1_a, the most relevant candidate genes to explain sex-reversal across populations were syndig1, acss1, vsx1, entpd6, pygb and ninl. In the initial study [30], the peak SNPs for this QTL were located either within the pygb or ninl gene, depending on the statistical approach. In particular, Fraslin et al. [30] mentioned a SNP located at 63,543,061 bp (on Swanson reference genome) within the ninl gene and annotated has a missense variant. This SNP was not genotyped in the validation population, but in this study, a very close SNP located at 63,530,416 bp on Swanson reference genome (corresponding to 67,434,621 bp on Arlee reference genome) has a putative effect in population A and a significant effect at the genome level in populations B and C (see Table C in S1 Tables).
In our previous study [30], all peak SNPs from the main QTL Omy1_b were located in the intergenic region between the hells gene and the uncharacterized protein LOC110527930, the latter being proposed as the most relevant positional candidate for explaining 14% of the genetic variance of sex-reversal in the discovery population. However, in the present study, other genes should also be considered as candidate genes in several validation populations. Some were located before the hells gene (slc2a15a, fgf8, and fbxw4), while the most distal one, col13a1, was located after LOC110527930. This observation, as well as the identification of syndig1 in QTL Omy1_a that was positioned before the first QTL interval defined in Fraslin et al. [30] were the motivations to enlarge the genomic region investigated in the current study.
In the previous study [30], the QTL on Omy12 was detected using the 57K chip, but not when using imputed sequence variants in the discovery population. In the present study, this QTL is validated in population D and F with a significant SNP identified within hcn1 gene but is not validated in population A although it was closely related to the discovery population. We can hypothezise that the QTL is segregating in population A, but either at a too low frequency or too small effect to be statistically detected.
The QTL region on Omy20 was not validated in population A, but two QTL are segregated on Omy20 in populations C, D and F, with significant SNPs located in two distant genes: hmgcll1 located around 34.4 Mb, and ENSOMYG00000068776 gene located around 37.5 Mb. The same hypothesis as for the QTL on Omy12 can thus be proposed for the absence of statistical validation in population A. In addition, in an INRAE experimental rainbow trout line, a QTL for spontaneous sex-reversal was also identified on Omy20 (previously named RT17) but with a very imprecise position [31].
In the next paragraphs, we describe the putative functional roles of our top positional candidate genes for the two QTL on Omy1 (Fig 2) as well as for the most promising genes for the three different QTL on Omy20 (Fig 3). Other positional candidate genes on Omy12, Omy20_a and Omy20_b with less functional evidence are presented in an additional section [see S1 Appendix].
syndig1 as the best positional and functional candidate gene for QTL Omy1_a
syndig1 encodes a protein that belongs to the interferon-induced transmembrane family of proteins and plays a critical role during synapse development as well as in postsynaptic development and maturation [51,52]. Although syndig1 expression is mainly restricted in the embryonic retina in mice [53], its expression has been detected also in granulosa cells at the small antral stage in sheep [54] suggesting a potential role in follicular development in vertebrates. It is most likely that one or a few variants in the syndig1 (synapse differentiation-inducing gene protein 1) sequence or in its close downstream region (< 1kb) are the causative variants explaining part of the sex-reversal phenotype. Indeed, four SNPs were significant in the Fisher’s exact test in two to four validation populations. In the discovery population, the RF analysis highlighted three haplotypes in this gene among which three missense variants were annotated, and, finally, the DAPC ranked three SNPs within or downstream of syndig1 among the 100 best SNPs to discriminate the low and high sex-reversed progeny dams (see Tables C, E, and J in S1 Tables). However, among the 50 best SNPs, the only two retained were located in downstream intergenic positions. We speculate that these latter positions were kept to the detriment of SNPs within syndig1 due to linkage disequilibrium between SNPs of the two close genes syndig1 and vsx1 (visual system homeobox 1).
A unique SNP was retained from vsx1 in the list of the 100 best SNPs and was filtered out in the list of the 50 best ones. We may speculate that vsx1 may play a minor additional role to explain sex-reversal as significant SNPs (including five missense variants) were detected in the discovery population and in four validation populations. However, as far as we know, no direct link has been suggested between vsx1 and sex differentitation in any species.
Multiple candidate genes for QTL Omy1_b, with tlx1 and hells as best functional candidates
While this QTL was identified as the one explaining the largest proportion of genetic variance of spontaneous sex-reversal [30], our previous study pointed on different candidate genes than in the current one. In our first study, the three putative functional candidate genes were fgf8a, cyp17a1 and LOC110527930 based on progeny imputed genotypes, while the current analysis of dams’ sequences using an updated genome assembly pointed to genes located upstream, prioritizing tlx1 (T cell leukemia homeobox 1) for RF analysis and myoz1 (myozenin-1) for DAPC. It seems, in fact, that two different QTL should be distinguished within Omy1_b, Omy1_b1 and Omy1_b2, on each side of cyp17a1. The new analyses highlighted myoz1 and tlx1 that are located in Omy1_b1 before the start of the QTL region previously defined [30]. Although cyp17a1 is known to be involved in gonad masculinization in the common carp and zebrafish [55,56], no variant in this gene was significantly associated with sex-reversal in rainbow trout, neither in the discovery nor in the validation populations. However, it is possible that a cis-regulatory element within the QTL region could control, at long-distance, the expression of cyp17a1, influencing sex differentiation in rainbow trout. Such long-distance cis-regulatory elements have been found to control sex like in the Sry-deficient Amami spiny rat where the male-specific upregulation of sox9 is controlled by an enhancer region 0.5 Mb upstream of sox9 [57].
The initial Omy1_b from [30] is now defined as a second genomic region Omy1_b2, spanning from fbxw4 to gbf1 (including hells and LOC110527930). The RF analysis identified some haplotypes among the 10 best ones to explain sex-reversal on Omy1, putting similar emphasis to four genes: fbxw4, hells, LOC110527930 and gbf1. In addition, Fisher’s exact test in the discovery population identified a significant SNP within gbf1 (Golgi-specific brefeldin A-resistance guanine nucleotide exchange factor 1). SNPs in this gene were not tested in the validation populations as outside the limits of the initial boundaries for the QTL [30]. gbf1, is one of the guanine-nucleotide exchange factors (GEF) for members of the Arf family of small GTPases involved in vesicular trafficking in the early secretory pathway (see Table I in S1 Tables). A gbf1 expression was shown throughout spermatocyte development in the Golgi apparatus with expression extending into early spermatids where it localizes also in the acrosome [58]. No direct link with gonad development, oogonia or spermatogognia could be established for fbxw4 (F-box and WD repeat domain containing 4). This gene is a member of the F-box/WD-40 gene family that may participate in Wnt signaling which is a pathway involved in gonadal differentiation (see Table I in S1 Tables). The hells (helicase lymphoid specific) gene (also known as lsh) is a very convincing functional candidate gene for sex-reversal. It is a member of the snf2 helicase family that is required for normal development and survival in the mouse [59], and is implicated in the control of genome-wide DNA methylation [60]. Analysis of ovarian explants obtained from hells mutant females demonstrates that lack of hells function is associated with severe oocyte loss and lack of ovarian follicle formation [61]. Although hells is expressed in undifferentiated embryonic stem cells, it is significantly downregulated upon differentiation [62]. Using allografting of testis tissue from hells -/- mice to study postnatal male germ cell differentiation, Zeng et al. [63] showed that proliferation of spermatogonia was reduced in the absence of hells, and germ cell differentiation arrested at the midpachytene stage, implicating an essential role for hells during male meiosis.
Although Omy1_b1 was a strong candidate identified in both RF and DAPC analyses, there is no direct evidence for a potential role of myoz1 gene in sex-reversal (see Table I in S1 Tables). The second-best candidate gene tlx1 (also known as hox11), encodes a nuclear transcription factor that belongs to the NK-linked or NK-like (nkl) subfamily of homeobox genes, essential for cell survival during spleen development in fish [64] and mammals [65,66]. It is also involved in specification of neuronal cell fates during embryogenesis (see Table I in S1 Tables), and tlx1 can either activate or repress gene transcription [67]. tlx1-dependent regulation of retinoic acid (RA) metabolism is critical for spleen organogenesis [68]. RA is the active metabolite of vitamin A that is required for vertebrate embryogenesis [69,70]. In mice, the transcription factor steroidogenic factor 1 (nr5a1 also known as SF1) regulates RA metabolism during germ cell development [71] and is essential for sexual differentiation and formation of the primary steroidogenic tissues. Indeed, mutations within nr5a1 underlie different disorders of sexual development, including sex reversal. In patients with disorders of sexual development, a mutant form of nr5a1 was shown to be defective in activating tlx1 transcription [72]. Thus, it can be hypothetized that genetic variant in tlx1 may also directly deregulate RA metabolism, inducing sex-reversal.
Gathering all the information presented here for the QTL on Omy1, we concluded that there is a strong statistical and biological evidence of significant involvment in the sex determination and differentiation cascade of syndig1, tlx1, hells and gbf1, and, putatively also of vsx1, and myoz1. Notably, at the only exception of myoz1, all these proteins are relatively well conserved in terms of protein identity across various fishes and mammals (See Table K in S1 Tables).
arfgef3 as the best functionnal candidate for Omy20_a due to its activation of the estrogen/ERα signaling pathway
arfgef3 (ARFGEF family member 3) gene (named also big3) is a member of the big1/sec7p subfamily of ADP ribosylation factor-GTP exchange factors (ARF-GEFs). The protein is very well conserved across vertebrates (see Table K in S1 Tables). arfgef3 overexpression has been identified as one of the important mechanisms causing the activation of the estrogen/ERα signaling pathway in the hormone-related growth of breast cancer cells [73]. Control of estradiol synthesis could play a key role not only for ovarian, but also for testicular differentiation and sex change in fish [12].
khdrbs2, dst, hmgcll1 and csmd1 as functional candidates for QTL Omy20_b
Implication of khdrbs genes in sex determination and regulation of alternative splicing.
In nematodes, gld-1, the ortholog of khdrbs (KH domain-containing, RNA-binding, signal transduction-associated protein 2), is indispensable for oogenesis and meiotic prophase progression and it can stimulate sex determination of males in the hermaphrodite germ line [74, 75, 76]. In fruit fly, nsr, the ortholog of khdrbs regulates some male fertility genes in the primary [77]. In vertebrates, khdrbs (khdrbs1, khdrbs2 and khdrbs3) genes encode respectively for the SAM68, SLM1 and SLM2 proteins that have RNA binding and signal transduction activities [78,79]. In fish, the khdrbs1 was further duplicated through the teleost-specific whole genome duplication, forming khdrbs1a and khdrbs1b [80]. In zebrafish, all the khdrbs genes were found to be primarily expressed in the brain during early development and at the adult stage. khdrbs1a was also found to be expressed in the gonad primordium as well as in the gonads of adult fish along with khdrbs1b, and khdrbs3 [80]. In our study, eight intronic khdrbs2 variants were ranked among the 100 top SNPs in DAPC (see Table J in S1 Tables), and an indel/SNP variant was identified with a p-value < 0.002 in the discovery population (Table 4). In mouse, khdrbs1 and khdrbs3 are predominantly expressed in the brain and testis, while khdrbs2 is exclusively expressed in the brain [81]. khdrbs2 knockout mice are fertile whereas khdrbs1 knockout mice show impaired fertility in males [81]. khdrbs3 is involved in spermatogenesis via directly binding to RNA [81]. In addition, khdrbs1 knockout females displayed a reduction in the number of developing ovarian follicles, alteration of estrous cycles, and impaired fertility (e.g., [82]). It seems that the invertebrate genes gld-1 and nsr that are orthologs of khdrbs1 and khdrbs3, respectively, have similar roles in the regulation of gametogenesis [80].
dst and hmgcll1 as novel players in sex determination and differentiation in fish?
dst (dystonin) encodes a member of the plakin protein family of adhesion junction plaque proteins. Low temperatures significantly impact dst splicing in killifish, stickleback and zebrafish [83]. As far as we know, dst has not yet been demonstrated as being involved in sex determination or differentiation. Nevertheless, there is no statistical doubt that this gene is invoved in sex-reversal in rainbow trout. A SNP was highly significant (p-value = 0.0006) in the discovery population (Table 4) with homozygous TT dams exhibiting high ratios of progeny sex-reversal (see Table J in S1 Tables). In addition, four haplotypes within the gene ranked among the 60 top in the full RF analysis (see Table H in S1 Tables), and 15 intronic SNPs were ranked among the 100 top in DAPC, 10 remaining in the 50 top SNPs (see Table J in S1 Tables).
hmgcll1 (3-hydroxymethyl-3-methylglutaryl-CoA lyase-like 1) was first characterized as a lyase activity enzyme localized in extra-mitochondrial region involved in ketogenesis for energy production in nonhepatic animal tissues (see Table I in S1 Tables). A novel role of hmgcll1 in cell cycle regulation was recently suggested [84] that could be linked with a putative functional role on sex-reversal. Statistically, there is no doubt about the effect of this gene on sex-reversal in rainbow trout. Two SNPs within the gene were significant in two validation population (Table 3). In the discovery population, an upstream intergenic variant at position 34,345,558 bp was among the 50 top-ranked in DAPC, with homozygous TT dams exhibited high ratios of progeny sex-reversal (see Table J in S1 Tables).
csmd1 is associated to gonadal failure in mammals
The csmd1 (CUB and sushi multiple domains protein 1) gene was one of the best positional candidates in our RF and DAPC analyses with 8 SNPs being among the 50 top ones to discriminate the two groups of extreme phenotypes. The family of proteins csmd is one of the three types of the complement system-related proteins involved in the recognition of molecules in innate immune-system and in the central nervous system and csmd1 is predicted to act on several processes, including reproductive structure development (reviewed in [85]). The large transmembrane protein encoded by csmd1 is highly conserved between vertebrates with 89% amino acid sequence identity between trout and medaka and still 77% between trout and human or mouse (see Table K in S1 Tables). In humans and mice, csmd1 was associated with gonadal failure in both sexes [86]. Significant association between early menopause and rare 5′-deletions of csmd1 was detected in humans. In mice, knockout females show significant reduction in ovarian quality and breeding success, and csmd1-knockout males show increased rates of infertility and severe histological degeneration of the testes. No variants predicted with moderate or high impacts on protein expression in this gene were observed. But, as indicated by Lee at al. [86], intronic variants mainly located in introns 1 and 2 likely harbored functional elements that can influence csmd1 expression in testis.
lrrc59 and caskin2 as the best positional candidates for QTL Omy20_c
As far as we know, no direct effect of lrrc59 (leucine-rich repeat-containing protein 59) on sex determination is known. This gene enables RNA binding activity and cadherin binding activity and plays a key role in the regulation of local translation in the endoplasmic reticulum [87] (see Table I in S1 Tables). Statistically, it was the top-ranked haplotype on the full RF analysis (see Table H in S1 Tables), indicating a strong importance of lrrc59 to predict sex-reversal.
caskin2 (CASK Interacting Protein 2) downregulates genes associated with endothelial cell activation and upregulates genes associated with endothelial cell quiescence [88]. Morpholino knockdown of caskin2 in zebrafish results in abnormal vascular development, while caskin2 knockout mice are viable and fertile [88]. caskin2 is not well conserved across vertebrates, including only 54% AA identity between rainbow trout and either Medaka or Zebrafish (see Table K in S1 Tables). Statistically, two haplotypes including parts of caskin2 were among the top-ranked haplotypes in the full RF analysis (see Table H in S1 Tables). In addition, three SNPs were among the 50 top- ranked in DAPC (see Table J in S1 Tables). The expression of caskin2 was down-regulated in gonads of Gobiocypris rarus of both sexes, when exposed to a 7-day exposure to 17α-methyltestosterone which had a sex-reversal effect [89]. In Coho salmon, androgens play a major role in stimulating primary ovarian follicle development and in the transition into secondary follicles, in particular inducing a differential expression of caskin2 after 3-day treatment with non-aromatizable androgen 11-ketotestosterone [90].
Putative role of foxl3 in germ cell fate decision
Intriguingly, foxl3 (Forkhead Box L3), a co-paralog gene of foxl2, is located on Omy20 spanning from 22,942,515–22,948,650 bp, upstream the QTL regions we studied on the same chromosome. Although foxl3 is expressed in the gonads of teleosts, no variant within foxl3 were associated to sex-reversal in our study. However, we can speculate that foxl3 is involved in germ cell fate decision in rainbow trout. Indeed, foxl3 has been suggested to be involved in the onset of oocyte meiosis, and/or in the regulation of male specific genes during late testis development and testis maturation in salmonids [91, 92, 93]. It was also demonstrated in medaka that foxl3 expression is only maintained in female germ cells, and that functional sperm is produced in the ovary of a foxl3 loss-of-function mutant [94]. In addition, foxl3 was shown to initiate oogenesis in medaka through two genetically independent pathways, meiosis and follicular development, and that a third pathway indepentdent of foxl3 may also be hypothetized [95]. Last, foxl3 expression levels were strongly increased in gonads during the natural female-to-male sex reversal in the rice field eel (Monopterus albus). foxl3 seemed to be involved in physiological processes that promote oocyte degeneration in the ovotestis and stimulating spermatogenesis in spermatogonia [96]. Therefore, we can hypothetize that expression of foxl3 may be regulated by some other genes, presumably located in the QTL regions identified on Omy20 in our study, similarly to the long-distance cis-regulation observed for sox9 expression [57]. Further investigation should aim at validating such speculations, as well as the causative variants and precise functional roles of the identified genes in our QTL regions.
Interaction effects between spontaneous masculinization and rearing temperature in farming and ecological contexts
In addition to the genetic influence, it is well known that water temperature during development can modulate sex differentiation in many fish species and, in particular, high temperature is associated with masculinization [97]. In rainbow trout, high rearing temperature during the early phase of fry development after hatching was shown to increase the frequency of sex reversal in XX females to a greater or lesser extent depending on the genetic background of the trout lines [98,99]. From a fish farmer perspective, further investigation is still needed before any operational use of the genotypes at minor sex-modifier genes either to eradicate spontaneous sex-reversal in all-female production stocks or to produce progeny from free-hormone neomales in rainbow trout. Indeed, the high heritability of spontaneous maleness in XX individuals [30] suggests that the use of spontaneously masculinised individuals as progenitors of all-female stocks would increase the frequency of undesirable masculinised progeny. Improved knowledge about the joint effects of genetic basis and environmental factors determining spontaneous maleness in all-female stocks is needed to search for a trade-off combining together genetic and environmental control of gonad masculinization according to the destination of the fish (broodstock vs all-female production stock).
From an ecological point of view, the question of the extent of spontaneous sex reversal arises as a key one for conservation biology and wild fish population sex ratios in a global warming context [100]. A study on zebrafish [100] compared the impact of elevated-temperature on sex reversal in different strains with chromosomal or polygenic GSD. They showed that strains with a major sex-determining locus may not be immune to temperature driven sex reversal. Spontaneous sex reversed females in chinook salmon (O. tshawytscha), Nile tilapia (Oreochromis niloticus), zebrafish (Danio rerio) and lake trout (Salvelinus namaycush), have been observed at low frequency in the wild [1,25,100–103]. The impact of spontaneous masculinization in wild rainbow trout on population dynamics and fitness remains unknown and deserves more attention as this highly heritable phenomenon, driven by environmental factors, may become more frequent with warmer water temperature.
Conclusions
In the context of global warming, it is essential to better understand the genetic by environment interactions controlling spontaneous masculinization of rainbow trout, both in farmed environment but also in wild populations. While considerable scientific effort has been made over the last fifty years to unravel mechanisms for sex-determination and differentiation, the nature of genetic variations that regulate these mechanisms in fish is still not well-known. We confirmed on several French commercial rainbow trout populations the importance of some genomic regions on chromosomes Omy1, Omy12 and Omy20 for spontaneous masculinization of XX-individuals. Those regions can be considered as minor sex-determining regions in rainbow trout. Together, our results are consistent with a model in which spontaneous female-to-male sex-reversal in rainbow trout is associated with genetic factors able to reduce germ cell proliferation and arrest oogenesis. The main candidate genes that we suggested based on positional and functional information are syndig1, tlx1, hells and gbf1 on Omy1, and arfgef3, khdrbs2, dst, hmgcll1, csmd1, lrrc59 and caskin2 on Omy20. Most of these genes were previously indicated as playing roles in developmental pathways in vertebrates.
Supporting information
S1 Appendix.
Positional candidate genes on Omy12 and Omy20 without known evidence of a functional role linked to sex-reversal
https://doi.org/10.1371/journal.pone.0313464.s001
(DOCX)
S1 Tables.
This excel file includes 11 supplementary tables whose titles are listed below. Table A. Number of progeny (female, intersex and male) and proportion of neomales per sequenced dam. Table B. Positions on the Swanson reference genome and flanking sequences of the 192 SNPs used in validation. Table C. List of the 192 SNPs detected in 6 validation populations (named A to F) with their positions on Swanson (SW) and Arlee (AR) reference genomes. Table D. List of most significative SNPs found in Omy1, Omy12 and Omy20 QTL. Table E. Mean, Minimum and Maximum values of the 30 haplotypes with the highest Gini index over 100 runs of Random Forests for Omy1 QTL. Table F. Mean, Minimum and Maximum values of the 10 haplotypes with the highest Gini index over 100 runs of Random Forests for Omy12 QTL. Table G. Mean, Minimum and Maximum values of the 30 haplotypes with the highest Gini index over 100 runs of Random Forests for the 3 Omy 20 QTL (Z3, Z4, Z5). Table H. Mean values of the 60 haplotypes with the highest Gini index over 100 runs of Random Forests for the QTL regions of Omy1 (Z1), Omy12 (Z2) and Omy20 (Z3 to Z5). Table I. List of potential candidate genes for QTL on Omy1, Omy12 and Omy20 with their descriptions on GeneCards, UniProtKB/Swiss-Prot and NCBI databases. Table J. Location, annotation, alleles and genotypes for sequenced dams of the 100 most important SNPs in DAPC analysis. Genotypes in blue are considered as male genotypes, genotypes in orange as female genotypes and genotypes in light orange are heterozygous. Table K. Protein sequence identities of genes associated to sex-reversal in rainbow trout across six species in Vertebrates.
https://doi.org/10.1371/journal.pone.0313464.s002
(XLS)
Acknowledgments
The authors are grateful to the companies “Les Fils de Charles Murgat”, “Aqualande”, “Bretagne Truite”, “Font Rome” and “Viviers de Sarrance” for fish samples and data collection.
References
- 1. Baroiller JF, D’Cotta H. The reversible sex of gonochoristic fish: insights and consequences. Sex Dev. 2016;10(5–6):242–66. pmid:27907925
- 2. Pandian TJ, Sheela SG. Hormonal induction of sex reversal in fish. Aquaculture. 1995;138(1–4):1–22.
- 3. Piferrer F. Endocrine sex control strategies for the feminization of teleost fish. Aquaculture. 2001;197(1–4):229–81.
- 4. Aksnes A, Gjerde B, Roald SO. Biological, chemical and organoleptic changes during maturation of farmed Atlantic salmon, Salmo salar. Aquaculture. 1986;53(1):7–20.
- 5. Cleveland BM, Kenney PB, Manor ML, Weber GM. Effects of feeding level and sexual maturation on carcass and fillet characteristics and indices of protein degradation in rainbow trout (Oncorhynchus mykiss). Aquaculture. 2012;338–341:228–36.
- 6. Manor ML, Weber GM, Salem M, Yao J, Aussanasuwannakul A, Kenney PB. Effect of sexual maturation and triploidy on chemical composition and fatty acid content of energy stores in female rainbow trout, Oncorhynchus mykiss. Aquaculture. 2012;364–365:312–21.
- 7. Azuma T, Takeda K, Doi T, Muto K, Akutsu M, Sawada M, et al. The influence of temperature on sex determination in sockeye salmon Oncorhynchus nerka. Aquaculture. 2004;234(1–4):461–73.
- 8. Robles V, Riesco MF, Psenicka M, Saito T, Valcarce DG, Cabrita E, et al. Biology of teleost primordial germ cells (PGCs) and spermatogonia: Biotechnological applications. Aquaculture. 2017;472:4–20.
- 9. Weber C, Capel B. Sex reversal. Curr Biol. 2018;28(21):R1234–6. pmid:30399341
- 10. Swain A, Lovell-Badge R. A molecular approach to sex determination in mammals. Acta Paediatr Suppl. 1997;423:46–9. pmid:9401538
- 11. Nagahama Y, Chakraborty T, Paul-Prasanth B, Ohta K, Nakamura M. Sex determination, gonadal sex differentiation, and plasticity in vertebrate species. Physiol Rev. 2021;101(3):1237–308. pmid:33180655
- 12. Herpin A, Schartl M. Plasticity of gene-regulatory networks controlling sex determination: of masters, slaves, usual suspects, newcomers, and usurpators. EMBO Rep. 2015;16(10):1260–74. pmid:26358957
- 13. Lin YT, Capel B. Cell fate commitment during mammalian sex determination. Curr Opin Genet Dev. 2015;32:144–52. pmid:25841206
- 14. Devlin RH, Nagahama Y. Sex determination and sex differentiation in fish: an overview of genetic, physiological, and environmental influences. Aquaculture. 2002;208:191–364.
- 15. Guiguen Y, Fostier A, Piferrer F, Chang CF. Ovarian aromatase and estrogens: a pivotal role for gonadal sex differentiation and sex change in fish. Gen Comp Endocrinol. 2010;165(3):352–66. pmid:19289125
- 16. Wargelius A, Leininger S, Skaftnesmo KO, Kleppe L, Andersson E, Taranger GL, et al. Dnd knockout ablates germ cells and demonstrates germ cell independent sex differentiation in Atlantic salmon. Sci Rep. 2016;6:21284. pmid:26888627
- 17. Slanchev K, Stebler J, de la Cueva-Méndez G, Raz E. Development without germ cells: the role of the germ line in zebrafish sex differentiation. Proc Natl Acad Sci U S A. 2005;102(11):4074–9. pmid:15728735
- 18. Siegfried KR, Nüsslein-Volhard C. Germ line control of female sex determination in zebrafish. Dev Biol. 2008;324(2):277–87. pmid:18930041
- 19. Kurokawa H, Saito D, Nakamura S, Katoh-Fukui Y, Ohta K, Baba T, et al. Germ cells are essential for sexual dimorphism in the medaka gonad. Proc Natl Acad Sci U S A. 2007;104(43):16958–63. pmid:17940041
- 20. Nishimura T, Yamada K, Fujimori C, Kikuchi M, Kawasaki T, Siegfried KR, et al. Germ cells in the teleost fish medaka have an inherent feminizing effect. PLoS Genet. 2018;14(3):e1007259. pmid:29596424
- 21. Herpin A, Schindler D, Kraiss A, Hornung U, Winkler C, Schartl M. Inhibition of primordial germ cell proliferation by the medaka male determining gene Dmrt I bY. BMC Dev Biol. 2007;7:99. pmid:17760954
- 22. Kobayashi T, Matsuda M, Kajiura-Kobayashi H, Suzuki A, Saito N, Nakamoto M, et al. Two DM domain genes, DMY and DMRT1, involved in testicular differentiation and development in the medaka, Oryzias latipes. Dev Dyn. 2004;231(3):518–26. pmid:15376325
- 23. Pan Q, Kay T, Depincé A, Adolfi M, Schartl M, Guiguen Y, et al. Evolution of master sex determiners: TGF-β signalling pathways at regulatory crossroads. Philos Trans R Soc Lond B Biol Sci. 2021;376(1832):20200091. pmid:34247498
- 24. Chen J, Zhu Z, Hu W. Progress in research on fish sex determining genes. Water Biology and Security. 2022;1(1):100008.
- 25. Yano A, Nicol B, Jouanno E, Quillet E, Fostier A, Guyomard R, et al. The sexually dimorphic on the Y-chromosome gene (sdY) is a conserved male-specific Y-chromosome sequence in many salmonids. Evol Appl. 2013;6(3):486–96. pmid:23745140
- 26. Bertho S, Herpin A, Branthonne A, Jouanno E, Yano A, Nicol B, et al. The unusual rainbow trout sex determination gene hijacked the canonical vertebrate gonadal differentiation pathway. Proc Natl Acad Sci U S A. 2018;115(50):12781–6. pmid:30463951
- 27. Bertho S, Pasquier J, Pan Q, Le Trionnaire G, Bobe J, Postlethwait JH, et al. Foxl2 and its relatives are evolutionary conserved players in gonadal sex differentiation. Sex Dev. 2016;10(3):111–29. pmid:27441599
- 28. Pailhoux E, Vigier B, Chaffaux S, Servel N, Taourit S, Furet JP, et al. A 11.7-kb deletion triggers intersexuality and polledness in goats. Nat Genet. 2001;29(4):453–8. pmid:11726932
- 29. Pannetier M, Elzaiat M, Thépot D, Pailhoux E. Telling the story of XX sex reversal in the goat: highlighting the sex-crossroad in domestic mammals. Sex Dev. 2012;6(1–3):33–45. pmid:22094227
- 30. Fraslin C, Phocas F, Bestin A, Charles M, Bernard M, Krieg F, et al. Genetic determinism of spontaneous masculinisation in XX female rainbow trout: new insights using medium throughput genotyping and whole-genome sequencing. Sci Rep. 2020;10(1):17693. pmid:33077766
- 31.
Guyomard R, Guiguen Y, Bernard M, Charlet A, Dechamp N, Hervet C, et al. RAD-seq mapping of spontaneous masculinization in XX doubled haploid rainbow trout lines. 10th WCGALP, Vancouver, BC, Canada, 2014.
- 32. Pearse DE, Barson NJ, Nome T, Gao G, Campbell MA, Abadía-Cardoso A, et al. Publisher Correction: Sex-dependent dominance maintains migration supergene in rainbow trout. Nat Ecol Evol. 2020;4(1):170. pmid:31819240
- 33. Gao G, Magadan S, Waldbieser GC, Youngblood RC, Wheeler PA, Scheffler BE, et al. A long reads-based de-novo assembly of the genome of the Arlee homozygous line reveals chromosomal rearrangements in rainbow trout. G3 (Bethesda). 2021;11(4):jkab052. pmid:33616628
- 34. Palti Y, Gao G, Miller MR, Vallejo RL, Wheeler PA, Quillet E, et al. A resource of single-nucleotide polymorphisms for rainbow trout generated by restriction-site associated DNA sequencing of doubled haploids. Mol Ecol Resour. 2014;14(3):588–96. pmid:24251403
- 35. Poplin R, Ruano-Rubio V, DePristo MA, Fennell TJ, Carneiro MO, Van der Auwera GA, et al. Scaling accurate genetic variant discovery to tens of thousands of samples. bioRxiv. 2017.
- 36. Garrison E, Marth G. Haplotype-based variant detection from short-read sequencing. 2012.
- 37. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10(2):giab008. pmid:33590861
- 38. DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43(5):491–8. pmid:21478889
- 39.
R Development Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria2023Conference
- 40. Mehta CR, Patel NR. A network algorithm for performing Fisher’s exact test in r x c contingency tables. J Am Stat Assoc. 1983; 78:427–34.
- 41. Mehta CR, Patel NR. Algorithm 643: FEXACT, a FORTRAN subroutine for Fisher's exact test on unordered r x c contingency tables. ACM Trans Math Softw. 1986;12(2):154–61.
- 42. Clarkson DB, Fan Y, Joe H. A remark on algorithm 643: FEXACT: an algorithm for performing Fisher's exact test in r x c contingency tables. ACM Trans Math Softw. 1993;19(4):484–8.
- 43. Bernard M, Dehaullon A, Gao G, Paul K, Lagarde H, Charles M, et al. Development of a High-Density 665 K SNP Array for Rainbow Trout Genome-Wide Genotyping. Front Genet. 2022;13:941340. pmid:35923696
- 44. Breiman L. Random Forests. Machine Learning. 2001;45:5–32.
- 45. Chen X, Ishwaran H. Random forests for genomic data analysis. Genomics. 2012;99(6):323–9. pmid:22546560
- 46. Liaw A, Wiener M. Classification and Regression by randomForest. 2002 [cited 6 Feb 2024]. In: R News, 2(3), 18-22. Available from: https://CRAN.R-project.org/doc/Rnews/
- 47. Jombart T, Devillard S, Balloux F. Discriminant analysis of principal components: a new method for the analysis of genetically structured populations. BMC Genet. 2010;11:94. pmid:20950446
- 48. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. pmid:17701901
- 49. Jombart T. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics. 2008;24(11):1403–5. pmid:18397895
- 50. Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 2012;6(2):80–92. pmid:22728672
- 51. Kalashnikova E, Lorca RA, Kaur I, Barisone GA, Li B, Ishimaru T, et al. SynDIG1: an activity-regulated, AMPA- receptor-interacting transmembrane protein that regulates excitatory synapse development. Neuron. 2010;65(1):80–93. pmid:20152115
- 52. Lovero KL, Blankenship SM, Shi Y, Nicoll RA. SynDIG1 promotes excitatory synaptogenesis independent of AMPA receptor trafficking and biophysical regulation. PLoS One. 2013;8(6):e66171. pmid:23785483
- 53. Chenaux G, Matt L, Hill TC, Kaur I, Liu X-B, Kirk LM, et al. Loss of SynDIG1 Reduces Excitatory Synapse Maturation But Not Formation In Vivo. eNeuro. 2016;3(5):ENEURO.0130-16.2016. pmid:27800545
- 54. Bonnet A, Servin B, Mulsant P, Mandon-Pepin B. Spatio-temporal gene expression profiling during in vivo early ovarian folliculogenesis: integrated transcriptomic study and molecular signature of early follicular growth. PLoS ONE. 2015;10(11):e0141482.
- 55. Ruane NM, Lambert JG, Goos HJ, Komen J. Hypocorticism and interrenal hyperplasia are not directly related to masculinization in XX mas(-1)/mas(-1) carp, Cyprinus carpio. Gen Comp Endocrinol. 2005;143(1):66–74. pmid:15993106
- 56. Zhai G, Shu T, Xia Y, Lu Y, Shang G, Jin X, et al. Characterization of Sexual Trait Development in cyp17a1-Deficient Zebrafish. Endocrinology. 2018;159(10):3549–62. pmid:30202919
- 57. Terao M, Ogawa Y, Takada S, Kajitani R, Okuno M, Mochimaru Y, et al. Turnover of mammal sex chromosomes in the sry-deficient Amami spiny rat is due to male-specific upregulation of Sox9. Proc Natl Acad Sci U S A. 2022;119(49):e2211574119. pmid:36442104
- 58. Au CE, Hermo L, Byrne E, Smirle J, Fazel A, Simon PHG, et al. Expression, sorting, and segregation of Golgi proteins during germ cell differentiation in the testis. Mol Biol Cell. 2015;26(22):4015–32. pmid:25808494
- 59. Geiman TM, Tessarollo L, Anver MR, Kopp JB, Ward JM, Muegge K. Lsh, a SNF2 family member, is required for normal murine development. Biochim Biophys Acta. 2001;1526(2):211–20. pmid:11325543
- 60. Dennis K, Fan T, Geiman T, Yan Q, Muegge K. Lsh, a member of the SNF2 family, is required for genome-wide methylation. Genes Dev. 2001;15(22):2940–4. pmid:11711429
- 61. De La Fuente R, Baumann C, Fan T, Schmidtmann A, Dobrinski I, Muegge K. Lsh is required for meiotic chromosome synapsis and retrotransposon silencing in female germ cells. Nat Cell Biol. 2006;8(12):1448–54. pmid:17115026
- 62. Xi S, Geiman TM, Briones V, Guang Tao Y, Xu H, Muegge K. Lsh participates in DNA methylation and silencing of stem cell genes. Stem Cells. 2009;27(11):2691–702. pmid:19650037
- 63. Zeng W, Baumann C, Schmidtmann A, Honaramooz A, Tang L, Bondareva A, et al. Lymphoid-specific helicase (HELLS) is essential for meiotic progression in mouse spermatocytes. Biol Reprod. 2011;84(6):1235–41. pmid:21349825
- 64. Xie L, Tao Y, Wu R, Ye Q, Xu H, Li Y. Congenital asplenia due to a tlx1 mutation reduces resistance to Aeromonas hydrophila infection in zebrafish. Fish Shellfish Immunol. 2019;95:538–45. pmid:31678534
- 65. Roberts CW, Shutter JR, Korsmeyer SJ. Hox11 controls the genesis of the spleen. Nature. 1994;368(6473):747–9. pmid:7908720
- 66. Dear TN, Colledge WH, Carlton MB, Lavenir I, Larson T, Smith AJ, et al. The Hox11 gene is essential for cell survival during spleen development. Development. 1995;121(9):2909–15. pmid:7555717
- 67. Rice KL, Izon DJ, Ford J, Boodhoo A, Kees UR, Greene WK. Overexpression of stem cell associated ALDH1A1, a target of the leukemogenic transcription factor TLX1/HOX11, inhibits lymphopoiesis and promotes myelopoiesis in murine hematopoietic progenitors. Leuk Res. 2008;32(6):873–83. pmid:18082256
- 68. Lenti E, Farinello D, Yokoyama KK, Penkov D, Castagnaro L, Lavorgna G, et al. Transcription factor TLX1 controls retinoic acid signaling to ensure spleen development. J Clin Invest. 2016;126(7):2452–64. pmid:27214556
- 69. Kashimada K, Svingen T, Feng C-W, Pelosi E, Bagheri-Fam S, Harley VR, et al. Antagonistic regulation of Cyp26b1 by transcription factors SOX9/SF1 and FOXL2 during gonadal development in mice. FASEB J. 2011;25(10):3561–9. pmid:21757499
- 70. Rhinn M, Dollé P. Retinoic acid signalling during development. Development. 2012;139(5):843–58. pmid:22318625
- 71. Bowles J, Knight D, Smith C, Wilhelm D, Richman J, Mamiya S, et al. Retinoid signaling determines germ cell fate in mice. Science. 2006;312(5773):596–600. pmid:16574820
- 72. Zangen D, Kaufman Y, Banne E, Weinberg-Shukron A, Abulibdeh A, Garfinkel BP, et al. Testicular differentiation factor SF-1 is required for human spleen development. J Clin Invest. 2014;124(5):2071–5. pmid:24905461
- 73. Kim JW, Akiyama M, Park JH, Lin ML, Shimo A, Ueki T, et al. Activation of an estrogen/estrogen receptor signaling by BIG3 through its inhibitory effect on nuclear transport of PHB2/REA in breast cancer. Cancer Sci. 2009;100(8):1468–78. pmid:19496786
- 74. Jones AR, Schedl T. Mutations in gld-1, a female germ cell-specific tumor suppressor gene in Caenorhabditis elegans, affect a conserved domain also found in Src-associated protein Sam68. Genes Dev. 1995;9(12):1491–504. pmid:7601353
- 75. Lee MH, Schedl T. Identification of in vivo mRNA targets of GLD-1, a maxi-KH motif containing protein required for C. elegans germ cell development. Genes Dev. 2001;15(18):2408–20. pmid:11562350
- 76. Lee MH, Schedl T. C. elegans star proteins, GLD-1 and ASD-2, regulate specific RNA targets to control development. Adv Exp Med Biol. 2010;693:106–22. pmid:21189689
- 77. Ding Y, Zhao L, Yang S, Jiang Y, Chen Y, Zhao R, et al. A young Drosophila duplicate gene plays essential roles in spermatogenesis by regulating several Y-linked male fertility genes. PLoS Genet. 2010;6(12):e1001255. pmid:21203494
- 78. Frisone P, Pradella D, Di Matteo A, Belloni E, Ghigna C, Paronetto MP. SAM68: signal transduction and RNA metabolism in human cancer. Biomed Res Int. 2015;2015:528954. pmid:26273626
- 79. Ehrmann I, Fort P, Elliott DJ. STARs in the CNS. Biochem Soc Trans. 2016;44(4):1066–72. pmid:27528753
- 80. Wang S, Yang Q, Wang Z, Feng S, Li H, Ji D, et al. Evolutionary and expression analyses show co-option of khdrbs genes for origin of vertebrate brain. Front Genet. 2018;8:225. pmid:29354154
- 81. Zhang LY, Zeng M, Chen P, Sun HQ, Tao DC, Liu YQ, et al. Identification of messenger RNA substrates for mouse T-STAR. Biochemistry (Mosc). 2009;74(11):1270–7. pmid:19916944
- 82. Ehrmann I, Dalgliesh C, Liu Y, Danilenko M, Crosier M, Overman L, et al. The tissue-specific RNA binding protein T-STAR controls regional splicing patterns of neurexin pre-mRNAs in the brain. PLoS Genet. 2013;9(4):e1003474. pmid:23637638
- 83. Healy TM, Schulte PM. Patterns of alternative splicing in response to cold acclimation in fish. J Exp Biol. 2019;222(Pt 5):jeb193516. pmid:30692167
- 84. Park JH, Woo YM, Youm EM, Hamad N, Won HH, Naka K, et al. HMGCLL1 is a predictive biomarker for deep molecular response to imatinib therapy in chronic myeloid leukemia. Leukemia. 2018;33(6):1439–50.
- 85. Yuzaki M. Synapse formation and maintenance by C1q family proteins: a new class of secreted synapse organizers. Eur J Neurosci. 2010;32(2):191–7. pmid:20646056
- 86. Lee AS, Rusch J, Lima AC, Usmani A, Huang N, Lepamets M, et al. Rare mutations in the complement regulatory gene CSMD1 are associated with male and female infertility. Nat Commun. 2019;10(1):4626. pmid:31604923
- 87. Hannigan MM, Hoffman AM, Thompson JW, Zheng T, Nicchitta CV. Quantitative Proteomics Links the LRRC59 Interactome to mRNA Translation on the ER Membrane. Mol Cell Proteomics. 2020;19(11):1826–49. pmid:32788342
- 88. Mueller SB, Arora S, Mattocks N, Gurley SB, Kontos CD. Abstract 14: Caskin2 is a Novel Regulator of Endothelial Cell Quiescence. ATVB. 2014;34(suppl_1).
- 89. Gao J, Liu S, Zhang Y, Yang Y, Yuan C, Chen S, et al. Effects of 17 α-methyltestosterone on transcriptome, gonadal histology and sex steroid hormones in rare minnow Gobiocypris rarus. Comp Biochem Physiol Part D Genomics Proteomics. 2015;15:20–7. pmid:26070167
- 90. Monson C, Forsgren K, Goetz G, Harding L, Swanson P, Young G. A teleost androgen promotes development of primary ovarian follicles in coho salmon and rapidly alters the ovarian transcriptome. Biol Reprod. 2017;97(5):731–45. pmid:29045593
- 91. Baron D, Cocquet J, Xia X, Fellous M, Guiguen Y, Veitia RA. An evolutionary and functional analysis of FoxL2 in rainbow trout gonad differentiation. J Mol Endocrinol. 2004;33(3):705–15. pmid:15591029
- 92. von Schalburg KR, Yasuike M, Davidson WS, Koop BF. Regulation, expression and characterization of aromatase (cyp19b1) transcripts in ovary and testis of rainbow trout (Oncorhynchus mykiss). Comp Biochem Physiol B Biochem Mol Biol. 2010;155(2):118–25. pmid:19895900
- 93. von Schalburg KR, Yasuike M, Yazawa R, de Boer JG, Reid L, So S, et al. Regulation and expression of sexual differentiation factors in embryonic and extragonadal tissues of Atlantic salmon. BMC Genomics. 2011;12:31. pmid:21232142
- 94. Nishimura T, Sato T, Yamamoto Y, Watakabe I, Ohkawa Y, Suyama M, et al. Sex determination. foxl3 is a germ cell-intrinsic factor involved in sperm-egg fate decision in medaka. Science. 2015;349(6245):328–31. pmid:26067255
- 95. Kikuchi M, Nishimura T, Ishishita S, Matsuda Y, Tanaka M. foxl3, a sexual switch in germ cells, initiates two independent molecular pathways for commitment to oogenesis in medaka. Proc Natl Acad Sci U S A. 2020;117(22):12174–81. pmid:32409601
- 96. Gao Y, Jia D, Hu Q, Li D. Foxl3, a Target of miR-9, stimulates spermatogenesis in spermatogonia during natural sex change in monopterus albus. Endocrinology. 2016;157(11):4388–99. pmid:27583788
- 97. Hattori RS, Castañeda-Cortés DC, Arias Padilla LF, Strobl-Mazzulla PH, Fernandino JI. Activation of stress response axis as a key process in environment-induced sex plasticity in fish. Cell Mol Life Sci. 2020;77(21):4223–36. pmid:32367192
- 98. Magerhans A, Müller-Belecke A, Hörstgen-Schwark G. Effect of rearing temperatures post hatching on sex ratios of rainbow trout (Oncorhynchus mykiss) populations. Aquaculture. 2009;294(1–2):25–9.
- 99. Valdivia K, Jouanno E, Volff J-N, Galiana-Arnoux D, Guyomard R, Helary L, et al. High temperature increases the masculinization rate of the all-female (XX) rainbow trout “Mal” population. PLoS One. 2014;9(12):e113355. pmid:25501353
- 100. Valdivieso A, Wilson CA, Amores A, da Silva Rodrigues M, Nóbrega RH, Ribas L, et al. Environmentally-induced sex reversal in fish with chromosomal vs. polygenic sex determination. Environ Res. 2022;213:113549. pmid:35618011
- 101. Cavileer TD, Hunter SS, Olsen J, Wenburg J, Nagler JJ. A Sex‐Determining Gene (sdY) assay shows discordance between phenotypic and genotypic sex in wild populations of chinook salmon. Trans Am Fish Soc. 2015;144(2):423–30.
- 102. Bezault E, Clota F, Derivaz M, Chevassus B, Baroiller J-F. Sex determination and temperature-induced sex differentiation in three natural populations of Nile tilapia (Oreochromis niloticus) adapted to extreme temperature conditions. Aquaculture. 2007;272:S3–16.
- 103. Shinomiya A, Otake H, Hamaguchi S, Sakaizumi M. Inherited XX sex reversal originating from wild medaka populations. Heredity (Edinb). 2010;105(5):443–8. pmid:20424640