Into the Wild: A novel wild-derived inbred strain resource expands the genomic and phenotypic diversity of laboratory mouse models

The laboratory mouse has served as the premier animal model system for both basic and preclinical investigations for over a century. However, laboratory mice capture only a subset of the genetic variation found in wild mouse populations, ultimately limiting the potential of classical inbred strains to uncover phenotype-associated variants and pathways. Wild mouse populations are reservoirs of genetic diversity that could facilitate the discovery of new functional and disease-associated alleles, but the scarcity of commercially available, well-characterized wild mouse strains limits their broader adoption in biomedical research. To overcome this barrier, we have recently developed, sequenced, and phenotyped a set of 11 inbred strains derived from wild-caught Mus musculus domesticus. Each of these “Nachman strains” immortalizes a unique wild haplotype sampled from one of five environmentally distinct locations across North and South America. Whole genome sequence analysis reveals that each strain carries between 4.73–6.54 million single nucleotide differences relative to the GRCm39 mouse reference, with 42.5% of variants in the Nachman strain genomes absent from current classical inbred mouse strain panels. We phenotyped the Nachman strains on a customized pipeline to assess the scope of disease-relevant neurobehavioral, biochemical, physiological, metabolic, and morphological trait variation. The Nachman strains exhibit significant inter-strain variation in >90% of 1119 surveyed traits and expand the range of phenotypic diversity captured in classical inbred strain panels. These novel wild-derived inbred mouse strain resources are set to empower new discoveries in both basic and preclinical research.


Introduction
Inbred mouse strains have served as the workhorses of mammalian genetics for over a century [1].Standardized inbred strain backgrounds ensure experimental reproducibility across labs and experiments, provide the backbone for mechanistic investigations into gene and pathway function, and enable testing of genetically identical cohorts across different treatments, exposures, and perturbations.Inbred strains also provide platforms for community resource development, including comprehensive gene knockout panels [2,3] and phenome resources [4].
The classical inbred (CI) mouse strains were developed from a small number of founder mice purpose-bred by mouse fanciers for traits of interest in the early 1900s [5,6].As a result, the CI strains capture a limited subset of the genetic variation found in wild mouse populations [7,8].Many CI strains have inherited large stretches of their genome identical by descent, such that pairwise strain comparisons yield numerous genomic regions where segregating variation is reduced to or near zero [8,9].Furthermore, due to their unique origins and history of selective breeding, the complex architecture of trait variation in inbred strains may not faithfully model the genetic architecture of phenotypic variation in human populations [10].
Wild mouse populations harbor many predicted functional and disease-associated alleles, the majority of which are not present in CI mouse strains and have therefore never been experimentally tested in the laboratory [8,11].Thus, wild mice present an untapped opportunity for advancing new biomedical research discoveries [7,8,11].In contrast to the history of intense artificial selection and subspecies admixture that has molded the genomes of classical laboratory strains, the genetic diversity observed in wild mice reflects the interplay of selection, genetic drift, mutation, and migration, mirroring the natural population genetic processes that have sculpted the contemporary landscape of human genomic diversity.Wild mice may therefore better approximate the diversity and organization of functional genetic variation in human populations than CI strains, including genetic variation influencing responses to dietary challenges, pharmaceutical interventions, and toxin exposures.
Despite these potential advantages, multiple challenges stand in the way of using wildcaught mice in biomedical research.Trapping wild house mice is laborious, especially if one is interested in assembling a large sample of unrelated individuals.Further, wild mice are genetically unique, preventing experimental studies that require reproducible or controlled genetic backgrounds.In addition, phenotypic variation in wild mice is influenced by differences in age, reproductive history, health status, and other, typically unknown, environmental exposures.Finally, wild mice are also vectors for numerous pathogens that pose a threat to human health and the health status of laboratory mouse colonies.
Wild-derived inbred mouse strains (WDIS) present a powerful intermediary between CI and wild mice.WDIS are developed from wild-caught mice that are brother-sister mated in a laboratory environment for >20 generations, thereby immortalizing a single haplotype from the wild in an inbred state.Thus, WDIS combine the reproducibility and fixed genetic background of inbred mouse models with the natural diversity present in wild mouse populations.WDIS have been strategically utilized in gene mapping studies to introduce increased diversity [12][13][14][15][16], profiled in immunological [17], metabolic [18], and reproductive studies [19][20][21], and featured as founders to current mouse diversity populations such as the Collaborative Cross (CC) and Diversity Outbred (DO; [22,23]).Indeed, the majority of quantitative trait loci (QTL) identified in mapping studies in the CC and DO are attributable to the allelic effects of one or more of the three wild-derived founder haplotypes [24][25][26][27].
Despite their realized power, only a modest number of wild-derived inbred strains are commercially available (S1 Table ).Of these, many are poor breeders, are maintained at low (or undocumented) health status, or have few associated genomic resources.These considerations present notable obstacles to their widespread use in biomedical research.Furthermore, whereas the genomes of CI strains are largely derived from one house mouse subspecies (M.m. domesticus) [6], many WDIS are representatives of alternative house mouse subspecies that exhibit variable degrees of reproductive isolation from M. m. domesticus.Crosses between these WDIS and CI strains often expose multilocus incompatibilities linked to hybrid sterility [28,29], an outcome that further limits their practical utility in genetic studies.
Recently, we developed a panel of ~25 WDIS from wild-caught M. m. domesticus from five locations across North and South America: Saratoga Springs, New York, USA; Gainesville, Florida, USA; Manaus, Brazil; Tucson, Arizona, USA; and Edmonton, Alberta, Canada (Fig 1).These sampling locations are defined by distinct ecosystems and climates, including tropical rainforest (Manaus), desert (Tucson), temperate forest (Saratoga Springs), prairie (Edmonton), and wetland (Gainesville) habitats.As a result, wild mice from these regions have been subject to distinct selective pressures and have evolved unique morphological and physiological adaptations to their environments [30][31][32][33].Recent work has also uncovered dramatic transcriptomic differences between populations, revealing divergence at the functional genomic level [32].Taken together, data from wild mice sampled from these five geographic regions portend the rich potential for their descendant inbred strains to serve as powerful new mouse models of susceptibility and resilience to numerous diseases, phenotypes, and conditions relevant to human health.
From 2019-2022, we imported a representative subset of 11 WDIS from the broader parent Nachman panel to The Jackson Laboratory (JAX), including at least one inbred line per geographic location (S2 Table ).Here, we introduce this novel diverse mouse strain resource, including the extent of genomic and phenotypic diversity across these strains and their relationship to CI strains.Our strain survey emphasizes the collective promise of these Nachman strains to advance biomedical discoveries into multiple trait domains, systems genetics analysis, and fundamental principles of evolutionary biology.

Generating the Nachman Panel
Wild house mice were caught in 2012-2013 from five geographic locations (Fig 1).Animals were transported to UC Berkeley, and mice from each geographic region were randomly paired to create inbred lines which were propagated through brother-sister mating for at least 10 generations.Initially, ~10 independent lines were established from each location.No Map of sample locations and summary of inbred strains imported to the JAX Repository.Inbred strains were developed from wild-caught mice trapped at 5 sample locations across North and South America: Manaus, Brazil (MANB/NachJ, MANE/NachJ, MANF/NachJ); Saratoga Springs, New York, USA (SARA/NachJ, SARB/NachJ, SARC/NachJ); Gainesville, Florida, USA (GAIC/NachJ); Tucson, Arizona, USA (TUCA/NachJ, TUCB/ NachJ); and Edmonton, Alberta, Canada (EDMB/NachJ).Strain GAIA/NachJ was successfully imported but has been since discontinued from the JAX Repository due to poor breeding and is not listed.Inbreeding generation numbers are presented in the following format: number of inbreeding generations in Nachman Colony prior to importation at JAX + number of inbreeding generations in JAX's Importation Facility + number of inbreeding generations in the JAX Repository.Inbreeding generation numbers are imprecise owing to the use of inter-generational crosses in JAX's importation facility to expedite colony expansion for oocyte harvests.This ambiguity is denoted by a "?" in the inbreeding generation supplied in the strain table.The map base layer was made with Natural Earth, which maintains all raster and vector map data in the public domain (https://www.naturalearthdata.com/about/terms-of-use/). Specifically, the base layer was produced using the rnaturalearth package for R by invoking the command ne_countries(continent = c("North America", "South America"), scale = "medium", returnclass = "sf").
https://doi.org/10.1371/journal.pgen.1011228.g001attempts were made to rescue lines that exhibited infertility due to inbreeding depression and half of the initiated lines eventually became extinct.At the time of writing, 25 of the initiated lines remain in Dr. Nachman's strain holdings at UC Berkeley.
A subset of 20 strains were selected for importation to the JAX Repository (S2 Table ).Of these strains, 11 were successfully rederived via in vitro fertilization (IVF) and embryo transfer to a pseudopregnant dam and integrated into production breeding colonies (Fig 1).Reasons for rederivation failure were complex and variable, ranging from poor breeding, failure to recover sufficient numbers of oocytes for IVF, no live births following multiple embryo transfers, and accidental strain contamination (S2 Table ).One successfully imported strain (GAIA/ NachJ) was subsequently terminated due to poor breeding performance.

Breeding performance
Many wild-derived inbred strains breed poorly, a consideration that limits their practical utility in biomedical research.Our strategic propagation of only the best performing inbreeding lineages derived from each wild-caught founder pair should ensure that resulting inbred strains are vigorous breeders.Indeed, the inbred Nachman lines breed reliably across two independent animal facilities (Fig 2).Fewer than 25% of established matings are non-productive and average weaned litter sizes for most strains are between 3-6 pups.
While comprehensive diallele crosses have not yet been performed, there are no signs of F1 sterility or reduced fertility in the few F1 hybrids between independent Nachman strains that have been generated to date (S3 Table).To the contrary, testis weight and sperm density measurements in F1 hybrids exceed fertility metrics quantified in the inbred parental lines and classical inbred strains, revealing hybrid vigor (S1 Fig and S3 Table ).Efforts to more comprehensively profile reproductive traits in the inbred Nachman strains and their derivative F1 hybrids are on-going, and we expect our future results to strengthen empirical support for the early trends reported here.We note that these preliminary findings fall in contrast to observations from diallele crosses between the inbred founder strains of the DO and CC [34], which include strains from reproductively isolated subspecies.Many incipient inbred CC strains were lost due to male infertility [35] and many surviving strains exhibit poor breeding performance, male infertility, and extreme sex ratio distortion [36].Thus, strains in the Nachman WDIS panel comprise a shelf-stable mouse diversity resource with limited potential to uncover genetic incompatibilities in experimental crosses between Nachman strains or between Nachman strains and CI strains.

Cytogenetic characterization of Nachman Strain Karyotypes
Wild M. m. domesticus populations harbor frequent Robertsonian chromosomal translocations that give rise to considerable karyotypic diversity across Europe [37].Hybrid mice from crosses between different karyotypic races exhibit reduced fitness owing to altered chromosome dynamics and inefficient chromosome segregation during meiosis [37][38][39][40].To confirm the absence of largescale karyotypic alterations or changes in chromosome number among strains, we generated spermatoctye cell spreads from each inbred Nachman line and assessed karyotype and meiotic chromosome pairing by fluorescent microscopy (S2 Fig) .These cytogenetic analyses are on-going, but all Nachman strains evaluated to date exhibit the standard 2n = 40 all-acrocentric karyotype (7 tested strains: EDMB, GAIC, SARA, SARB, MANB, MANF, and TUCB).

Genomic diversity in the Nachman Panel
To assess the extent of genomic diversity in this new strain resource, we sequenced the whole genomes of a representative male from each strain to moderate coverage using PacBio HiFi sequencing technology (~10x coverage per strain; S4 Table ).Average read lengths exceeded 10kb for all strains (range: 10.217-14.957kb), with >96.67% of sequenced bases exceeding a quality score of 30 (S4 Table ).Sequenced reads were mapped to the GRCm39 reference genome assembly [41] and subject to single nucleotide variant (SNV) calling using DeepVariant [42,43].Each strain harbors between 4.73-6.54 million fixed SNV differences relative to the C57BL/6J-based reference, with >2.75M SNVs distinguishing any pair of Nachman strains ( We performed joint SNV calling with the Nachman lines and 51 CI mouse strains previously sequenced by the Mouse Genomes Project (44).Of the 16,071,877 autosomal SNVs observed in the Nachman strains, 6,836,536 variants (42.5%) are absent from CI strains [44].We constructed a maximum likelihood phylogenetic tree from SNPs on chr19 to assess relationships between existing mouse strains and strains in this new wild-derived inbred strain resource.The CI strains present as a single clade nested within the diversity sampled in the Nachman panel (Fig 3C).Branch lengths are notably longer for the WDIS than for the CI strains, reflecting the greater diversity in the former.These findings are confirmed by a principal component (PC) analysis, which reveals a single cluster of points corresponding to the CI strains (Fig 3B), with the Nachman strains and other WDIS well-separated in PC coordinate space.However, we acknowledge that differences in the technologies used to sequence the Nachman strains (PacBio HiFi) and other strains (Illumina) could partially contribute to the isolation of the Nachman strains along these PC axes.Nachman strains derived from a common locale are more genetically similar than strains from divergent locations, although strains from Gainesville, Edmonton, and Saratoga Springs are minimally separated along PC1-PC5 (41.53% of the variance).PC dimensions 6-8 provide separation across these geographic sample locations (S4 Fig).
We next annotated variants according to likely functional impact.Variant counts for different functional categories are provided in S5 Table .Overall, the Nachman strain panel harbors 1,976 SNPs predicted to be highly deleterious, with 1,319 of these variants not observed in classical inbred laboratory strains.These highly deleterious variants are enriched in genes with biological roles in sensory perception and G protein-coupled receptor signaling (S5 Table ).The Nachman strains also harbor predicted loss-of-function alleles at genes implicated in human diseases.For example, the TUCA/NachJ and TUCB/NachJ lines harbor a premature stop variant in Eif2b1 that is a predicted target of nonsense-mediated decay (NMD; chr5:124716942).Eif2b1 encodes a subunit of the eukaryotic translation initiation factor eIF2B, which is essential for protein synthesis.Mutations in this gene have been linked to leukoencephalopathy with vanishing white matter and ovarian failure in humans [45,46].Similarly, a predicted NMD variant in Yeats2 (chr16:20028820, stop-gain mutation) is present in GAIC/NachJ, TUCA/NachJ, and TUCB/NachJ.Mutations in this gene are causally associated with myoclonic epilepsy in humans [47].Strains MANB/NachJ, MANF/NachJ, TUCA/NachJ, and TUCB/NachJ carry a predicted loss-of-function splice-donor variant in Ifnar1 (chr16:91302893).Humans with mutations in this gene have immunodeficiency-106 and exhibit hyperinflammatory responses to some vaccines [48].
Taken together, our analyses indicate that the Nachman strains harbor considerable genetic diversity that is not captured in existing inbred mouse strain panels.A subset of this variation is likely functional, establishing the prediction of broad phenotypic diversity among these strains and their phenotypic divergence from CI mouse strains.

Structural diversity in the Nachman Panel
Structural variants (SVs) are important contributors to phenotypic diversity, including disease risk and incidence.Prior work has suggested that house mouse genomes are burdened by higher rates of structural mutation than human genomes [49,50], leading us to posit the presence of abundant, potentially functional SVs within the Nachman strain genomes.
Using an ensemble approach to minimize false positive SV calls (see Methods), we identified 274,177 autosomal structural variants across the 11 sequenced Nachman strain genomes (139,931 deletions, 133,058 insertions, 710 duplications, and 478 inversions).Of the 139,931 deletions discovered in the Nachman strains, 80,222 were not previously detected in a diverse panel of inbred strains (57.3%; requiring 75% reciprocal overlap [49]).Similarly, 60.6% of insertions ascertained in the Nachman panel are not present in the current mouse SV catalog [49] (n = 80,568 unique insertions in the Nachman lines).Nachman strain genomes each contain 13.5 Mb-18.3Mb of sequence that is absent in the mm39 C57BL/6J-based reference genome (Fig 4A), with 23.5 Mb-30.6 Mb of sequence in the reference genome absent from any given Nachman strain (Fig 4B).The modest genome coverage of the Nachman lines may lead to an appreciable number of missed SVs (i.e., false negatives) in these genomes, implying that SVs potentially have a greater collective impact on these genomes than suggested by the numbers presented here.
Many SVs in the Nachman strain genomes are potentially functional.Approximately 46.2% of SVs overlap RefSeq genes (126,741 SVs overlapping 18,504 unique genes), with 89,757 SVs overlapping the annotated coding regions of 12,209 unique genes.An additional 933 SVs lead to the ablation of whole transcripts (S6 Table ).The majority of these transcript ablating SVs impact predicted genes or pseudogenes, although several olfactory receptors, vomeronasal receptors, and immunoglobulins harbor loss-of-function structural mutations.
Recent work has suggested that a majority of SVs in mouse genomes are due to transposable element (TE) activity [49].We theorized that many SVs in the Nachman strains are likewise mediated by TE-activity in the genome.Indeed, the size distribution of insertion and deletion calls in the Nachman strain call set is consistent with key contributions from TEs (S5 Fig) , with peaks corresponding to the average length of SINEs and LINEs.To more formally assess this possibility, we annotated all insertion and deletion SV calls using repeatMasker (see Methods).Overall, we identify 113.7 Mb of SV-associated sequence in the Nachman strains comprised of TE-derived sequence (57.01Mbdeletion, 56.70Mb insertion), corresponding to 78.1% of SV-impacted bases (145.76Mb).SINE B2 MM1A repeats are the most abundant TEs within SVs (15,169 polymorphic MM1A elements in the Nachman panel; S7 Table ), consistent with prior reports of active SINE B2 transposition in house mouse genomes [49,51].RLTR10, IAPEz, and L1MdGf elements are also highly abundant in SVs identified in the Nachman lines (S7 Table ), consistent with the young age of these elements and their active mobilization in mouse genomes [49,51,52].Overall, our results suggest that TEs have substantially contributed to the landscape of SVs in the inbred Nachman strains and, by extension, the wild mouse populations from which these strains derive.

Strain subspecies ancestry
An earlier investigation of wild-caught mice from Tucson, Arizona uncovered evidence of introgression from M. m. castaneus [30].We used a sample of 134 wild-caught mouse genomes from M. m castaneus, M. m. domesticus, and the outgroup M. spretus to evaluate genomic evidence of possible M. m. castaneus introgression in the TUCA/NachJ and TUCB/NachJ lines (S8 Table).Patterson's D is significantly non-zero for both strains (D TUCA = 0.213 and D TUCB = 0.192; P < 2.3 x 10 −16 for both strains), with estimated M. m. castaneus admixture proportions (f 4 ratio) of 13.4% and 11.3%, respectively.D is also significantly greater than zero for SARB/NachJ and SARC/NachJ (P < 0.05; S9 Table ).However, the estimated admixture proportion from M. m. castaneus is <1% in these strains and may be attributable to the incomplete sampling of ancestral wild mouse diversity.We conclude that the genomes of both Tucson lines, but not strains from other locations, harbor significant M. m. castaneus ancestry.We also investigated possible introgression from M. m. musculus and found that all Nachman strains have <1% admixture from M. m. musculus, indicating no significant or recent introgression from this subspecies (S9 Table ).
We next estimated admixture statistics in windows of 5000 informative SNPs (2500 slide) across the genomes of TUCA/NachJ and TUCB/NachJ, focusing on the 5% of windows with the highest f dM values to identify regions of likely M. m. castaneus introgression.We utilize f dM , rather than the related D statistic, as the variance in D can be quite large when applied to small windows [53,54].Introgressed regions are overwhelmingly unique to either TUCA/ NachJ or TUCB/NachJ (S10 and S11 Tables and S6 Fig), consistent with the high genomic divergence between these lines (Fig 3).For strain TUCA/NachJ, genes in introgressed regions are enriched for biological processes related to regulation of lipopolysaccharide-mediated signaling, zymogen activation, and sensory perception of taste and smell (S12 Table ).For strain TUCB/NachJ, genes in regions of M. m. castaneus introgression are enriched for biological processes related to testosterone biosynthesis, keratinization, and multiple aspects of immunity (S13 Table ).Overall, introgressed regions are short, implying that events are not recent (S6 Fig).

Relationship to wild mouse diversity
To contextualize the variation present in the inbred Nachman strains with that observed in wild M. m. domesticus, we created a joint variant callset featuring the 11 inbred Nachman strains and 97 publicly available wild M. m. domesticus samples from multiple populations (Iran, France, Germany, and the Eastern United States).PC analysis on this Nachman-wild mouse call set reveals genetic clustering of the Nachman lines with samples from Europe and the US, with samples from Iran isolated along PC1 (13.9% of the variance; Fig 5A).These trends are recapitulated by a maximum likelihood phylogenetic tree of these samples, which places the Iranian samples as ancestral to the Nachman strains and wild-caught mice from Europe and the United States (Fig 5C).Our findings are consistent with previous population genetic analyses of wild mice, which suggest that M. m. domesticus from the Indo-Iranian valley harbor elevated genetic diversity [11,55] and point to the European origin of North American house mice [56].
We excluded the Iranian samples and repeated the PCA to evaluate how the genetic diversity captured in the Nachman samples compares to the genetic variation in contemporary M. m. domesticus from Europe and the Eastern US (Fig 5B).PC1 (12.7% of the variance) stratifies mice from wild-caught M. m. domesticus populations, with Nachman lines falling at intermediate positions along this PC.PC2 (10.2% of the variance) isolates the two Nachman strains from Tucson, likely reflecting the presence of M. m. castaneus admixture in these lines.While the Nachman lines introduce significant new variation into laboratory strain collections, the 11 inbred strains in this panel sample only a subset of the genetic variation present in wild M. m. domesticus populations.

Nachman wild-derived inbred strains capture extensive phenotypic diversity
We subjected mice from the majority of JAX imported Nachman lines to a 19-week phenotyping pipeline to profile strain variation in multiple metabolic, neurobehavioral, physiological, morphological and biochemical traits (Fig 6).Males and females from 9 strains were phenotyped across 16 cohorts of age-matched animals (+/-6 days).Cohorts were comprised of mice from multiple strains, and the majority of phenotyping cohorts included C57BL/6J control mice to permit post-hoc detection of potential batch effects.The strain composition of each cohort is provided in S14 Table .Overall, we collected 1119 phenotype measures from each animal, although many trait values are highly correlated and therefore not independent (S15 Table ).
More than 90% of the 1119 surveyed phenotypes differ among Nachman strains, with 86.7% and 80.4% of phenotypes differing significantly in comparisons involving only females or males, respectively (Kruskal-Wallis test, P<0.05; S16 Table ).We obtain qualitatively identical results using both one-way ANOVA (S17 Table ) and a linear modelling approach with strain treated as a random factor (73.8% of models including strain as a random effect variable provide significantly better model fit than a reduced model excluding strain; see Methods; S18 and S19 Tables).Approximately 25% (n = 288) of phenotypes exhibit significant differences between males and females and a significant strain-by-sex effect is observed for 21.5% (n = 235) of surveyed measures (two-way fixed effects ANOVA, P < 0.05; S20 Table).Similar trends are again observed using a linear mixed effects model comparison strategy: 26.6% and 8.5% of models including sex and interaction terms, respectively, provide improved fit compared to simpler models that exclude these effects (S18 and S19 Tables).Thus, phenotypic variation is ubiquitous across the Nachman strain panel, and many traits vary between sexes in a strain-dependent manner.
The wild-caught mice used to establish the Nachman lines were subject to unique, locationdependent selective pressures in their native environments.Consistent with this legacy of environmental adaptation, we observe systematic phenotype differences across strains as a function of geographic origin (S21 and S22 Tables).Location is a significant factor accounting for variation in 85.16% of phenotypes (Kruskal-Wallis test, P<0.05; n = 930/1092; S21 Table ).We obtain similar results when fitting linear mixed effect models: 61.2% of models that include geographic sample origin as a random effect provide a better fit compared to models excluding this term (S23 Table ).For the majority of surveyed traits, strains derived from a common location are more phenotypically similar to each other than strains from distinct locations, echoing trends in sequence-level similarity among strains (Fig 3).
To visualize the extent of phenotypic diversity across the Nachman panel, we performed a PC analysis of all surveyed phenotypes.The spatial distribution of strains along the first two PCs (26.4% of variance) affirms model-based findings above, revealing loose clusters based on strain geography and differences between the sexes (S7 Fig) .While strains are more tightly clustered in the genotypic PCA than the phenotypic PCA, there is significant similarity between the two PC matrices (Similarity of Matrices Index = 0.709; P < 0.001; S7 Fig) .This correspondence with the genotypic PC matrix would seem to suggest that the surveyed phenotypes are subject to polygenic control in the Nachman strains.
Below, we highlight results from each phenotype testing paradigm, present estimates of broad sense heritability, and compare the phenotypic variance among the Nachman samples to that present across classical inbred mouse strain panels.We then present results from correlation analyses that cut across multiple phenotype domains.Comprehensive phenotype data are available in S24  Body composition analysis by NMR.We assessed multiple metrics of body composition (body weight, fat mass, lean mass, water mass) via NMR at 5 timepoints between 4 and 19 weeks of age.As expected, all mice gained weight during this period, although the percentage increase in body mass differed among strains across this 15-week interval (range: 28.1% SARC males to 95.92% in GAIC/NachJ males).Males trended toward larger body mass than females (average sex dimorphism across strains at 19 weeks: 4.45 g; Fig 7), and all Nachman lines are smaller than C57BL/6J control mice.The broad sense heritability of body weight at 19 weeks is ~0.8 in both males and females (S17 Table ), indicating that significant proportion of strain variation in body mass is attributable to underlying genetic differences between strains.
We similarly document significant strain and sex differences in body fat mass, lean mass, and water mass (S16, S17, and S20 Tables and Fig 7).Body composition phenotypes are highly correlated over time within strains (average Spearman's rho = 0.669; S26 Table ), implying that increased body mass is driven by coordinated changes in fat, lean, and water mass rather than isolated changes in one compositional component.Body composition measures Frailty assessment of overall body condition.Overall health and body condition were assessed using a 29-dimension frailty index score [57].All strains have low mean frailty index scores (values ranging from 1.75-3.0out of a possible score range of 0-28; S25 Table ), although the qualitatively small strain differences observed nearly exceed chance expectations (Kruskal-Wallis P = 0.055; S16 Table ).Body temperature differed significantly across strains, with GAIC/NachJ and the three strains from Saratoga Springs, NY exhibiting the highest values (Kruskal-Wallis; P = 0.00077; S16 and S25 Tables).
Light-dark test.The light-dark test is premised on the natural aversion of rodents to being in brightly lit spaces; the proportion of time mice spend in the brightly lit versus dark zones of the testing chamber yields quantifiable metrics of anxiety-like behaviors.Strain TUCB/NachJ shows the lowest overall ambulatory time (24.5 ± 3.54 s; range of other strain means: 30.2-35.8 s), fewest crossovers from one zone to the other (30.9 ± 12.2; range for other strain means: 31.8-61.7),and the greatest proportion of time in the dark chamber zone (67.1%; range for other strain means: 48.0-59.5%;S25 Table ), consistent with higher overall levels of anxiety-like behavior in this strain.MANE/NachJ spent the greatest proportion of time in the light zone (52%), whereas GAIA/NachJ traveled the greatest total distance in the light zone (819 ± 159 cm; range of other strain means: 436-745 cm).Overall, females are more active in the dark than males, although this trend appears to be largely driven by an especially pronounced sex dimorphism and high levels of activity in SARA/NachJ females (S9 Fig) .Importantly, the time spent at rest and number of bouts of movement in the dark are moderately heritable (H 2 > 0.2; S17 Table ), revealing a genetic component to observed strain variation.
Open field assay.Many measures of locomotion and exploration exhibit high broad sense heritability in the Nachman lines (S17 Table ).Strains vary more than 3-fold in total distance traveled within the open field arena over the 60-minute test period, ranging from a low of 5158 ± 803 cm in GAIC/NachJ to a high of 17141 ± 4135 cm in SARC/NachJ (F 8 = 24.14;P = 6.05x10 -24 ).We observe significant sex effects on several open field metrics, including the number of independent movement episodes and vertical activity time (S20 Table ).Females exhibited more bouts of movement (females: 661 ± 86.6 episodes; males 619 ± 112 movement episodes; F 1 = 7.19; P = 0.0082), whereas males spent more time moving in the vertical plane (males: 343 ± 209 s; females: 290 ± 161 s; F 1 = 4.12; P = 0.044).We report a significant interaction between sex and strain for the total distance traveled in the center of the arena (F 8 = 2.41; P = 0.018; S20 Table ), with SARA/NachJ, SARB/NachJ, and SARC/NachJ females traversing significantly greater distances than their male counterparts (Wilcoxon Rank Sum Exact Test; P < 0.05).Taken together, these analyses reveal striking strain and sex differences in locomotion and exploration, and reinforce population-level differences in activity levels in the wild [32].
Spontaneous alternation in the Y-maze.The spontaneous alternation assay capitalizes on the natural tendency of mice to explore novel environments and serves as a test of spatial working memory.Nachman strains vary in the total number of arm entries and number of spontaneous alternations in the Y-maze, with both phenotypes also exhibiting significant sex and strain × sex effects (two-way ANOVA, P < 0.05; S20 Table ).The percent alternation for most strains exceeds that for C57BL/6J, suggesting that the Nachman lines have stronger working memories and/or a more intense innate desire to explore novel spaces than C57BL/6J (S25 Table ).Overall, strains derived from Saratoga Springs, NY exhibit increased percent alternation compared to mice from Manaus, Brazil (58.33 vs. 49.01;Wilcoxon rank sum test, P = 1.25 x 10 −6 ; S21 Table), a finding consistent with the discovery of increased exploratory behavior in mice from the former location compared to the latter in the open field assay.
Home cage wheel running.At week 13, mice were subjected to a 3-day home cage wheel running assay to assess overall activity and circadian rhythms.Strains vary 6.8-fold in total distance run over the 60h trial, from a low of 9,112 m in MANE/NachJ to 62,019 m in SARC/ NachJ (S25 Table ).Mice are nocturnal and, as expected, activity was highest during nighttime hours for all strains (Wilcoxon Rank Sum Test, P < 0.0005; Fig 8A ).
The Nachman strains derive from wild-caught mice exposed to distinct photoperiods in the wild.We sought to determine whether strains differ in the timing of their peak wheel running as a function of geographic origin.To this end, we computed the change in the number of wheel revolutions per hour across the full trial, defining the transition between day and night as the point exhibiting the most extreme difference in activity level.All strains commence nighttime levels of activity at ~18:00h, but the onset of daytime rest periods is considerably more variable, with strain differences in the duration of nighttime exercise and daytime activity.For example, mice from Manaus, Brazil exhibit a shorter duration and reduced levels of nighttime activity compared to strains from Saratoga Springs, New York, with mice from the latter strain remaining active during much of the day (Fig 8B).
Indirect calorimetry.Mice were subject to continuous, high-definition respiratory monitoring in Promethion Core cages for 5 days, allowing estimation of energy expenditure, activity levels, and food consumption.We observe significant strain effects on all surveyed phenotypes (P < 0.01), with approximately half of the metabolic traits captured in this assay also exhibiting significant differences between the sexes (S16, S17, and S20 Tables).Both sexes of all strains were most active, exhibited highest energy expenditure, and consumed the most food during the dark cycle (Fig 9A -9C; Paired Wilcoxon Signed Rank Test, all P < 10 −10 ).Most metabolic traits are highly heritable (S17 Table ), nominating the Nachman lines as excellent models for genetic studies of metabolism.Intriguingly, despite their close genetic affinity, the two strains from Gainesville, Florida exhibited the lowest (GAIC/NachJ; 7.53 kcal/24h) and highest (GAIA/NachJ; 9.98 kcal/24h) total energy expenditure, with C57BL/6J controls presenting an intermediate phenotype (9.exhibit significant variation in all surveyed EKG metrics, including heart rate, peak amplitudes, wave durations, and peak intervals (S16 Table ; Kruskal-Wallis P < 0.005).Nachman lines have lower heart rates, smaller PR intervals, and reduced Q amplitudes than C57BL/6J control animals (S25 Table ), suggesting clinically relevant differences in cardiac function between laboratory and wild mice.
Gross morphology and organ weights.Nachman strains exhibit significant, heritable variation in body length and tail length (Kruskal-Wallis P < 10 −8 ; H 2 body ¼ 0:71; H 2 tail ¼ 0:31; S16 and S17 Tables and S10 Fig) .Body length exhibits a significant sex effect, with males having longer bodies than females (S20 Table ).The magnitude of the sex dimorphism for this morphology trait varies by strain, with SARA/NachJ and SARB/NachJ males exhibiting much longer body lengths than their female conspecifics and only a modest length dimorphism between MANB/NachJ males and females (S10 Fig) .Similarly, we observe significant heritable variation in organ weights among strains and between the sexes (S17 Table ).Brain size exhibits exceptionally high heritability (H 2 = 0.73) and variability among strains, even after standardizing by total body weight (Kruskal-Wallis P = 2.94x10 -9 ), indicating that observed brain size differences among strains are not simply proportional to overall body size.
Clinical chemistry.We assessed levels of numerous blood-based clinical markers after subjecting mice to a 4 hr fast at 19 weeks (Fig 11).Nachman strains vary in immune cell populations, blood lipid profiles, measures of liver function, and both platelet and red blood cell composition (S16 and S17 Tables).For example, total cholesterol levels range 2.7-fold among strains, with TUCB/NachJ and GAIC/NachJ mice defining the extremes (71.2 mg/dL-195 mg/dL; S25 Table).Males from all strains have higher total cholesterol values than their female conspecifics (average dimorphism = 34.3mg/dL), a trend echoed in levels of HDL cholesterol (Fig 11A and 11B).Nachman lines show exceptionally high variability in platelet counts  ), observations that may underlie recent reports of strain differences in pathogen response [58].

Integrated analysis of Nachman and classical inbred strain phenotypes expands the variance observed in classical inbred strains
We intersected phenotype data from the Nachman lines with existing strain survey datasets deposited on the Mouse Phenome Database (see Methods; [59]).The Nachman animals expand the range of phenotype values observed across inbred strain panels for several phenotypes, including total platelet counts, percent fat mass, and hematocrit levels (S11 Fig).The inclusion of C57BL/6J mice as controls in our phenotyping cohorts provides a common touchpoint between our data and previously published datasets.While we find that C57BL/6J trait values are generally stable across our phenotyped cohorts (S28 Table), C57BL/ 6J trait values differ significantly between our data and legacy datasets (S29 and S30 Tables).The inconsistency in trait values could owe to differences in animal age at the time of phenotyping, differences in housing conditions, animal handling by different technicians and research staff, differences in phenotyping protocols or equipment, or other vagaries of the experimental environment.Regardless of the source of this phenotypic variance, it poses major obstacle to phenotype data integration with the Nachman strains and underscores the need for caution in the interpretation of absolute differences in trait values across independently collected datasets.

Strain-level trait correlations across phenotype assays
Significant strain-level phenotype correlations may reveal traits regulated through shared genetic pathways.As expected, measures of body composition, body size, and organ weights are positively correlated, suggesting a general pattern of isometric growth (S8 Fig) .Additionally, measures of total activity in indirect calorimetry and open field assays are positively correlated (S15 Table ).Total serum bilirubin levels are negatively correlated with several phenotypes assessed from the spontaneous alternation assay, including total number of arm entries (rho = -0.97;P = 0.00017) and the number of spontaneous alternations (rho = -0.83, P = 0.0083), but suggestively positively correlated with the percent alternation (rho = 0.62; P = 0.086).This latter result supports published reports of a negative association between serum bilirubin concentration and cognitive impairment in schizophrenia in humans [60], tentatively extending this correlation to mice.Leptin levels are positively correlated with fat mass at each timepoint assessed by NMR (4, 8, 12, 16, and 19 weeks; rho > 0.66; P < 0.06) and gonadal fat pad weight (rho = 0.63; P = 0.076), and are negatively correlated with food consumption per unit body weight (rho = -0.85;P = 0.0061).These results reinforce the wellestablished link between leptin secretion and satiety sensing.The duration of the QRS interval on an unconscious EKG is negatively correlated with percent lean mass, but positively correlated with fat mass across strains (S15 Table ).These findings lend independent support to a recently published association between QRS duration and BMI in humans with no overt cardiac disease [61].A comprehensive summary of all pairwise trait correlations is provided in S15 Table .Sex-specific trait correlations are provided in S31 and S32 Tables.

Discussion
Here, we introduce a new inbred mouse strain resource: the Nachman wild-derived inbred strain panel.Strains in this panel derive from wild-caught mice from unique environments across North and South America.Integrated analysis of the genomes of these new strains with CI strains reveals millions of novel genetic variants segregating in this panel, including predicted deleterious alleles and gene-spanning structural variants.Paralleling this genetic diversity, Nachman strains capture considerable phenotypic variation across biochemical, neurobehavioral, physiological, morphological, and metabolic trait domains.
We integrated our phenotype data with publicly available phenotype datasets from prior laboratory mouse strain surveys to assess phenotypic variability across the Nachman panel within the context of that observed in CI strains.Importantly, the C57BL/6J controls included in our phenotyping cohorts provide a common data point to anchor our data to previously published phenotyping efforts in mouse.However, due to differences in animal ages, experimental methodology, and environmental housing conditions, the vast majority of phenotypes present inconsistent trait measurements across C57BL/6J controls, limiting the extent to which we can make reliable comparisons across independently surveyed strain panels.These challenges underscore well-known difficulties with data integration and emphasize the importance of standardized phenotyping protocols and detailed metadata reporting [59].While we acknowledge these caveats, our analyses nonetheless place high certainty on the conclusion that the Nachman strains extend the range of trait values realized using CI strains alone (S11 Fig).
The Nachman strain panel complements existing diverse mouse platforms, such as the BXD [62], CC [22], HMDP [63], and DO [23], offering a new community resource for profiling phenotypic variation and assessing responses to exposures, interventions, or treatments.While the 11-member panel is too small to power genetic mapping studies, experimental crosses between Nachman strains with extreme phenotypes can be employed to generate F2, backcross, or advanced intercross lines for experimental mapping.Similarly, crossing knockout alleles into the Nachman strains could enable the identification of naturally segregating modifiers of disease penetrance, severity, or onset.
At the same time, a key feature that distinguishes the Nachman panel from other mouse diversity resources is its exclusive profile of the natural genetic and phenotypic variability in M. musculus.During the creation of these WDIS, we aimed to minimize the impacts of artificial selection by randomly selecting breeders at each generation, ensuring that the resultant fixed strain genomes immortalize haplotypes that resemble those found in the wild.In contrast, CI strains are products of strong selection for morphological traits of interest, increased reproductive output, and behavioral phenotypes that promote ease of handling.Such strong artificial selection has almost certainly had broad genomic consequences on unrelated phenotypes through pleiotropy and linkage [64].This recognition implies that the multigenic architecture of complex trait variation in CI strains may not accurately model that found in natural populations, a consideration that may limit their translational relevance to humans.
Instead, we assert that the Nachman strain genomes mirror multiallelic patterns of diversity that closely approximate the polygenic architecture of traits in nature and, therefore, more accurately model the complex genetic basis of human disease-related phenotypes than CI strains.Beyond the absence of overt artificial selection during strain development, two additional lines of evidence support this assertion.First, our phenotyping efforts highlight the legacy of local adaptive pressures on the organization of phenotype diversity across the Nachman strains.Indeed, we find that geographic origin explains a significant proportion of the variance across strains for the majority of surveyed phenotypes in the Nachman strain panel.For example, we find that SAR mice are typically bigger, more active, and have higher energy expenditure than MAN mice, consistent with predictions of Bergmann's Rule [65] and earlier reports of adaptative phenotypic evolution to colder environments in wild mouse populations [32].These observations broadly parallel findings from humans, where local adaptive evolution has acted as a major driver of phenotypic divergence between populations [66].Thus, natural selection has played an important role in sculpting phenotypic diversity in both humans and the Nachman strains.Second, due to their origins from fancy mouse populations purposebred for traits of interest, CI strain genomes are mosaics derived from three principal house mouse subspecies [6].This subspecies composition falls in striking contrast to the structure of wild mouse genomes, which typically bear subspecies ancestry contributions from one, and more rarely two, subspecies [56,67].Our analyses of subspecies affiliation and introgression reveal that Nachman strains from four of the five sample locations (Manaus, Brazil; Saratoga Springs, New York; Gainesville, Florida; and Edmonton, Alberta, Canada) are of pure M. m. domesticus ancestry, but point to moderate levels of introgression from M. m. castaneus in mice from Tucson.This adds to prior observations of M. m. castaneus introgression into mouse populations from the West coast of North America [68] and earlier SNP-based genetic investigation of mice from the American southwest [69].Overall, Nachman strain genomes encapsulate subspecies diversity in a more naturalistic way than current CI strains, maximizing the biological relevance of the phenotypic diversity encoded by their genomes.
Our initial phenotypic and genetic characterization of the Nachman strains provides the basis for many new findings and establishes a powerful suite of resources to spur future research efforts.However, our work presents an initial survey that only scratches the surface of possible biological discovery in this strain panel.For one, all phenotyping was performed "at baseline" using animals fed a standard rodent diet, housed under standard conditions, and in the absence of any experimental treatments.Experimental designs that employ environmental perturbations, different exposure or treatment regimes, aged animals, or other deliberate manipulations may uncover novel phenotypic responses or resilience/susceptibility traits.Second, our phenotyping pipeline was purposefully designed to survey a broad range of biomedically relevant traits and is far from comprehensive.For example, surveyed phenotypes exclude sensory perception phenotypes, bone density, histopathology assessments, and microbiome composition.Deeper, more exhaustive phenotyping of specific trait domains could unlock strain-level differences that fail to manifest using coarser phenotyping assays.Third, due to COVID-related impacts, mice from several strains were poorly represented (GAIC/NachJ, TUCA/NachJ, TUCB/Nach) or altogether absent (EDMB/NachJ) from our phenotyped cohorts (S14 Table ).As a result, the extent of phenotypic variation across Nachman strains presented here is potentially underestimated-a possibility that future investigations could readily assess.Fourth, we sequenced each strain to modest coverage and relied on comparisons to the GRCm39 reference genome for variant discovery.We obtained only low-coverage of the X and Y chromosomes in our sequenced males (~5x), leading us to exclude these chromosomes from our genomic analyses.Ultimately, we endeavor to perform additional long-and ultra-long read sequencing, generate high-quality de novo sequence assemblies, and perform assembly-based variant calling to more comprehensively catalog variation in these strain genomes.
We also acknowledge practical caveats to the use of mice in this strain panel.Our colony breeding records demonstrate that the Nachman strains are robust breeders, but breeding performance falls short of what is typically observed for many CI strains that have been subject to intense artificial selection for productivity (Fig 2).Further, like many other WDIS, breeder dams from the Nachman lines do not possess externally visible mating plugs, potentially due to their rapid loss or re-absorption following mating.This may pose challenges for experiments that hinge on the ability to precisely time matings or obtain fetuses at specific gestational ages.Males from many strains also exhibit high levels of aggression toward cage mates and may require single housing or additional enrichment to minimize aggressive encounters.Anecdotally, we have found that supplying mice with wooden blocks for gnawing minimizes tail biting among cage mates.Finally, strains in the Nachman panel retain very high levels of wildness, imposing challenges to routine handling.Of special note, juvenile mice from the TUCB/NachJ strain are remarkable jumpers, easily capable of escaping cages topped with cage extenders (total height ~14").While working with wild mice may be daunting at first, it is our experience that competency and confidence build quickly.Specialized workstations can aid in containing mice during routine handling, and innovations in animal housing could eventually allow for hands-off cage changes (e.g., via attachment of open plastic tubing to closeable ports on dirty and clean cages, allowing mice to move into new cages on their own).Further, continuous home-cage monitoring coupled with machine-learning based quantification of phenotypes from video footage could obviate many traditional phenotyping paradigms that rely on animal handling and restraint, while also capturing animal behavior in a more ethologically relevant setting [70,71].
The Nachman panel is currently part of the NIH-funded Special Mouse Strain Resource at JAX (P40 OD011102), which provides the framework and setting for its long-term maintenance and external distribution.Several additional strains are maintained in Dr. Nachman's private colony and we endeavor to import these additional strains to grow current holdings at JAX. Recognizing that the scientific value of this strain resource is directly tied to the availability of genomic and phenotypic resources, we have made all raw (e.g., fastq files, raw phenotype measures) and derived data (e.g., vcf files, strain phenotype means) presented in this manuscript available in public repositories and as supplemental material (S1-S33 Tables and S1-S12 Figs).Our team is currently engaged in efforts to expand the collection of tools and resources presented here, including de novo genome reference assemblies, embryonic stem cell lines, gene expression datasets, and the initiation of an outbred population founded from a subset of 4 Nachman strains.Together, the inbred Nachman strain panel, its planned derived populations, and accompanying 'omics resources are poised to enable new discoveries in the basic, biomedical, and preclinical research spheres.

Ethics statement
All animal procedures were conducted in compliance with animal care protocols approved by The Jackson Laboratory's Animal Care and Use Committee (Animal Use Summary # 18070) and the University of California Berkeley Animal Care and Use Committee (AUP R361-0514 and AUP-2016-03-8548).

Inbred strain development
Wild house mice were caught in 2012-2013 from five geographic locations (Fig 1) using Sherman live traps baited with oats.To avoid catching closely related individuals, each mouse was trapped at least 500m from every other mouse.Mice were transported to UC Berkeley and held in quarantine during pathogen testing.Mice from each geographic region were randomly paired to create inbred lines which were propagated through brother-sister mating for at least 10 generations.Initially, 10 independent lines were established from each location.No attempts were made to rescue lines that exhibited infertility due to inbreeding depression, ensuring that surviving lines likely harbor low deleterious mutation loads.The founders of these strains were prepared as museum specimens and have been deposited in the collections of the UC Berkeley Museum of Vertebrate Zoology.

Mouse breeding, rederivation, and husbandry
Mice from 20 incipient inbred strains were imported from the Nachman Laboratory mouse colony at UC Berkeley to the Jackson Laboratory Importation Facility (S2 Table ).Strain colonies were expansion bred to obtain cohorts of >20 breeding-aged females for oocyte collection.To expedite breeding, strains were mated with some allowance for breeding between generations, a departure from strict sib-sib inbreeding.
Females were superovulated according to standard procedures [72] and harvested oocytes were in vitro fertilized with sperm from conspecific males [73].Embryos were cultured to the 2-cell or 8-cell stage before implantation into pseudopregnant QSi5/IanmTaftJ x C57BL/6J (JAX strains 027001 x 000664) F1 recipient dams of high health status.Live-born pups were transferred to the Dumont Laboratory colony where they were maintained at intermediate health status by strict sib-sib mating.A representative male and female from each successfully imported strain was genotyped on JAX's standard 54-SNP panel to assure expected strain identity and safeguard against contamination with known laboratory stocks.
Mice were housed under SPF conditions in sterile plastic caging and subject to biweekly cage changes.All animals were fed 6% sterilized rodent chow (Lab Diet 1 Formulation 5K0G) and provided with acidified water ad libitium.Cages were supplied with aspen bedding material and enriched with nestlets, crinkle paper nesting material, red igloos, and cedar blocks for gnawing.

Cytogenetic chromosome analysis
Spermatocyte cell spreads were prepared from testis tissue of males aged >8 weeks and immunostained as previously described [74].Antibodies used were a polyclonal antibody against mouse SYCP3 (1:100 dilution; Novus Biologicals, cat.# NB300-231) and human anti-centromere protein (1:100 dilution; Antibodies Incorporated, cat.# 15-235).Cells were then imaged on a Leica DM6B upright fluorescent microscope equipped with fluorescent filters, LED illumination, and a cooled monochrome Leica DFC7000 GT 2.8 megapixel digital camera.A minimum of 20 cells were captured per strain.Fluorescent intensity and background signal were manually adjusted, and the number of chromosomes counted using ImageJ (v 1.53k).

Whole genome sequencing
DNA extraction, library preparation, quality control, and sequencing were performed by the Genome Technologies Scientific Service at The Jackson Laboratory.An initial survey of shortread Illumina whole genome sequences from three strains (SARA/NachJ, SARB/NachJ, MANB/NachJ) pointed to high levels of structural variation relative to the mm39 reference genome (S12 Fig) .This discovery motivated our use of long-read PacBio HiFi technology for all subsequent sequencing.
High molecular weight DNA was isolated from spleen tissue of a single male from each of the 11 imported Nachman wild-derived inbred strains using the Wizard DNA Purification Kit (Promega) or the Monarch HMW DNA kit (NEB) according to manufacturer's instructions.DNA concentration and quality were assessed using the Nanodrop 2000 spectrophotometer (Thermo Scientific), the Qubit 3.0 dsDNA BR Assay (Thermo Scientific), and the Genomic DNA ScreenTape Analysis Assay (Agilent Technologies).DNA quality from all samples was assessed to be high (260/280 > 1.79 and 260/280 < 1.86, 260/230 > 1.99) and suitable for input for PacBio HiFi library construction.
PacBio HiFi libraries were constructed for each sample using the SMRTbell Express Template Prep Kit 2.0 (Pacific Biosciences) according to the manufacturer's protocols.Briefly, the protocol entails shearing DNA using a g-TUBE device (Covaris), ligating PacBio specific barcoded adapters, and size selection on the Blue Pippin (Sage Science).The quality and concentration of the library were assessed using the Femto Pulse Genomic DNA 165 kb Kit (Agilent Technologies) and Qubit dsDNA HS Assay (ThermoFisher), respectively, according to the manufacturers' instructions.The resultant library for each strain was sequenced on a single SMRT cell on the Sequel II platform (Pacific Biosciences) using a 30-hour movie time.

Read mapping and variant calling
Read quality and library coverage were assessed using fastp [75].Individual SMRT cells yielded between 22.04 and 38.04 Gb of unique sequence data, with an average read length of 12.9kb (S4 Table ).Reads were then mapped to the GRCm39 reference sequence using minimap2 invoking the "HIFI" preset.Per sample single nucleotide calling was performed using Deep-Variant (v1.2.0) under the "PACBIO" model [43].Per sample gVCF files were then merged using glnexus (v1.2.7) under the DeepVariantWGS configuration to produce a joint call set [42].Sites with missing data, genotype quality <30, and indels were subsequently filtered using bcftools (v 0.1.19;[76]).We further eliminated sites with heterozygous calls as these sites are potentially enriched for false positives given our modest sequencing coverage.We refer to this call set as the "Nachman only" call set.
Variants in the Nachman strain panel were compared to those previously discovered in laboratory inbred strains in order to assess the extent to which this new resource captures novel genetic diversity.We downloaded GRCm39-mapped bam files for 51 inbred strains from the European Nucleotide Archive (PRJEB47108).Per sample variant calling was performed using the "WGS" model in DeepVariant (v1.2.0).Joint variant calling of laboratory strain genomes and the 11 inbred Nachman strains was carried out using glnexus (1.2.7) in DeepVariantWGS mode.As above, indels and variants with genotype quality <30 were removed using bcftools (v.0.1.19).Sites with >10% missing data and heterozygous calls were also excluded.We refer to this callset as the "Sanger-Nachman" call set.
Identical methodology was adopted to generate a joint call set for the Nachman strains and 97 wild-caught M. m. domesticus, 22 M. m. musculus, 30 M. m. castaneus, and 7 M. spretus.Sample accession numbers for the wild mouse genome sequences used are provided in S8 Table .We refer to this call set as the "wild-Nachman" call set.Again, sites with >10% missing data, genotype quality <30, and indels were excluded using bcftools (v 0.1.19).
Basic statistics were computed on each call set using bcftools stats.Call sets were partitioned and intersected using bcftools view and and bcftools isec, respectively.Variant effects were predicted using the Variant Effect Predictor (Ensembl release 109.3)using the GRCm39 Mus musculus assembly [77].GO Enrichment Analyses were performed using the enrichment analysis tool on the GO Consortium Website (http://geneontology.org/), with the entire set of annotated genes in Mus musculus as background [78,79].Enrichment was determined by a Fisher's Exact test with False Discovery Rate of P < 0.05.

Genome sequence analysis
SNP variation in the Sanger-Nachman call set was summarized via principal component analysis.We first eliminated six strains of non-M.m. domesticus ancestry from the call set (CAST/ EiJ, CZECHII/EiJ, JF1/MsJ, MOLF/EiJ, PWK/PhJ, and SPRET/EiJ) and removed fixed variants using the view command in bcftools (v 0.1.19),invoking the flags -q 0.01 -s ^CAST_EiJ,CZE-CHII_EiJ,JF1_MsJ,MOLF_EiJ,PWK_PhJ,SPRET_EiJ.Variants were greedily thinned to include only those with r 2 > 0.2 using PLINK (v1.90b6.18;[80]): plink --vcf $VCF \ --double-id --allow-extra-chr \ --set-missing-var-ids @:# --vcf-half-call missing \ --indep-pairwise 50 50 0.2 --out $THINNED_VARIANTS Principle component analysis was performed on the pruned set of variants using the-pca command in plink: plink --vcf $VCF \ --double-id --allow-extra-chr \ --set-missing-var-ids @:# --vcf-half-call missing \ --extract $THINNED_VARIANTS --make-bed --pca --out $PCA_RESULTS A maximum likelihood phylogenetic tree was constructed from the LD-thinned SNPs on chr19 in the Sanger-Nachman call set using phyml (version 3.3; [81]).We focus on chr19 for the sake of computational efficiency.We first created a fasta format file from the thinned chr19 SNPs using a custom perl script.The output file was then converted to phylip format using the SeqIO.writefunction in the BioPython (v1.43) library.A maximum likelihood tree was constructed under a GTR model of nucleotide evolution, with nucleotide frequencies computed from the empirical data, and with the transition/transversion ratio, proportion of invariant sites, and gamma distribution of rate classes estimated via maximum likelihood.The executed command was: phyml --input $ALN \ --datatype nt \ --bootstrap -1 --model GTR \ -f e -t e \ --pinv e --alpha e \ --r_seed 12345 Identical methodology was used to perform PC analysis and phylogeny construction on the Nachman-wild call set.Prior to analysis, we first filtered the vcf file to include only wild M. m. domesticus samples, recognizing that PCA results would be dominated by subspecies-level diversity if samples from M. m. musculus and M. m. castaneus were retained.We also further subset the Nachman-wild call set to exclude M. m. domesticus samples from Iran.

Introgression analysis
Nachman strain genomes were scanned for potential signatures of introgression from M. m. castaneus and M. m. musculus using the allele sharing methods implemented in Dsuite (v.0.5) [82].Wild M. m. domesticus and M. m. castaneus (or M. m. musculus) genomes in the wild-Nachman call set were utilized as P1 and P3, respectively, with M. spretus genomes specified as the outgroup taxon (S8 Table ).Dsuite Dtrio was run separately on each chromosome and with each inbred Nachman strain individually profiled as P2.Chromosome-level statistics were integrated into genome-wide estimates using Dcombine.To pinpoint specific sites of introgression between M. m. castanus and each focal strain, a windowed analysis was performed using the Dinvestigate command in Dsuite with a window size of 5000 informative sites and 2500 site slide.Output statistics were plotted in RStudio (2022.02.0,Build 443).We focus on sites with an excess of derived alleles shared between each of our profiled Nachman lines and M. m. castaneus and extracted the 5% of windows with the highest f dM estimates consistent with potential introgression from M. m. castaneus.These outlier windows were then intersected with Ensembl genes (v. 109; Mus musculus GRCm39) using bedtools isec (v.2.28.0; [83]) and subjected to a GO Enrichment Analysis using the online enrichment analysis tool on the GO Consortium Website (http://geneontology.org/) [78,79].We specified the entire set of annotated genes in Mus musculus as background to identify specific biological processes enriched in putative regions of M. m. castaneus introgression.Enrichment was determined by a Fisher's Exact test with False Discovery Rate of P < 0.05.

Structural variant discovery and calling
We identified SVs in the 11 Nachman wild-derived inbred strain genomes using both pbsv (https://github.com/PacificBiosciences/pbsv)and sniffles2 (v.2.0.7;[84]).pbsv was first run on each sample in discover mode to identify read signatures consistent with possible SVs.SVs where then called and samples jointly genotyped by executing pbsv in call mode.Tandem repeats in the GRCm39 assembly were identified using the findTandemRepeats.pyscript (https://github.com/PacificBiosciences/pbsv/commit/bcec7d382f3ea40158ed9cca3c5fef9686a76641) and supplied when executing sniffles2 to improve the accuracy of calls in repetitive regions.Per sample SV calls generated by sniffles2 were merged and filtered to include only autosomal calls using bcftools merge (v.0.1.19).Calls with close or overlapping breakpoints across samples were collapsed using truvari (v4.0.0; [85]), with the following parameters specified: -pctsize 0.75 -pctovl 0.5 -pctseq 0.7 -s 20 -S 10000000 -k common-chain.We then intersected pbsv and sniffles2 SV calls using truvari bench to produce a higher confidence call set.We used the pbsv callset as the "truth" set and invoked the following command line parameters: -pctsize 0.75 -pctovl 0.5 -pctseq 0.7 -dupto-ins-passonly -sizemin 20.SVs were annotated using the Ensembl Variant Effect Predictor (release 109.3) and gene model annotations from the GRCm39 assembly.
Lastly, SV calls were intersected with a previously published SV call set for a diverse set of inbred mouse strains [49].Shared SVs (>75% reciprocal overlap) were identified using truvari bench, with the Nachman callset featured as the "truth" set and invoking the same command line parameters as above.

Animal phenotyping
We developed a phenotyping pipeline loosely modeled on that used by the KOMP2 Project (https://www.mousephenotype.org/impress/index),with modifications to account for the overall health and wildness of the Nachman strains (Fig 6).Phenotypes broadly profile disease-relevant neurobehavioral, physiological, metabolic, biochemical, and morphological trait variation.
Mice were organized into 16 phenotyping cohorts ranging in size from 6-16 animals (mean = 13.4;S14 Table ).With the exception of cohort 13, each cohort was composed of animals from multiple strains (N = 2-5 strains) and mice of both sexes.Cohort 13 included only animals from strain GAIA/NachJ.Ten of the 16 cohorts included C57BL/6J mice as controls (n = 2-5 mice per cohort), providing a means for ensuring stability of phenotype measurements across cohorts.A total of 215 mice (108 female, 107 male) were phenotyped from 9 of the 11 Nachman lines in two waves.The first wave included 5 cohorts that were phenotyped between August 2020 and November 2020.The second wave included the remaining 11 cohorts which were phenotyped between June 2021 and October 2021.Strain EDMB/NachJ was not imported in time for inclusion in this phenotyping effort.
All live animal phenotyping was performed by trained staff in the Center for Biometric Analysis at The Jackson Laboratory (JAX).Animals were ~4 weeks (± 6 days) at study intake and exited the phenotyping pipeline at 19 weeks (± 6 days).Terminal collections and tissue harvests were performed by the Necropsy Scientific Service at JAX, with samples subsequently transferred to the Clinical Chemistry and Histopathological Sciences Scientific Services at JAX.
Body composition assessment by nuclear magnetic resonance.Body composition analysis was carried out on mice at weeks 4, 8, 12, 16, and 19.Conscious, unrestrained mice were individually placed in an acrylic tube that was inserted into a EchoMRI-3-in-1 Body Composition Analyzer (EchoMRI LLC) to yield non-invasive estimates of mass and percentage estimates of fat, lean, and water mass.
Frailty assessment (9 weeks).Mice were gently restrained for visual assessment of 29 metrics of physical condition using a modified version of the protocol described in [57].Observational assessment of most parameters was qualitatively categorized as 0 if normal, 1 if highly abnormal, and 0.5 if intermediate.Exceptions include body temperature and body weight, which were measured in standard, qualitative units (Celsius and grams, respectively).A general frailty index score was computed by summing the scores for individual parameters (excluding body weight and body temperature).
Light-dark test (10 weeks).The light-dark test is a classic behavioral paradigm for quantifying anxiety-related behaviors in rodents [86].Following ~60 minute acclimation to the testing room, mice were individually placed in a square, plexiglass arena (40 x 40 x 40 cm) divided into two compartments separated by a doorway.One side of the chamber was illuminated by an external light to ~400-450 Lux, while the other side remained darkened.Mouse movements were tracked over a span of 10 minutes via an infrared photobeam 3-dimensional grid system invisible to the animals.Beam breaks were computationally decoded to provide information about latency to enter the lighted side of the chamber, total number of transitions between the light and dark sides, total distance traveled, and time spent in the light versus dark sides.
Open field test (11 weeks).Mice were first acclimated to the testing room for ~60 minutes.Animals were individually placed into the center of a square plexiglass arena (40 x 40 x 40 cm) overlaid by a sensitive infrared photobeam three-dimensional grid.Beam breaks were computationally decoded into quantitative measurements of distance traveled, rearing, time spent within certain zones of the arena, and repetitive behaviors over the span of 60 minutes.
Spontaneous alternation with Y-maze (12 weeks).Following a 60-minute room acclimation, a single mouse was placed into the center of an opaque Y-shaped testing arena composed of three equally sized arms each measuring 25-35 cm long, 5-6 cm wide, and with walls extending 12-18cm high.Mouse movements were recorded by a camera interfaced with tracking software (Noldus Ethnovision) to quantify distance traveled, arm entries, time spent in each arm, and the number of correct alternations (i.e., a visit to each of the three arms before returning to any arms).
Home cage wheel running (13 weeks).Singly housed mice were provided with low profile running wheels (Med Associates) equipped with wireless transmitters for ~3 days.Transmitters recorded the number of wheel revolutions and time on one-minute intervals.Data were forced onto a common timeline delimited by 18:00:00 on Day 1 to 6:00:00 on Day 3, converted to distances based on running wheel diameter (15.5cm), and aggregated over 5 minute, 30 minute, 1h, 6h, and 12 h intervals for analysis.
Indirect calorimetry (16 weeks).Mice were acclimated to single-housing for 5-7 days prior to transfer to Promethion Core cages for continuous high-definition respiratory monitoring over a 5-day span.Rates of oxygen consumption and carbon dioxide production were assessed using a built-in respirometry system, allowing calculation of overall energy expenditure using manufacturer software.Food and water intake were continuously monitored via high precision weight sensors mounted to the cage lid.Cages were overlaid with a grid of high sensitivity infrared beams to track animal activity and locomotion in continuous time.
Glucose tolerance testing.Animals were fasted for 4 hours, weighed, and restrained to make an angled incision at the tail for repeat blood collection.An initial blood drop was placed on a glucose test strip and a baseline blood glucose recorded using a handheld glucometer.Animals were then administered sterile glucose solution at a dose of 2g/kg of body weight via intra-peritoneal injection.Blood glucose levels were monitored at 15, 30, 60, 90, and 120 minutes post injection using single drops of blood obtained from the tail incision.
Unconscious electrocardiogram and gross morphology.Mice were anesthetized by exposure to isoflurane gas in an enclosed chamber.Once unconscious, each animal was removed from the chamber and placed ventral side up on an ECG platform, with continued gas delivery administered through a nose cone.Paralube ophthalmic ointment was placed on the eyes to prevent drying during testing and recovery.Single-use pre-gelled self-adhesive silver/silver chloride ring recording electrodes (MVAP Medical Supplies, EMG electrodes) were adhered to the palmer surface of both front feet and the plantar surface of the left hind food.Leads were connected to each electrode and attached to a FE132 BIoAmp (AD Instruments) electrical signal amplifier which was fed into a PowerLab 4/35 digital-to-analog converter system and 4-channel recorder (AD Instruments) connected to a laptop computer with LabChart software.Core temperature was monitored and regulated via rectal probe physiology monitoring system and automatic heat pad.The ECG tracing was recorded until 30 seconds of waveform signal was obtained.
While animals remained unconscious, overall length, body length, and tail length were measured using handheld calipers.
Terminal collections.Approximately ~400ul whole blood was collected from the submental vein of 4-hour fasted mice.Animals were then euthanized via CO 2 asphyxiation in accordance with recommendations from the American Veterinary Medical Association.Blood samples were transferred to the Clinical Chemistry Scientific Service at The Jackson Laboratory for measurement of the following blood-based clinical traits: albumin, alkaline phosphatase, alanine transaminase, aspartate transaminase, blood urea nitrogen, enzymatic creatinine, glucose, total cholesterol, HDLD cholesterol, triglycerides, insulin, leptin, iron, total bilirubin, total protein, and complete blood count with differential.The following organs were carefully dissected from each animal by skilled staff in the Jackson Laboratory Necropsy Scientific Service: skeletal muscle, gonadal fat pad, brown adipose from between the shoulder blades, tail, skin (with hair), femur, spleen, kidneys, liver, brain, gonads, heart, lungs, eye.Tissues were weighed, frozen, and fixed in 10% NBF and paraffin embedded according to the protocol presented in S33 Table.

Phenotype data integration, statistical analysis, and broad sense heritability estimation
All phenotype data were wrangled into a single file in RStudio (v.2022.02.0 Build 443) using the dplyr (v.1.1.1),tidyverse (v.2.0.0), data.table(v.1.14.8),readxl (v.1.4.2), and lubridate (v.1.9.2) packages (S24 Table ).Categorical traits collected as part of the frailty analysis (n = 27) showed little variation among strains (S25 Table) and were excluded from statistical analyses.All remaining phenotypes were continuous.We tested for overall strain effects on each phenotype using non-parametric Kruskall Wallis (S16 Table) and one-way ANOVA tests (phenotype value ~strain; S17 Table ).Additionally, we tested for an effect of geographic origin on variation in each phenotype using Kruskall-Wallis (S21 Table ) and one-way ANOVA tests (phenotype value ~sample locale; S22 Table).C57BL/6J controls were excluded from these analyses to allow specific focus on variation across the Nachman strains.Although the ANOVA assumption of normality does not hold for many analyzed traits, there is strong agreement of P-values estimated by these two statistical methods (Spearman's Rho = 0.929, P < 2.2 x 10 −16 ).Given that our objective is to flag phenotypes that likely vary across the Nachman panel (rather than make robust claims about specific phenotypes), reported P-values are not corrected for multiple testing.
PC analysis was performed on the set of traits from each phenotype assay using the pca command within the pcaMethods R package [87].We distilled trait values into per strain, per sex means and normalized the resulting values to have unit variance centered on zero.We then identified the number of PCs required to explain 90% of the variance associated with each phenotype assay and combined these into a single matrix.PC analysis was performed on this combined matrix to summarize multidimensional trait variation across strains and sexes (S7 Fig) .To compare the phenotypic PC matrix with the genotypic PC matrix, phenotypic PC values were averaged across both sexes within a strain.Matrix similarity was assessed by the similarity of matrices index using the SMI function call within the MatrixCorrelation R package [88].Significance of the resulting SMI value was determined by 1000 permutations.
Broad sense heritability was estimated from one-way ANOVA output by the intraclass correlation method [89]: where MSB and MSW are the mean squares between and within strains, respectively.Heritability estimates were computed from one-way ANOVA tests run on each sex separately, as well as both sexes combined (S17 Table ).Two-way ANOVA was also performed to test the impact of strain, sex, and their interaction on the variability observed in each trait (phenotype value strain * sex; S20 Table).Spearman's rank correlate was used to quantify trait correlations at the strain, strain × sex, and individual levels.
In parallel to this approach, we also fit a series of nested linear mixed effect models to assess the impact of strain, sex, and strain × sex interaction on each phenotype.Specifically, for each trait t, we fit the following series of models using the lmer command within the lme4 (version 1.1-32) R package: Sex was treated as a fixed effect, with strain, cohort, and the interaction between strain and sex treated as random effects.Nested models were compared by a log likelihood test using the anova function call in R. Specifically, models 1 and 2 were compared to assess whether inclusion of strain as a random effect significantly improved model fit.Models 1 and 3 were compared to assess whether inclusion of sex significantly improved model fit, and models 1 and 4 were compared to determine whether the interaction between strain and sex meaningfully improved model fit.Owing to the modest amount of data and the complexity of these models, many models were singular, necessitating caution in the interpretation of results from this model fitting approach.Of the 1092 traits analyzed under this mixed model framework, only 358 provided non-singular model fits (S18 Table ).
Many phenotypes are highly correlated with body weight (S15 Table ), motivating us to also consider body weight-adjusted trait values in our models.We fit the residuals from a simple linear regression of each trait value on body weight at 18 weeks in each of the models above.This had a similar impact on the number of singular model fits (1083 analyzed traits, with 462 with non-singular fits across models 1-4; S19 Table ).We excluded body weight traits at all assessed timepoints, accounting for the reduced number of phenotypes analyzed under this framework.
To explore the role of geographic sample location on observed variation across strains, we fit linear mixed effect models analogous to those in Eqs ( 1) and ( 2), exchanging the random factor "Strain" for "Location".
Model comparisons were performed via likelihood ratio test to determine whether inclusion of location as a random effect significantly improved model fit.Phenotype values were first regressed against body weight at 18 weeks to eliminate the correlation with this trait.As above, the majority of phenotypes provided singular fits (S23 Table ).

Comparisons with legacy phenotype data
We accessed the CGDpheno1 and CGDpheno3 datasets from the Mouse Phenome Database [4].These strain survey datasets were prioritized because: (1) they share multiple phenotypes in common with those measured in the Nachman strains, (2) they were generated within the CBA at JAX, providing methodological consistency in data collection and similarity in housing conditions, and (3) animals were not subject to any experimental treatments.Public datasets were combined with the Nachman data using base R commands and dplyr (v.1.1.1),with attention paid to consistency of measurement units.Where needed, values were converted onto common unit scales using the appropriate conversion factors or mathematical manipulations.

Fig 1 .
Fig 1.Map of sample locations and summary of inbred strains imported to the JAX Repository.Inbred strains were developed from wild-caught mice trapped at 5 sample locations across North and South America: Manaus, Brazil (MANB/NachJ, MANE/NachJ, MANF/NachJ); Saratoga Springs, New York, USA (SARA/NachJ, SARB/NachJ, SARC/NachJ); Gainesville, Florida, USA (GAIC/NachJ); Tucson, Arizona, USA (TUCA/NachJ, TUCB/ NachJ); and Edmonton, Alberta, Canada (EDMB/NachJ).Strain GAIA/NachJ was successfully imported but has been since discontinued from the JAX Repository due to poor breeding and is not listed.Inbreeding generation numbers are presented in the following format: number of inbreeding generations in Nachman Colony prior to importation at JAX + number of inbreeding generations in JAX's Importation Facility + number of inbreeding generations in the JAX Repository.Inbreeding generation numbers are imprecise owing to the use of inter-generational crosses in JAX's importation facility to expedite colony expansion for oocyte harvests.This ambiguity is denoted by a "?" in the inbreeding generation supplied in the strain table.The map base layer was made with Natural Earth, which maintains all raster and vector map data in the public domain (https://www.naturalearthdata.com/about/terms-of-use/). Specifically, the base layer was produced using the rnaturalearth package for R by invoking the command ne_countries(continent = c("North America", "South America"), scale = "medium", returnclass = "sf").
Fig 3A; corresponding to a minimum of ~1 SNP every ~1 kb).Although sequenced mice have undergone a modest number of inbreeding generations, within strain heterozygosity is low (π < 0.0006 versus ~0.0017 in wild-caught M. m. domesticus) and aligns with theoretical expectations for the number of inbreeding generations (S3 Fig).

Fig 2 .
Fig 2. Distribution of weaned litter sizes across 10 strains in the Nachman WDIS panel.Sizes of litters born at JAX and in the Nachman Lab at UC Berkeley are presented as standard box plots, with box width defining the inter-quartile range and the thick line bisecting each box denoting the median.Litter sizes are jittered for ease of visualization.https://doi.org/10.1371/journal.pgen.1011228.g002

Fig 3 .
Fig 3. Genetic diversity in the Nachman strains.(A) Heatmap displaying the number of pairwise SNP differences between Nachman lines.Counts derive from the autosomal genome fraction only and exclude unplaced contigs.(B) PC analysis of autosomal genetic diversity partitions the Nachman lines by sample location and isolates Nachman strains from the CI mouse strains.(C) Maximum likelihood phylogenetic tree constructed from SNPs on chr19.The CI strains form a single clade nested within the diversity sampled by the Nachman strains.https://doi.org/10.1371/journal.pgen.1011228.g003

Fig 5 .
Fig 5. Genetic diversity in Nachman strains and wild-caught M. m. domesticus.(A) Principal component analysis performed on autosomal variants segregating in wild-caught M. m. domesticus mice from multiple populations and inbred Nachman strains.PC1 separates wild-caught mice from Iran and all other mice.PC2 stratifies mice from within Iran.(B) Excluding mice from Iran provides increased granularity to detect differences across other M. m. domesticus populations.PC2 isolates the Nachman strains from Tucson, Arizona from other strains and populations.(C) Maximum likelihood tree constructed from biallelic chr19 SNPs.Strains are color-coded according to the legends in A and B. https://doi.org/10.1371/journal.pgen.1011228.g005

Fig 6 .
Fig 6.Schematic of the phenotyping pipeline.Mice were transferred to the Center for Biometric Analysis (CBA) at JAX at 4 weeks of age.Nuclear magnetic resonance (NMR) was used to assess body composition at 4, 8, 12, 16, and 19 weeks.From weeks 9-13, animals were subject to a series of neurobehavioral testing paradigms, including light-dark test, open field, spontaneous alternation with Y-maze, and voluntary wheel running.From weeks 16-17, animals were subject to 5-day indirect calorimetry trials and intraperitoneal glucose tolerance testing (GTT).At week 18, mice underwent an unconscious EKG to assess cardiac rhythm and function.Mice exited the phenotyping pipeline at 19 weeks.Blood and plasma samples were used to assess multiple biochemical traits and multiple organs were harvested and weighed.A subset of dissected tissues were frozen for future molecular analysis and others were paraffin embedded for future histological study.https://doi.org/10.1371/journal.pgen.1011228.g006

Fig 7 .
Fig 7. Changes in body composition over time as assessed by NMR.Points correspond to strain-level means and data are partitioned by sex to highlight the degree of sex-dimorphism in body composition.Strains are color-coded by geographic origin, with strains from a common location distinguished by line type.https://doi.org/10.1371/journal.pgen.1011228.g007 67 kcal/24h (Fig 9D).Glucose tolerance test.All Nachman lines show an attenuated response to intraperitoneal injection of a controlled concentration of glucose relative to C57BL/6J (Fig 10).Peak blood glucose concentrations are higher in males than females for all strains, with exceptionally

Fig 8 .
Fig 8. Strain variation in voluntary exercise and circadian rhythm.(A) The average number of wheel revolutions per strain over a 60-hour trial period.(B) Hour-to-hour change in wheel running activity (number of wheel revolutions).Abrupt changes in activity correspond to transitions between sleep-wake cycles.In both figures, strains are color-coded by geographic origin, with strains from the same location further distinguished by line type.https://doi.org/10.1371/journal.pgen.1011228.g008

Fig 9 .
Fig 9. Strain differences in metabolic phenotypes quantified by continuous respiratory monitoring.(A) All strains exhibit higher activity during the night, although the extent of the nighttime activity bias varies across strains.(B) Similarly, all strains expend more energy and (C) consume more food at night, but the ratio of day:night energy expenditure and food intake varies across strains.(D) Total energy expenditure varies significantly across strains, with the two strains from Gainesville, Florida (GAIC/NachJ and GAIA/NachJ) delimiting the extremes.https://doi.org/10.1371/journal.pgen.1011228.g009

Fig 10 .
Fig 10.Glucose response curves over a 2-hour window.Females and males were analyzed separately.Mice were fasted for 4 hours prior to administration of a fixed (w/v) amount of glucose by intraperitoneal injection.Blood glucose levels were assessed at 15, 30, 60, 90, and 120 minutes post injection.In both panels, strains are color-coded by geographic origin, with strains from a common location further distinguished by line type.Vertical bars denote +/-1 SD around the strain mean.https://doi.org/10.1371/journal.pgen.1011228.g010

Table .
Strain-level and Strain x Sex-level phenotype means are provided in S25 Table.Broad-sense heritability estimates are provided in S17 Table.Results from nonparametric Kruskal-Wallis tests of strain effects on each phenotype are provided in S16 Table.
Two-way ANOVA results (Phenotype ~Strain * Sex) are presented in S20 Table.Results from