Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Variability of test parameters from mice of different age groups in published data sets

Abstract

The use of mice as animal models in biomedical research allows the standardization of genetic background, housing conditions as well as experimental protocols, which all affect phenotypic variability. In this study, the phenotypic variability of test parameters was analyzed in genetically identical mice of different age groups, i.e., early adults versus late adults. Therefore, published data sets of genetically identical mice of different age groups collected from the same investigator/ project were retrospectively analyzed. Morphological parameters, blood parameters and behavioral tests were analyzed which are predominantly used in biomedical research. The JaxKOMP project examined C57BL/6NJ mice with an age of 7–20 weeks and 66–81 weeks. Further substrains of C57BL/6N mice with an age of 8–16 weeks and 49–63 weeks were examined as wild-type controls from various investigators of the International Mouse Phenotyping Consortium (IMPC). Additional data sets of young and old groups of genetically identical mice were derived from the Mouse Phenome Database (MPD) and the RIKEN BioResource Research Center (RBRC). The phenotypic variability of the chosen traits and parameters was measured by calculating the coefficient of variation (CV = standard deviation/ mean) of the animals with the same sex of a given mouse strain. Subsequently, the CVs of the young and the old mouse group were compared. The comparison of the phenotypic variability of the late adults versus early adults revealed the appearance of unpredictable interactions between genotype, environment and experiment. Overall, a higher phenotypic variability of the late adults appeared almost consistently for body weight including lean mass and fat mass for females as well as for hematology and immunology parameters, particularly for females. Clinical chemistry often appeared inconspicuous. No noticeable differences were detected for the traits echocardiography and electrocardiogram, whereas late adults also often showed a higher phenotypic variability for behavioral tests.

Introduction

Phenotypic variability of genetically identical individuals is caused by known or unrecognized unstandardized abiotic and biotic environmental conditions as well as by stochastic cellular noise leading to phenotypic variability in individual mice which gives rise to an intrinsic variance that cannot be reduced [1]. Inbred mouse strains offer the opportunity to address the question if phenotypic variability increases in older mice as a consequence of the prolonged impact of these factors. The phenotypic variability within the experimental unit determines the choice of the group size which is necessary for achieving valid and reproducible results. Literature search for publications dealing with age-related phenotypic variability as primary study goal and by using sufficient animal numbers resulted in the absence of such studies up-to-date.

In biomedical research, usually inbred mice are used as young adults with an age of 6–12 weeks [2]. Morphological parameters, blood parameters and behavioral tests of young adult mice as well as of older mouse groups were examined to detect age-specific differences for chosen phenotypic parameters. The data were collected from the same investigator/ project by using standardized experimental protocols, ideally in the same mouse cohort after further carrying out husbandry in the same standardized environment until the recurrent analysis of the chosen parameters. Such data were also delivered to public databases to serve as reference values for wild-type mouse strains at a higher age.

In this study, the phenotypic variability of test parameters in genetically identical mice of different age groups (early adults versus late adults) was analyzed in data sets of laboratory mouse strains which have been submitted to the Mouse Phenome Database (https://phenome.jax.org/centers/JaxKOMP; https://phenome.jax.org), the International Mouse Phenotyping Consortium (IMPC) (https://www.mousephenotype.org/data/late-adult-data), and the RIKEN BioResource Research Center (RBRC) (https://ja.brc.riken.jp/lab/jmc/mouse_clinic/Aged_JMC/group_Condition-Aged_mice/index.html). Parameters with sufficient data submitted to the databases were selected for this study which may indicate the frequency of the use of the respective parameters in biomedical research with mice.

Materials and methods

Public mouse databases were searched for data sets of genetically identical groups of different age periods which have been collected from the same investigator/ project by using standardized experimental protocols, ideally in the same animal cohort. Morphological parameters, blood parameters and behavioral tests were analyzed. In this study, body weight as a test parameter turned out to show a different variability in different age groups (see “Results” section), therefore, further test parameters were compared between the age groups as absolute values without relation to body weight or body composition parameters.

The JaxKOMP project (https://phenome.jax.org/centers/JaxKOMP) analyzed C57BL/6NJ mice with an age of 7–20 weeks (“early adult”) and 66–81 weeks (“late adult”). A great number of test parameters were examined in a great number of mice, and all results of the individual mice have been submitted online. Therefore, in the current study the 95% data range (by excluding 2.5% each of the highest and lowest values) was chosen for each parameter as physiological range to exclude technical outliers. Beside phenotypic traits analyzed as single parameter, the other traits included following parameters (VT, vertebrate trait ontologies): bone (n = 3; bone mineral content (BMC), bone mineral density (BMD), bone area); clinical chemistry (n = 13; albumin, VT:0000199; bilirubin, VT:0001569; cholesterol, VT:0000180; creatinine, VT:0005328; glucose, VT:0000188; total protein, VT:0005567; triglycerides, VT:0002644; urea, VT:0005265; calcium, VT:0001562; phosphorus, VT:0001565; alanine aminotransferase (ALT, EC 2.6.1.2), VT:0001573; aspartate aminotransferase (AST, EC 2.6.1.1), VT:0000203; alkaline phosphatase (AP, EC 3.1.3.1)); glucose tolerance test (n = 5; plasma glucose 0 min, 15 min, 30 min, 60 min, 120 min); hematology (n = 7; hemoglobin (MHEM), VT:0001588; mean corpuscular volume (MCV), no VT applied; red blood cell count (RBC), VT:0001586; white blood cell count (WBC), VT:0000217; platelets (PLT), VT:0003179; RBC corpuscular distribution width (RDW), no VT applied; mean platelet volume (MPV), no VT applied. The parameters hematocrit, mean corpuscular hemoglobin (MCH), and mean corpuscular hemoglobin concentration (MCHC) were not included in the study as they are subsequently calculated by using parameters which are directly measured.); electrocardiogram (n = 12; heart rate, heart rate variability, CV, RR interval, PQ interval, PR interval, QRS complex duration, QT interval, ST segment, QTc dispersion, mean SR amplitude, mean R amplitude); grip strength (n = 2; forelimb grip strength mean, fore- and hindlimb grip strength mean); light-dark box (n = 8; reaction time, side changes, left side time spent, left side mobile time spent, right time side spent, right side mobile time spent, percent time in light, percent time in dark); and open field (n = 19; time spent mobile, distance traveled total, periphery distance traveled, center distance traveled, number of rears first five minutes, corners permanence time, percent time center, whole arena resting time, whole arena average speed, periphery resting time, periphery permanence time, periphery average speed, center resting time, center permanence time, center average speed, number of center entries, vertical time, number of rears total, percent time corners).

Further C57BL/6N substrains (C57BL/6N, C57BL/6NCrl, C57BL/6NJcl, C57BL/6NTac) with an age of 8–16 weeks (“early adult”) and 49–63 weeks (“late adult”) were examined as wild-type controls from various investigators (Baylor College of Medicine (BCM), Helmholtz-Zentrum Muenchen (HMGU GMC; two projects with two substrains), Institut Clinique de la Souris, France (ICS), Korea Mouse Phenotyping Center (KMPC), Mary Lyon Centre at MRC Harwell, RIKEN BioResource Research Center (RBRC), The Centre for Phenogenomics Canada (TCP), UC Davis) of the International Mouse Phenotyping Consortium (IMPC) following standardized phenotyping pipelines, as described in IMPReSS (https://www.mousephenotype.org/data/late-adult-data). Only the data of the wild-type controls but no data of genetically modified mouse lines were used. Data of the Jackson Laboratory (JAX) in this database were not used as data of the JaxKOMP project was already analyzed in this study (see above). Number of mice tested, mean, and standard deviation (SD) are published online for the parameters tested. Data sets were included with at least 50 mice of a given sex examined as “late adult” group for a test parameter. The test parameters, which were selected from the database and included in Tables 2 and S1, are listed in S2 Table.

thumbnail
Table 1. Variability of test parameters from C57BL/6NJ mice of two different age groups of the project JaxKOMP-EAP/ JaxKOMP-LAP (95% range of the data sets).

https://doi.org/10.1371/journal.pone.0329357.t001

thumbnail
Table 2. Variability of test parameters from C57BL/6N mice (n ≥ 50 per sex for the late adult group) of two different age groups of the database IMPC LATE ADULT.

https://doi.org/10.1371/journal.pone.0329357.t002

Additional data sets of young and old groups of genetically identical mice were derived from the Mouse Phenome Database (MPD, https://phenome.jax.org). For each phenotypic trait/ parameter, strain data sets were chosen for the study by using the following selection criteria: inbred strains (including those derived from the Collaborative Cross (CC)), F1 hybrids; no lines with newly generated alleles; no treatment; group size: n ≥ 5 of a given sex for a chosen strain in both age groups; data of at least five strains available. Data for the following comparisons of different age groups were found: “6 months old” vs. “18-20 months old”; and “6 months old” vs. “24 months old”. More than one test parameter was analyzed for the phenotypic traits “bone” (n = 3 parameters), “clinical chemistry” (n = 11), “hematology” (n = 6), “immunology” (n = 8–10), “electrocardiogram”(n = 6), and “gait analysis”(n = 16; including data of only three strains). The information about the selected parameters is accessible online in the Mouse Phenome Database (https://phenome.jax.org) under the name of the phenotypic trait and the project name (usually the name of the investigator) given in Table 3.

thumbnail
Table 3. Variability of test parameters from mouse strains of two different age groups in data sets (n ≥ 5 per sex) of the Mouse Phenome Database.

https://doi.org/10.1371/journal.pone.0329357.t003

The RIKEN BioResource Research Center (RBRC) (https://ja.brc.riken.jp/lab/jmc/mouse_clinic/Aged_JMC/group_Condition-Aged_mice/index.html) additionally submitted online a comprehensive phenotypic screen of a small number of C57BL/6NJcl mice (n = 7–14 per sex) at an early (9–16 weeks) and late (49–56 weeks) age period.

The phenotypic variability of test parameters and traits was measured by calculating the coefficient of variation (CV = standard deviation/ mean) of animals with the same sex of a specific mouse strain. The CV of the early adult mouse group and the CV of the late adult mouse group were compared subsequently by the calculation of the ratio r = CVlate adult/ (CVlate adult + CVearly adult) for a given parameter/ trait. Data analysis was carried out using the software program Microsoft Excel 2016 (Microsoft Corp., Redmond, WA). The chi-squared test was used for the statistical analysis of the data.

Results

In this study, the phenotypic variability of test parameters and traits was measured by calculating the coefficient of variation (CV = standard deviation/ mean) of animals with the same sex of a specific mouse strain. The CV of the early adult mouse group and the CV of the late adult mouse group were compared subsequently by the calculation of the ratio r = CVlate adult/ (CVlate adult + CVearly adult) for a given parameter/ trait. r = 0.5 if the late adults show the identical phenotypic variability as the early adults. Increase or decrease of the phenotypic variability of the late adults vs. the early adults leads to r values > 0.5 or < 0.5, respectively. In the case that the late adults show the identical phenotypic variability as the early adults for traits with more than one test parameter measured, 50% of the parameters of a given trait are expected to deliver r values > 0.5 (and the other 50% of the parameters are expected to deliver r values < 0.5).

For the following values of r = CVlate adult/ (CVlate adult + CVearly adult) the respective relative change of the phenotypic variability of the late adults vs. the early adults is given in parentheses: r = 0.25 (0.33 × phenotypic variability of the late adults compared to the early adults); 0.33 (0.5×); 0.4 (0.66×) 0.43 (0.75×); 0.47 (0.9×); 0.5 (1×); 0.52 (1.1×); 0.55 (1.25×); 0.6 (1.5×); 0.66 (2×); and 0.75 (3×). Thus, an increase of the CV of the late adults vs. the early adults of 50% or more results in r values of ≥ 0.6.

During the analysis, body weight as well as lean mass and fat mass as test parameters often turned out to a show a higher variability in the late age group vs. the young age group (see below). Therefore, further test parameters were compared between the age groups as absolute values without relation to body weight or body composition parameters. Data of four different databases were analyzed.

The JaxKOMP project (https://phenome.jax.org/centers/JaxKOMP) analyzed a high number of C57BL/6NJ mice with an age of 7–20 weeks (“early adult”) and 66–81 weeks (“late adult”; n between > 100 and > 500 per test parameter) with a great number of test parameters, and all results of the individual mice have been submitted online. Therefore, in the current study the 95% data range (by excluding 2.5% each of the highest and lowest values) was chosen for each parameter as physiological range to exclude technical outliers. For traits with more than one test parameter measured, the mean of the r’s of all test parameters of a given trait was calculated (Table 1).

Analogous results were found for most traits in both sexes. The tests resulted in following r values: body weight: > 0.6, with higher values for females; lean mass and fat mass: partly clearly over 0.5, with higher values for females; bone: 0.5 for females, and < 0.5 for males; clinical chemistry: nearly 0.5 (with 6 and 7 of the 13 test parameters showing values > 0.5 for females and males, respectively); glucose tolerance test: > 0.5; hematology: distinctly > 0.5 (with 6 of 7 test parameters showing values > 0.5 in both sexes); electrocardiogram: nearly 0.5; grip strength: < 0.5; behavioral tests (hole board, light-dark box, open field): 0.55 to 0.6 (see columns “mean” of Table 1). Between the traits, no relation was detected for the level of the CV value (see columns “cv (range)” of Table 1) and the r value (see columns “mean” of Table 1) of a given trait.

As there is no phenotypic parameter described as a control for the analysis of the variability in different mouse age groups, no substantial variability of different mouse age groups for the parameter “body length” was assumed as hypothesis, particularly as “body length” showed a low CV in examined mouse groups in published studies [3] as well as in this study (CV = 0.03 for all four mouse groups “female early adult”, “female late adult”, “male early adult”, and “male late adult”; see columns “cv (range)” of Table 1). The analysis of the JaxKOMP project data delivered r values of 0.52 and 0.49 for females and males, respectively (see columns “mean” of Table 1).

The trait “hematology” included seven parameters (see Materials and Methods section). The two parameters “WBC” and “platelets” showed much higher CVs within a given mouse group in both the JaxKOMP project and the IMPC LATE ADULT project (see below) which corresponds to published studies [3]. Therefore, subsequently separate r values were analyzed for the group of the five remaining hematology parameters usually showing a low CV (MCV, MHEM, MPV, RBC, RDW), as well as for the group “WBC, platelets”. “WBC, platelets” showed a higher r value, however, the r value of the group of the remaining five hematology parameters was still > 0.5 (see columns “mean” of Table 1).

The trait “clinical chemistry” included 13 test parameters and delivered a r value of nearly 0.5 both for females and males. This was confirmed by the fact that half of the parameters showed r values < 0.5 and > 0.5, respectively (see columns “r > 0.5 (n)” of Table 1). The separate analysis of the 13 parameters resulted in r values within a high range (see columns “range” of Table 1). As described in published studies [3] as well as in this study (see columns “cv (range)” of Table 1), the separate clinical chemistry parameters showed different CV values in examined mouse groups. Therefore, subsequently the clinical chemistry parameters were classified in the four subgroups “electrolytes” (calcium with low CV, phosphorus with higher CV), “plasma enzyme activities” (ALT, AST, and AP; with high CVs), “substrates low” (albumin and total protein; with lower CVs), and “substrates high” (bilirubin, cholesterol, creatinine, glucose, triglycerides, urea; with higher CVs). For both the JaxKOMP project and the IMPC LATE ADULT project (see below), no relation was detected for the level of the CV value and the r value of the four subgroups within the trait “clinical chemistry”. This was also true for the analysis of the 12 test parameters of the trait “electrocardiogram” after subsequently classifying them in two subgroups with low CVs and high CVs, respectively.

The behavioral tests (hole board, light-dark box, open field) resulted in r values between 0.55 and 0.6 (see columns “mean” of Table 1). All test parameters of the behavioral tests showed high to very high CVs (> 0.1 to > 1.0; see columns “cv (range)” of Table 1), therefore, no subsequent analyses with subgroups of parameters with lower CVs and higher CVs was carried out.

Different projects submitted data to the database IMPC LATE ADULT (https://www.mousephenotype.org/data/late-adult-data). Data of up to nine projects using wild-type animals of C57BL/6N substrains with an age of 8–16 weeks (“early adult”) and 49–63 weeks (“late adult”) were analyzed (Table 2). Data sets were included with at least 50 mice of a given sex examined as “late adult” group for a test parameter. The test parameters chosen from the database are listed in S2 Table. For traits with more than one test parameter measured, the mean and the median of the r’s of all test parameters within a given trait of a project was calculated. Subsequently, the mean was calculated from the project means (see columns “mean” of Table 2).

Following r values were found in traits with data from at least three projects: body weight, lean mass and fat mass: > 0.5 for females, and around 0.5 for males; bone: > 0.5; clinical chemistry: slightly over 0.5; intraperitoneal glucose tolerance test: > 0.5; hematology: > 0.5 for females and 0.5 for males; immunology (n = 2 projects): > 0.5; echocardiography and electrocardiogram: around 0.5; grip strength: < 0.5; fear conditioning (n = 2 projects): < 0.5; light-dark box (n = 1 project): < 0.5 for females and 0.5 for males; open field, rotarod (n = 1 project) and Y-maze (n = 1 project): > 0.5 (see columns “mean” of Table 2). Individual phenotypic trait data of all projects including n ≥ 100 mice per sex for the late adult group of the database IMPC LATE ADULT are presented in further detail in S1 Table.

4-5 and 2 projects analyzed the parameters “body length” and “tibia length”, respectively. As already stated above for the analysis of the data sets of the JaxKOMP project, “body length” was hypothetically chosen to serve as potential “control” test parameter. The CVs of both parameters “body length” and “tibia length” for all four mouse groups “female early adult”, “female late adult”, “male early adult”, and “male late adult” of all projects were ≤ 0.06 (see also columns “CV” of S1 Table). The mean r values were 0.48 and 0.44 for “body length”, and 0.49 and 0.51 for “tibia length” in females and males, respectively (see columns “mean” of Table 2). However, the r values of individual projects for “body length” remarkably deviated to a high extent from the hypothetized r value of 0.5 (see columns “range (projects)” of Table 2), also in projects including n ≥ 100 mice per sex for the late adult group (S1 Table). The review of the downloaded values of mean and standard deviation (SD), which are published online, gave hints for the assumption that a rounding error due to only one decimal digit for the SD value being specified can not be excluded in these cases. In total, the retrospective analysis does not support the choice of “body length” to serve as “control” test parameter for all projects where data sets were examined for this study.

As already described above for the JaxKOMP project, the subsequent analyses of subgroups for particular phenotypic traits was also carried out with the data of the IMPC LATE ADULT project. In the trait “hematology”, the separate analysis of “WBC, platelets” resulted in r values > 0.5, whereas the analysis of the group of the remaining parameters with lower CVs obtained r values around 0.5 for both sexes (see columns “mean” of Table 2). For the trait “clinical chemistry”, 12–20 test parameters were chosen from five different projects. Analogous to the results in the JaxKOMP project, the subsequent analyses of the four subgroups “electrolytes” (with low CVs, except of phosphorus and iron), “plasma enzyme activities” (with high CVs), “substrates low” (with lower CVs), and “substrates high” (with higher CVs) obtained no relation for the level of the CV value and the r value of the four subgroups of the trait “clinical chemistry” within a given project. This was also true for the analysis of the test parameters of the trait “electrocardiogram” after subsequently classifying them in two subgroups with low CVs and high CVs, respectively. For the five behavioral tests (fear conditioning, light-dark box, open field, rotarod, Y-maze), data from at least three different projects were collected only for the trait “open field” (n = 4 projects). Almost all test parameters of the behavioral tests showed high to very high CVs (> 0.1 to > 1.0; see columns “CV” of S1 Table), therefore, no subsequent analyses with subgroups of parameters with lower CVs and higher CVs was carried out.

Furthermore, data of the Mouse Phenome Database (MPD, https://phenome.jax.org) were used for the analysis in the case that the identical project examined the same mouse strains at two different ages (“6 months old” vs. “18-20 months old”; and “6 months old” vs. “24 months old”) (Table 3). Data sets with group sizes of n ≥ 5 of a given sex for a chosen strain in both age group were used. All data collected were derived from inbred strains. For each test parameter, the mean and the median of the r’s of all mouse strains were calculated. Subsequently, for phenotypic traits with more than one test parameter, the mean of the parameter means (see columns “mean” of Table 3) and the mean of the parameter medians (see columns “median” of Table 3) were calculated for the given trait. The number of the mouse strains involved greatly varied for the different traits (n = 6–27; see columns “n strains” of Table 3).

For the comparison “6 months old” vs. “18-20 months old” following r values were found: body weight, lean mass and fat mass: > 0.5, with higher values for females; bone: > 0.5 for females and 0.5 for males; clinical chemistry, hematology, and immunology: clearly > 0.5; electrocardiogram: slightly > 0.5; grip strength: slightly > 0.5; gait analysis: > 0.5. For the comparison “6 months old” vs. “24 months old”, the traits hematology and immunology showed an even higher increase of the variability when compared to the analogous results of the comparison “6 months old” vs. “18-20 months old” (see columns “mean” of Table 3).

All individual parameters – irrespective of their CV values – generally showed a very high range of the r values of the various mouse strains tested (see columns “range (strains)” of Table 3). It is assumed that this does not or only to a minor extent reflect the genetical differences of the mouse strains, but is predominantly caused by the low average numbers of mice per experimental unit (n = 5–9). Thus, only a high number of mouse strains tested for a given parameter may lead to representative mean r values in the MPD project. The singular results of each strain with the low number of tested mice clearly showed the appearance of unpredictable interactions between genotype, environment and experiment.

The “control” test parameter “body length” was analyzed in one project of the comparison “6 months old” vs. “20 months old”, and showed mean r values of 0.61 and 0.55 for females (23 strains examined) and males (19 strains examined), respectively (see columns “mean” of Table 3). The same investigator also carried out the comparison “6 months old” vs. “12 months old” for the parameter “body length”; use of these data resulted in mean r values of 0.51 and 0.54 for females (24 strains examined) and males (18 strains examined), respectively. The generally described very high range of the r value for the various mouse strains tested was also seen for “body length” (see columns “range (strains)” of Table 3).

Analysis of data of a relatively small number of C57BL/6NJcl mice with an age of 9–16 weeks (“early adult”) and 49–56 weeks (“late adult”) from the RIKEN BioResource Research Center (RBRC) (https://ja.brc.riken.jp/lab/jmc/mouse_clinic/Aged_JMC/group_Condition-Aged_mice/index.html) is shown in S3 Table. For each test parameter, the CV of the early adult group and the CV of the late adult group, and subsequently the ratio r = CVlate adult/ (CVlate adult + CVearly adult) was calculated. For traits with more than one test parameter measured, the mean and the median of the r’s of all parameters within a given trait was calculated (see columns “mean” of S3 Table).

Following r values were found: body weight, analyzed in various phenotypic analyses: mostly clearly > 0.5, with higher values for females; lean mass: > 0.5; fat mass: clearly > 0.5 for females, and clearly < 0.5 for males; clinical chemistry: around 0.5 for females and 0.53 for males; hematology: > 0.5; grip strength, open field: inconsistent results for both sexes (see columns “mean” of S3 Table). As observed in the outcome of the analysis of the phenotypic variability from data of the Mouse Phenome Database (see above), in respect of the phenotypic variability caution should be taken in the interpretation of results derived from singular mouse strains with only low numbers of animals examined.

Discussion

The retrospective analysis of the phenotypic variability of test parameters was carried out with data sets of early adult mouse groups versus late adult mouse groups from projects which determined the age ranges for early adult groups and late adult groups for their own. Thus, the chosen data sets have not been produced especially for the analysis of the age-related phenotypic variability which is an inherent limitation of the study. Data from four databases were analyzed, where C57BL/6N substrain mice were used except of project MPD. The projects JaxKOMP, IMPC LATE ADULT, and RBRC delivered data from early adults (7–20 weeks old) vs. late adults (around 1.5 years old (JaxKOMP), and around 1 year old (IMPC LATE ADULT, and RBRC)). MPD data were collected for the comparison of the age groups “6 months old” vs. “18-20 months old”, and “6 months old” vs. “24 months old”. For the old mice of the MPD project, it is assumed, that only mice with no apparent symptoms of the pre-mortal geriatric stage were chosen for the analyses. Attempts have been made to relate different mouse age periods to human age [4]. The projects JaxKOMP and IMPC LATE ADULT examined high numbers (n ≥ 50) of C57BL/6N substrain mice. On the other hand, the small group sizes used in the projects MPD and RBRC may represent the relatively low numbers of animals which are usually used as group size at least in fundamental biomedical research.

Taking into account particularly the projects with high numbers of mice examined, in summary the comparison of late adults vs. early adults resulted in following r values: body weight including lean mass and fat mass for females: consistently > 0.5; body weight including fat mass for males: around 0.5 or > 0.5; clinical chemistry: often around 0.5, or > 0.5; hematology and immunology: often > 0.5, with higher values for females; echocardiography and electrocardiogram: around 0.5; grip strength: < 0.5 or around 0.5; behavioral tests: often > 0.5. The results also showed the appearance of unpredictable interactions between genotype, environment and experiment regarding the variability of the parameters and traits in the different projects analyzed.

Unsatisfactory validity and reproducibility of results obtained from biomedical research in mice are of major and constant scientific concern (e.g., see [57], and refs. therein). Thus, despite of elaborate efforts to standardize main known abiotic and biotic experimental factors, non-reproducible and inconsistent results still occur in present studies. In addition, the data sets chosen for this age-related analysis were inherently derived from long-term projects which harbour an increased risk of change of abiotic and biotic factors during the project (e,g. personnel interacting with the experimental animals) compared to short-term experiments. Inconsistent results of the age-related phenotypic variability between the projects observed in this retrospective study also hinder or negatively impact the study of the causes underlying the results of the analysis about age-related phenotypic variation. Thus, preceding experiments of the impact of main abiotic and biotic factors on the outcome of the analysis of age-related phenotypic variation would be beneficial.

The four projects differed in the choice of the specific age of the early adult and late adult mice; in addition, the chosen age ranges of the early adult and late adult mice often covered a large time interval in the projects except of project MPD. The high variation of the age in the early adult groups (JaxKOMP: 7–20 weeks; IMPC LATE ADULT; 8–16 weeks; RBRC; 9–16 weeks) may negatively interfere with the aim of this study to compare the phenotypic variability in early adult mice and late adult mice as the first part of the chosen time range may still cover the phase of the late adolescence of the animals [1,2]. Thus, these early adult groups may include putative differences in the phenotypic variability of late adolescent mice and adult mice. However, even in cases of parameters with different mean values between age subgroups within a chosen age group, this does not indicate for differences of the phenotypic variability between these age subgroups.

The use and separate analysis of both sexes in biomedical research is strongly recommended because of possible differences in the outcome [810]. It is assumed that the data sets have been achieved by using the housing method which is usually carried out when working with mice in biomedical research, i.e., both sexes are group housed, and females in their reproductive lifespan are used without regard to the stage of the estrous cycle. A meta-analysis revealed that group housing of mice increased the variability in both males and females by 37% [11]. Therefore, no sex is expected to take advantage of this housing method in respect to the extent of the variability compared to the other sex. The similar increase of the variability in group housed males and females is also expected to cover the consequences of the Lee Boot effect which leads to the suppression of the estrous cycle in group housed female mice [12]. In addition, the behavior of female mice was described to reflect individual identity far more than estrous state [13]. Thus, the appearance of a higher age-related phenotypic variability of females for some parameters found in this study can not be explained by well-reproduced results of the published research. On the other hand, it is assumed, that single housing of mice may have been necessary in the late adult groups of the four projects more frequently for males than for females due to the appearance of social incompatibilities in the group. This would lead to inconsistencies of the impact of the housing method in early adult groups and late adult groups especially for males.

At several time points across the lifespan of Mus musculus, linear and nonlinear shifts in gene expression during ageing were revealed [14,15]. In a retrospective analysis of a high number of humans, individual patient-specific set points of hematological parameters were identified which persisted for at least 20 years [16]. On the other hand, the analysis of epigenetic, transcriptomic, and metabolomic levels in mice indicated the occurrence of rapid changes of the biological age in both directions at least in various models of severe stress [17].

The phenotypic variability within the experimental unit determines the choice of the group size which is necessary for achieving valid and reproducible results. Thus, the results may also be considered for the calculation of the group size when using older mice. Depending on the difference which is determined as relevant for the test group vs. the control group, an increase of the phenotypic variability CV (= standard deviation/ mean) of 50% (which results in a ratio r = CVlate adult/ (CVlate adult + CVearly adult) = 0.6 for late adult mice) within the experimental unit leads to the need of using roughly about twice the animal number of late adult mice for gaining the identical statistical strength when compared to early adult mice which are usually used in biomedical research [3].

Conclusions

Overall, the comparison of the late adults vs. early adults resulted in a higher phenotypic variability almost consistently for body weight including lean mass and fat mass for females as well as for hematology and immunology parameters, particularly for females. Clinical chemistry often appeared inconspicuous. No noticeable differences were detected for the traits echocardiography and electrocardiogram, whereas late adults also often showed a higher phenotypic variability for behavioral tests. The results also clearly showed the appearance of unpredictable interactions between genotype, environment and experiment regarding the variability of the parameters and traits in the different projects analyzed.

Supporting information

S1 Table. Variability of test parameters from projects including n ≥ 100 mice per sex for the late adult group of the database IMPC LATE ADULT.

https://doi.org/10.1371/journal.pone.0329357.s001

(XLSX)

S2 Table. Test parameters chosen from the nine projects (n ≥ 50 mice per sex of the old age group analyzed) of the database IMPC LATE ADULT shown in Table 2.

https://doi.org/10.1371/journal.pone.0329357.s002

(XLSX)

S3 Table. Variability of test parameters from C57BL/6NJcl mice (n = 7–14 per sex) of two different age groups of the RBRC project.

https://doi.org/10.1371/journal.pone.0329357.s003

(XLSX)

References

  1. 1. Brust V, Schindler PM, Lewejohann L. Lifetime development of behavioural phenotype in the house mouse (Mus musculus). Front Zool. 2015;12(S1).
  2. 2. Jackson SJ, Andrews N, Ball D, Bellantuono I, Gray J, Hachoumi L, et al. Does age matter? The impact of rodent age on study outcomes. Lab Anim. 2016;51(2):160–9.
  3. 3. Aigner B, Heumann C. Variability of clinical chemical and hematological parameters, immunological parameters, and behavioral tests in data sets of the Mouse Phenome Database. PLoS ONE. 2023;18(7):e0288209.
  4. 4. Dutta S, Sengupta P. Men and mice: Relating their ages. Life Sciences. 2016;152:244–8.
  5. 5. Crabbe JC, Wahlsten D, Dudek BC. Genetics of Mouse Behavior: Interactions with Laboratory Environment. Science. 1999;284(5420):1670–2.
  6. 6. Jaric I, Voelkl B, Amrein I, Wolfer DP, Novak J, Detotto C, et al. Using mice from different breeding sites fails to improve replicability of results from single-laboratory studies. Lab Anim. 2023;53(1):18–22.
  7. 7. von Kortzfleisch VT, Ambrée O, Karp NA, Meyer N, Novak J, Palme R, et al. Do multiple experimenters improve the reproducibility of animal studies? PLoS Biol. 2022;20(5):e3001564.
  8. 8. Clayton JA, Collins FS. Policy: NIH to balance sex in cell and animal studies. Nature. 2014;509(7500):282–3.
  9. 9. Rodríguez-Montes L, Ovchinnikova S, Yuan X, Studer T, Sarropoulos I, Anders S, et al. Sex-biased gene expression across mammalian organ development and evolution. Science. 2023;382(6670).
  10. 10. Sandberg K, Umans JG. Recommendations concerning the new U.S. National Institutes of Health initiative to balance the sex of cells and animals in preclinical research. FASEB J. 2015;29(5):1646–52.
  11. 11. Prendergast BJ, Onishi KG, Zucker I. Female mice liberated for inclusion in neuroscience and biomedical research. Neuroscience & Biobehavioral Reviews. 2014;40:1–5.
  12. 12. Bind RH, Minney SM, Rosenfeld S, Hallock RM. The role of pheromonal responses in rodent behavior: future directions for the development of laboratory protocols. J Am Assoc Lab Anim Sci. 2013;52(2):124–9. pmid:23562094
  13. 13. Levy DR, Hunter N, Lin S, Robinson EM, Gillis W, Conlin EB, et al. Mouse spontaneous behavior reflects individual variation rather than estrous state. Current Biology. 2023;33(7):1358-1364.e4.
  14. 14. Schaum N, Lehallier B, Hahn O, Pálovics R, Hosseinzadeh S, Lee SE, et al. Ageing hallmarks exhibit organ-specific temporal signatures. Nature. 2020;583(7817):596–602.
  15. 15. Almanzar N, Antony J, Baghel AS, Bakerman I, Bansal I, Barres BA, et al. A single-cell transcriptomic atlas characterizes ageing tissues in the mouse. Nature. 2020;583(7817):590–5.
  16. 16. Foy BH, Petherbridge R, Roth MT, Zhang C, De Souza DC, Mow C, et al. Haematological setpoints are a stable and patient-specific deep phenotype. Nature. 2024;637(8045):430–8.
  17. 17. Poganik JR, Zhang B, Baht GS, Tyshkovskiy A, Deik A, Kerepesi C, et al. Biological age is increased by stress and restored upon recovery. Cell Metabolism. 2023;35(5):807-820.e5.