Separated or joint models of repeated multivariate data to estimate individuals’ disease trajectories with application to scleroderma

Ji Soo Kim; Ami A. Shah; Laura K. Hummers; Scott L. Zeger

doi:10.1371/journal.pone.0320414

Abstract

Estimating a patient’s disease trajectory as defined by clinical measures is an essential task in medicine. Given multiple biomarkers, there is a practical choice of whether to estimate the joint distribution of all biomarkers in a single model or to model the univariate marginal distribution of each marker separately ignoring the covariance structure among measures. To fully utilize all trajectory-relevant information in multiple longitudinal markers, a joint model is required, but its complexity and computational burden may only be warranted when joint estimates of trajectories are substantially more efficient than separate estimates. This paper derives general expressions for the inefficiency of univariate or “separated" estimates of population-average trajectories and individual’s random effects as compared to the fully efficient multivariate or “combined" estimates. Then, in two settings: (1) a general bivariate case; and (2) our motivating clinical case study with 5 measures, we find that separated estimates of fixed effects are nearly fully efficient. However, joint estimates of random effects can be meaningfully more efficient for measures with substantial missing data when other strongly correlated measures are observed more frequently. This increased efficiency of the joint model derives more from joint shrinkage of random effects in multivariate space than from improved estimates of the subject-specific trajectories obtained when accounting for correlations in measurements. These findings have application to a diverse array of chronic diseases where biomarkers’ trajectories guide clinical decisions.

Citation: Kim JS, Shah AA, Hummers LK, Zeger SL (2025) Separated or joint models of repeated multivariate data to estimate individuals’ disease trajectories with application to scleroderma. PLoS ONE 20(4): e0320414. https://doi.org/10.1371/journal.pone.0320414

Editor: Pinaki Sarder, State University of New York at Buffalo, UNITED STATES OF AMERICA

Received: September 2, 2024; Accepted: February 18, 2025; Published: April 21, 2025

Copyright: © 2025 Kim et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Data cannot be shared publicly because they contain potentially identifying or sensitive patient information. Data are available upon reasonable request from the Johns Hopkins Scleroderma Center Research Registry (contact Center Research Data Manager Adrianne Woods at awoods9@jhmi.edu) and can be shared with appropriate IRB approvals and institutional data sharing agreement.

Funding: This work was supported in part by the Jerome L. Greene Foundation, the Johns Hopkins inHealth initiative, the Scleroderma Research Foundation, the Nancy and Joachim Bechtle Precision Medicine Fund for Scleroderma, the Manugian Family Scholar, the Donald B. and Dorothy L. Stabler Foundation, the Chresanthe Staurulakis Memorial Fund, and NIH/NIAMS (P30AR070254, R01AR073208, K24AR080217).

Competing interests: The authors have declared that no competing interests exist.

Introduction

Estimating a patient’s trajectory in a space defined by multiple clinical measures is an essential task in medicine. In some problems, a goal is to find a lower-dimensional summary of measures that parsimoniously represents the trajectory. But in the autoimmune disease application that motivates this work, each of the original measures is important to clinical decisions and must be retained.

Systemic sclerosis (scleroderma) is an autoimmune disease characterized by dysregulation of the immune system and damage to multiple organ systems, including the skin, heart, lungs, kidneys, gastrointestinal tract, and blood vessels [1]. Although relatively rare, scleroderma is a one of 80 related autoimmune diseases that, in aggregate, comprise the third most prevalent set of chronic diseases after cancer and heart disease [2]. All organs must be monitored to determine appropriate treatment for an individual [3,4]. There is heterogeneity among patients in their clinical manifestations, response to treatment, rate of disease progression, and survival [5]. Therefore, clinicians seek to accurately measure each patient’s current disease state and rate of progression or “trajectory" for each organ.

From a statistical perspective, we obtain multivariate longitudinal measures at irregularly-observed times for a cohort of patients. Some measures are easily obtained; others demand considerable resources. Joint and univariate, marker-specific models are widely used to estimate disease trajectories [6,7]. This paper focuses on quantifying the efficiency in estimating fixed and random effects of the joint or “combined (C)" model in comparison to marker-specific or “separated (S)" models. We ask under what circumstances the additional computational and statistical burden of the combined model add substantial value when estimating individual patients’ trajectories for all biomarkers. Although the combined models can result in slower convergence and greater challenge in obtaining reliable parameter estimates, it estimates across-measure correlations which is a key scientific question in some applications and cannot be estimated by fitting the separated models. We derive general formulae measuring the relative efficiency for fixed effects, random effects and for predicted values. We quantify the inefficiency of separated models for the individual patient and describe its association with patient’s pattern of observations.

The scientific focus on biomarker trajectories implies that the main predictors for both the fixed and random effects are smooth basis functions of time and/or their interactions with baseline patient characteristics. As is well known in other efficiency studies, sharing predictors across regressions has important implications for the efficiency of separated relative to combined models as discussed below.

Statistical models of trajectory

The linear mixed model (LMM) is widely used to describe changes in a single approximately-Gaussian longitudinal outcome over time. LMMs yield valid inferences about trends by accounting for the autocorrelation among repeated measures of the same subject; they support estimation of subject-specific random effects while naturally handling irregularly spaced or/and unbalanced data [7,8]. Harville (1976) and Harville (1977) [9,10] first applied the Gauss-Markov theorem [11] to the statistical framework for the LMM and showed that its random effects estimators are the best unbiased linear predictors (BLUP) when the covariance parameters are known. The multivariate linear mixed model (MLMM) is an extension of the LMM for the analysis of multiple outcomes [12–15]. Given multivariate longitudinal observations measured for individuals, fitting a separate LMM for each outcome or fitting a single MLMM are both common [16–19]. The separated LMMs approach estimates the population and individual trajectories of each outcome independently of the others, while the MLMM additionally captures the between-measure correlations induced by correlated random effects and random error terms.

Inefficiency when ignoring correlation

In the time series context, Bloomfield and Watson [20] derived expressions for the maximum inefficiency of ordinary compared to general least squares as a function of the design matrix and residual variance matrix. A similar idea was explored much earlier by Tukey [21] who quantified the maximum inefficiency caused by using a misweighted mean as compared to the optimally weighted mean.

The gain in efficiency from using a joint model has also been studied for cross-sectional data under the “seemingly unrelated regressions" (SUR) framework [22]. A SUR comprises a set of linear regressions where each equation describes the relationship between a different outcome and its associated predictor variables. Zellner showed that joint coefficient estimation by general least squares (GLS) is asymptotically more efficient compared to separate regressions by ordinary least squares (OLS) and that the efficiency increases as the error terms from different equations become more cross-correlated and as the predictor variables in different equations become less correlated. An important special case is that estimates from separate regressions by OLS are fully efficient when the predictors for each outcome are the same, regardless of the degree of correlation among the outcomes [22]. Oliveira and Teixeira-Pinto further investigated the case in which some predictors are shared across the outcomes while others are outcome-specific and showed that the estimates for the regression parameters of the shared predictors are fully efficient while those of outcome-specific predictors have greater efficiency when a joint model is fit [23].

Objectives

In this paper, we study the inefficiency of separated LMMs relative to the combined MLMM in the longitudinal data setting. As in previous work on multivariate regression and on time series, we consider the inefficiency in estimating regression coefficients of separated models. However, motivated by our clinical application, an additional focus of this paper is on individual’s trajectories as represented by the random effects in the mixed models. We work under the assumption that missing data are missing at random (MAR) [24]. For estimation of the fixed effects parameters, we first consider whether the SUR conditions under which OLS is fully efficient can be satisfied by a MLMM. We then derive expressions for the inefficiency of the outcome-specific LMM estimates relative to the MLMM ones. Focusing on the random slopes (trajectories), we examine the cause and degree of imprecision in two cases: (1) a general two-biomarker problem and (2) our motivating clinical case study of scleroderma trajectories with five biomarkers.

Efficiencies of separated versus combined models

Notation

Let be the observed value for the kth measure for person i = 1 , … , m at the jth visit , at time since onset . Let be the vector of for , and are () and () known matrices of full rank, and and are and measure-specific vector of parameters for the fixed and random effects. Let be the total number of observations for person i and let be the measure-specific, within-subject error term.

With these definitions, the multivariate linear mixed effects model is written as where , , , and ⊕ ⁡ denotes the Kronecker sum. We assume , . Letting and , we can write the above model more compactly in the standard linear mixed model form Y = Xβ + Zb + e, where .

Defining combined (joint) and separated models

In the specification above, D and are (Kq × Kq) and () positive definite matrices, respectively. The K (q × q) and () measure-specific block diagonal matrices for D and represent within-measure covariance of random effects and random errors, respectively. The off-block diagonals of D and represent the covariances of random effects and random errors across measures. If the off-diagonal submatrices are set equal to zero, then the mixed effects model of K measures reduces to K univariate mixed effects models. We call this the “separated" model in contrast with the model with the unrestricted D and that is called the “combined" model.

For the separated model, where , so that , and .

For the combined model, where , , and . To simplify the notation, let , and , in following sections.

Separated models and seemingly unrelated regressions (SUR)

The fixed effects estimates from the combined model are generalized least squares (GLS) estimates first described in Aitken [25]. They are therefore the best linear unbiased estimator (BLUE) so that the variance of is greater than or equal to the variance of . There are, however, two situations where the separated models’ fixed effects estimates are fully efficient as originally discussed in Zellner [22], summarized in S1 Supporting materials. The first is a trivial case when the cross-measure covariances of error terms are zero, where the combined model is equivalent to the separated models. The other case is when the measure-specific design matrices are the same across all k = 1 , . . . K measures.

The question is whether, for multivariate linear mixed effects models, the separated models can be fully efficient as occurs on the SUR case? In Supporting Materials S1, we show that the separated models always lose efficiency relative to the multivariate model except when: (1) the cross-measure covariances of error terms and random effects are all zero; (2) are the same for all k and , a sub-matrix of , are the same for all k.

So, the question remains, how inefficient are the separated models for multivariate cross-sectional responses? Is the inefficiency sufficient to warrant the burden of jointly modeling the outcomes in situations like tracking disease progression where the separated models meet the clinical objectives?

Comparing estimates of combined and separated models

Our interest lies in quantifying the improvement in efficiency of the combined model relative to the separated model when both can provide valid inferences that address the clinical question in estimating: (1) fixed effects coefficients that represent population average trajectories ; (2) an individual’s estimated random effects that represent his estimated deviations from the average trajectories; and (3) an individual patient’s estimated trajectories that are a linear combination of and .

We compute the following ratios of mean squared error (MSE) for each of , , and from the combined and separated model.

(1)

(2)

(3)

Formulae for the MSE, variance, and squared bias are presented in Supporting Materials S2.

Case studies

In two case studies, we examine the inefficiencies resulting from fitting separate LMMs rather than a single MLMM, derived from the general expressions. As detailed below, the first is the general bivariate case with fixed predictors, covariance matrices, degrees of missing data, and simulated missing data patterns in which we can examine the entire space of correlations between the two measures. In this first case, we focus on the inefficiency of the random effects because the fixed effects estimates are close to fully efficient. The second is the motivating scleroderma study in which there are 5 distinct measures where we consider the inefficiency of both the fixed and random effects.

Bivariate case study

Consider two measures of length and of length for subject i. Let and be the measure-specific vectors of random effects for and . With little loss of generality, we simplify the problem by assuming that population regression coefficients β are known for the following reasons. First, the separated model is fully efficient in estimating β if is a part of and if is the same across measures and subjects (see S1 Supporting materials). In other cases, we observe that the degree of inefficiency in estimating random effects are numerically similar when estimating β as compared to known β.

When both measures are fully observed, we expect little benefit from fitting the combined model. The real value of fitting the combined model is when one of the variables is poorly determined, either due to missing data or noise in the measurements. We let be fully observed and be missing in various degrees to measure efficiency gains for the random effects by fitting the combined model. Our primary interest in this case study is estimating , as jointly modeling would only have marginals effects on , which is already well determined. We calculate MSE Ratio of , MSE ratio that corresponds to the random effects of the second measure, by taking sub-matrices of MSEs in the equation 2. MSEs for the separated and combined models with known β are derived in S3 Supporting materials.

Using the formulae in S2 Supporting materials, we investigate the relative contributions to improved efficiency of (1) the degree of missingness, (2) measurement error, (3) heterogeneity in individual trajectories, and (4) across-measure correlations.

Degree of missingness.

With complete data, the random effects design matrix comprises the constant vector for the intercept and a vector of equally spaced times, scaled to range from -1 to 1, for the trajectory. We assume is fully observed, while a portion of is randomly missing. When , , and ; when , . In applications, such a pattern is observed when one measure is more frequently collected than the other. We also consider the case where is missing in a drop-out pattern, such that we only observe the first observations of the second measure and the rest is lost to follow up.

Degree of measurement error and heterogeneity in trajectories.

To investigate under which scenario the combined model borrows most strength in estimating , we consider three cases based on relative sizes of the variance components.

The random effects covariance matrix D and measurement error covariance matrix R are defined by four correlation variables. Let

(4)

where .

Case A: equal-sized random effect and measurement error variances; .
Case B: unequal measurement errors; variance components of the random effects and measurement errors are as in Case A except that , representing greater measurement error for .
Case C: heteroskedastic random effects and equal measurement error variances; variance components of the random effects and measurement errors are as in Case A except that . Greater variances are assumed for the random slope variance for to produce substantial heterskedasticity in across time.

We assess the degree to which greater measurement error in (Case B) or increased heterogeneity in trajectories of the second measure (Case C) results in greater efficiency gains for the joint model compared to our reference case of equal-sized random effect and measurement error variances (Case A).

Evaluation of efficiency at different combinations of within and across-measurecorrelations.

For each case, we explore the entire range of across-measure correlation between the random effects , across-measure correlation of measurement errors , and within-measure correlations of random intercept and slope for the two measures and . For simplicity, we report results only for the case . The general formulae in the Supporting Materials S3 can be used when .

When exploring the range of correlations and variances summarized above, we ensure the resulting random effects covariance matrix D is positive-semi-definite using a slightly modified version of the spectral decomposition method introduced in Rebonato and Jäckel (2001) [26].

Download:

Fig 1. MSE Ratio of

by varying

,

and

under scenarios A, B, and C when

for randomly missing case.

Cells representing unlikely combinations of and are colored in grey.

https://doi.org/10.1371/journal.pone.0320414.g001

Gains in efficiencies.

Fig 1 shows the relative efficiencies for for the combined versus separated models. We see that the greater the absolute values of and , the higher the efficiency gain for the combined model. We observe greater gains when and are less similar, that is when the two sources of variability introduce correlations of opposing signs. However, in practice, large values with opposite signs for are highly unlikely. Hence, the efficiencies are close to one in most practical situations.

In the complete case (first row of Fig 1), the separated model is nearly full efficient when is similar to regardless of their magnitude, a result reminiscent of the SUR case. With no missing data, , the situation in the SUR model where OLS is fully efficient. The inefficiency of the separated model increases as , the fraction of missing data for , increases. In fact, affects the efficiency more than the correlation parameters over their realistic ranges. This pattern is consistent across different combinations of , , .

Download:

Fig 2. MSE ratio of

evaluated at

,

under cases A, B, and C for randomly missing case.

Only the likely combinations of illustrated in Fig 1 are used to create boxplots. Different combinations of yield negligible difference in outcome, so we only present the case of common that takes three values.

https://doi.org/10.1371/journal.pone.0320414.g002

The effects of varying on efficiency gain by , and case are presented in Fig 2. The result illustrates that fitting the combined model is particularly advantageous when missingness in is large. For those with complete data, the average gains across individuals are minimal, especially for case A. On the individual level, however, we observe combinations of and with decreased MSE ratios. Assuming greater variance for random measurement error for , 25th percentiles of ratios have greater than 15% decrease in MSE by fitting the combined model even with 0% missingness. Additionally, in a special case of the model specification in which we assume random effects are the only source of across-measure correlations, we can still expect greater efficiency gains when is high.

The gains are much larger when we increase to 33% and 66%. There is heterogeneity in MSE ratios ranging from 0.3 to 1. The variability is more closely related to values of and than , the absolute number of observations. When between-measure correlation is weak, the separated model can be fully efficient even with large . The gains are slightly greater in the drop-out missing pattern, but the findings are qualitatively similar (see S4 Supporting materials, S1 Fig, and S2 Fig).

From these results, we conclude that the available information in the measure itself and other correlated measures together determines the benefit from fitting the combined model to estimate individual trajectories. Individuals with rich data can obtain reasonable trajectory estimates by only modeling , while individuals with sparse data can achieve substantial reduction in MSE by fitting the combined model if the two measures are highly correlated.

Scleroderma case study

Sclerodema data.

The Johns Hopkins Scleroderma Center Cohort comprises over 4,000 patients, providing a unique opportunity to study trajectory-focused prediction tools relevant to any chronic disease that manifests in many biomarkers. In scleroderma, clinicians track: pulmonary function measured by the standardized percent predicted forced vital capacity (pFVC) and standardized percent predicted diffusing capacity for carbon monoxide (pDLCO); cardiac function measured by left ventricular ejection fraction (EF) and right ventricular systolic pressure (RVSP); and skin measured by the modified Rodnan skin score (mRSS). Disease onset is defined by the earlier of the onset of Raynaud’s phenomenon, reduced blood flow to the fingers, and first non-Raynaud’s symptom. In this paper, we study 581 patients who have at least 4 repeated observations for each of the 5 measurements within 40 years since disease onset.

Some measures are collected more frequently than other measures. On average, we see greater numbers of skin and pulmonary measures compared to those of cardiac measures. This is because pFVC and pDLCO from pulmonary function tests and mRSS from routine clinical visits are more easily collected compared to EF and RVSP from echocardiograms. The summary statistics of the number of observations by measure are shown in Table 1.

Download:

Table 1. Summary statistics of 581 patients’ number of observations

https://doi.org/10.1371/journal.pone.0320414.t001

All five measures were quantile normalized by mapping their empirical marginal distributions to the Gaussian distribution. Let be a vector of the observed values from measure k = 1 , . . . , 5. The quantile-normalized vector is obtained by , where and is the inverse of the standard Gaussian distribution. RVSP and mRSS are transformed by multiplying them by -1 so that an increase in all five measures indicates improved disease status.

Estimating the separated and combined models.

Because the clinical focus is on patient trajectories, the fixed effects of our model included natural splines of time with 3 degrees of freedom, age of onset, race, sex, skin type, presence of three common autoantibodies, and the interactions of each of the baseline covariates listed above with the natural spline of time. Patient specific intercept and linear time are included as random effects. Standard linear mixed model software including R packages lme4 [27] and nlme [28] can easily fit the separated models. However, in this case-study, the algorithms failed to converge when fitting the combined model despite substantial efforts to tailor the starting values and convergence tuning constants. The combined model with saturated random effects and residual covariances requires estimation of 40 + 10 additional parameters in the random effects and residual covariance matrices, respectively, compared to those of the separated model.

We therefore fit the combined model using the R package MCMCglmm[29]. For the fixed effects of both models, we used a diffuse independent Gaussian prior centered around zero with a large variance (). Weakly informative inverse-Wishart priors are placed on random effects and residual covariance matrices. Specifically, we set the prior distribution of the random intercepts to have a mode of one and those of random slopes to have the mode of 0.005, with 10 degrees of freedom. The prior distribution of the residual covariance matrix also had mode of one for each measure with 5 degrees of freedom. The degrees of freedom are chosen to make the distributions as diffuse as possible while guaranteeing they are valid inverse-Wishart distributions.

An alternative to fitting the combined model is to estimate the cross-measure covariance parameters using within-measure variance estimates obtained by fitting separated models. Jackson et al. [30] and Chen et al. [31] extended DerSimonian and Laird’s univariate method of moments estimator [32] to a multivariate setting to estimate the cross-measure covariance matrix in the random effect model from the measure-specific models. Using such methods and our formulae, we can compute the inefficiency of fitting the separated models without directly fitting the combined model and evaluate whether the combined model should be fit.

Scleroderma covariances within and across-biomarker.

Download:

Fig 3. Pairwise correlations of observations from all patients for 11 years (years 0,...,10 since the disease onset) are calculated and plotted (left) using range of colors from red, white, and blue each representing correlation of 1, 0, and − 1, respectively.

The 11 by 11 block matrices on the diagonals shows the degree of correlation in patients’ repeated observations over time for each of the five measures. Looking along the block-diagonal, one observes that the two lung measurements and mRSS are highly correlated with their respective past observations, while observations of the two heart measures have less serial correlation. The empirical correlation matrices of the combined and separated models (middle and right) are plotted using the covariance estimates from the two models. The combined model allows correlation among the five measures, while the separated model does not.

https://doi.org/10.1371/journal.pone.0320414.g003

We compare the assumed covariance structures of the combined and separated models with the estimated covariance matrices. In Fig 3, we show that the combined model captures the within and across-measure correlation patterns quite well; the separated model only captures within-measure correlations. We display the correlations of yearly average values for the first 10 years of follow-up within and among the five measures.

We observe high positive correlation for the two lung measures which suggests that there could be gains in efficiency when modeling the measures jointly. We observe positive correlation between RVSP, mRSS, and the two lung measures; the EF observations appear to be uncorrelated with any other measure including RVSP, the other cardiac measure.

Comparing bias and efficiency.

Using the formulas derived in Supporting Materials S2, we compare MSE, bias and variance of: (1) the fixed effects estimates and ; (2) random effects estimates and ; and (3) the predicted values and . All three estimands of interest are functions of the design matrices (X and Z ) and covariance matrices , , , and ). We construct design matrices for each individual using observed times at which the five measurements are taken based on the model described above. From the model, we also estimate the population covariance of the random effects and population residual covariance . In this section, we use the finite sample posterior estimates of the variances obtained by taking the posterior mean of the MCMC estimates of and . The variances and for the separated model are constructed by setting the off– diagonal terms of and to be zero.

Population average trajectory estimation.

In Table 2, we present overall and measure-wise MSE Ratio of β obtained from Equation 1. Assuming known variance parameters, the overall MSE in estimating fixed effects is reduced by only 3% when using the combined model compared to fitting the separated model. Since both fixed effect estimates for the separated and combined models ( and ) are unbiased (see S2 Supporting materials), the reduction in MSE solely comes from variance reduction.

Estimating random effects and individual patients’ prediction.

Patients’ deviations in the level and trend from the average population trajectory is captured by the random intercept and slope estimates. Table 3 presents subject-average MSE ratios from Equation 2 for random effects (MSE Ratio of ), random intercepts only (MSE Ratio of ), and random slopes only (MSE Ratio of ). Equation 3 is a similar expression for predicted values (MSE Ratio of ). Estimating random effects and predicted values from the combined model is most advantageous for RVSP. Overall, the mean gains in MSEs are minimal.

Download:

Table 2. Ratio of MSE of overall and measure-wise fixed effects of the combined model to the separated model

https://doi.org/10.1371/journal.pone.0320414.t002

Heterogeneity in bias and efficiency gains by patient.

Random effects estimates are a linear combination of patient-specific level and trajectory estimates and their population analogues. Hence, depending on the amount and characteristics of individuals’ data, we can expect variation among patients in the MSEs. In Fig 4, we show the measure-specific MSE for estimators of individual patient’s fixed effects, random effects, random slope only, and predicted values. The MSEs are transformed onto the log scale; a positive value indicates that the separated model has smaller errors and a negative value indicates that the combined model does.

In panel (a), the five points marking the measure-specific log ratios of MSE are equivalent to the MSE Ratio of β in Table 2 transformed to the log scale. The box plots in panels (b)–(d) show the patient-level log ratios for the random effects, random slope estimates, and predicted values for the 581 patients.

The most notable result is that there is sizable heterogeneity for the patient-specific log ratios, especially for pDLCO and RVSP. For RVSP, most patients benefit from fitting the combined model. The gains in pDLCO are substantial for only 25 percent of the patients. The stretched out left tails of the pDLCO MSEs indicate that a small fraction of patients are estimated to have over 20% efficiency gains.

Download:

Table 3. Average MSE ratios of random effects, random intercept only, random slope only, and predicted values of the combined model to the separated model

https://doi.org/10.1371/journal.pone.0320414.t003

Ethics approval and consent to participate.

Data analyzed in this study were obtained from consenting participants in the Johns Hopkins Scleroderma Center Research Registry accessed on February 10th, 2020. This study was approved by the Johns Hopkins Medicine Institutional Review Board (IRB00251593 and IRB00226995). Participants provided written informed consents, and the authors have permission to identify patients during and after data collection for additional data collection. All methods were performed in accordance with the relevant guidelines and regulations.

Download:

Fig 4. Log ratio of MSEs of the combined model to the separated model for estimators of individual patient’s fixed effects, random effects, random slope only, and predicted values.

https://doi.org/10.1371/journal.pone.0320414.g004

Discussion

In our application, a patient’s disease state is reflected in multiple irregularly spaced longitudinal measures. By selecting and estimating multivariate Bayesian hierarchical models, we estimated smooth individual and population trajectories for each measure/organ systems using noisy and, for many individuals, sparse data. This analysis can further clinicians’ understanding of the disease by representing disease progression in multiple dimensions for clinically-defined subpopulations and by quantifying the correlations across measures and time.

We addressed the statistical question of whether, in a simple bivariate case and in our motivating scleroderma example with five biomarkers, fitting a more complex multivariate hierarchical model (“combined model") produces substantially more efficient estimates compared to fitting a set of “separated models," one for each measure. In regression analysis, this question was raised by Zellner [22]. He showed that the coefficient estimation using the GLS [25] is asymptotically more efficient compared to the OLS, and that the efficiency increases as the error terms from different equations become more cross-correlated and as the predictor variables in different equations become less correlated. The OLS estimates are fully efficient when either there is no cross-measure correlation or when the predictors are the same in the regression model for each measure. We show that multivariate linear mixed models are not separable into individual equation systems without efficiency loss except for restrictive special cases. We derived equations to quantify this loss of efficiency for fixed effects, and in one general and one specific case study showed the inefficiency is negligible.

With mixed effect models, there is an additional question of how inefficient are estimates of random effects using separated as compared to combined models. We derived a set of generalized formulae to compare the relative efficiency of individual-level estimates from the fully efficient combined model and the simpler separated models. We defined the MSE for the estimated random effects as the average conditional MSE over the distribution of . The conditional MSE is defined as the conditional expected squared difference between the predicted values above and the true value of the random effect.

There is little advantage of the combined model for estimating the fixed effects, when multiple outcome measures are observed at similar or common times. Such patterns are frequently observed in longitudinal studies, including our own case study. The five measures are captured at the same or similar times, making the fixed and random effects design matrices similar, resulting in separated models being almost fully efficient. For the random effects estimates, however, the degree of advantage of the combined model can be substantial for some individuals. There are sizable gains for those individuals for whom the relative number of observations in the measure of interest is smaller than those in other correlated measures. The degree of efficiency gain increases with the degree of cross-measure correlations.

The increased efficiency of the combined model estimates of individual trajectories derive mainly from multidimensional shrinkage toward the population mean trajectory instead of shrinking in each dimension separately as occurs in separated models. The population average trajectories for the five measures are estimated with near full-efficiency with the separated model. However, the combined model is advantageous as it shrinks the subject-specific measurements towards the population average trajectories in a multivariate space, whereas the separated model shrink within each univariate space. The rate of shrinkage depends on cross-measure correlations and missingness, producing different efficiency loss across subjects.

In terms of bias and variance, the efficiency gain, or the reduction in MSE for the random effects mostly results from reduced bias. For individuals who have only a few data points for a given measure, the data for the measure alone cannot accurately reflect the underlying disease state of the individual. Hence, fitting the separated models results in greater shrinkage towards the measure-specific mean and results in larger bias. The bias is reduced when fitting the combined model, where the random effects estimator borrows strength from data-rich measures.

This framework for comparing the performances of the combined and separated models for the population and individual level estimates can be applied to any setting where the individuals’ and population trajectories in higher dimension space need to be estimated. However, it should be noted that the results are drawn assuming Gaussian responses after transformation and missingness at random. The effects of non-Gaussian and non-ignorable missingness on the results are topics for further studies. The results in this paper describe the efficiency costs of misspecifying the covariance structure among the random effects and/or residual errors. Another form of misspecification is by omitting key predictors. In our particular application, this might involve assuming a smooth trajectory for a biomarker when the changes are more acute or immediate. The efficiency results presented in this paper assumed that linear predictors are correctly specified. When the models are misspecified, there is no a priori reason to believe that the effects of misspecification would be more or less for the combined versus separated models.

Software

Software in the form of R code, together with a sample input data set and complete documentation is available on request from the corresponding author.

Supporting information

Supporting materials.

includes four sections: S1 Efficiency of fixed effect estimates and seemingly unrelated regressions S2 Mean squared error and bias-variance decomposition S3 Mean squared error and bias-variance decomposition of random effect estimates with known population parameters S4 Efficiency gains for the random effects in the case of drop-out missing pattern.

https://doi.org/10.1371/journal.pone.0320414.s001

(PDF)

S1 Fig. MSE Ratio of , under scenarios A, B, and C when for drop-out missing case. Cells representing unlikely combinations of are colored in grey.

https://doi.org/10.1371/journal.pone.0320414.s002

(TIF)

S2 Fig. MSE Ratio of , , , , under cases A, B, and C for drop-out missing pattern. Only the “likely combinations" of illustrated are used to create boxplots. Different combinations of yield negligible difference in outcome, so we only present the case of common that takes three values.

https://doi.org/10.1371/journal.pone.0320414.s003

(TIF)

Acknowledgments

The authors thank Professor Antony Rosen, director of the Johns Hopkins inHealth Precision Medicine program, Fred Wigley director of the Johns Hopkins Scleroderma Center, and Aalok Shah for supporting our use of the JH Precision Medicine Analytics Platform.

References

1. Pattanaik D, Brown M, Postlethwaite AE. Vascular involvement in systemic sclerosis (scleroderma). J Inflam Res. 2011;4:105–25. pmid:22096374
- View Article
- PubMed/NCBI
- Google Scholar
2. Fairweather D, Frisancho-Kiss S, Rose NR. Sex differences in autoimmune disease from a pathological perspective. Am J Pathol 2008;173(3):600–9. pmid:18688037
- View Article
- PubMed/NCBI
- Google Scholar
3. Steen VD. The many faces of scleroderma. Rheum Dis Clin North Am. 2008;34(1):1–15;v.
- View Article
- Google Scholar
4. Shah AA, Wigley FM. My approach to the treatment of scleroderma. Mayo Clin Proc 2013;88(4):377–93. pmid:23541012
- View Article
- PubMed/NCBI
- Google Scholar
5. Allanore Y, Simms R, Distler O, Trojanowska M, Pope J, Denton CP, et al. Systemic sclerosis. Nat Rev Dis Primers 2015;1(1):1–21. pmid:27189141
- View Article
- PubMed/NCBI
- Google Scholar
6. Verbeke G, Fieuws S, Molenberghs G, Davidian M. The analysis of multivariate longitudinal data: a review. Stat Methods Med Res 2014;23(1):42–59. pmid:22523185
- View Article
- PubMed/NCBI
- Google Scholar
7. Diggle P, Heagerty P, Liang KY, Zeger S, Zeger. Analysis of longitudinal data. Oxford: OUP; 2002.
8. Brown H, Prescott R. Applied mixed models in medicine. Wiley; 1999.
9. Harville DA. Extension of the Gauss-Markov theorem to include the estimation of random effects. Ann. Statist 1976;4(2):384–95.
- View Article
- Google Scholar
10. Harville DA. Maximum likelihood approaches to variance component estimation and to related problems. Journal of the American Statistical Association. 1977;72(358):320–338.
- View Article
- Google Scholar
11. Graybill FA. Theory and application of the linear model. North Scituate, MA: Duxbury Press; 1976.
12. Reinsel G. Estimation and prediction in a multivariate random effects generalized linear model. J Am Stat Assoc 1984;79(386):406–14.
- View Article
- Google Scholar
13. Sammel M, Lin X, Ryan L. Multivariate linear mixed models for multiple outcomes. Stat Med 1999;18(17):2479–92.
- View Article
- Google Scholar
14. Fieuws S, Verbeke G. Joint modelling of multivariate longitudinal profiles: pitfalls of the random-effects approach. Stat Med 2004;23(20):3093–104. pmid:15449333
- View Article
- PubMed/NCBI
- Google Scholar
15. Wang WL, Fan TH. Bayesian analysis of multivariate t linear mixed models using a combination of IBF and Gibbs samplers. J Multivar Anal 2012;105(1):300–10.
- View Article
- Google Scholar
16. Shah A, Laird N, Schoenfeld D. A random-effects model for multiple characteristics with possibly missing data. Journal of the American Statistical Association 1997;92(438):775–9.
- View Article
- Google Scholar
17. Verbeke G. Linear mixed models for longitudinal data. In: Linear mixed models in practice. Springer; 1997. p. 63–153.
18. Sammel M, Lin X, Ryan L. Multivariate linear mixed models for multiple outcomes. Stat Med. 1999;18(17–18):2479–92.
- View Article
- Google Scholar
19. Pantazis N, Touloumi G, Walker A, Babiker A. Bivariate modelling of longitudinal measurements of two human immunodeficiency type 1 disease progression markers in the presence of informative drop-outs. J R Stat Soc Ser C Appl Stat 2005;54(2):405–23.
- View Article
- Google Scholar
20. Bloomfield P, Watson GS. The inefficiency of least squares. Biometrika 1975;62(1):121–8.
- View Article
- Google Scholar
21. Tukey JW. Approximate weights. Ann Math Stat. 1948;19(1):91–92.
- View Article
- Google Scholar
22. Zellner A. An efficient method of estimating seemingly unrelated regressions and tests for aggregation bias. J Am Stat Assoc 1962;57(298):348–68.
- View Article
- Google Scholar
23. Oliveira R, Teixeira-Pinto A. Analyzing multiple outcomes: is it really worth the use of multivariate linear regression? J Biom Biostat. 2015;6(4).
- View Article
- Google Scholar
24. Rubin DB. Inference and missing data. Biometrika. 1976;63(3):581–92.
- View Article
- Google Scholar
25. Aitken AC. On least squares and linear combination of observations. Proc R Soc Edinb. 1934;55:42–48.
- View Article
- Google Scholar
26. Rebonato R, Jäckel P. The most general methodology to create a valid correlation matrix for risk management and option pricing purposes. J Risk. 2001;2.
- View Article
- Google Scholar
27. Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. 2015;67(1):1–48.
- View Article
- Google Scholar
28. Pinheiro J, Bates D, DebRoy S, Sarkar D. The Nlme package: linear and nonlinear mixed effects models. R Core Team; 2019.
29. Hadfield JD. MCMC methods for multi-response generalized linear mixed models: the MCMCglmm R package. J Stat Softw. 2010;33(2):1–22.
- View Article
- Google Scholar
30. Jackson D, White IR, Thompson SG. Extending DerSimonian and Laird’s methodology to perform multivariate random effects meta-analyses. Stat Med 2010;29(12):1282–97. pmid:19408255
- View Article
- PubMed/NCBI
- Google Scholar
31. Chen H, Manning AK, Dupuis J. A method of moments estimator for random effect multivariate meta-analysis. Biometrics 2012;68(4):1278–84. pmid:22551393
- View Article
- PubMed/NCBI
- Google Scholar
32. DerSimonian R, Laird N. Meta-analysis in clinical trials. Control Clin Trials 1986;7(3):177–88. pmid:3802833
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Pattanaik D, Brown M, Postlethwaite AE. Vascular involvement in systemic sclerosis (scleroderma). J Inflam Res. 2011;4:105–25. pmid:22096374
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Fairweather D, Frisancho-Kiss S, Rose NR. Sex differences in autoimmune disease from a pathological perspective. Am J Pathol 2008;173(3):600–9. pmid:18688037
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Steen VD. The many faces of scleroderma. Rheum Dis Clin North Am. 2008;34(1):1–15;v.
View Article
Google Scholar

[10] View Article

[11] Google Scholar

[ref4] 4. Shah AA, Wigley FM. My approach to the treatment of scleroderma. Mayo Clin Proc 2013;88(4):377–93. pmid:23541012
View Article
PubMed/NCBI
Google Scholar

[13] View Article

[14] PubMed/NCBI

[15] Google Scholar

[ref5] 5. Allanore Y, Simms R, Distler O, Trojanowska M, Pope J, Denton CP, et al. Systemic sclerosis. Nat Rev Dis Primers 2015;1(1):1–21. pmid:27189141
View Article
PubMed/NCBI
Google Scholar

[17] View Article

[18] PubMed/NCBI

[19] Google Scholar

[ref6] 6. Verbeke G, Fieuws S, Molenberghs G, Davidian M. The analysis of multivariate longitudinal data: a review. Stat Methods Med Res 2014;23(1):42–59. pmid:22523185
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref7] 7. Diggle P, Heagerty P, Liang KY, Zeger S, Zeger. Analysis of longitudinal data. Oxford: OUP; 2002.

[ref8] 8. Brown H, Prescott R. Applied mixed models in medicine. Wiley; 1999.

[ref9] 9. Harville DA. Extension of the Gauss-Markov theorem to include the estimation of random effects. Ann. Statist 1976;4(2):384–95.
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref10] 10. Harville DA. Maximum likelihood approaches to variance component estimation and to related problems. Journal of the American Statistical Association. 1977;72(358):320–338.
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref11] 11. Graybill FA. Theory and application of the linear model. North Scituate, MA: Duxbury Press; 1976.

[ref12] 12. Reinsel G. Estimation and prediction in a multivariate random effects generalized linear model. J Am Stat Assoc 1984;79(386):406–14.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref13] 13. Sammel M, Lin X, Ryan L. Multivariate linear mixed models for multiple outcomes. Stat Med 1999;18(17):2479–92.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref14] 14. Fieuws S, Verbeke G. Joint modelling of multivariate longitudinal profiles: pitfalls of the random-effects approach. Stat Med 2004;23(20):3093–104. pmid:15449333
View Article
PubMed/NCBI
Google Scholar

[40] View Article

[41] PubMed/NCBI

[42] Google Scholar

[ref15] 15. Wang WL, Fan TH. Bayesian analysis of multivariate t linear mixed models using a combination of IBF and Gibbs samplers. J Multivar Anal 2012;105(1):300–10.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref16] 16. Shah A, Laird N, Schoenfeld D. A random-effects model for multiple characteristics with possibly missing data. Journal of the American Statistical Association 1997;92(438):775–9.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref17] 17. Verbeke G. Linear mixed models for longitudinal data. In: Linear mixed models in practice. Springer; 1997. p. 63–153.

[ref18] 18. Sammel M, Lin X, Ryan L. Multivariate linear mixed models for multiple outcomes. Stat Med. 1999;18(17–18):2479–92.
View Article
Google Scholar

[51] View Article

[52] Google Scholar

[ref19] 19. Pantazis N, Touloumi G, Walker A, Babiker A. Bivariate modelling of longitudinal measurements of two human immunodeficiency type 1 disease progression markers in the presence of informative drop-outs. J R Stat Soc Ser C Appl Stat 2005;54(2):405–23.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref20] 20. Bloomfield P, Watson GS. The inefficiency of least squares. Biometrika 1975;62(1):121–8.
View Article
Google Scholar

[57] View Article

[58] Google Scholar

[ref21] 21. Tukey JW. Approximate weights. Ann Math Stat. 1948;19(1):91–92.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref22] 22. Zellner A. An efficient method of estimating seemingly unrelated regressions and tests for aggregation bias. J Am Stat Assoc 1962;57(298):348–68.
View Article
Google Scholar

[63] View Article

[64] Google Scholar

[ref23] 23. Oliveira R, Teixeira-Pinto A. Analyzing multiple outcomes: is it really worth the use of multivariate linear regression? J Biom Biostat. 2015;6(4).
View Article
Google Scholar

[66] View Article

[67] Google Scholar

[ref24] 24. Rubin DB. Inference and missing data. Biometrika. 1976;63(3):581–92.
View Article
Google Scholar

[69] View Article

[70] Google Scholar

[ref25] 25. Aitken AC. On least squares and linear combination of observations. Proc R Soc Edinb. 1934;55:42–48.
View Article
Google Scholar

[72] View Article

[73] Google Scholar

[ref26] 26. Rebonato R, Jäckel P. The most general methodology to create a valid correlation matrix for risk management and option pricing purposes. J Risk. 2001;2.
View Article
Google Scholar

[75] View Article

[76] Google Scholar

[ref27] 27. Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. 2015;67(1):1–48.
View Article
Google Scholar

[78] View Article

[79] Google Scholar

[ref28] 28. Pinheiro J, Bates D, DebRoy S, Sarkar D. The Nlme package: linear and nonlinear mixed effects models. R Core Team; 2019.

[ref29] 29. Hadfield JD. MCMC methods for multi-response generalized linear mixed models: the MCMCglmm R package. J Stat Softw. 2010;33(2):1–22.
View Article
Google Scholar

[82] View Article

[83] Google Scholar

[ref30] 30. Jackson D, White IR, Thompson SG. Extending DerSimonian and Laird’s methodology to perform multivariate random effects meta-analyses. Stat Med 2010;29(12):1282–97. pmid:19408255
View Article
PubMed/NCBI
Google Scholar

[85] View Article

[86] PubMed/NCBI

[87] Google Scholar

[ref31] 31. Chen H, Manning AK, Dupuis J. A method of moments estimator for random effect multivariate meta-analysis. Biometrics 2012;68(4):1278–84. pmid:22551393
View Article
PubMed/NCBI
Google Scholar

[89] View Article

[90] PubMed/NCBI

[91] Google Scholar

[ref32] 32. DerSimonian R, Laird N. Meta-analysis in clinical trials. Control Clin Trials 1986;7(3):177–88. pmid:3802833
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

Figures

Abstract

Introduction

Statistical models of trajectory

Inefficiency when ignoring correlation

Objectives

Efficiencies of separated versus combined models

Notation

Defining combined (joint) and separated models

Separated models and seemingly unrelated regressions (SUR)

Comparing estimates of combined and separated models

Case studies

Bivariate case study

Degree of missingness.

Degree of measurement error and heterogeneity in trajectories.

Evaluation of efficiency at different combinations of within and across-measurecorrelations.

Gains in efficiencies.

Scleroderma case study

Sclerodema data.

Estimating the separated and combined models.

Scleroderma covariances within and across-biomarker.

Comparing bias and efficiency.

Population average trajectory estimation.

Estimating random effects and individual patients’ prediction.

Heterogeneity in bias and efficiency gains by patient.

Ethics approval and consent to participate.

Discussion

Software

Supporting information

Supporting materials.

S1 Fig. MSE Ratio of , under scenarios A, B, and C when for drop-out missing case. Cells representing unlikely combinations of are colored in grey.

S2 Fig. MSE Ratio of , , , , under cases A, B, and C for drop-out missing pattern. Only the “likely combinations" of illustrated are used to create boxplots. Different combinations of yield negligible difference in outcome, so we only present the case of common that takes three values.

Acknowledgments

References