A multilevel hierarchical framework for quantification of experimental heterogeneity in population snapshot data

David J. Warne; Xiangrun Zhu; Thomas P. Steele; Stuart T. Johnston; Scott A. Sisson; Matthew Faria; Ryan J. Murphy; Alexander P. Browning

doi:10.1371/journal.pcbi.1014379

Abstract

Biological systems exhibit substantial heterogeneity: that is, variation in specific characteristics of individuals within a population. As a result, it is of critical importance to appropriately account for biological heterogeneity when calibrating mathematical models to infer cellular processes and predict behaviour. Recent approaches consider ordinary differential equations with random parameters to quantify heterogeneity in dynamical processes of cells. In this setting, statistical inference is performed to characterise the distribution of these random parameters within a cell population. One significant limitation of this approach is the tacit assumption that there are no substantial deviations in these distributions across experimental replicates. In this work, we propose a flexible Bayesian hierarchical differential equation modelling framework that quantifies and distinguishes both inter-experimental heterogeneity (heterogeneity between experimental replicates) and intra-experimental heterogeneity (biological heterogeneity within replicate populations). We consider two recent studies that employ mathematical models to interpret flow cytometry snap-shot data and quantify heterogeneity in nano-particle cell interactions and cell internalisation processes. Using simulation data, we demonstrate that substantial inaccuracy in the inferred dynamics can arise when experimental heterogeneity is not accounted for. By contrast, our hierarchical approach is robust to variability in inter-experimental and intra-experimental heterogeneity and our method simplifies to previous methods when inter-experimental heterogeneity is negligible. Our approach is flexible and widely applicable to applications involving replicate populations and snapshot data. We provide open-source implementations of our methods on GitHub.

Author summary

In the design of personalised and targeted drugs and therapeutics it is essential to quatify the variability in the response of cellular processes involved in drug uptake. In practice, the experimental instruments and protocols used to collect cell population data for the study of such cellular processes are complex. As a result, it is important for statistical analyses that use these data to correctly account for variability between experiments in addition to the variability of cellular behaviour within the same experimental replicate. We present a flexible model of cellular dynamic that accounts for variability in biological, technical, and experimental replicate. We show that this leads to more accurate and robust results over standard approaches that pool replicates together.

Citation: Warne DJ, Zhu X, Steele TP, Johnston ST, Sisson SA, Faria M, et al. (2026) A multilevel hierarchical framework for quantification of experimental heterogeneity in population snapshot data. PLoS Comput Biol 22(6): e1014379. https://doi.org/10.1371/journal.pcbi.1014379

Editor: William Majoros, Duke University School of Medicine, UNITED STATES OF AMERICA

Received: January 28, 2026; Accepted: May 28, 2026; Published: June 15, 2026

Copyright: © 2026 Warne et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Code and relevant data to reproduce the results in this work are available on Git Hub at https://github.com/davidwarne/multilevelHierarchicalODEs.

Funding: This project was supported by the Australian Research Council (ARC). DJW is supported by an ARC Discovery Early Career Researcher Award (DE250100396). STJ and MF are supported by an ARC Discovery Project (DP230100380). STJ is also supported by an ARC Discovery Project (DP250100477). SAS is supported by an ARC Discovery Project (DP260101134). DJW, APB, and STJ acknowledge support from the ARC Centre of Excellence for the Mathematical Analysis of Cellular Systems (MACSYS; CE230100001). APB thanks the Mathematical Institute, University of Oxford for a Hooke Research Fellowship. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Accounting for biological heterogeneity is crucial in the biomedical sciences [1]. Heterogeneity plays a key role in the development and maintenance of healthy living organisms [2,3] through complex interactions between various types of cells driven by intracellular biochemical processes [4]. In pathological situations, such as cancer or viral infections [5–8], heterogeneity can seriously impact the efficacy of pharmaceutical interventions and eventual patient outcomes [9–11]. As a result, appropriately quantifying biological heterogeneity within a population of cells is of tremendous importance for understanding natural processes and for the design of effective drugs, especially in the areas of precision medicine such as targeted therapeutics [12–15].

A standard experimental approach to quantify properties of cell populations is flow cytometry [16–18]. In such an experiment, populations of cells (Fig 1(a)), are tagged or stained with fluorescent markers. The cells then individually flow through the cytometer nozzle and pass through a laser from which scattered light is amplified and converted to a digital signal representing a measurement of fluorescent intensity related to each cell in the population (Fig 1(b)). This results in a snapshot in time of the fluorescent intensity distribution across multiple colour channels, providing a rich source of information about potential variability in the cell populations at a single point in time. (Fig 1(c)). Typically, thousands to tens of thousands of cells are analysed per time point.

Download:

Fig 1. Schematic of flow cytometry data acquisition.

(a) Cells are seeded into cell culture plates, e.g., 12-well plates, in a growth media (e.g., 1 mL) and incubated with various treatments for some time period (typically between 1 to 48 hours). (b) Cells are then removed from plates and placed in suspension, and passed through the flow cytometer nozzle to a laser beam where light scattering from the cell is measured at detectors. (c) For each fluorescent channel, the intensity distribution is obtained, leading to distributions showing cellular variability.

https://doi.org/10.1371/journal.pcbi.1014379.g001

Data obtained from flow cytometry experiments exhibit substantial levels of heterogeneity. While some of this variation can be attributed to measurement noise, it is well established that some of this variation arises due to intrinsic biological variation even between genetically identical cells [19–23]. However, there are also extrinsic factors that drive heterogeneity in flow cytometry experiments that can adversely affect the statistical analysis of the biological heterogeneity of interest if not properly accounted for [24–26]. For example, subtle variation in sample preparation, instrument configuration for data acquisition, and manual user operation can have a substantial effect on the signal-to-noise ratio and statistical bias in the data, potentially impacting the validity of conclusions drawn from the analysis [24,25,27,28].

In many studies, it is typically assumed that experimental replicates of identically prepared cell populations represent independent and identically distributed replicates. For example, in the work of Faria et al. [29], human leukaemia cells (THP-1 cell line) are seeded into a 12-well plate and nano-particle-cell interactions are observed via flow cytometry at six observation times (between 1 hr and 24 hrs), resulting in two replicates per observation time. In practice, statistical analysis is performed on the pooled sample of replicates [30,31], or each replicate is individually analysed [32,33]. Neither approach is ideal. Pooling ignores heterogeneity between experimental replicates and may lead to incorrect conclusions. Conversely, analysing replicates independently can lead to difficulties in assessing global trends [34]. A potential solution to analyse flow cytometry data is to use mathematical or statistical models that structure relationships between snapshots, and between time points data [34–38].

In this work, we introduce an approximate Bayesian multilevel hierarchical modelling framework for quantifying biological heterogeneity in flow cytometry data in situations where the parameters of interest are related to a mathematical model of cell dynamics based on differential equation models. The class of models we consider here is more general than previous work on non-linear mixed-effects models [35,36], as we allow for all cells to interact with a shared dynamic environment. To demonstrate our approach, we consider simulation studies based on two recent studies that quantify biological heterogeneity in cell behaviour using pooled flow cytometry data [39,40]. We highlight the dramatic impact that variation between replicates can have on pooled sample analysis results and show how a hierarchical approach can account for this effect. In addition, our approach reduces to the pooled case in situations where variation between replicates is minimal. Our approach is therefore generally applicable for applications in which controlling variation across flow cytometry replicates is infeasible.

Materials and methods

We develop a general Bayesian hierarchical modelling framework for the analysis of heterogeneity in cell behaviour using flow cytometry data. This heterogeneity is variability that can be intra-experimental heterogeneity within a replicate, or inter-experimental heterogeneity between replicates. This inter-experimental heterogeneity can be variation between either technical replicates or experimental replicates. For clarity, we define a technical replicate to be an identically prepared replicate performed at the same time and the same person, and define an experimental replicate to be a replicate replicate performed on a different day, and/or by a different person (See Fig 2). For example, if an experiment was performed with a 12-well plate where two-wells were assigned to each of six time points then this would represent two technical replicates. However, if the entire 12-well plate experiment was performed again at a different lab or with a different instrument then this would be two experimental replicates, each with two technical replicates. For most experimental protocols, we expect technical replicates to be closely related. As a result, we focus our attention to experimental replicates in this manuscript. However, we show how this can be extended to account for both technical and experimental replicates in S1 Appendix.

Download:

Fig 2. A schematic representation of the hierarchical statistical framework.

Experimental replicates are indicated by different colours, with each experiment consisting of two identical technical replicates. Inter-experimental heterogeneity is captured through the hyper-parameters that characterise the distribution of the M replicate specific hyper-parameters, . Here, characterises the intra-experimental heterogeneity between cells with being the dynamic parameters for the ith cell within the jth replicate.

https://doi.org/10.1371/journal.pcbi.1014379.g002

Flow cytometry data

We use simulated flow cytometry data that is a realistic characterisation of real flow cytometry data. Specifically, we build our data simulation process based on recent studies that investigate heterogeneity in interactions between cells and nano-particles [29,30,40] and cell internalisation processes [39,41,42].

Nano-engineered particle-cell interaction data.

The flow cytometry data published by Faria et al., [29] and subsequently analysed for heterogeneity in particle-cell interaction by Murphy et al., [40] are for a human leukaemia monocytic suspension cell line, THP-1. Here, THP-1 cells are seeded in a 12-well plate and incubated with nano-particles (specifically, we focus on the 150 nm core-shell data) for 1, 2, 4, 8, 16, and 24 [hr]. After incubation, nano-particles that are not associated with cells are removed via washing. The nano-particle fluorescence across the cell population from each well plate is collected via flow cytometry. This results in snapshot data of associated nano-particle fluorescence for 20,000 cells for each incubation time point.

Dual-labelled probe internalisation data.

Browning et al., [39] published flow cytometry data for a human B cell lymphoblast cell line (C1R) and analysed the heterogeneity in endocytosis of anti-transferrin (anti-TFR) antibodies. The measurement technique is based upon programmable sensors using DNA quenching probes [43,44]. Antibodies are dual-labelled with fluorescent dyes: FIP-Cy5 and BODIPY FL. Of these two probes, FIP-Cy5, is quenchable, that is, the fluorescence is effectively disabled on the cell surface using a quencher dye. Here, C1R cells are incubated with dual-labelled antibodies with incubation times of 5, 10, 20, 30, 60, 120, and 180 [min]. After incubation, cells are washed and resuspended both with and without the quencher [44]. The fluorescent signals for both FIP-Cy5 and and BODIPY FL are measured for the cell populations in both quenched and unquenched samples via flow cytometry. This leads to snapshots of the jointly distributed FIP-Cy5 and BODIPY FL fluorescence for N = 1,000 cells over the incubation times and qenched/unquenched samples.

Mathematical modelling

We consider applications for which flow cytometry data is analysed using deterministic modelling based on ordinary differential equations (ODEs) as is commonplace in the mathematical and computational biology communities. The goal of such analysis is to quantify biological heterogeneity in parameters related to the dynamics of some cellular process. In this sense, each individual cell in the population is treated as having its own set of parameters and the goal is to find a suitable distribution for the parameters leading to cell dynamics matching the data distribution over time [39,40,45]. Here we describe the general ODE framework with random parameters used in the setting of pooled replicates, and then connect the framework to the particle-cell interaction model of Murphy et al., [40] and the cell internalisation model of Browning et al., [39]. These previous works focus exclusively on intra-experimental heterogeneity. We then show how to extend the analysis framework also account for inter-experimental heterogeneity between replicates and present a Bayesian multilevel hierarchical approach to quantify both intra- and inter-experimental heterogeneity together.

Cell population models with random parameters.

We consider a representative and heterogeneous population of N cells. Let be a vector of dynamic properties of interest related to the ith cell (e.g., the average number of nano-particles associated with cell, or concentrations of various biochemicals). This property vector evolves over time according to

(1)

for , where is a vector of parameters with values specific to the ith cell, is a matrix of the populations parameters, is a matrix representing the state of the cell population, and is a vector of shared environmental variables that interacts with cells in the population (e.g., a shared nutrient source). The functions and define the dynamics of cells and the environment, respectively. In this setting, cells only indirectly interact via the shared environment. Direct interaction between cells could be incorporated in our framework by extending the cell dynamics to be a function of instead of only. For t₀ representing the start of the initial incubation time, we have initial conditions for , and . Importantly, we do not assume that these initial conditions are known quantities, since such assumptions are known to cause problems in study reproducibility [46]. Instead, initial conditions are included as model parameters that we will estimate. This estimation need not be independent of the other parameters, but rather, initial conditions could be functionally dependent on the model parameters. For example, one may assume the environment or cell states may be assumed to be in equilibrium that is determined based on a function of rate parameters.

The measured fluorescence of each cell is subject to external noise and thus treated as a random variable, that is,

(2)

where are the recorded fluorescent intensities for the ith cell given the cell state , dynamics parameters , and observation process specific parameters, . The functional from of the observation process, , will typically be determined based on calibration data for flow cytometry. The dimension of will be the number of fluorescence channels and will generally not be the same as the dimension of . Thus a single cell population snapshot obtained at time t > t₀ using flow cytometry is given by . Typically, we do not observe the initial condition at t = t₀, and therefor it is treated as a latent variable to infer with the model parameters. Finally, for n observation times, , we represent the entire flow cytometry dataset as .

Since snapshot data as obtained through flow cytometry does not track individual cells, using such data to perform parameter inference on is not meaningful. Instead, one can consider to be independent identically distributed (i.i.d.) samples from a parametric distribution with probability density, , then perform parameter inference on the hyper-parameters , where is the hyper-parameter space. In a Bayesian setting, this leads to

where

This is the essence of the approach taken by Browning et al., [39] and Murphy et al., [40] and it relies on pooled snapshot data that requires the assumption of i.i.d. parameters across all cells and replicates. Our main contribution, that we present later, is an extension to this framework that only requires i.i.d. parameters within each experimental replicate.

Example 1: Particle-cell interaction model.

The particle-cell interaction model of Murphy et al., [40], describes the association of free nano-particles to a population of cells. Given N cells the number of particles associated with the ith cell at time t > t₀, , is governed by the system,

(3)

for , where and are, respectively, the association rate and carrying capacity of particles for the ith cell, c is the fractional cell surface coverage, s is the surface area of the cell boundary, v is the volume of the well-mixed media, and u(t) is the total free particle density with initial condition . In the context of our framework given by Eqs. (1) and (2), and (i.e., ), and (i.e., ).

The measured fluorescence, , for cell i is modelled by

where is the autofluorescence of a cell, drawn at random from the empirical distribution cell-only calibration dataset, and are individual particle fluorescences drawn at random from the empirical distribution particle-only calibration dataset. Here are parameters associated with normalising voltages that are used to obtain flow cytometry measurements (See Murphy et al., [40] for details). Since there are N = 20,000 cells, this leads to a computationally challenging system of ODEs as they are fully coupled through the environment (Eq. (3)). However, a tractable approximation with a semi-closed form solution can be obtained under the assumption that throughout the simulation time (See Murphy et al., [40] for details). This assumption is quite common in the field and represents a high nano-particle dose such that the cells are not expected to meaningfully deplete it. Importantly, it does not lead to a completely decoupled system free from any environmental coupling and changes in u(t) are still captured.

Example 2: Internalisation model.

Browning et al., [39] consider a model describing the internalisation of transferrin anti-bodies that accounts for receptor recycling. Due to the experimental protocol (See Dual-labelled probe internalisation data), the concentration of free transferrin antibodies can be considered sufficiently high that unbound surface receptors are assumed to bind instantaneously to a free antibody. Furthermore, the high free antibody concentration enables the internalisation dynamics of all N = 2,000 cells to be considered independent of each other since cells will never compete for free antibodies. For the ith cell at time t > t₀, Browning et al., [39] model the dynamics of the density of unbound internal receptors, , bound surface receptors, , internalised bound receptors, , and interalised free antibodies, , according to,

(4)

for . Here, the initial total density of receptors for the ith cell is given by and it is assumed no antibodies have initially been internalised with . Further, and are, respectively, the internalisation rate and recycling rate for the the ith cell. Finally, is the disassociation probability that captures the probability that an internalised antibody is absorbed into the cell versus being recycled back to the surface. Following Browning et al., [39], p is associated with a purely chemical process and considered constant across cells, whereas , and are assumed to be heterogeneous across cells . Relating this to our framework (Eqs. (1) and (2)), (i.e., ), and (i.e., ). In this model we do not have a dependent shared environment so . In this case an analytic solution to Eq. (4) can be obtained using the matrix exponential (See Browning et al., [39] for details).

The observation process is more complex in this model than the particle-cell interaction model. Here, the experimental protocol involves dual-labelled antibodies with one of the labels (FIP-Cy5) being “quenchable”, leading to a loss in FIP-Cy5 fluorescence for free or surface bound antibodies, and the other (BODIPY FL) being “unquenchable” with fluorescence unaffected by the quenching process. The N = 2,000 cells consist of two equal-sized technical replicates of size 1,000 with one replicate undergoing the quenching process before being processed through the flow cytometer. This leads to four measured fluorescence signal values for the ith pair of cells,

where (resp. ) are the quenchable FIP-Cy5 (resp. unquenchable BODIPY FL) fluorescence signals measured from the unquenched sample group, and (resp. ) are the quenchable FIP-Cy5 (resp. unquenchable BODIPY FL) fluorescence signals measured from the quenched sample group. The resulting observation process is

(5)

for , where and (resp. and ) are intensity and noise the fluorescence measurement for FIP-Cy5 (resp. BODIPY FL). Further, (resp. ) corresponds to the average autofluorescence of FIP-Cy5 (resp. BODIPY FL), and q is the quenching efficiency. While , and q are pre-estimated from data directly, the parameters must be inferred with the model parameters.

Statistical framework

In the previous sections, we describe the typical setting in which biological heterogeneity has been treated in the literature. That is, replicates are pooled and treated as a single snapshot population [29–31,39,40]. In this work, we propose an extension in which each replicate is treated as its own population with variation within replicates (i.e., intra-experimental heterogeneity) being distinguished from variation between replicates (i.e., inter-experimental heterogeneity). To simplify the exposition of our approach we present a two-level Bayesian hierarchical model, however, the approach can be extended to more levels as shown in the S1 Appendix.

Inter-experimental and intra-experimental heterogeneity.

Within the random ODE framework (Eqs. (1) and (2)), we consider M experimental replicates with the jth replicate analysing cells in total across identical technical replicates (Fig 2). The cell population dynamic model (Eq. (1)) becomes,

(6)

for , and . Here, is the state of the ith cell in the jth replicate having parameters . For the jth replicate we have the subpopulation state, , the shared environment, , and the subpopulation parameters .

Similarly, the observation process (Eq. (2)) becomes,

(7)

where is the fluorescence of the ith cell in the jth replicate. The snapshot data for the jth replicate at time t > t₀ is . We assume that the groups of M replicates are observed at the same n observations times , and denote for the series of snapshots nominally labelled as the jth replicate, as in reality there are replicates as the process of taking the snapshot typically destroys the sample. This leads to the complete flow cytometry dataset, including the set of snapshots in replicate subgroups, to be represented by .

Here, we assume that are i.i.d. from a parametric distribution with subgroup hyper-parameter, , representing the distribution parameters for the jth replicate, this subgroup hyper-parameter may also include the number of observed cells it this varies between replicates. Across all replicates, these hyper-parameters , are distributed (i.i.d.) according to another parametric distribution, where are the between group hyper-parameters (Fig 2).

Computational inference.

To quantify both inter-experimental and intra-experimental heterogeneity, characterised by and , respectively, we aim to infer the hyper-parameters, . That is,

(8)

where the joint prior that enforces the hierarchical structure of the model is given by

(9)

and the likelihood is

(10)

with . In Eq. (10) we integrate out the individual cell parameters, , within the likelihood function. This leads to a complex likelihood evaluation while reducing the dimension of the inference problem substantially to the space of hyper-parameters .

To maintain the lower dimensional inference problem from Equations (8)–(10) while avoiding the the direct likelihood calculation (10), we adopt an Approximate Bayesian computation (ABC) approach [47–49]. That is, we approximate Eq. (8) using the ABC posterior,

(11)

where is simulated data and is a distribution matching discrepancy metric based on the Anderson-Darling distance,

(12)

where is the real data snapshot for replicate j at time , and is the simulated data snapshot for replicate j at time . Here, the function is the Anderson-Darling distance,

(13)

where and are two sample sets and is the empirical cumulative distribution for . In this setting, we have that as . We use an adaptive ABC scheme based on sequential Monte Carlo methods of Drovandi and Pettitt [50] that refines through sequential importance resampling (S2 Appendix).

Results

To demonstrate the advantages of our approach, we perform simulations that represent in silico repeats of the experiments described in the Flow cytometry data Section based on published data [29,39,40]. The simulation setting enables experimental heterogeneity to be directly controlled and results to be compared to a known ground truth.

Nano-engineered particle-cell interaction experiments

Murphy et al., [40] consider an ODE model of particle-cell interactions (See Example 1: Particle-cell interaction model) with each cell, , having its own particle association rate, ,and carrying capacity, . In the pooled sample setting, they consider these parameters to be log-Normally distributed,

(14)

For ease of interpretation, Eq (14) is re-parameterised such that , , , and . For inference, this leads to a vector of hyper-parameters of interest .

Synthetic data scenarios.

Using the particle-cell interation model (See Example 1: Particle-cell interaction model) and the parameter distributions (Eq. (14)) for and , we generate in silico data based on the experimental data collected by Faria et al., [29]. We consider M = 3 experimental replicates each with N = 20,000 cells with simulated flow cytometry snapshots taken at n = 6 time points that are considered identically prepared technical replicates (t₁ = 1 [hr], t₂ = 2 [hr], t₃ = 4 [hr], t₄ = 8 [hr], t₅ = 16 [hr], and t₆ = 24 [hr]). Within this setting, we generate synthetic flow cytometry data for the following three scenarios:

Scenario 1: All M experimental replicates are i.i.d. with hyper-parameters , , , and .
Scenario 2: Cell parameters in experimental replicate 1 are generated with a larger mean association rate of . Experimental replicates 2 and 3 are identical to Scenario 1.
Scenario 3: Cell parameters in experimental replicate 1 are generated with a larger mean association rate and cell parameters in experimental replicate 2 are generated with a larger mean association rate of and a larger mean carrying capacity . Experimental replicate 3 is identical to Scenario 1.

In all three scenarios we use, c = 1, , , and . The specific values for these parameters are based on prior work of Murphy et al., [40].

We can consider Scenario 1 as the ideal experimental conditions for pooled sample analysis, whereas Scenarios 2 and 3 represent different ways the pooled assumptions may not hold. We note that the variation we have introduced to these parameters is fairly modest. Fig 3 provides an example of the synthetic flow cytometry data generated for Scenario 3. The larger mean association rate of replicate 1 (Fig 3(a)) leads to a subtle increase in the location of the peak fluorescence at early times, , compared with replicate 3 (Fig 3(c), the basis of Scenario 1), though at later times, t > 8 [hr], replicates 1 and 3 come into alignment. Replicate 2 generally is more diffuse than replicates 1 and 3 for all times, with the peak fluorescence approaching a higher steady state due to the higher mean carrying capacity (Fig 3(b)).

Download:

Fig 3. Synthetic flow cytometry population snapshot data example simulating nano-particle cell interaction data.

Each histogram represents the simulated fluorescence S [AU] (that is Arbitrary Units) distribution for 20,000 observed cells interacting with nano-particles at one of six observation times (t₁ = 1 [hr], t₂ = 2 [hr], t₃ = 4 [hr], t₄ = 8 [hr], t₅ = 16 [hr], and t₆ = 24 [hr]). Replicate 1 (row (a), blue) is generated with a mean particle association rate of and a mean carrying capacity of . Replicate 2 (row (b), orange) is generated with a mean particle association rate of and a mean carrying capacity of . Replicate 3 (row (c), green) is generated with a mean particle association rate of and a mean carrying capacity of . All replicates have the same standard deviation parameters for the association rate and carrying capacity of and , respectively. Here, Replicate 2 (row (b), orange) represents an experimental outlier with higher carrying capacity, resulting in a distinctly larger fluorescence distribution mode at later times than Replicates 1 (row (a), blue) and 3 (row (c), green). Replicate 3 (row (c), green) represents an experimental outlier with a lower association rate, resulting in heavier skewness to the right than Replicates 1 (row (a), blue) and 2 (row (b), orange).

https://doi.org/10.1371/journal.pcbi.1014379.g003

The effect of pooling on quantification of heterogeneity.

Before presenting details of the pooled and hierarchical analysis, we focus on estimated distributions of the individual cell parameters resulting from this analysis. In particular, we consider the heterogeneity in the particle association rate (Fig 4(a)) and carrying capacity (Fig 4(b)) that results from the analysis of synthetic data Scenario 3 (See Synthetic data scenarios and Fig 3). This scenario captures a reasonable experimental setting for which the pooled i.i.d. assumptions are invalid due to experimental heterogeneity that is a confounding factor for the biological heterogeneity.

Download:

Fig 4. Heterogeneity in posterior cell parameter distributions resulting from pooled and hierarchical analysis of the particle-cell interaction model.

Marginal distributions are shown for: (a) the particle association rate, r, and (b) the carrying capacity, K. Data Scenario 3 is used for this analysis (See Synthetic data scenarios and Fig 3). Here, Replicate 2 (orange) represents an experimental outlier with higher mean carrying capacity and Replicate 3 (green) represents an experimental outlier with a lower mean association rate. In this setting the pooled sample (black) does not adequately characterise the combined biological heterogeneity across experimental replicates.

https://doi.org/10.1371/journal.pcbi.1014379.g004

The pooled analysis adequately captures the variability in the cell parameter distributions with support that covers the bulk of each individual replicate distribution (Fig 4). However, the hierarchical analysis highlights that the actual biological heterogeneity is more complex with replicate 3 having a lower particle association rate (Fig 4(a)) and replicate 2 having larger carrying capacity (Fig 4(b)). These replicate specific distributions are not estimated independently, but rather, the hierarchical modelling structure allows for information sharing between replicates to provide quantification of experimental heterogeneity.

Pooled sample analysis.

We investigate the sensitivity of a pooled sample analysis to the i.i.d. assumptions. Following the pooled sample analysis protocol of Murphy et al., [40], we estimate the hyper-parameters, , for each of the three synthetic data scenarios. These hyper-parameters characterise the heterogeneity (Eq. (14)) in particle association rates and carrying capacities throughout the cell population, given the cell population dynamics under the model of Murphy et al., [40] (See Example 1: Particle-cell interaction model). The Bayesian analysis follows the approach described in the Statistical framework Section, using independent uniform priors, , , , and . These priors are consistent with the analysis of Murphy et al., [40], and cover a range of physically viable values.

The results, as shown in Fig 5, indicate that the estimated hyper-parameters are sensitive to the synthetic data scenario. Firstly, the analysis of Scenario 1 results in posterior distributions that accurately recover the true hyper-parameters of (Fig 5(a)), (Fig 5(b)), (Fig 5(c)), and (Fig 5(d)). This is expected, since Scenario 1 represents our well-specified setting in which each replicate contains identical populations of cells. While this simulation validates the use of the pooled approach when when experimental replicates are genuinely the same [39,40], the results for Scenarios 2 and 3 demonstrate substantial sensitivity of the posteriors given deviations in one of the replicates.

Download:

Fig 5. Pooled sample analysis of the nano-engineered particle-cell interaction experiment for the three synthetic data scenarios.

As described in Synthetic data scenarios, Scenario 1 (blue) is the i.i.d. case with no outliers, Scenario 2 (orange) has a single outlier replicate with a larger mean association rate, and Scenario 3 (green) has an additional outlier with a larger mean carrying capacity. Marginal posterior distributions for the hyper-parameters: (a) mean association rate, ; (b) mean carrying capacity, ; (c) association rate standard deviation, ; and (d) carrying capacity standard deviation, .

https://doi.org/10.1371/journal.pcbi.1014379.g005

In Scenario 2, only one replicate was modified to have a mean association rate of resulting in potential outlier in one of the replicates (Fig 3(a)) compared with the unmodified replicates (Fig 3(c)). This outlier replicate strongly affects the estimates for both the pooled associate rate distributions with inflated means (Fig 5(a)) and standard deviations (Fig 5(c)). While the results represent the best pooled sample inference given the outlier replicate, the results may not be meaningful for prediction in biological applications since the inferred heterogeneity does not represent any of the replicates accurately. The results are even more concerning for Scenario 3, where an addition replicate is modified to be an outlier association rate, , and in carrying capacity , since a shift in all four hyper-parameters occurs (Fig 5). This means that the heterogeneity estimates for the pooled population are not representative of any of the three replicate populations. These results motivate the adoption of our extended hierarchical model in experimental settings for which variation is expected between replicates.

Hierarchical analysis.

We now demonstrate the advantages of our hierarchical framework (See Statistical framework) as a method for capturing heterogeneity in flow cytometry data when variation between replicates is present. We consider the same synthetic data scenarios (See Synthetic data scenarios) and apply our hierarchical framework to the particle-cell interaction model (See Example 1: Particle-cell interaction model) of Murphy [40]. Specifically, we consider the setting where the ith cell in the jth replicate population has its own associated rate, , and carrying capacity, . Following the pooled sample setting, we assume these parameters are log-Normally distributed, however, we deviate from the pooled assumption by allowing each replicate population to have its own mean association rate, , and mean carrying capacity, ,

Note that the pooled sample model is recovered for the special case where and . We further assign a Normal distribution to the replicate population means,

where (resp. ) and (resp. ) are the mean and standard deviation of the replicate population association rate (resp. carrying capacity) means. Thus we infer intra-experimental heterogeneity for replicate population j through inference of the hyper-parameters and inter-experimental heterogeneity though the population level hyper-parameters, .

We perform Bayesian inference for the full joint posterior for the two levels of hyper-parameters (Eqs. (8) and (11)). For the population level parameters, we apply independent uniform priors for the means , , and weakly informative half Cauchy priors for the standard deviations and . Note the shape parameters for the half Cauchy priors are determined based on the guidelines of Gelman et al., [51] with regard to selection of weakly informative priors for standard deviation terms in Bayesian hierarchical models with small numbers of groups.

The results, shown in Fig 6, highlight the value of our hierarchical approach. In Scenario 1 we accurately identify replicates are i.i.d. with (Fig 6(a)–6(c)), and (Fig 6(e)–6(g)). Furthermore the outlier replicates in Scenario 2 (replicate 1 with larger ; Fig 6(a)) and Scenario 3 (replicate 1 with larger , and replicate 2 with larger ; Fig 6(a) and 6(f)), are directly captured with only minor inflation in the variance of the posterior distributions for the remaining replicate populations (Fig 6(b) and 6(c) for Scenario 2, and Fig 6(b), 6(c), 6(e) and 6(g) for Scenario 3).

Download:

Fig 6. Hierarchical analysis of the nano-engineered particle-cell interaction experiment for the three synthetic data scenarios.

As described in Synthetic data scenarios, Scenario 1 (blue) is the i.i.d. case with no outliers, Scenario 2 (orange) has a single outlier replicate with a larger mean association rate, and Scenario 3 (green) has an additional outlier with a larger mean carrying capacity. Marginal posterior distributions are shown for the replicate hyper-parameters: (a)–(c) mean association rates, for replicates j = 1,2,3; (d) association rate standard deviation, ; (e)–(g) mean carrying capacity, for replicates j = 1,2,3; and (h) carrying capacity standard deviation, . Marginal posterior distribution are shown for the population level hyper-parameters: (i) mean of association rate means, ; (j) mean of carrying capacity means, ; (k) standard deviation of association rate means, ; and (l) standard deviation of carrying capacity means, . Outlier mean association rates ((a), orange and green) and mean carrying capacity ((f), green) are identified, whereas information sharing lead to robust population level heterogeneity estimates (i)–(l).

https://doi.org/10.1371/journal.pcbi.1014379.g006

In addition to reliable characterisation of heterogeneity for each replicate population, we also obtain estimates of the inter-experimental heterogeneity that are insensitive to variation in a single replicate. This is evidenced by substantial overlap in the posterior probability densities around the true population level means (Fig 6(i) and 6(j)) and standard deviations (Fig 6(k) and 6(l)). In almost all cases, the true population level hyper-parameter is within the bulk density region of the posterior distribution. The exception, is the population standard deviation parameter for the carrying capacity mean in Scenario 3 (Fig 6(l)), however, the deviation in the posterior distributions for across scenarios is substantially less than that of the equivalent pooled sample analysis (Fig 5(d)). This robustness in population level inferences arises from the between-replicate information sharing inherent to the hierarchical model.

Dual-labelled probe internalisation experiments

To demonstrate the generality of or framework, we perform a similar simulation study based on the study of heterogeneity in cell internalisation processes by Browning et al., [39]. The internalisation process of a cell is modelled according an ODE system (See Example 2: Internalisation model) with each cell having its own internalisation rate, , recycling rate, , and initial receptor density . The heterogeneity in these parameters is modelled, in the pooled sample setting, according to

where , , and (resp. , , and ) are the mean, standard deviation, and skewness of the internalisation rate (resp. recycling rate). Here, , are the standard log-Normal parameters and the distribution of R_0,i is shifted such that (See Browning et al., [39] for further details). For inference, these distribution hyper-parameters are of interest along with the disassociation rate p, that is, . As per Eq. (5), parameters for to the observation process, , are also inferred, however, we do not focus on these here.