Identifiability of phenotypic adaptation from low-cell-count experiments and a stochastic model

Alexander P. Browning; Rebecca M. Crossley; Chiara Villa; Philip K. Maini; Adrianne L. Jenner; Tyler Cassidy; Sara Hamis

doi:10.1371/journal.pcbi.1013202

Abstract

Phenotypic plasticity contributes significantly to treatment failure in many cancers. Despite the increased prevalence of experimental studies that interrogate this phenomenon, there remains a lack of applicable quantitative tools to characterise data, and importantly to distinguish between resistance as a discrete phenotype and a continuous distribution of phenotypes. To address this, we develop a stochastic individual-based model of plastic phenotype adaptation through a continuously-structured phenotype space in low-cell-count proliferation assays. That our model corresponds probabilistically to common partial differential equation models of resistance allows us to formulate a likelihood that captures the intrinsic noise ubiquitous to such experiments. We apply our framework to assess the identifiability of key model parameters in several population-level data collection regimes; in particular, parameters relating to the adaptation velocity and cell-to-cell heterogeneity. Significantly, we find that cell-to-cell heterogeneity is practically non-identifiable from both cell count and proliferation marker data, implying that population-level behaviours may be well characterised by homogeneous ordinary differential equation models. Additionally, we demonstrate that population-level data are insufficient to distinguish resistance as a discrete phenotype from a continuous distribution of phenotypes. Our results inform the design of both future experiments and future quantitative analyses that probe phenotypic plasticity in cancer.

Author summary

Many cancers adaptively and reversibly develop resistance to treatment, adding complexity to predictive model development and, by extension, treatment design. While so-called drug challenge experiments are now commonly employed to interrogate phenotypic plasticity, there are very few quantitative tools available to interpret the biological data that arises. In particular, it remains unclear what is needed from drug challenge experiments in order to identify the phenotypic structure of a population that responds adaptively to treatment. In this work, we develop a new individual-level mathematical model of phenotypic plasticity in parallel with a structured model calibration process. Applying our framework to various existing and potential experimental designs reveals that experiments that yield only population-level data cannot distinguish between drug resistance as a distinct cell state, or drug resistance as a continuum of cell states. Consequentially, at the population-level, we demonstrate that common mathematical models that assume a set of distinct cell states can characterise the behaviour of cell populations that, in actuality, respond through a continuum of states. Importantly, our results shed light on both the mathematical models and experiments required to capture phenotypic plasticity in cancer.

Citation: Browning AP, Crossley RM, Villa C, Maini PK, Jenner AL, Cassidy T, et al. (2025) Identifiability of phenotypic adaptation from low-cell-count experiments and a stochastic model. PLoS Comput Biol 21(6): e1013202. https://doi.org/10.1371/journal.pcbi.1013202

Editor: Guillermo Lorenzo, University of A Coruna: Universidade da Coruna, SPAIN

Received: January 20, 2025; Accepted: June 4, 2025; Published: June 24, 2025

Copyright: © 2025 Browning et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Code used to produce the results is available at https://github.com/ap-browning/phenotypic_heterogeneity_ibm.

Funding: APB thanks the Mathematical Institute, University of Oxford, for a Hooke Research Fellowship. SH was funded by Wenner-Gren Stiftelserna/the Wenner-Gren Foundations (WGF2022-0044) and the Kjell och Märta Beijer Foundation. RMC would like to thank the Engineering and Physical Sciences Research Council (EP/T517811/1) and the Oxford-Wolfson-Marriott scholarship at Wolfson College, University of Oxford (SFF2122-OWM-1091340) for funding. ALJ thanks the London Mathematical Society. CV is a Fellow of the Paris Region Fellowship Programme. This work was partially supported by a Heilbronn Institute for Mathematical Research Small Maths Grant to TC. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

1. Introduction

Phenotypic plasticity is widely acknowledged as a significant factor in the eventual failure in the treatment of many cancers [1–4]. Such short-term phenotypic adaptation arises in isogenic populations through epigenetics such that cells quickly manifest a reversible drug-tolerant phenotype when exposed to sufficiently high doses of a therapeutic drug [5, 6]. Both experimental [6, 7] and theoretical [8, 9] studies have proposed adaptive therapy and the intermittent delivery of drug to overcome this phenomenon. Mathematical models, in particular, have been proposed to characterise this behaviour; interpret experimental studies of phenotypic adaptation; and to develop treatment schedules robust to resistance [10–16].

Complicating the characterisation of tumour-level plasticity within a given cancer are contrasting observations in the literature that resistance corresponds to a well-defined discrete phenotype [17–20] and to a continuous spectrum of phenotypes [21, 22]. Indeed, many mathematical models of resistance describe a heterogeneous population comprising cells that are either firmly drug-sensitive or drug-resistant [18, 23–25]. While mathematical models that capture continuous phenotype adaptation have been proposed [26–28] and are in fact well studied in the partial differential equation (PDE) literature [21, 29], they remain largely unvalidated with experimental data.

Despite an increased prevalence of experimental studies that interrogate adaptive plasticity, there remains a lack of quantitative tools to calibrate models of phenotypic plasticity to experimental data. Thus key questions—such as the data requisite to identify the mechanisms behind adaptive plasticity, and the ability to distinguish between resistance as a discrete phenotype and a continuous distribution of phenotypes —remain unanswered. Mathematically, the question of whether model parameters can be estimated from experimental data is broadly referred to as parameter identifiability [30]. More specifically, if distinct parameter sets always lead to distinct model outputs (i.e., the parameter to output map is bijective), a model is said to be structurally identifiable [31]. In many cases, however, models may be structurally but not practically identifiable: that is, model parameters cannot be accurately estimated from a finite amount of noisy experimental data. It is this more pragmatic question, which relates directly to the experimental design required to reliably estimate parameters of interest, that we are primarily concerned with in this paper. Thus, from hereon we use the term identifiability to refer to practical identifiability. Issues relating to the identifiability of mechanisms behind adaptive plasticity from these models are likely to be further exacerbated by other sources of cell-to-cell variability present in even isogenic cell populations [32], and by potential model misspecification.

We are motivated by a recent study of intermittent therapy of mutant melanoma cells by Kavran et al. [6], in which the authors provide genetic evidence for the presence of at least two (reversible) phenotype states: a drug-resistant phenotype and a drug-sensitive phenotype arising within a seven-day period of drug exposure and drug removal, respectively. From reported cell fold-change data from each phenotype, we have previously quantified a dose and phenotype dependent difference in net growth rate (Fig 1a); a characteristic of high interest in the context of the eventual development of treatments robust to adaptation [33]. Notably, Kavran et al. [6] provide compelling evidence for a continuous distribution of phenotypes present in the period between days 7 and 14 as the cells resensitise through observations of the cell-adhesion marker L1CAM (reproduced in Fig 1b), a protein well-known as a marker of the epithelial-to-mesenchymal transition and drug resistance in melanoma [34]. While sequence and protein data provide qualitative insight into the adaptive dynamics, their link to cell growth rate is unlikely to be direct. We must, therefore, resort to using cell count data arising from proliferation assay experiments (Fig 1c) to quantify adaptive dynamics and the corresponding, possibly heterogeneous, net cellular growth rate.

Download:

Fig 1. Experimental data of phenotype adaptation.

An example suite of experimental data of phenotype adaptation. (a) Kavran et al. [6] expose WM239A melanoma cells to either a continuous treatment, or an intermittent treatment, as shown in (e). Net growth rate of cells calculated for various drug dose levels [33]. Cells that have been exposed to drug in the 7 days prior to measurement are classified as drug-resistant; cells that have not as drug-sensitive. (b) Phenotype characterised experimentally by the expression of L1CAM, a marker for cell adhesion. Day 14 intermittent data (i.e., cells that have not been exposed to drug between days 7 and 14) show a similar profile to day 0. Reprinted from [6] with permission from the author. (c) Schematic of a cell proliferation assay; cells grow on the substrate of 9 mm wells, and a central region is imaged at various time points. (d) Example suite of cell proliferation assay data; experiments conducted with a low density of 3T3 Fibroblast cells (reprinted from [35] under a CC-BY license and further analysed in [36]). (e) Cells are subject to either continuous treatment, in which a drug concentration is maintained, or to intermittent treatment, in which treatment alternates between 7-day periods of drug exposure and drug removal. (f,g) Schematics of continuous and discrete models of phenotypic heterogeneity, respectively (see text for details).

https://doi.org/10.1371/journal.pcbi.1013202.g001

To capture the stochasticity intrinsic to low-cell count experiments such as proliferation assays, we develop an individual-based model (IBM) of drug-based adaptation [27]. We build on the IBM of Hamis et al. [33] in a stochastic differential equation (SDE) framework to present a model in a continuous phenotypic space where a population of cells tend reversibly toward either a drug-sensitive state (mathematically, denoted by x_i = 0 where x_i denotes the phenotype of cell i) or a drug-resistant state (denoted by x_i = 1). Changes in cellular phenotype are driven by two key mechanisms. First, deterministic drug-responsive movement described by a function of potentially unknown analytical form. Second, by a random diffusive process that induces cell-to-cell heterogeneity. For simplicity, all cells are otherwise statistically identical. The choice to work in an SDE framework means that our IBM corresponds precisely in a probabilistic sense to common PDE models of phenotypic adaptation [37].

Exploiting the tractability of the SDE and analogous PDE model, we build an inference framework that captures intrinsic noise in low-cell-count proliferation assay experiments without the pervasive, but often unjustified, assumption that experimental observations of cell count are subject to additive Gaussian noise. To do this, we derive and present a chemical master equation (CME) that describes the time-evolution of cell count, with which we construct a likelihood function [38]. We then perform inference and identifiability analysis under three data collection scenarios. First, we consider a suite of cell proliferation experiments for cells that are initially either resistant or sensitive and are exposed (or not) to a drug over a seven day period. Second, we consider a hypothetical scenario in which proliferation assays are observed continuously such that the time of cell-proliferation and cell-death events are directly observed. Lastly, we consider another hypothetical scenario in which a cell proliferation marker (i.e., L1CAM) correlates weakly, but linearly, with cell proliferation. All analysis is initially conducted in an idealised scenario where the functional form of the phenotype adaptation mechanism is correctly specified. We later relax this assumption and perform model selection.

Together, the data collection regimes we study establish the identifiability of individual model parameters and, more importantly, our ability to distinguish the phenotypic heterogeneity induced by random changes in phenotype from possible model misspecification. Given that the entrenched model of reversible phenotypic adaptation is that of two well-defined discrete phenotypes, we conclude our study by investigating whether such discrete heterogeneity can be distinguished, using cell count data alone, from a model comprising a continuous phenotypic space. As we are primarily interested in establishing the theoretical identifiability of model parameters and mechanisms, in the main text we focus our analysis on regimes where cell counts are extracted from images precisely; we relax this assumption in the supplementary material (S5 File) by investigating where identifiability is lost if only imprecise measurements can be made.

2. Mathematical methods

2.1. Individual-based model

We assume that individuals undergo a biased random walk in phenotype space, such that the phenotype of a cell i, denoted x_i, is given by

(1)

where indicates whether a drug is present or not present, respectively; describes the adaptation velocity; describes the magnitude of diffusive movement throughout the phenotype space, and W_i is a Wiener process. We further assume that, for , the system has a stable steady state at x_i = 0 for (this is referred to as the sensitive state), and likewise at x_i = 1 for (referred to as the resistant state).

We assume that the net cellular growth rate is phenotype-dependent, modelled as a linear function of x_i [33], parameterised as

(2)

as shown in Fig 1a. Provided that the growth rate is monotonic in x_i, the functional form of is arbitrary since we could, in theory, rescale the phenotypic space in Eq (1) and thus equivalently the functional form of v. Furthermore, we follow [33], and assume that corresponds solely to net death (apoptosis or necrosis), and corresponds solely to net proliferation. We further assume that both proliferation and death events occur according to a Poisson process. Upon death, a cell is removed from the population. Upon proliferation, a cell is replicated such that daughter cells are created with an (initially) identical phenotype index to the parent.

While we focus on analysis of synthetic data, we choose biologically realistic parameters based upon analysis on the emergence of reversible resistance to the BRAF-inhibitor LGX818 in -mutant melanoma cells [6, 33]. The growth rate parameters are chosen to be , , to approximately match the mean growth rate of sensitive and resistance cells under drug and no drug conditions (see dose-response curve in Fig 1a). Very little information is available regarding the adaptation dynamics through and diffusivity , other than the qualitative observation that cells move between drug-sensitive and drug-resistant states within a seven day window. We set

(3)

with such that x_i is an Ornstein-Uhlenbeck process. We revisit this assumption with a more general form in Sect 3.4. Finally, we set such that the stationary distribution of sensitive cells has a standard deviation of approximately 0.05. Implicit in our model is an assumption that the mechanisms behind drug-sensitisation and the reverse are identical. However, this need not be the case as we later exposit: it is sufficient to study identifiability in a single direction.

We set the initial condition in the model to a probabilistic representation of a spatially uniform low-cell count proliferation assay experiment; specifically, a cell proliferation assay conducted in a standard 9 mm well initialised with approximately 1000 cells (this is slightly larger than the initial population in [6]). The field-of-view of the imaged proliferation assay in Fig 1c–1d is , and so each cell has probability of presenting in the field-of-view. The initial condition is thus set to , corresponding to a mean initial cell count of approximately 7.9 per image.

In Fig 2a–2d, we simulate a set of synthetic cell proliferation assay experiments with our IBM under both continuous and intermittent treatments; the latter is defined as alternating 7-day periods of drug and no drug (Fig 1d). Results in Fig 2a, 2b highlight emergent isogenic heterogeneity due to white-noise driven fluctuations in the phenotype index. Results in Fig 2c, 2d show high levels of stochasticity in cell count. Since the simulations are discrete, there is a non-zero probability of extinction as our model does not, in its standard formulation, consider migration into and out of the cell proliferation assay field-of-view (Fig 1c).

Download:

Fig 2. Model comparison.

We compare realisations of the SDE-based IBM to the solution of both the corresponding Fokker-Planck PDE (Eq 4) and the CME (Eq 12) for the population size. (a–b) A single realisation of an IBM initiated with drug sensitive cells. The mean phenotype is zero in the equilibrium drug-sensitive state, and unity in the equilibrium drug-resistant state. Also shown are the timings of birth and death events. Treatment applied is (a) continuous; and (b) intermittent (Fig 1e). (c–d) Cell count observations from 10 realisations of the IBM (blue) under both (c) continuous and (d) intermittent treatment. Also shown is the expected population computed from a numerical solution of the PDE (black), and both a 50% and 95% credible region computed from a numerical solution of the CME (grey). A full comparison between the solution of the CME and the IBM is provided as supplementary material (S1 File). (e–f) Comparison between the phenotypic distribution computed empirically using an IBM initiated with 500 cells (coloured) and from the PDE (black dashed). Results in (e) show the phenotype distribution for both continuous and intermittent treatment for (in which both regimes are identical) and in (f) for intermittent treatment from .

https://doi.org/10.1371/journal.pcbi.1013202.g002

2.2. Partial differential equation model

We now define u(x,t) as the density of cells with phenotype x at time t, such that the dynamics of u(x,t) are governed by the Fokker-Planck equation

(4)

subject to the usual set of no-flux and vanishing far-field boundary conditions [27, 37, 39].

Experiments are initiated with a sample of cells from a zero-net-growth stationary distribution for either a sensitive or resistant population. For the Ornstein-Uhlenbeck formulation of , this corresponds approximately to

(5)

which we set as the initial condition in the model.

We denote by the expected cell count and by p(x,t) probability density function (PDF), given by

(6)

and

(7)

respectively.

In Fig 2e–2f, we compare a finite-difference approximation to the PDE to a set of realisations of the IBM initialised with a large (n₀ = 500) number of initial cells. We remind the reader that we expect a close match (that converges as ), as the PDE is an exact probabilistic representation the IBM.

2.3. Chemical master equation

We now derive an approximate master equation for the time-evolution of the probability mass function for the cell count, defined as

(8)

We consider that

(9)

Note that we can also include terms in the above that explicitly capture migration into and out of the field of view. Generally, however, we would expect these to vanish if we assume that the assay as a whole is sufficiently homogeneous such that migration out of the window occurs at the same rate as migration into the window (i.e., periodic boundary conditions).

To make progress, we assume that the phenotypic states of cells are independent. While not strictly true for very high proliferation and death rates (since cells inherit their phenotype from a parent), this is appropriate for the range of growth rates we observe (Fig 1a). Under these assumptions, the per-capita instantaneous proliferation and death rates are given by

(10)

and

(11)

respectively, where p(x,t) is governed by the PDE (Eqs 4 and 7).

For sufficiently small, we can consider a Taylor expansion of the exact Poisson probability to obtain an asymptotic expression for the event probabilities in Eq 9. These are given by

and

Substituting into the difference equation (Eq 9) and taking , we arrive at the CME

(12)

subject to absorbing boundaries such that for n<0.

In Fig 2c–2d, we compare the solution of the CME to realisations of the IBM, showing that the CME captures both the average and variance of the cell count. A more detailed comparison is provided in S1 File.

2.4. Likelihood-based inference

We take a Bayesian approach to parameter estimation and identifiability analysis and apply the CME (Eq 12) to construct a likelihood for cell count data reported from proliferation assays. The advantage of this approach, compared to a more standard approach that considers an average cell count subject to additive Gaussian noise, is that we account directly for the stochasticity intrinsic to the proliferation death process. As we are primarily interested in the identifiability of model parameters, we assume that all cell counts are exact. In the supplementary material (S5 File), we investigate identifiability in the case that experimental observations are potentially subject to miscounting.

Experiments are conducted for t days, at the conclusion of which a cell count observation is taken. We denote by a cell count taken from the kth replication of an experiment terminated at time t, conducted entirely with () or without () drug, using an initial population of sensitive (denoted ) or resistant (denoted ) cells, and denote by the complete set of data. Further denoting the solution to the CME with conditions and parameter values by , the log-likelihood is given by

(13)

Here, the summation is taken over all experimental conditions, all time points, and all experimental replicates. Note that we have assumed that cell count observations are independent between time points; effectively assuming that measurements are taken at the termination of an experiment and not as a time-series. While our approach could be trivially extended to account for time-series data, this would add significant computational cost by potentially requiring a numerical solution to the CME for each individual observation. While we focus our results on inference using cell count data, we also consider log-likelihood functions constructed for two other data types: event timing data (i.e., the exact time of proliferation or cell death observed from temporal data) and from a cell proliferation marker that may linearly correlate with the net growth rate.

Following the construction of the log-likelihood function, we can either take a frequentist approach and find the maximum likelihood estimate (MLE), or apply a Bayesian approach to quantify identifiability and parameter uncertainty. While unusual to consider both approaches, we do so in this work as the former is advantageous as it allows us to perform model selection using frequentist hypothesis tests [40].

For the latter, we assume that knowledge about model parameters is initially encoded in a prior distribution, . We choose to be independent uniform over a sufficiently wide range of parameter magnitudes (full details are given in S1 File). This choice also ensures that the maximum a posteriori estimate (MAP) corresponds to the MLE. Following a set of observations, denoted by , arising from cell proliferation assay measurements, or otherwise, we update our knowledge about the model parameters using the relevant likelihood denoted to obtain the posterior distribution, given by

(14)

When applying the Bayesian approach, we sample from the posterior using the adaptive scaling within adaptive Metropolis Markov-Chain Monte Carlo algorithm implemented by [41] in AdaptiveMCMC.jl with 10,000 iterations. To obtain MLEs we apply the DIRECT global search algorithm implemented in NLopt for Julia [42] to the likelihood function. Similarly, for MAPs we apply the same algorithm to the posterior density function. As we are primarily interested in parameter identifiability, which relates to whether the likelihood is flat in the vicinity of either the “true” or best fitting parameter values, for simplicity we initiated each chain using the “true” set of parameter values that are used to generate the synthetic data.

3. Results

3.1. Phenotypic heterogeneity is poorly identified from cell count data

We begin our analysis by considering a suite of synthetic cell proliferation assays conducted within a seven day period (specifically, a set of assays that terminate at t = 1, 3, 5, and ). For each termination time, we conduct a set of four experiments: with or without drug and initiated with either a population of fully sensitive or resistant cells. We devote two 96-well plates to each termination time, such that the sample size for each condition is M = 48. The duration is chosen based on the observation that the population adapts or resensitises within a seven day interval [6] (in S2 File, we consider a variety of termination time sets).

Applying the CME-based Bayesian inference procedure reveals that all growth rate parameters are practically identifiable. The results in Fig 3a–3b show how model predictions produced at the MAP align with synthetic cell count data observations. Furthermore, results in Fig 3c show that the adaptation speed parameter, , is identifiable. However, we see from results in Fig 3d that the diffusion parameter , which corresponds to the variance in the phenotype variable x within an adapting population, is only one-sided identifiable: we can establish an upper bound, but no lower bound. In the supplementary material, we show this to also be the case if temporally correlated cell count observations are made (S6 File). The parameter is, however, structurally identifiable: we show this in the supplementary material (S4 File) using a significantly larger (M = 768) data set, however the parameter becomes again non-identifiable when imprecise cell-count observations are made (S5 File). Thus, from cell count data alone, we expect that models with a phenotypic heterogeneity (i.e., models with a random component to phenotype changes) to be indistinguishable from models with deterministic adaptation (the scenario).

Download:

Fig 3. Proliferation assay inference.

We perform Bayesian inference on a set of synthetic cell proliferation assay data using the CME as a likelihood. Independent cell count observations (M = 48 replicates per condition) are collected from experiments conducted with fully sensitive or fully resistant cells, with and without drug, and terminated at . (a–b) Synthetic proliferation assay cell count data (box plots), the chemical master equation predicted median cell count at the MAP (solid lines), and the model predicted first and third quantiles (dashed semi-transparent lines). (c–d) Posterior distributions for the logarithms of v, the adaptation speed, and , the diffusivity. Shown also is the uniform prior (blue), the true value (black dashed), and the MAP (red dashed). While the adaptation speed is identifiable (as are all other parameters; see S2 File), the diffusivity is only one-sided identifiable; the model cannot be distinguished from that with purely deterministic adaptation (i.e., no heterogeneity).

https://doi.org/10.1371/journal.pcbi.1013202.g003

To investigate the identifiability of further, we recall that the phenotype distribution, p(x,t), affects overall cell count dynamics only indirectly. Specifically, cell proliferation and death is governed at the population-level by the overall proliferation and death rates, given by Eqs 10 and 11. For , p(x,t) tends to a degenerate distribution such that , where is the mean phenotype (in the case of homogeneity, the only phenotype). Following from our assumption that a positive net growth rate corresponds solely to proliferation, the most obvious consequence of the parameter regime is that proliferation and death cannot occur simultaneously: thus, we expect a sharp proliferation-death transition at as the population switches between death and proliferation events, depending on the presence of drug and the mean phenotype. In contrast, the transition at will be diffuse for non-zero . In Fig 4, we compare the event rates for various values of . Clearly, aside from minor differences at the proliferation-death transition, rate curves are visually indistinguishable for decreasing values of . For large , which has very little or no posterior mass (see Fig 3d), the proliferation rate curve becomes distinguishable.

Download:

Fig 4. Practical non-identifiability from cell count data.

Practical non-identifiability of the diffusion parameter (corresponding to a measure of the heterogeneity), seen through differences in the total expected proliferation and death rate functions, and , respectively. All other parameters are fixed at their true values. Clear differences are seen in the proliferation rate between and a proliferation rate constructed where ; we have seen previously that is one-sided identifiable. However, reducing shows (visually) very minor differences between the proliferation and death rate functions as both tend to the deterministic limit (in this case, corresponding to exponential decay from the negative maximum death rate through to the maximum proliferation rate). In the absence of heterogeneity (i.e., for ), proliferation and death events cannot occur simultaneously in a population.

https://doi.org/10.1371/journal.pcbi.1013202.g004

3.2. Phenotype heterogeneity is identifiable from event-timing data

Under the current model formulation, in which heterogeneity is driven solely by diffusion through the phenotypic space, it is only in the regime where that we will ever see proliferation and death events occur simultaneously. Thus, in the constraints of our model formulation, we expect to be able to more precisely identify if we observe the precise timings of cellular proliferation and death events from, for example, live cell imaging.

We therefore investigate a hypothetical scenario where we have access to noise-free event timings from a set of proliferation assays that are initiated with a total of 10,000 cells. Without loss of generality, for the rest of the study we focus only on adaptation in the forward direction (i.e., from drug-sensitive to drug-resistant), since an analogous analysis could be conducted in the reverse direction. A log-likelihood function can be constructed by discretising the resultant Poisson process such that the number of proliferation and death events occurring in the interval , denoted and , respectively, are distributed according to

(15)

where N(t) is the (observed) cell population at time t. Under the well-mixed phenotype assumption for which the CME applies, Eq 15 is exact as . We choose such that the seven-day experiment is subdivided into 200 observation intervals (as a consequence, one could also consider event-timing data that is not exact, but accurate to intervals of width that correspond to a finite imaging frequency). As the intervals are non-overlapping, the observed number of birth and death events within each interval are statistically independent, and the log-likelihood is given simply through the probability mass function for the Poisson distribution in each interval.

The synthetic data set is shown in Fig 5a, along with an estimate for the instantaneous event rate constructed using a moving average. Visually, heterogeneity can be detected through the transition from primarily cell proliferation to primarily cell death. That is, for the homogeneous population of cells will exclusively either proliferate or die, but not both. We proceed to perform inference on this synthetic data set using the Poisson likelihood, with the posterior shown for in Fig 5b (all other relevant parameters remain identifiable). Clearly, heterogeneity is now identifiable; estimates of can be drawn precisely.

Download:

Fig 5. Identifiability of heterogeneity from event timing data.

We generate a synthetic data set from an experiment (or set of experiments) that are initiated with a total of 10,000 cells that are under continuous treatment. The exact event timings (i.e., time of cell proliferation, and time of cell death) are recorded and used for inference. (a) Synthetic event timing data. Shown is a rug plot of a sample of 500 each of proliferation and death events, and a local regression (LOESS) of the observed proliferation and death rate. (b) Posterior distribution for , previously non-identifiable, constructed using a Poisson likelihood for the exact timing data. Shown also is the uniform prior (blue), the true value (black dashed), and the MAP (red dashed).

https://doi.org/10.1371/journal.pcbi.1013202.g005

3.3. Phenotype heterogeneity is not identifiable from proliferation marker data

Our study is in part motivated by Kavran et al. [6] who provide compelling evidence for a continuous transition from a sensitive to resistant state through the cell-adhesion marker L1CAM. Such data are difficult to interpret directly due to uncertainty in the precise link between the net growth rate and the expected marker expression and the resultant flow cytometry measurement. Challenges aside, we now consider identifiability of in the case that the measured marker expression correlates linearly with the proliferation rate (and effectively, since the link between the net growth rate and phenotype index is also linear, the phenotype index).

We assume that the observed marker expression for cell i, denoted by M_i, is given by

(16)

where is independent of x_i. We consider both that is normally distributed with zero mean and unknown standard deviation , and a scenario where the shape of is additionally unknown such that is given by a translated Gamma distribution with zero mean, unknown standard deviation, and unknown skewness (this distribution becomes normal as ) [43]. By convoluting the distribution of with that for x, we can construct an exact log-likelihood for a set of marker data. In Fig 6a, we show the resultant (weak) linear correlation between phenotype index and marker measurement.

Download:

Fig 6. Identifiability of heterogeneity from noisy marker data.

We generate a synthetic data set comprising noisy measurements of the phenotype state of each cell using a hypothetical marker for cell proliferation (i.e., L1CAM). (a) Measurements are normalised such that the mean of fully sensitive cells is approximately zero, and that of fully resistant cells (which arise in the limit as ) is approximately unity. The marker is assumed to weakly linearly correlate with growth rate (and hence, the phenotype index); we model this by a measurement noise process that is normally distributed with variance . (b) We perform Bayesian inference on a dataset generated from cell proliferation assays with fully sensitive cells, exposed to drug continuously, with independent measurements taken at (M = 48 replicates per measurement time). All other parameters, identifiable from cell count data, are fixed, and and are estimated, with the joint posterior (grey discs) shown alongside the true value (blue diamond). (c) We repeat the analysis in the case that the shape (skewness, quantified by ) of the measurement noise distribution is additionally unknown.

https://doi.org/10.1371/journal.pcbi.1013202.g006

We fix all other mechanistic parameters, which we previously established to be identifiable from cell count data, at the corresponding true values. We then consider a synthetic data set in which marker data is taken from a set of proliferation assays terminated at . Results in Fig 6b show samples from the joint posterior distribution for and in the case that . In both the case where the marker error shape is known () and unknown, we are unable to place a lower bound on . Furthermore, the shape of the posterior in Fig 6b indicates that, even if we had knowledge of , wouldremain only one-sided identifiable. We conclude that, from a marker that does not correlate perfectly with growth rate, heterogeneity in the proliferation rate is indistinguishable from marker noise.

3.4. Model selection and misspecification for cell-count data

We have made two significant observations thus far: first, that the regime is indistinguishable from the regime from cell count data; and second, that all other model parameters are identifiable given a correctly specified model. As a consequence of the first observation, we perform all remaining analysis using what we term the “homogeneous continuous model”: an ordinary differential equation (ODE) model given by Eq 1 with . Therefore, all cells carry the same phenotype, denoted now by x(t). Our goal now is assess whether we can not only identify model parameters, but also the functional form of the adaptation velocity (without loss of generality in the case that such that ).

We consider a relatively general functional form for u(x), given by

(17)

where is the sign function. This form of u(x) allows choices of increasing complexity to be recovered by setting parameters to zero. As before, we consider a set of synthetic cell proliferation assays conducted with drug sensitive cells under continuous treatment and terminated at (M = 48 per condition). The true model (Eq 3 is recovered by setting a = c = d = 0. We can recover a variety of velocity models using the functional form given by Eq 17, including for b = c = d = 0 the constant adaptation presented in our previous work [33]. As the growth rate parameters for the drug-on experiment, and , were found to be identifiable (and can be established by conducting drug-off and drug-on experiments with sensitive and resistant cells, respectively) we fix each to their corresponding true value.

We perform model selection using the frequentist likelihood ratio test (equivalent to profile likelihood). For example, to test whether a = 0, we compare the likelihood at the MLE (equivalently, the MAP) where we fix a = 0 to that for the model where all parameters in Eq 17 are non-zero, denoted by . Fig 7a shows the resultant set of log-likelihoods, translated such that . From the likelihood ratio test [40], we can construct a threshold based on a 95% confidence interval outside of which we reject a null hypothesis that the parameter set, i.e., [a] is equal to zero.

Download:

Fig 7. Model selection and misspecification.

We perform inference and model selection on a general adaptation velocity function of form given by Eq 17. The true model corresponds to b = 0.4 and a = c = d = 0 (i.e., the combination ). (a) Results from a likelihood ratio test where the null hypothesis in each column is that the stated parameter combination is zero. Relative log-likelihood values below the relevant threshold (colours correspond to different dimensionalities) indicate that the null hypothesis can be rejected at the level of a 95% confidence interval. Arrows indicate that observed statistics are below the plotted region. (b,c) Identified possible adaptation velocities and phenotype transitions respectively.

https://doi.org/10.1371/journal.pcbi.1013202.g007

Results in Fig 7a show that any individual parameter can be set to zero. Furthermore, any pair of parameters can be set to zero except a and b simultaneously. Finally, only the parameter triples that do not contain both a and b can be set to zero. If the goal was to identify a single model, one would use an information criterion [40] (or similar) to penalise differences in log-likelihood by the dimensionality of the non-zero parameter set; in our case, we expect a model where only one of a or b is non-zero as the most parsimonious.

Our analysis has identified a family of possible adaptation velocity functions, given by the MLE for each combination for which the relative log-likelihood in Fig 7a is above the corresponding threshold. In Fig 7b we compare the identified adaptation velocities for the true model (b non-zero) to the full model (no non-zero parameters) and a model where only a is non-zero. Clearly, there remains large uncertainty as to the functional form of u(x) throughout the phenotype space. Results in Fig 7c, however, demonstrate why these differences do not manifest in statistically different cell count observations: while u(x) varies significantly, the possible paths for x(t) are similar.

3.5. Continuous and discrete-binary heterogeneity may be indistinguishable

Arguably the standard model of plasticity describes a drug-dependent switch between two discrete phenotypes: sensitive and resistant. Such an analogue of our model is

(18)

where sensitive cells, X₀, have net growth rate dependent on the drug concentration d, and resistant cells, X₁, have net growth rate (Fig 1g). We assume that r₁₀(d) and r₀₁(d) are also drug-dependent.

As Eq 18 is linear, the mean cell count in each subpopulation, denoted by n₀(t) and n₁(t), is given by

(19)

To draw a correspondence to the continuous model, we consider now the mean , which we expect to correspond with x(t) in the continuous model (although not exactly, as in general at equilibrium ). The dynamics of are governed by

(20)

Thus, we expect the average cell count in the discrete model to correspond exactly to the average cell count in a continuous model with a quadratic and drug-dependent adaptation velocity. We cannot make an equivalent statement for higher order moments, however we can define an exact CME for the evolution of the joint density and hence the probability mass in the discrete model (S3 File).

For a given set of discrete model parameters, we compute a rescaled velocity function and set of continuous model net growth rates such that both models have equivalent initial and fully adapted net growth rates. In Fig 8a, we demonstrate under continuous application of the drug that the mean cell counts are identical between models. Therefore, from average cell count data, and by extension large-cell-count proliferation assays, we cannot distinguish a discrete model from a continuous model with quadratic adaptation velocity. Results in Fig 8b–8d demonstrate (subtle) differences in higher-order moments and the mass function for each model. We conclude, therefore, that within our modelling framework it may be possible to distinguish between the discrete and continuous models using higher order moments in low-cell-count proliferation assays; however this is unlikely to be the case if only imprecise cell count observations are available. Provided that the adaptation velocity is drug-dependent (i.e., cells sensitise at a rate different to that at which they develop resistance), these findings also apply for so-called intermittent treatment [33, 44]. In Fig 8e–8g, we demonstrate that this equivalence between the discrete and continuous model holds for a variety of different treatment schedules.

Download:

Fig 8. Continuous and discrete-binary models are only distinguishable from higher order moments.

We compare the solutions of the CME for a discrete-binary model and a continuous model with quadratic adaptation velocity under (a–d) continuous application of a drug, and (e–g) intermittent application of a drug. In (e–f), blue regions indicate time periods during which the drug is present, white regions indicate time periods when the drug is removed. (a,e,f,g) Exact correspondence in the mean cell count for each model; the regimes are non-identifiable. (b–d) Solution to the complete CME under continuous drug treatment at various time points, showing higher-order differences between the models. Discrete model parameters are give by r₀₁(1) = 1, r₁₀(1) = 0.01, , when the drug is applied (i.e., d = 1) and r₀₁(0) = 0.02, r₁₀(0) = 0.5, , when the drug is removed (i.e., d = 0).

https://doi.org/10.1371/journal.pcbi.1013202.g008

4. Discussion and conclusion

Phenotypic plasticity and the rapid adaptation of cells upon the application of treatment are widely recognised as a significant factor in the failure of many anti-cancer treatments [45]. Complicating a comprehensive characterisation of phenotypic plasticity is a lack of consensus as to whether adaptation occurs between a set of well-defined discrete cell states or across a continuous spectrum of phenotypes. While both hypotheses are associated with mature subsets of the mathematical modelling literature, there remains—particularly for the latter—a dearth of statistical methodology to parameterise such models. Indeed, key questions relating to the identifiability of adaptation mechanisms and the within-population heterogeneity arising out of random diffusive phenotypic changes, the ability of practitioners to distinguish between discrete and continuous adaptation, and the experimental design requirements to parameterise models, remain unanswered.

Our most significant result is that we are unable to detect heterogeneity arising from random phenotypic changes from population-level (i.e., cell count or proliferation marker) data. While we find that the relevant model parameter, , is theoretically identifiable given a sufficiently large number of experimental observations, this identifiability is lost for imprecise cell-counts. Indeed, the difficulty in distinguishing between the functional form of the adaptation velocity (Fig 7), combined with the narrow time window in which the proliferation and death rates are distinguishable (Fig 4) suggests that heterogeneity may be indistinguishable from misspecification of other model terms. It is only if cell-level information (i.e., timings of proliferation and death events in the population) are available that we are, in theory, able to establish heterogeneity; although, the timescale of adaptation compared to the cell doubling time (less than compared to 2– for melanoma [6]) may arise as a practical limitation if individual cells are not observed to proliferate sufficiently many times during the adaptation phase.

A consequence of the non-identifiability of the parameter is that we cannot distinguish between a heterogeneous and a homogeneous model of continuous adaptation. Mathematically, this offers a practical benefit as it implies that population-level behaviours are well characterised by a simple, and in many cases analytically tractable, ODE model. All other model parameters: the on- and off-drug growth rates in each fully adapted state, and the adaptation velocity, are practically identifiable. In the supplementary material (S2 File), we explore a number of experimental designs in which various combinations of termination times are considered for a fixed total number of proliferation assays. Even if all experiments are terminated after , all relevant parameters remain identifiable; albeit estimates are drawn with reduced precision. The indistinguishability of the heterogeneous and homogeneous continuous transition model motivates us to explore the model selection question using an ODE-based homogeneous model. We are unable to distinguish the functionally correct adaptation velocity, however we do identify a class of models that manifest similar trajectories through phenotype space (Fig 7c).

The theoretical identifiability of heterogeneity from event-timing data using our stochastic formulation highlights two potential (and rarely considered) sources of potential misspecification in our model. First, that proliferation and death events are mutually exclusive: often it is only in a stochastic modelling framework that the two can be distinguished [46]. A more realistic (and correspondingly, further parameterised) model would consider individual and phenotype-dependent proliferation and death rates. Depending on the action of the drug and the metabolic cost of resistance, it may be appropriate for one of these rates to be phenotype-independent. For experiments initiated with a large number of cells, where cell competition may play a role, it may also be appropriate to consider a stochastic analogue of logistic, rather than exponential, growth. A second source of misspecification, the effects of which are, to the best of our knowledge, largely unknown in the context of population-level behaviour in IBMs, is that proliferation occurs according to a Markov process. Clearly, this is a strong assumption that, while routine in the mathematical literature, may be inappropriate. Given that adaptation occurs on a similar timescale as proliferation and that the proliferation rate varies, future application of established alternative models, for example those based on Erlang distributions [47], is not straightforward. Furthermore, any move away from a Markovian formulation in the IBM would render intractable the ODE, PDE, and CME formulations that we rely upon for inference.

Following the vast majority of the PDE literature, the heterogeneity in our model only manifests through random diffusive changes in phenotype [27]. Consequentially, all cells are statistically identical, and the phenotypic state of each cell is constantly evolving, even within a system that appears static at the population-level. It is only this formulation of heterogeneity that we find to be non-identifiable. A potentially more realistic model of heterogeneity is one that also considers inherent heterogeneity between individual cells; for example, variation in the growth rates of cells that are otherwise fully adapted (i.e., variation across cells where x_i = 1), or variation in the rate of adaptation for each cell. The question of identifiability of these population-level distributions from population-level statistics, such as cell count, remains open, although there is a fast-growing set of statistical tools that could be adapted to answer these questions [43, 48]. Given the difficulty faced within our framework identifying variability in the instantaneous proliferation rate, we hypothesise that other sources of heterogeneity are likely to be non-identifiable (or at least, indistinguishable from measurement error or model misspecification) from population-level data.

Our final result is to demonstrate that the discrete-binary model of heterogeneous phenotypic adaptation is indistinguishable at the population level from an appropriately formulated continuous model. Our results do show very minor differences in high-order behaviours (cell count variance), although we still expect both models to remain indistinguishable upon consideration of potential model misspecification and measurement noise. We do expect, however, the binary strategy to be distinguishable from marker data that correlate well with proliferation: the distribution of net growth rates in the binary model will always be bimodal throughout the adaptation phase, contrasting with the continuous transition we see both experimentally (Fig 1b), and in our model (Fig 6a). The equivalence we derive in Sect 3.5 also suggests at a hybrid discrete-continuous model that could be studied in future. Namely, a model in which drug-sensitive cells switch to an intermediate transition state with some drug-dependent propensity, in which the phenotypic state varies continuously until the cell reaches the drug-resistant state.

From a practical perspective, our work provides a statistical framework sufficient to characterise a population-level continuous phenotypic transition in response to a drug within relatively simple experiments. Key model parameters relating to the net growth rates and adaptation velocity were identifiable from an experimental design involving only eight standard 96-well plates (as few as two if images are taken as a time-series). To distinguish between a continuous phenotypic transition and the binary model of disparate sensitive and resistant phenotypes, or to establish the role of random diffusive phenotype changes (i.e., heterogeneity in the continuous framework), single-cell data are required. For example, marker data such as L1CAM (Fig 1b) are likely sufficient to distinguish between the binary and continuous models. In the continuous framework, however, heterogeneity can likely not be distinguished from either misspecification or marker noise. More sophisticated experiments, potentially based on microfluidics [49], may be necessary to accurately quantify heterogeneity in proliferation.

We establish the identifiability of reversible phenotype driven by both directed and random changes from commonly reported low-cell-count proliferation assay experiments. To achieve this, we develop a computationally efficient inference framework that captures potential information arising as intrinsic noise, without resorting to the study of a mean-field model subject to an additive Gaussian measurement process. That we find heterogeneity non-identifiable is significant to the mathematical modelling community, and implies that population-level behaviours (including, importantly, the response of systems to drugs and the design of adaptive therapies) are well characterised by homogeneous ODE models. For the experimental community, our methodology can be used to design and characterise experiments that probe continuous phenotypic adaptation in cancer.

Supporting information

S1 File. IBM/CME comparison.

https://doi.org/10.1371/journal.pcbi.1013202.s001

(PDF)

S2 File. MCMC priors and results.

https://doi.org/10.1371/journal.pcbi.1013202.s002

(PDF)

S3 File. CME for discrete model.

https://doi.org/10.1371/journal.pcbi.1013202.s003

(PDF)

S4 File. Structural identifiability of heterogeneity.

https://doi.org/10.1371/journal.pcbi.1013202.s004

(PDF)

S5 File. Large data set inference with noisy data.

https://doi.org/10.1371/journal.pcbi.1013202.s005

(PDF)

S6 File. Inference with correlated data.

https://doi.org/10.1371/journal.pcbi.1013202.s006

(PDF)

Acknowledgments

The authors thank Adriana Zanca for helpful discussions.

References

1. McGranahan N, Swanton C. Clonal heterogeneity and tumor evolution: past, present, and the future. Cell. 2017;168(4):613–28. pmid:28187284
- View Article
- PubMed/NCBI
- Google Scholar
2. Arozarena I, Wellbrock C. Phenotype plasticity as enabler of melanoma progression and therapy resistance. Nat Rev Cancer. 2019;19(7):377–91.
- View Article
- Google Scholar
3. Boumahdi S, de Sauvage FJ. The great escape: tumour cell plasticity in resistance to targeted therapy. Nat Rev Drug Discov. 2020;19(1):39–56. pmid:31601994
- View Article
- PubMed/NCBI
- Google Scholar
4. Labrie M, Brugge JS, Mills GB, Zervantonakis IK. Therapy resistance: opportunities created by adaptive responses to targeted therapies in cancer. Nat Rev Cancer. 2022;22(6):323–39. pmid:35264777
- View Article
- PubMed/NCBI
- Google Scholar
5. Shaffer SM, Dunagin MC, Torborg SR, Torre EA, Emert B, Krepler C, et al. Rare cell variability and drug-induced reprogramming as a mode of cancer drug resistance. Nature. 2017;546(7658):431–5. pmid:28607484
- View Article
- PubMed/NCBI
- Google Scholar
6. Kavran AJ, Stuart SA, Hayashi KR, Basken JM, Brandhuber BJ, Ahn NG. Intermittent treatment of BRAFV600E melanoma cells delays resistance by adaptive resensitization to drug rechallenge. Proc Natl Acad Sci U S A. 2022;119(12):e2113535119. pmid:35290123
- View Article
- PubMed/NCBI
- Google Scholar
7. Sharma SV, Lee DY, Li B, Quinlan MP, Takahashi F, Maheswaran S, et al. A chromatin-mediated reversible drug-tolerant state in cancer cell subpopulations. Cell. 2010;141(1):69–80. pmid:20371346
- View Article
- PubMed/NCBI
- Google Scholar
8. Altrock PM, Liu LL, Michor F. The mathematics of cancer: integrating quantitative models. Nat Rev Cancer. 2015;15(12):730–45. pmid:26597528
- View Article
- PubMed/NCBI
- Google Scholar
9. Rockne RC, Hawkins-Daarud A, Swanson KR, Sluka JP, Glazier JA, Macklin P, et al. The 2019 mathematical oncology roadmap. Phys Biol. 2019;16(4):041005. pmid:30991381
- View Article
- PubMed/NCBI
- Google Scholar
10. Anderson ARA, Weaver AM, Cummings PT, Quaranta V. Tumor morphology and phenotypic evolution driven by selective pressure from the microenvironment. Cell. 2006;127(5):905–15. pmid:17129778
- View Article
- PubMed/NCBI
- Google Scholar
11. Clairambault J, Pouchol C. A survey of adaptive cell population dynamics models of emergence of drug resistance in cancer, and open questions about evolution and cancer. BIOMATH. 2019;8(1).
- View Article
- Google Scholar
12. Marusyk A, Janiszewska M, Polyak K. Intratumor heterogeneity: the rosetta stone of therapy resistance. Cancer Cell. 2020;37(4):471–84. pmid:32289271
- View Article
- PubMed/NCBI
- Google Scholar
13. Gunnarsson EB, De S, Leder K, Foo J. Understanding the role of phenotypic switching in cancer drug resistance. J Theor Biol. 2020;490:110162. pmid:31953135
- View Article
- PubMed/NCBI
- Google Scholar
14. Stace REA, Stiehl T, Chaplain MAJ, Marciniak-Czochra A, Lorenzi T. Discrete and continuum phenotype-structured models for the evolution of cancer cell populations under chemotherapy. Math Model Nat Phenom. 2020;15:14.
- View Article
- Google Scholar
15. Cassidy T, Nichol D, Robertson-Tessi M, Craig M, Anderson ARA. The role of memory in non-genetic inheritance and its impact on cancer treatment resistance. PLoS Comput Biol. 2021;17(8):e1009348. pmid:34460809
- View Article
- PubMed/NCBI
- Google Scholar
16. Ardévol Martínez V, Salvioli M, Laleh NG, Thuijsman F, Brown JS, Cavill R, et al. Improving mathematical models of cancer through game-theoretic modelling: a study in non-small cell lung cancer. Cold Spring Harbor Laboratory. 2021. https://doi.org/10.1101/2021.10.29.466444
17. West J, Ma Y, Newton PK. Capitalizing on competition: an evolutionary model of competitive release in metastatic castration resistant prostate cancer treatment. J Theor Biol. 2018;455:249–60. pmid:30048718
- View Article
- PubMed/NCBI
- Google Scholar
18. Craig M, Kaveh K, Woosley A, Brown AS, Goldman D, Eton E, et al. Cooperative adaptation to therapy (CAT) confers resistance in heterogeneous non-small cell lung cancer. PLoS Comput Biol. 2019;15(8):e1007278. pmid:31449515
- View Article
- PubMed/NCBI
- Google Scholar
19. Kaznatcheev A, Peacock J, Basanta D, Marusyk A, Scott JG. Fibroblasts and alectinib switch the evolutionary games played by non-small cell lung cancer. Nat Ecol Evol. 2019;3(3):450–6. pmid:30778184
- View Article
- PubMed/NCBI
- Google Scholar
20. Kareva I. Different costs of therapeutic resistance in cancer: short- and long-term impact of population heterogeneity. Math Biosci. 2022;352:108891. pmid:35998834
- View Article
- PubMed/NCBI
- Google Scholar
21. Lorz A, Lorenzi T, Hochberg ME, Clairambault J, Perthame B. Populational adaptive evolution, chemotherapeutic resistance and multiple anti-cancer therapies. ESAIM: M2AN. 2013;47(2):377–99.
- View Article
- Google Scholar
22. França GS, Baron M, King BR, Bossowski JP, Bjornberg A, Pour M, et al. Cellular adaptation to cancer therapy along a resistance continuum. Nature. 2024;631(8022):876–83. pmid:38987605
- View Article
- PubMed/NCBI
- Google Scholar
23. Smalley I, Kim E, Li J, Spence P, Wyatt CJ, Eroglu Z, et al. Leveraging transcriptional dynamics to improve BRAF inhibitor responses in melanoma. EBioMedicine. 2019;48:178–90. pmid:31594749
- View Article
- PubMed/NCBI
- Google Scholar
24. Cassidy T, Craig M. Determinants of combination GM-CSF immunotherapy and oncolytic virotherapy success identified through in silico treatment personalization. PLoS Comput Biol. 2019;15(11):e1007495. pmid:31774808
- View Article
- PubMed/NCBI
- Google Scholar
25. Strobl MAR, West J, Viossat Y, Damaghi M, Robertson-Tessi M, Brown JS, et al. Turnover modulates the need for a cost of resistance in adaptive therapy. Cancer Res. 2021;81(4):1135–47. pmid:33172930
- View Article
- PubMed/NCBI
- Google Scholar
26. Dieckmann U, Law R. The dynamical theory of coevolution: a derivation from stochastic ecological processes. J Math Biol. 1996;34(5–6):579–612. pmid:8691086
- View Article
- PubMed/NCBI
- Google Scholar
27. Chisholm RH, Lorenzi T, Lorz A, Larsen AK, de Almeida LN, Escargueil A, et al. Emergence of drug tolerance in cancer cell populations: an evolutionary outcome of selection, nongenetic instability, and stress-induced adaptation. Cancer Res. 2015;75(6):930–9. pmid:25627977
- View Article
- PubMed/NCBI
- Google Scholar
28. Celora GL, Byrne HM, Zois CE, Kevrekidis PG. Phenotypic variation modulates the growth dynamics and response to radiotherapy of solid tumours under normoxia and hypoxia. J Theor Biol. 2021;527:110792. pmid:34087269
- View Article
- PubMed/NCBI
- Google Scholar
29. Lorenzi T, Chisholm RH, Desvillettes L, Hughes BD. Dissecting the dynamics of epigenetic changes in phenotype-structured populations exposed to fluctuating environments. J Theor Biol. 2015;386:166–76. pmid:26375370
- View Article
- PubMed/NCBI
- Google Scholar
30. Raue A, Kreutz C, Maiwald T, Bachmann J, Schilling M, Klingmüller U, et al. Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood. Bioinformatics. 2009;25(15):1923–9. pmid:19505944
- View Article
- PubMed/NCBI
- Google Scholar
31. Audoly S, Bellu G, D’Angiò L, Saccomani MP, Cobelli C. Global identifiability of nonlinear models of biological systems. IEEE Trans Biomed Eng. 2001;48(1):55–65. pmid:11235592
- View Article
- PubMed/NCBI
- Google Scholar
32. Elsasser WM. Outline of a theory of cellular heterogeneity. Proc Natl Acad Sci U S A. 1984;81(16):5126–9. pmid:6591183
- View Article
- PubMed/NCBI
- Google Scholar
33. Hamis S, Browning AP, Jenner AL, Villa C, Maini P, Cassidy T. Growth rate-driven modelling reveals how phenotypic adaptation drives drug resistance in BRAFV600E-mutant melanoma. bioRxiv. 2024.
34. Kiefel H, Bondong S, Pfeifer M, Schirmer U, Erbe-Hoffmann N, Schäfer H, et al. EMT-associated up-regulation of L1CAM provides insights into L1CAM-mediated integrin signalling and NF-κB activation. Carcinogenesis. 2012;33(10):1919–29. pmid:22764136
- View Article
- PubMed/NCBI
- Google Scholar
35. Markham DC, Simpson MJ, Baker RE. Choosing an appropriate modelling framework for analysing multispecies co-culture cell biology experiments. bioRxiv. 2014. https://doi.org/10.1101/008318
36. Markham DC, Simpson MJ, Baker RE. Choosing an appropriate modelling framework for analysing multispecies co-culture cell biology experiments. Bull Math Biol. 2015;77(4):713–34. pmid:25549623
- View Article
- PubMed/NCBI
- Google Scholar
37. Villa C, Maini PK, Browning AP, Jenner AL, Hamis S, Cassidy T. Reducing phenotype-structured PDE models of cancer evolution to systems of ODEs: a generalised moment dynamics approach. arXiv. 2024.
38. Warne DJ, Baker RE, Simpson MJ. Simulation and inference algorithms for stochastic biochemical reaction networks: from basic concepts to state-of-the-art. J R Soc Interface. 2019;16(151):20180943. pmid:30958205
- View Article
- PubMed/NCBI
- Google Scholar
39. Risken H, Frank T. The Fokker-Planck equation methods of solution and applications. 2nd ed. Berlin: Springer. 1996.
40. Pawitan Y. In all likelihood: statistical modelling and inference using likelihood. Oxford: Oxford University Press. 2013.
41. Vihola M. Ergonomic and reliable bayesian inference with adaptive Markov chain Monte Carlo. Wiley StatsRef: Statistics Reference Online. Wiley; 2020. p. 1–12. https://doi.org/10.1002/9781118445112.stat08286
42. Johnson SG. The NLopt module for Julia. 2021.
43. Browning AP, Drovandi C, Turner IW, Jenner AL, Simpson MJ. Efficient inference and identifiability analysis for differential equation models with random parameters. PLoS Comput Biol. 2022;18(11):e1010734. pmid:36441811
- View Article
- PubMed/NCBI
- Google Scholar
44. Corigliano M, Di Bernardo A, Cosentino Lagomarsino M, Pompei S. Optimal treatment for drug-induced cancer persisters involves release periods and intermediate drug doses. Cold Spring Harbor Laboratory. 2024. https://doi.org/10.1101/2024.11.29.626082
45. Hanahan D. Hallmarks of cancer: new dimensions. Cancer Discov. 2022;12(1):31–46. pmid:35022204
- View Article
- PubMed/NCBI
- Google Scholar
46. Huynh L, Scott JG, Thomas PJ. Inferring density-dependent population dynamics mechanisms through rate disambiguation for logistic birth-death processes. J Math Biol. 2023;86(4):50. pmid:36864131
- View Article
- PubMed/NCBI
- Google Scholar
47. Gavagnin E, Ford MJ, Mort RL, Rogers T, Yates CA. The invasion speed of cell migration models with realistic cell cycle time distributions. J Theor Biol. 2019;481:91–9. pmid:30219568
- View Article
- PubMed/NCBI
- Google Scholar
48. Hasenauer J, Waldherr S, Doszczak M, Radde N, Scheurich P, Allgöwer F. Identification of models of heterogeneous cell populations from population snapshot data. BMC Bioinformatics. 2011;12:125. pmid:21527025
- View Article
- PubMed/NCBI
- Google Scholar
49. Tavakoli H, Zhou W, Ma L, Perez S, Ibarra A, Xu F, et al. Recent advances in microfluidic platforms for single-cell analysis in cancer biology, diagnosis and therapy. Trends Analyt Chem. 2019;117:13–26. pmid:32831435
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. McGranahan N, Swanton C. Clonal heterogeneity and tumor evolution: past, present, and the future. Cell. 2017;168(4):613–28. pmid:28187284
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Arozarena I, Wellbrock C. Phenotype plasticity as enabler of melanoma progression and therapy resistance. Nat Rev Cancer. 2019;19(7):377–91.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref3] 3. Boumahdi S, de Sauvage FJ. The great escape: tumour cell plasticity in resistance to targeted therapy. Nat Rev Drug Discov. 2020;19(1):39–56. pmid:31601994
View Article
PubMed/NCBI
Google Scholar

[9] View Article

[10] PubMed/NCBI

[11] Google Scholar

[ref4] 4. Labrie M, Brugge JS, Mills GB, Zervantonakis IK. Therapy resistance: opportunities created by adaptive responses to targeted therapies in cancer. Nat Rev Cancer. 2022;22(6):323–39. pmid:35264777
View Article
PubMed/NCBI
Google Scholar

[13] View Article

[14] PubMed/NCBI

[15] Google Scholar

[ref5] 5. Shaffer SM, Dunagin MC, Torborg SR, Torre EA, Emert B, Krepler C, et al. Rare cell variability and drug-induced reprogramming as a mode of cancer drug resistance. Nature. 2017;546(7658):431–5. pmid:28607484
View Article
PubMed/NCBI
Google Scholar

[17] View Article

[18] PubMed/NCBI

[19] Google Scholar

[ref6] 6. Kavran AJ, Stuart SA, Hayashi KR, Basken JM, Brandhuber BJ, Ahn NG. Intermittent treatment of BRAFV600E melanoma cells delays resistance by adaptive resensitization to drug rechallenge. Proc Natl Acad Sci U S A. 2022;119(12):e2113535119. pmid:35290123
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref7] 7. Sharma SV, Lee DY, Li B, Quinlan MP, Takahashi F, Maheswaran S, et al. A chromatin-mediated reversible drug-tolerant state in cancer cell subpopulations. Cell. 2010;141(1):69–80. pmid:20371346
View Article
PubMed/NCBI
Google Scholar

[25] View Article

[26] PubMed/NCBI

[27] Google Scholar

[ref8] 8. Altrock PM, Liu LL, Michor F. The mathematics of cancer: integrating quantitative models. Nat Rev Cancer. 2015;15(12):730–45. pmid:26597528
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref9] 9. Rockne RC, Hawkins-Daarud A, Swanson KR, Sluka JP, Glazier JA, Macklin P, et al. The 2019 mathematical oncology roadmap. Phys Biol. 2019;16(4):041005. pmid:30991381
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref10] 10. Anderson ARA, Weaver AM, Cummings PT, Quaranta V. Tumor morphology and phenotypic evolution driven by selective pressure from the microenvironment. Cell. 2006;127(5):905–15. pmid:17129778
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref11] 11. Clairambault J, Pouchol C. A survey of adaptive cell population dynamics models of emergence of drug resistance in cancer, and open questions about evolution and cancer. BIOMATH. 2019;8(1).
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref12] 12. Marusyk A, Janiszewska M, Polyak K. Intratumor heterogeneity: the rosetta stone of therapy resistance. Cancer Cell. 2020;37(4):471–84. pmid:32289271
View Article
PubMed/NCBI
Google Scholar

[44] View Article

[45] PubMed/NCBI

[46] Google Scholar

[ref13] 13. Gunnarsson EB, De S, Leder K, Foo J. Understanding the role of phenotypic switching in cancer drug resistance. J Theor Biol. 2020;490:110162. pmid:31953135
View Article
PubMed/NCBI
Google Scholar

[48] View Article

[49] PubMed/NCBI

[50] Google Scholar

[ref14] 14. Stace REA, Stiehl T, Chaplain MAJ, Marciniak-Czochra A, Lorenzi T. Discrete and continuum phenotype-structured models for the evolution of cancer cell populations under chemotherapy. Math Model Nat Phenom. 2020;15:14.
View Article
Google Scholar

[52] View Article

[53] Google Scholar

[ref15] 15. Cassidy T, Nichol D, Robertson-Tessi M, Craig M, Anderson ARA. The role of memory in non-genetic inheritance and its impact on cancer treatment resistance. PLoS Comput Biol. 2021;17(8):e1009348. pmid:34460809
View Article
PubMed/NCBI
Google Scholar

[55] View Article

[56] PubMed/NCBI

[57] Google Scholar

[ref16] 16. Ardévol Martínez V, Salvioli M, Laleh NG, Thuijsman F, Brown JS, Cavill R, et al. Improving mathematical models of cancer through game-theoretic modelling: a study in non-small cell lung cancer. Cold Spring Harbor Laboratory. 2021. https://doi.org/10.1101/2021.10.29.466444

[ref17] 17. West J, Ma Y, Newton PK. Capitalizing on competition: an evolutionary model of competitive release in metastatic castration resistant prostate cancer treatment. J Theor Biol. 2018;455:249–60. pmid:30048718
View Article
PubMed/NCBI
Google Scholar

[60] View Article

[61] PubMed/NCBI

[62] Google Scholar

[ref18] 18. Craig M, Kaveh K, Woosley A, Brown AS, Goldman D, Eton E, et al. Cooperative adaptation to therapy (CAT) confers resistance in heterogeneous non-small cell lung cancer. PLoS Comput Biol. 2019;15(8):e1007278. pmid:31449515
View Article
PubMed/NCBI
Google Scholar

[64] View Article

[65] PubMed/NCBI

[66] Google Scholar

[ref19] 19. Kaznatcheev A, Peacock J, Basanta D, Marusyk A, Scott JG. Fibroblasts and alectinib switch the evolutionary games played by non-small cell lung cancer. Nat Ecol Evol. 2019;3(3):450–6. pmid:30778184
View Article
PubMed/NCBI
Google Scholar

[68] View Article

[69] PubMed/NCBI

[70] Google Scholar

[ref20] 20. Kareva I. Different costs of therapeutic resistance in cancer: short- and long-term impact of population heterogeneity. Math Biosci. 2022;352:108891. pmid:35998834
View Article
PubMed/NCBI
Google Scholar

[72] View Article

[73] PubMed/NCBI

[74] Google Scholar

[ref21] 21. Lorz A, Lorenzi T, Hochberg ME, Clairambault J, Perthame B. Populational adaptive evolution, chemotherapeutic resistance and multiple anti-cancer therapies. ESAIM: M2AN. 2013;47(2):377–99.
View Article
Google Scholar

[76] View Article

[77] Google Scholar

[ref22] 22. França GS, Baron M, King BR, Bossowski JP, Bjornberg A, Pour M, et al. Cellular adaptation to cancer therapy along a resistance continuum. Nature. 2024;631(8022):876–83. pmid:38987605
View Article
PubMed/NCBI
Google Scholar

[79] View Article

[80] PubMed/NCBI

[81] Google Scholar

[ref23] 23. Smalley I, Kim E, Li J, Spence P, Wyatt CJ, Eroglu Z, et al. Leveraging transcriptional dynamics to improve BRAF inhibitor responses in melanoma. EBioMedicine. 2019;48:178–90. pmid:31594749
View Article
PubMed/NCBI
Google Scholar

[83] View Article

[84] PubMed/NCBI

[85] Google Scholar

[ref24] 24. Cassidy T, Craig M. Determinants of combination GM-CSF immunotherapy and oncolytic virotherapy success identified through in silico treatment personalization. PLoS Comput Biol. 2019;15(11):e1007495. pmid:31774808
View Article
PubMed/NCBI
Google Scholar

[87] View Article

[88] PubMed/NCBI

[89] Google Scholar

[ref25] 25. Strobl MAR, West J, Viossat Y, Damaghi M, Robertson-Tessi M, Brown JS, et al. Turnover modulates the need for a cost of resistance in adaptive therapy. Cancer Res. 2021;81(4):1135–47. pmid:33172930
View Article
PubMed/NCBI
Google Scholar

[91] View Article

[92] PubMed/NCBI

[93] Google Scholar

[ref26] 26. Dieckmann U, Law R. The dynamical theory of coevolution: a derivation from stochastic ecological processes. J Math Biol. 1996;34(5–6):579–612. pmid:8691086
View Article
PubMed/NCBI
Google Scholar

[95] View Article

[96] PubMed/NCBI

[97] Google Scholar

[ref27] 27. Chisholm RH, Lorenzi T, Lorz A, Larsen AK, de Almeida LN, Escargueil A, et al. Emergence of drug tolerance in cancer cell populations: an evolutionary outcome of selection, nongenetic instability, and stress-induced adaptation. Cancer Res. 2015;75(6):930–9. pmid:25627977
View Article
PubMed/NCBI
Google Scholar

[99] View Article

[100] PubMed/NCBI

[101] Google Scholar

[ref28] 28. Celora GL, Byrne HM, Zois CE, Kevrekidis PG. Phenotypic variation modulates the growth dynamics and response to radiotherapy of solid tumours under normoxia and hypoxia. J Theor Biol. 2021;527:110792. pmid:34087269
View Article
PubMed/NCBI
Google Scholar

[103] View Article

[104] PubMed/NCBI

[105] Google Scholar

[ref29] 29. Lorenzi T, Chisholm RH, Desvillettes L, Hughes BD. Dissecting the dynamics of epigenetic changes in phenotype-structured populations exposed to fluctuating environments. J Theor Biol. 2015;386:166–76. pmid:26375370
View Article
PubMed/NCBI
Google Scholar

[107] View Article

[108] PubMed/NCBI

[109] Google Scholar

[ref30] 30. Raue A, Kreutz C, Maiwald T, Bachmann J, Schilling M, Klingmüller U, et al. Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood. Bioinformatics. 2009;25(15):1923–9. pmid:19505944
View Article
PubMed/NCBI
Google Scholar

[111] View Article

[112] PubMed/NCBI

[113] Google Scholar

[ref31] 31. Audoly S, Bellu G, D’Angiò L, Saccomani MP, Cobelli C. Global identifiability of nonlinear models of biological systems. IEEE Trans Biomed Eng. 2001;48(1):55–65. pmid:11235592
View Article
PubMed/NCBI
Google Scholar

[115] View Article

[116] PubMed/NCBI

[117] Google Scholar

[ref32] 32. Elsasser WM. Outline of a theory of cellular heterogeneity. Proc Natl Acad Sci U S A. 1984;81(16):5126–9. pmid:6591183
View Article
PubMed/NCBI
Google Scholar

[119] View Article

[120] PubMed/NCBI

[121] Google Scholar

[ref33] 33. Hamis S, Browning AP, Jenner AL, Villa C, Maini P, Cassidy T. Growth rate-driven modelling reveals how phenotypic adaptation drives drug resistance in BRAFV600E-mutant melanoma. bioRxiv. 2024.

[ref34] 34. Kiefel H, Bondong S, Pfeifer M, Schirmer U, Erbe-Hoffmann N, Schäfer H, et al. EMT-associated up-regulation of L1CAM provides insights into L1CAM-mediated integrin signalling and NF-κB activation. Carcinogenesis. 2012;33(10):1919–29. pmid:22764136
View Article
PubMed/NCBI
Google Scholar

[124] View Article

[125] PubMed/NCBI

[126] Google Scholar

[ref35] 35. Markham DC, Simpson MJ, Baker RE. Choosing an appropriate modelling framework for analysing multispecies co-culture cell biology experiments. bioRxiv. 2014. https://doi.org/10.1101/008318

[ref36] 36. Markham DC, Simpson MJ, Baker RE. Choosing an appropriate modelling framework for analysing multispecies co-culture cell biology experiments. Bull Math Biol. 2015;77(4):713–34. pmid:25549623
View Article
PubMed/NCBI
Google Scholar

[129] View Article

[130] PubMed/NCBI

[131] Google Scholar

[ref37] 37. Villa C, Maini PK, Browning AP, Jenner AL, Hamis S, Cassidy T. Reducing phenotype-structured PDE models of cancer evolution to systems of ODEs: a generalised moment dynamics approach. arXiv. 2024.

[ref38] 38. Warne DJ, Baker RE, Simpson MJ. Simulation and inference algorithms for stochastic biochemical reaction networks: from basic concepts to state-of-the-art. J R Soc Interface. 2019;16(151):20180943. pmid:30958205
View Article
PubMed/NCBI
Google Scholar

[134] View Article

[135] PubMed/NCBI

[136] Google Scholar

[ref39] 39. Risken H, Frank T. The Fokker-Planck equation methods of solution and applications. 2nd ed. Berlin: Springer. 1996.

[ref40] 40. Pawitan Y. In all likelihood: statistical modelling and inference using likelihood. Oxford: Oxford University Press. 2013.

[ref41] 41. Vihola M. Ergonomic and reliable bayesian inference with adaptive Markov chain Monte Carlo. Wiley StatsRef: Statistics Reference Online. Wiley; 2020. p. 1–12. https://doi.org/10.1002/9781118445112.stat08286

[ref42] 42. Johnson SG. The NLopt module for Julia. 2021.

[ref43] 43. Browning AP, Drovandi C, Turner IW, Jenner AL, Simpson MJ. Efficient inference and identifiability analysis for differential equation models with random parameters. PLoS Comput Biol. 2022;18(11):e1010734. pmid:36441811
View Article
PubMed/NCBI
Google Scholar

[142] View Article

[143] PubMed/NCBI

[144] Google Scholar

[ref44] 44. Corigliano M, Di Bernardo A, Cosentino Lagomarsino M, Pompei S. Optimal treatment for drug-induced cancer persisters involves release periods and intermediate drug doses. Cold Spring Harbor Laboratory. 2024. https://doi.org/10.1101/2024.11.29.626082

[ref45] 45. Hanahan D. Hallmarks of cancer: new dimensions. Cancer Discov. 2022;12(1):31–46. pmid:35022204
View Article
PubMed/NCBI
Google Scholar

[147] View Article

[148] PubMed/NCBI

[149] Google Scholar

[ref46] 46. Huynh L, Scott JG, Thomas PJ. Inferring density-dependent population dynamics mechanisms through rate disambiguation for logistic birth-death processes. J Math Biol. 2023;86(4):50. pmid:36864131
View Article
PubMed/NCBI
Google Scholar

[151] View Article

[152] PubMed/NCBI

[153] Google Scholar

[ref47] 47. Gavagnin E, Ford MJ, Mort RL, Rogers T, Yates CA. The invasion speed of cell migration models with realistic cell cycle time distributions. J Theor Biol. 2019;481:91–9. pmid:30219568
View Article
PubMed/NCBI
Google Scholar

[155] View Article

[156] PubMed/NCBI

[157] Google Scholar

[ref48] 48. Hasenauer J, Waldherr S, Doszczak M, Radde N, Scheurich P, Allgöwer F. Identification of models of heterogeneous cell populations from population snapshot data. BMC Bioinformatics. 2011;12:125. pmid:21527025
View Article
PubMed/NCBI
Google Scholar

[159] View Article

[160] PubMed/NCBI

[161] Google Scholar

[ref49] 49. Tavakoli H, Zhou W, Ma L, Perez S, Ibarra A, Xu F, et al. Recent advances in microfluidic platforms for single-cell analysis in cancer biology, diagnosis and therapy. Trends Analyt Chem. 2019;117:13–26. pmid:32831435
View Article
PubMed/NCBI
Google Scholar

[163] View Article

[164] PubMed/NCBI

[165] Google Scholar

Figures

Abstract

Author summary

1. Introduction

2. Mathematical methods

2.1. Individual-based model

2.2. Partial differential equation model

2.3. Chemical master equation

2.4. Likelihood-based inference

3. Results

3.1. Phenotypic heterogeneity is poorly identified from cell count data

3.2. Phenotype heterogeneity is identifiable from event-timing data

3.3. Phenotype heterogeneity is not identifiable from proliferation marker data

3.4. Model selection and misspecification for cell-count data

3.5. Continuous and discrete-binary heterogeneity may be indistinguishable

4. Discussion and conclusion

Supporting information

S1 File. IBM/CME comparison.

S2 File. MCMC priors and results.

S3 File. CME for discrete model.

S4 File. Structural identifiability of heterogeneity.

S5 File. Large data set inference with noisy data.

S6 File. Inference with correlated data.

Acknowledgments

References