Figures
Abstract
The concept of coherence was proposed for single-agent phase I clinical trials to describe the property that a design never escalates the dose when the most recently treated patient has toxicity and never de-escalates the dose when the most recently treated patient has no toxicity. It provides a useful theoretical tool for investigating the properties of phase I trial designs. In this paper, we generalize the concept of coherence to drug combination trials, which are substantially different and more challenging than single-agent trials. For example, in the dose-combination matrix, each dose has up to 8 neighboring doses as candidates for dose escalation and de-escalation, and the toxicity orders of these doses are only partially known. We derive sufficient conditions for a model-based drug combination trial design to be coherent. Our results are more general and relaxed than the existing results and are applicable to both single-agent and drug combination trials. We illustrate the application of our theoretical results with a number of drug combination dose-finding designs in the literature.
Citation: Park Y, Liu S (2020) On the coherence of model-based dose-finding designs for drug combination trials. PLoS ONE 15(11): e0242561. https://doi.org/10.1371/journal.pone.0242561
Editor: Alan D. Hutson, Roswell Park Cancer Institute, UNITED STATES
Received: August 10, 2020; Accepted: November 4, 2020; Published: November 30, 2020
Copyright: © 2020 Park, Liu. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting information files.
Funding: The authors received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The objective of phase I dose-finding clinical trials is to evaluate the safety and maximum tolerated dose (MTD) of new drugs through a sequential process of dose escalation and de-escalation. Various dose-finding designs have been proposed to guide dose escalation and de-escalation, but all are based on the same principle: if the observed data suggest that the current dose is safe (i.e., below the MTD), we escalate the dose to avoid treating the next cohort of patients at potentially sub-therapeutic doses; and if the observed data suggest that the current dose is overly toxic (i.e., above the MTD), we de-escalate the dose to avoid exposing patients to overly toxic doses. Because a phase I trial is the first-in-human trial and at that stage little known about the toxicity profile of the drug, it is critically important to ensure that the dose escalation and de-escalation rules are safe and appropriate. For single-agent trials, Cheung [1] introduced the concept of coherence, which is that a dose-finding design is coherent if it never escalates the dose when the most recently treated patient experiences toxicity, and never de-escalates the dose when the most recently treated patient does not experience toxicity. From an ethical and practical viewpoint, it is desirable for a dose-finding design to be coherent. Cheung [1] established sufficient conditions for a single-agent dose-finding design to be consistent.
Drug combination trials have become a mainstream approach for treating cancer because of the ability to induce a synergistic treatment effect and overcome the drug resistance that is common with monotherapies. However, dose finding in drug combination trials is more challenging. In the dose-combination matrix, each drug combination has up to eight adjacent combinations as potential candidates for dose escalation and de-escalation. More important, these combinations are only partially ordered due to drug-drug interactions. For example, suppose one combination has a higher dose of drug 1 whereas the other combination has a higher dose of drug 2, a priori we do not know the toxicity order of these two combinations. As a result, when dose escalation or de-escalation is needed, it is not immediately clear which dose combination should be selected. In contrast, in single-agent trials, we are concerned with a string of doses that are associated with monotonically increasing toxicity. When dose escalation is needed, we simply move to the next higher dose level, and when dose de-escalation is needed, we move to the next lower dose level. Because of these fundamental differences between single-agent and multi-agent trials, the definition and results of coherence developed by Cheung [1] for single-agent trials cannot be directly applied to drug combination trials. In this article, we generalize the definition of coherence and establish sufficient conditions of coherence for drug combination trials. As we show, such generalization is not trivial and requires vastly different considerations. One important strength of our results is that they are still valid when the assumed drug-combination dose-toxicity model is misspecified.
Numerical dose-finding designs have been proposed for drug combination trials. The majority of them are model-based designs and make adaptive decisions of dose escalation/de-escalation using a strategy similar to that in the continuous reassessment method (CRM [2]). That strategy is to devise a parametric model to describe the dose-toxicity surface and then, based on the accumulating data, continuously update the model estimate to guide the dose selection and assignment. Thall et al. [3] proposed a six-parameter model-based design to find the MTD. Wang and Ivanova [4] developed a drug combination dose-finding design based on a log linear model. Yin and Yuan [5] and Yuan and Yin [6] proposed Bayesian dose-finding designs based on a copula-type regression model. Wages et al. [7] extended the CRM based on partial ordering of the dose combinations. Braun and Jia [8] proposed the generalized CRM model to guide dose finding. Riviere et al. [9] proposed a Bayesian dose-finding design based on the logistic model. Cai et al. [10] and Riviere et al. [11] adopted change-point models for drug combination trials involving molecularly targeted agents.
The remainder of the paper is organized as follows. In materials and methods section, we review coherence for single-agent trials and propose the generalized concept and theory of coherence that embraces both single-agent and drug combination trials. In application section, we use the proposed theory to study the coherence of a number of drug combination designs in the literature. In discussion section, we conclude with a discussion.
Materials and methods
Coherence for single-agent trials
We first review coherence for single-agent trials. Consider a single-agent phase I trial with K doses under investigation, where a higher dose presumably has a higher probability of causing toxicity. Patients are sequentially enrolled, and each patient is treated at a dose that is adaptively selected by a certain trial design (e.g., CRM) based on the interim data. For each n = 1, …, N, where N is the prespecified maximum sample size, let Xn denote the dose level assigned to the nth patient, and Yn be a toxicity indicator of the nth patient, with Yn = 1 denoting toxicity.
A design is called coherent in escalation if Pr(Xn+1 > Xn|Yn = 1) = 0 and coherent in de-escalation if Pr(Xn+1 < Xn|Yn = 0) = 0 for n = 1, …, N [1]. In other words, a coherent design never escalates the dose when the most recently treated patient experiences toxicity, and never de-escalates the dose when the most recently treated patient does not experience toxicity. Coherence provides a useful finite-sample–based metric to evaluate the safety and appropriateness of dose transition for phase I trial designs. Cheung [1] showed that the CRM is coherent when the prior estimate of the MTD is used as the starting dose.
Coherence for drug combination trials
Consider a trial combining J doses of agent 1, denoted as u1 < ⋯ < uJ, and K doses of agent 2, denoted as v1 < ⋯ < vK, where uj and vk are raw or standardized doses. Let (j, k) denote the combination of uj and vk, and pj,k = Pr{Yn = 1|Xn = (j, k)} denote the probability of dose-limiting toxicity (DLT) for (j, k). We assume that when the dose of one agent is fixed, the toxicity of the combination increases as the dose of the other agent increases, i.e., pj,k < pj′,k for j < j′ and pj,k < pj,k′ for k < k′. However, no toxicity order is assumed between other dose pairs. For example, the toxicity order between (j, k) and (j − 1, k + 1) and (j + 1, k − 1) is unknown a priori. This partial order in toxicity makes drug combination trials fundamentally different from single-agent trials.
Consider a model-based dose-finding design that assumes a dose-toxicity model
(1)
where F(⋅) is a parametric model indexed by unknown parameters θ = (θ1, …, θp). We assume that θ is appropriately constrained such that the partial order is conformed, i.e., Fj,k(θ) < Fj′,k(θ) for j < j′ and Fj,k(θ) < Fj,k′(θ) for k < k′. For example, for logistic model logit{Fj,k(θ) = θ1 + θ2 uj + θ3 vk + θ4 uj vk, the partial order is imposed by the constraint θ2 + θ4 vk > 0 for all k and θ3 + θ4 uj > 0 for all j. As the assumed dose-toxicity model may be misspecified, it is important to distinguish Fj,k(θ), the toxicity probability ascribed by the model, from the true toxicity probability pj,k. Throughout the paper, we do not require F(uj, vk, θ) to be correctly specified.
Suppose that at an interim time, a total of n patients have been enrolled and treated. Let Hn = {(Xi, Yi), i = 1, …, n} denote the accumulative data from the enrolled patients. Unlike single-agent trials, where there is only a single candidate dose level Xn + 1 for escalation and Xn − 1 for de-escalation, in drug combination trials, there are multiple candidate doses are available for escalation or de-escalation. For example, in the case of dose escalation, we can escalate either the dose level of drug 1 or the dose level of drug 2. Therefore, it is imperative to generalize the definition of coherence.
Specifically, given Xn, let and
denote the set of candidate dose levels for escalation and de-escalation, respectively, for the (n + 1)th patient, and define
to present all possible doses assignments for the (n + 1)th patient. A generalized definition of coherence that is applicable to both single-agent and drug combination trials follows.
Definition 1 A design is coherent in dose escalation if
for n = 1, …, N, and is coherent in dose de-escalation if
for n = 1, …, N. A design is coherent if its dose escalation and de-escalation are both coherent.
The definition of coherence for the single-agent trial proposed by Cheung [1] is a special case of the above, with and
.
Given the observed interim data Hn and a specific definition of and
, Xn+1 is typically determined as
(2)
where pT is the target toxicity probability,
is the model-based toxicity estimate, and
is the posterior mean of θ. That is, the next patient will be treated at the dose level that belongs to
and has estimated toxicity probability closest to the target pT. For the first patient, Hn is empty and
is obtained based on the prior distribution of θ. In the case that it is desirable to start the trial from the lowest dose (1, 1), we can set the prior estimate of the MTD as (1, 1).
In drug combination trials, the choice of and
has a profound impact on the coherence of a design, which does not arise in single-agent trials. To be consistent with practice, we herein assume the no-dose-skipping rule that restricts dose escalation and de-escalation to doses that are adjacent to the current dose. More precisely, given Xn = (j, k),
. We do not consider (j + 1, k + 1) because dose escalation from (j, k) to (j + 1, k + 1) is equivalent to skipping doses (j + 1, k) and (j, k + 1), noting that pj,k < pj+1,k, pj,k+1 < pj+1,k+1. For a similar reason, we do not consider dose de-escalation from (j, k) to (j − 1, k − 1). Nevertheless, all the results presented below are valid if (j + 1, k + 1) and (j − 1, k − 1) are included. In the literature, there are two common ways to specify
and
, as illustrated in Fig 1. The first approach, referred to as the ND-design, does not allow for dose movement along the diagonals:
and
. The second choice, the D-design, allows for dose movement along the diagonals, where
contain doses along the diagonal, i.e., (j − 1, k + 1) and (j + 1, k − 1).
The filled circle indicates the current dose combination; the empty circles denote the candidate doses for escalation and de-escalation under the given design.
ND-design.
We first investigate the coherence of the ND-design and refer to the following condition.
- Condition A. For any θ = (θ1, …, θp) and for any n with Xn = (j, k),
(3) where
denotes the tth element of
.
Theorem 1 Under condition A, the ND-design is coherent.
Condition A requires the determination of the sign of , for t = 1, …, p, which may not be straightforward. A special case of condition A that is easier to evaluate is uniform monotonicity.
Definition 2 (uniform monotonicity) F(u, v, θ) is uniformly nondecreasing (or nonincreasing) if for any value of (u, v), F(u, v, θ) is nondecreasing (or nonincreasing) in θt for all t = 1, …, p.
Lemma 2 If F is uniformly monotonic, then condition A holds.
Theorem 3 The ND-design is coherent if F(u, v, θ) is uniformly monotonic.
The proof of Lemma 2 is provided in the Appendix A of Supplementary material. Theorem 3 is established directly from Lemma 2 and Theorem 1. Although uniform monotonicity is easier to check, many dose-toxicity models do not satisfy that condition, as shown later. In these cases, Theorem 1 can be used to examine the coherence of the ND-design.
D-design.
The D-design allows for dose escalation and de-escalation to the adjacent doses on the diagonals, i.e., (j + 1, k − 1), (j − 1, k + 1), assuming that (j, k) is the current dose level. The complication is that a priori we do not know the toxicity order between (j, k) and (j + 1, k − 1) and (j − 1, k + 1), thus it is not clear whether (j + 1, k − 1) and (j − 1, k + 1) belong to or
. Different definitions of
and
lead to different definitions of coherence. A straightforward approach is to define
and
based on the true toxicity probability of these doses. We refer to the resulting coherence as strong coherence, which indicates that the design will never escalate to a dose that is truly more toxic than the current dose if the most recently treated patient has toxicity; and will never de-escalate to a dose that is truly less toxic than the current dose if the most recently treated patient does not have toxicity.
To establish sufficient conditions for a design to be strongly coherent, we define conditions B1 and B2 as follows.
- Condition B1. For any θ, F(u, v, θ) is increasing in both u and v.
- Condition B2. For any j and k and for any θ,
and
where the signum function is defined by sgn(x) = − 1, 0 or 1 for x < 0, x = 0 or x > 0, respectively.
Theorem 4 Under conditions B1 and B2,
- If condition A holds, the D-design is strongly coherent.
- If the condition of uniform monotonicity holds, the D-design is strongly coherent.
The proof of Theorem 4 is provided in the Appendix A of Supplementary material.
As the true toxicity probability is unknown in practice, a more practical approach is to define or
based on the model estimates. Recall that to determine dose level Xn+1 for the (n + 1)th patient, the model-based design fits the model F(uj, vk, θ) using interim data Hn, and then makes the decision of dose escalation or de-escalation for Xn+1 based on
, as specified by Eq (2). Based on
, the toxicity order of (j + 1, k − 1) and (j − 1, k + 1), with respect to (j, k), can be determined and used to define
and
. We refer to the resulting coherence as weak coherence. Accordingly, weak coherence indicates that the design will never escalate to a dose for which the estimated toxicity probability is higher than the current dose if the most recently treated patient has toxicity; and will never de-escalate to a dose for which the estimated toxicity probability is lower than the current dose if the most recently treated patient does not have toxicity. Because of potential model misspecification, a weakly coherent design is not necessarily strongly coherent. For example, suppose Xn = (j, k), Yn = 0 and Xn+1 = (j + 1, k − 1). If
, but the truth is pj+1,k−1 < pj,k, then the dose assignment for the (n + 1)th patient is weakly coherent, but not strongly coherent. This phenomenon is unique in drug combination trial designs because of the unknown toxicity order between some drug combinations. This phenomenon does not exist for single-agent trial designs, where the toxicity order is completely known among all doses. Despite this issue, weak coherence is still useful because the true toxicity probability is unknown in practice. From the user’s perspective, it is certainly concerning if a trial design has a high likelihood of escalating to a dose that is expected to have higher toxicity after the most recently treated patient experienced toxicity at a lower dose. It can be shown that the D-design is weakly coherent under the same condition as the ND-design (see the Appendix A of Supplemental material for the proof).
Theorem 5 If condition A or the condition of uniform monotonicity holds, the D-design is weakly coherent.
Comparing Theorem 5 with Theorem 4, it is clear that weak coherence is more lenient than strong coherence. With the extra requirements specified by conditions B1 and B2, a weakly coherent D-design becomes strongly coherent.
A two-stage design.
The above results are established under the assumption that the design starts by treating the first patient at the prior estimate of the MTD and the dose for each subsequent patient is selected based on the model estimates according to Eq (2). In practice, however, for patient safety, the trial often starts with the lowest dose, which is not necessarily the prior estimate of the MTD. In addition, as the dose-toxicity model for drug combination trials is relatively complicated, to improve estimation and design reliability, the two-stage design is often used. In the first stage (or the initial design), we make the decision of dose escalation/de-escalation based on a set of simple prespecified rules, without using the model, to collect some preliminary data. We then switch to the second stage (or the model-based design), in which we base the decision of dose escalation/de-escalation on the model estimates, as described previously. A typical example of the initial design is the “titration” design, under which we pre-select a string of dose combinations with monotonically increasing toxicity; see Fig 2. We treat the first patient at the lowest combination (1, 1) and then escalate the dose for treating the next patient if no toxicity is observed for the current patient. We continue this dose escalation (or titration) process until we encounter the first toxicity, and then we switch to the model-based design. Cheung [1] studied the condition of coherence for two-stage single-agent trials. In what follows, we provide the coherence condition for two-stage combination trials.
Let X1n and X2n denote the dose assignment according to the initial and model-based designs, respectively. Let R denote a prespecified rule that triggers the switch from the initial design to the model-based design. Then, the dose assignment for the two-stage design, denoted as , can be formally defined by
The theorem below provides a sufficient condition for the two-stage design to be coherent.
Theorem 6 Assume that the initial design and the model-based design are coherent. Let M denote the first patient whose dose is selected based on the model-based design, i.e., the patient enrolled at the moment of transition from the initial design to the model-based design. If almost surely, then the design with
is coherent.
Application
We apply our results to examine the coherence of some model-based drug combination designs in the literature.
Example 1. Logistic and scaled logistic regression models Riviere et al. [9] considered a standard logistic regression model with F(uj, vk, θ) = logit−1(θ1 + θ2 uj + θ3 vk + θ4 uj vk), where , with θ2 > 0 and θ3 > 0 ensuring that the toxicity probability is increasing with the increasing dose level of each agent alone, and θ3 + θ4 uj > 0 for all j and θ2 + θ4 vk > 0 for all k ensuring that the toxicity probability is increasing with the increasing dose levels of both agents together.
Depending on how the doses uj and vk are specified, Theorem 1 or Theorem 3 can be used to examine the coherence of the logistic model-based designs. When the raw dosages of the drugs are used, uj > 0 and vk > 0 and thus g(uj, vk, θ) ≡ θ1 + θ2 uj + θ3 vk + θ4 uj vk is increasing in θt for t = 1, …, 4. Because logit−1(⋅) is monotonically increasing, it follows that F(u, v, θ) is monotonically increasing in θt for t = 1, …, 4, and thus the uniform monotonicity condition holds. By Theorem 3, the ND-design with the logistic regression model is coherent. Also, by Theorem 5, the D-design with the logistic regression model is weakly coherent. For the D-design to be strongly coherent, we need to examine conditions B1 and B2. Condition B1 holds because of θ2 > 0 and θ3 > 0, but it is difficult to verify condition B2 because it involves the true dose-toxicity relationship that is unknown.
In many cases, standardized doses, rather than the raw dosages, are often used in the logistic model to improve the numerical stability or interpretation. For example, Riviere et al. [9] suggested using the standardized doses uj = logit(aj) and vj = logit(bk), where aj and bk are estimates of the toxicity probabilities of the jth dose level of agent 1 and the kth dose level of agent 2, respectively, when they are administered individually as a single agent. In addition, logarithmic transformation or centering is often employed in practice to standardize the dose. As a result, the standardized doses uj’s and vk’s are not necessarily all positive. Thus, the uniform monotonicity condition does not hold and Theorem 3 cannot be used. In these cases, Theorem 1 provides a more general tool to use for studying the coherence by examining condition A, described briefly as follows. Because of the symmetric role of uj and vk, without loss of generality, we assume uj ≥ 0 and vk ≤ 0. Suppose that Yn = 0, define and
, where ϕ(t) = ψ(t) except for the tth element, t = 1, …, 4. Let
be the posterior mean of the tth element of θ based on the first n observations. By the approach used in the proof of Lemma 2, for t = 1, 2, we have
implying
, and for t = 3, 4, we have
, implying that
. So,
for t = 1, 2, 3, 4. Similarly, we obtain
for t = 1, 2, 3, 4 when Yn = 1. Therefore, condition A holds, and we obtain a more general result: the ND-design with the logistic regression model is coherent, and the D-design with the logistic regression model is weakly coherent, no matter how uj and vk are specified.
Braun and Jia [8] proposed different coding for the doses, specifying dose 1 as a categorical dummy variable and dose 2 as a continuous variable. They called the resulting model the generalized CRM model, given by
where −∞ < αk < ∞, k = 1, …, K and β > 0. Cai et al. [10] proposed a modification of the logistic model, namely the scaled logistic model, for drug combination trials that involve molecularly targeted agents, with the form
where 0 < ρ < 1. The scaled logistic model plateaus at ρ, rather than 1 as in the standard logistic model. It can be shown that the ND-design with the generalized CRM model or scaled logistic regression model is coherent, and the D-design with the generalized CRM or scaled logistic regression model is weakly coherent. The details are provided in the Appendix B of Supplemental material.
Example 2. Change-point model For some targeted agent, the dose-toxicity curve may initially increase at low doses and then plateau at high doses. Cai et al. [10], Riviere et al. [11] and Sato et al. [12] proposed using the change-point model for some drug combination trials, as given by
where I(⋅) denotes an indicator function, θ = (α, β, γ, w) with −∞ < α < ∞, β > 0, γ > 0 and −∞ < w < ∞. The curve of the model initially increases with the dose level but flattens once it passes the threshold defined by α + βuj + γvk = w. Let η(x) = logit(x) for 0 < x < 1. Suppose that Yn = 0, in what follows, we use α as an example to show that condition A holds. Let
denote a posterior mean of α based on the first n patients. For fixed values of β, γ and w, we take ϕ = (α1, β, γ, w) and ψ = (α2, β, γ, w) for any α1 and α2. By the mean value theorem, we have
, where
lies between Fj,k(ϕ), and Fj,k(ψ) and η′(F) denotes the derivative of η with respect to F. For α1 ≤ α2, there are three possible cases: (1) α1 + βuj + γvk ≤ w and α2 + βuj + γvk ≤ w; (2) α1 + βuj + γvk ≤ w and α2 + βuj + γvk > w; and (3) α1 + βuj + γvk > w. In the first case, η{Fj,k(ϕ)} − η{Fj,k(ψ)} = α1 − α2 ≤ 0. The second case yields η{Fj,k(ϕ)} − η{Fj,k(ψ)} = α1 + βuj + γvk − w ≤ 0, and the third case induces η{Fj,k(ϕ)} − η{Fj,k(ψ)} = w − w = 0. That is, F is nondecreasing in α, noting that η′(x) = 1/{x(1 − x)} ≥ 0 for 0 < x < 1. Thus, we have (α1 − α2){Fj,k(ψ) − Fj,k(ϕ)} ≤ 0 for any α1 and α2. By the approach used in the proof of Lemma 2, we have
and
. Along a similar line, it can be shown that a similar inequality holds for β, γ and w (see the Appendix B of Supplemental material for details). Similarly, we obtain the inequality for all parameters when Yn = 1, and thus condition A holds. Applying Theorem 1 and Theorem 5, we conclude that the ND-design with the change-point model is coherent and the D-design with the change-point model is weakly coherent, respectively.
Example 3. Copula-type regression models Yin and Yuan [5] proposed a drug combination design based on the Clayton copula regression model
where pj and qk are prior estimates of the toxicity probability for level j of agent 1 and level k of agent 2, respectively, when they are used as monotherapy, and θ = (α, β, γ) with α, β, γ > 0. Let G(x) = (1 − x)−γ. Then,
. Suppose that Yn = 0. To apply Theorem 1, we check condition A with respect to α. For fixed values of β and γ, we take ϕ = (α1, β, γ) and ψ = (α2, β, γ) for any α1 and α2. By the mean value theorem, we have
for some
that lies between Fj,k(ϕ) and Fj,k(ψ), where G′(F) denotes the derivative of G with respect to F. Since G(x) is increasing and f(x) = (1 − px)−γ for some p ∈ (0, 1) is decreasing, Fj,k(ϕ) − Fj,k(ψ) ≥ 0 for α1 ≤ α2, i.e., F is nonincreasing in α. Let
be a posterior mean of α based on the first n patients. By using the approach used in the proof of Lemma 2, we have
and thus
. We can show that a similar inequality holds with respect to β and γ (see the Appendix B of Supplemental material). Thus, condition A holds. Therefore, the drug combination ND-design based on the Clayton copula regression model is coherent and the D-design with the Clayton copula regression model is weakly coherent.
For drug combination trials, Yin and Yuan [5] proposed an alternative copula-type regression model, i.e., the Gumbel model, as given by where θ = (α, β, γ) with α, β, γ > 0. As shown in the Appendix B of Supplemental material, the ND-design based on the Gumbel model is also coherent, and the D-design based on the Gumbel model is weakly coherent.
Example 4. Log-linear model Wang and Ivanova [4] considered a toxicity model given by
with θ = (α, β, γ), where α > 0, β > 0 and γ < 0. Let Gj,k(θ) = α log(1 − uj) + β log(1 − vk) + γ log(1 − uj) log(1 − vk). Then, Fj,k(θ) = 1 − exp{Gj,k(θ)}. Suppose that Yn = 0. We claim that condition A holds for the design with the log-linear model with respect to α. Let
denote a posterior mean of α based on the first n patients. For fixed values of β and γ, we take ϕ = (α1, β, γ) and ψ = (α2, β, γ) for any α1 and α2. Then,
for some
that lies between Gj,k(ϕ) and Gj,k(ψ). If uj < 0, then Gj,k(ψ) − Gj,k(ϕ) ≥ 0 for α1 ≤ α2, implying that Fj,k(ϕ) − Fj,k(ψ) ≥ 0 for α1 ≤ α2, i.e., F is nondecreasing in α. By the approach used in the proof of Lemma 2,
. So,
. Likewise,
is obtained even if 0 ≤ uj(< 1). In other words, regardless of the sign of the dose for the first agent uj, we obtain
. Similarly, we can show that the inequality, Eq (3), holds for β and γ (see the Appendix B of Supplemental material for details), and thus condition A holds. So, the ND-design is coherent and the D-design is weakly coherent.
Example 5. Six-parameter model Thall et al. [3] considered a six-parameter toxicity model given by
where 0 ≤ uj ≤ 1 and 0 ≤ vk ≤ 1 are standardized doses, and θ = (α1, α2, α3, β1, β2, β3) with α1 > 0, α2 > 0,
and
, to ensure that toxicity monotonically increases with each of the doses. Let
. Then, Fj,k(θ) = Gj,k(θ)/{1 + Gj,k(θ)}. Suppose that Yn = 0. It is easy to see that
and F is nondecreasing in α1, because f(x) = x/(1 + x) is nondecreasing in x. Let
denote the posterior mean of α1 based on the first n patients. By the approach used in the proof of Lemma 2,
and
. We can show that a similar inequality, Eq (3), holds for α2, α3, β1, β2 and β3. Therefore, condition A holds and, under the six-parameter model, the ND-design is coherent and the D-design is weakly coherent.
Discussion
Drug combination trials are more challenging than single-agent trials because of the higher dimensions of the dose search space and partial ordering among the drug combinations. We have proposed the concept of coherence for drug combination trials and distinguished two types of drug combination designs: the ND-design, which forbids diagonal dose movement, and the D-design, which allows for diagonal dose movement. To account for the possibility of model misspecification when using the D-design, we further defined weak coherence and strong coherence based on the model estimates and true toxicity probabilities of the doses, respectively. We provided sufficient conditions to study the coherence of the model-based drug combination designs. We investigated a number of drug combination models in the literature and showed that under these models, the ND-design is coherent and the D-design is weakly coherent. In general, it is difficult to establish strong coherence of a D-design because it involves the knowledge of the unknown, true dose-toxicity relationship. From a practical viewpoint, if strong coherence is desirable, we recommend adopting the ND-design by forbidding diagonal dose movement.
We have shown that some model-based designs are coherent for ND-designs and weakly coherent for D-designs, however not all model-based designs are coherent. One example is partial-order CRM [7], which fits multiple (single-agent) CRM models, each assuming a different toxicity order for the combinations, and then uses Bayesian model averaging (BMA) to summarize the estimates over different models and make the decision of dose transition. Although each CRM model is coherent, averaging over them would lead to incoherent decisions. This is because each of the CRM models may recommend a different dose-transition decision, and the final decision based on BMA would be consistent with some, but inconsistent with others, leading to incoherence.
We assumed that the toxicity outcomes of previous patients have been fully observed before accruing the next cohort of patients. This however may not be true when the toxicity is late-onset. In this case, we can employ the Bayesian data augmentation method to handle the delayed toxicity and facilitate real-time decision making of dose escalation and de-escalation [13, 14]. The coherence of drug combination designs in the presence of late-onset toxicity will be an interesting topic of future research.
Supporting information
S1 File. Appendix A and B of supplemental material are available with this paper at PLOS ONE website.
https://doi.org/10.1371/journal.pone.0242561.s001
(PDF)
References
- 1. Cheung YK. Coherence principles in dose-finding studies. Biometrika. 2005;92(4):863–873.
- 2. O’Quigley J, Pepe M, Fisher L. Continual reassessment method: a practical design for phase I clinical trials in cancer. Biometrics. 1990; p. 33–48.
- 3. Thall PF, Millikan RE, Mueller P, Lee SJ. Dose-Finding with Two Agents in Phase I Oncology Trials. Biometrics. 2003;59(3):487–496. pmid:14601749
- 4. Wang K, Ivanova A. Two-Dimensional Dose Finding in Discrete Dose Space. Biometrics. 2005;61(1):217–222. pmid:15737096
- 5. Yin G, Yuan Y. Bayesian dose finding in oncology for drug combinations by copula regression. Journal of the Royal Statistical Society: Series C (Applied Statistics). 2009;58(2):211–224.
- 6. Yuan Y, Yin G. A Bayesian Phase I/II Design for Oncology Clinical Trials of Combining Biological Agents. The Annals of Applied Statistics. 2011;5(2A):924–942.
- 7. Wages NA, Conaway MR, O’Quigley J. Continual reassessment method for partial ordering. Biometrics. 2011;67(4):1555–1563. pmid:21361888
- 8. Braun TM, Jia N. A generalized continual reassessment method for two-agent phase I trials. Statistics in biopharmaceutical research. 2013;5(2):105–115.
- 9. Riviere MK, Yuan Y, Dubois F, Zohar S. A Bayesian dose-finding design for drug combination clinical trials based on the logistic model. Pharmaceutical statistics. 2014;13(4):247–257. pmid:24828456
- 10. Cai C, Yuan Y, Ji Y. A Bayesian dose finding design for oncology clinical trials of combinational biological agents. Journal of the Royal Statistical Society: Series C (Applied Statistics). 2014;63(1):159–173.
- 11. Riviere MK, Yuan Y, Dubois F, Zohar S. A Bayesian dose finding design for clinical trials combining a cytotoxic agent with a molecularly targeted agent. Journal of the Royal Statistical Society: Series C (Applied Statistics). 2015;64(1):215–229.
- 12. Sato H, Hirakawa A, Hamada C. An adaptive dose-finding method using a change-point model for molecularly targeted agents in phase I trials. Statistics in medicine. 2016;35(23):4093–4109. pmid:27221807
- 13. Liu S, Ning J. A Bayesian dose-finding design for drug combination trials with delayed toxicities. Bayesian analysis. 2013;8(3):703.
- 14. Liu S, Yin G, Yuan Y. Bayesian data augmentation dose finding with continual reassessment method and delayed toxicity. The annals of applied statistics. 2013;7(4):1837.