On the coherence of model-based dose-finding designs for drug combination trials

Yeonhee Park; Suyu Liu

doi:10.1371/journal.pone.0242561

Abstract

The concept of coherence was proposed for single-agent phase I clinical trials to describe the property that a design never escalates the dose when the most recently treated patient has toxicity and never de-escalates the dose when the most recently treated patient has no toxicity. It provides a useful theoretical tool for investigating the properties of phase I trial designs. In this paper, we generalize the concept of coherence to drug combination trials, which are substantially different and more challenging than single-agent trials. For example, in the dose-combination matrix, each dose has up to 8 neighboring doses as candidates for dose escalation and de-escalation, and the toxicity orders of these doses are only partially known. We derive sufficient conditions for a model-based drug combination trial design to be coherent. Our results are more general and relaxed than the existing results and are applicable to both single-agent and drug combination trials. We illustrate the application of our theoretical results with a number of drug combination dose-finding designs in the literature.

Citation: Park Y, Liu S (2020) On the coherence of model-based dose-finding designs for drug combination trials. PLoS ONE 15(11): e0242561. https://doi.org/10.1371/journal.pone.0242561

Editor: Alan D. Hutson, Roswell Park Cancer Institute, UNITED STATES

Received: August 10, 2020; Accepted: November 4, 2020; Published: November 30, 2020

Copyright: © 2020 Park, Liu. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the manuscript and its Supporting information files.

Funding: The authors received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Introduction

The objective of phase I dose-finding clinical trials is to evaluate the safety and maximum tolerated dose (MTD) of new drugs through a sequential process of dose escalation and de-escalation. Various dose-finding designs have been proposed to guide dose escalation and de-escalation, but all are based on the same principle: if the observed data suggest that the current dose is safe (i.e., below the MTD), we escalate the dose to avoid treating the next cohort of patients at potentially sub-therapeutic doses; and if the observed data suggest that the current dose is overly toxic (i.e., above the MTD), we de-escalate the dose to avoid exposing patients to overly toxic doses. Because a phase I trial is the first-in-human trial and at that stage little known about the toxicity profile of the drug, it is critically important to ensure that the dose escalation and de-escalation rules are safe and appropriate. For single-agent trials, Cheung [1] introduced the concept of coherence, which is that a dose-finding design is coherent if it never escalates the dose when the most recently treated patient experiences toxicity, and never de-escalates the dose when the most recently treated patient does not experience toxicity. From an ethical and practical viewpoint, it is desirable for a dose-finding design to be coherent. Cheung [1] established sufficient conditions for a single-agent dose-finding design to be consistent.

Drug combination trials have become a mainstream approach for treating cancer because of the ability to induce a synergistic treatment effect and overcome the drug resistance that is common with monotherapies. However, dose finding in drug combination trials is more challenging. In the dose-combination matrix, each drug combination has up to eight adjacent combinations as potential candidates for dose escalation and de-escalation. More important, these combinations are only partially ordered due to drug-drug interactions. For example, suppose one combination has a higher dose of drug 1 whereas the other combination has a higher dose of drug 2, a priori we do not know the toxicity order of these two combinations. As a result, when dose escalation or de-escalation is needed, it is not immediately clear which dose combination should be selected. In contrast, in single-agent trials, we are concerned with a string of doses that are associated with monotonically increasing toxicity. When dose escalation is needed, we simply move to the next higher dose level, and when dose de-escalation is needed, we move to the next lower dose level. Because of these fundamental differences between single-agent and multi-agent trials, the definition and results of coherence developed by Cheung [1] for single-agent trials cannot be directly applied to drug combination trials. In this article, we generalize the definition of coherence and establish sufficient conditions of coherence for drug combination trials. As we show, such generalization is not trivial and requires vastly different considerations. One important strength of our results is that they are still valid when the assumed drug-combination dose-toxicity model is misspecified.

Numerical dose-finding designs have been proposed for drug combination trials. The majority of them are model-based designs and make adaptive decisions of dose escalation/de-escalation using a strategy similar to that in the continuous reassessment method (CRM [2]). That strategy is to devise a parametric model to describe the dose-toxicity surface and then, based on the accumulating data, continuously update the model estimate to guide the dose selection and assignment. Thall et al. [3] proposed a six-parameter model-based design to find the MTD. Wang and Ivanova [4] developed a drug combination dose-finding design based on a log linear model. Yin and Yuan [5] and Yuan and Yin [6] proposed Bayesian dose-finding designs based on a copula-type regression model. Wages et al. [7] extended the CRM based on partial ordering of the dose combinations. Braun and Jia [8] proposed the generalized CRM model to guide dose finding. Riviere et al. [9] proposed a Bayesian dose-finding design based on the logistic model. Cai et al. [10] and Riviere et al. [11] adopted change-point models for drug combination trials involving molecularly targeted agents.

The remainder of the paper is organized as follows. In materials and methods section, we review coherence for single-agent trials and propose the generalized concept and theory of coherence that embraces both single-agent and drug combination trials. In application section, we use the proposed theory to study the coherence of a number of drug combination designs in the literature. In discussion section, we conclude with a discussion.

Materials and methods

Coherence for single-agent trials

We first review coherence for single-agent trials. Consider a single-agent phase I trial with K doses under investigation, where a higher dose presumably has a higher probability of causing toxicity. Patients are sequentially enrolled, and each patient is treated at a dose that is adaptively selected by a certain trial design (e.g., CRM) based on the interim data. For each n = 1, …, N, where N is the prespecified maximum sample size, let X_n denote the dose level assigned to the nth patient, and Y_n be a toxicity indicator of the nth patient, with Y_n = 1 denoting toxicity.

A design is called coherent in escalation if Pr(X_n+1 > X_n|Y_n = 1) = 0 and coherent in de-escalation if Pr(X_n+1 < X_n|Y_n = 0) = 0 for n = 1, …, N [1]. In other words, a coherent design never escalates the dose when the most recently treated patient experiences toxicity, and never de-escalates the dose when the most recently treated patient does not experience toxicity. Coherence provides a useful finite-sample–based metric to evaluate the safety and appropriateness of dose transition for phase I trial designs. Cheung [1] showed that the CRM is coherent when the prior estimate of the MTD is used as the starting dose.

Coherence for drug combination trials

Consider a trial combining J doses of agent 1, denoted as u₁ < ⋯ < u_J, and K doses of agent 2, denoted as v₁ < ⋯ < v_K, where u_j and v_k are raw or standardized doses. Let (j, k) denote the combination of u_j and v_k, and p_j,k = Pr{Y_n = 1|X_n = (j, k)} denote the probability of dose-limiting toxicity (DLT) for (j, k). We assume that when the dose of one agent is fixed, the toxicity of the combination increases as the dose of the other agent increases, i.e., p_j,k < p_j′,k for j < j′ and p_j,k < p_j,k′ for k < k′. However, no toxicity order is assumed between other dose pairs. For example, the toxicity order between (j, k) and (j − 1, k + 1) and (j + 1, k − 1) is unknown a priori. This partial order in toxicity makes drug combination trials fundamentally different from single-agent trials.

Consider a model-based dose-finding design that assumes a dose-toxicity model (1) where F(⋅) is a parametric model indexed by unknown parameters θ = (θ₁, …, θ_p). We assume that θ is appropriately constrained such that the partial order is conformed, i.e., F_j,k(θ) < F_j′,k(θ) for j < j′ and F_j,k(θ) < F_j,k′(θ) for k < k′. For example, for logistic model logit{F_j,k(θ) = θ₁ + θ₂ u_j + θ₃ v_k + θ₄ u_j v_k, the partial order is imposed by the constraint θ₂ + θ₄ v_k > 0 for all k and θ₃ + θ₄ u_j > 0 for all j. As the assumed dose-toxicity model may be misspecified, it is important to distinguish F_j,k(θ), the toxicity probability ascribed by the model, from the true toxicity probability p_j,k. Throughout the paper, we do not require F(u_j, v_k, θ) to be correctly specified.

Suppose that at an interim time, a total of n patients have been enrolled and treated. Let H_n = {(X_i, Y_i), i = 1, …, n} denote the accumulative data from the enrolled patients. Unlike single-agent trials, where there is only a single candidate dose level X_n + 1 for escalation and X_n − 1 for de-escalation, in drug combination trials, there are multiple candidate doses are available for escalation or de-escalation. For example, in the case of dose escalation, we can escalate either the dose level of drug 1 or the dose level of drug 2. Therefore, it is imperative to generalize the definition of coherence.

Specifically, given X_n, let and denote the set of candidate dose levels for escalation and de-escalation, respectively, for the (n + 1)th patient, and define to present all possible doses assignments for the (n + 1)th patient. A generalized definition of coherence that is applicable to both single-agent and drug combination trials follows.

Definition 1 A design is coherent in dose escalation if for n = 1, …, N, and is coherent in dose de-escalation if for n = 1, …, N. A design is coherent if its dose escalation and de-escalation are both coherent.

The definition of coherence for the single-agent trial proposed by Cheung [1] is a special case of the above, with and .

Given the observed interim data H_n and a specific definition of and , X_n+1 is typically determined as (2) where p_T is the target toxicity probability, is the model-based toxicity estimate, and is the posterior mean of θ. That is, the next patient will be treated at the dose level that belongs to and has estimated toxicity probability closest to the target p_T. For the first patient, H_n is empty and is obtained based on the prior distribution of θ. In the case that it is desirable to start the trial from the lowest dose (1, 1), we can set the prior estimate of the MTD as (1, 1).

In drug combination trials, the choice of and has a profound impact on the coherence of a design, which does not arise in single-agent trials. To be consistent with practice, we herein assume the no-dose-skipping rule that restricts dose escalation and de-escalation to doses that are adjacent to the current dose. More precisely, given X_n = (j, k), . We do not consider (j + 1, k + 1) because dose escalation from (j, k) to (j + 1, k + 1) is equivalent to skipping doses (j + 1, k) and (j, k + 1), noting that p_j,k < p_j+1,k, p_j,k+1 < p_j+1,k+1. For a similar reason, we do not consider dose de-escalation from (j, k) to (j − 1, k − 1). Nevertheless, all the results presented below are valid if (j + 1, k + 1) and (j − 1, k − 1) are included. In the literature, there are two common ways to specify and , as illustrated in Fig 1. The first approach, referred to as the ND-design, does not allow for dose movement along the diagonals: and . The second choice, the D-design, allows for dose movement along the diagonals, where contain doses along the diagonal, i.e., (j − 1, k + 1) and (j + 1, k − 1).

Download:

Fig 1. (a) ND-design, which does not allow for diagonal dose movement, and (b) D-design, which allows for diagonal dose movement.

The filled circle indicates the current dose combination; the empty circles denote the candidate doses for escalation and de-escalation under the given design.

https://doi.org/10.1371/journal.pone.0242561.g001

ND-design.

We first investigate the coherence of the ND-design and refer to the following condition.

Condition A. For any θ = (θ₁, …, θ_p) and for any n with X_n = (j, k), (3) where denotes the tth element of .

Theorem 1 Under condition A, the ND-design is coherent.

Condition A requires the determination of the sign of , for t = 1, …, p, which may not be straightforward. A special case of condition A that is easier to evaluate is uniform monotonicity.

Definition 2 (uniform monotonicity) F(u, v, θ) is uniformly nondecreasing (or nonincreasing) if for any value of (u, v), F(u, v, θ) is nondecreasing (or nonincreasing) in θ_t for all t = 1, …, p.

Lemma 2 If F is uniformly monotonic, then condition A holds.

Theorem 3 The ND-design is coherent if F(u, v, θ) is uniformly monotonic.

The proof of Lemma 2 is provided in the Appendix A of Supplementary material. Theorem 3 is established directly from Lemma 2 and Theorem 1. Although uniform monotonicity is easier to check, many dose-toxicity models do not satisfy that condition, as shown later. In these cases, Theorem 1 can be used to examine the coherence of the ND-design.

D-design.

The D-design allows for dose escalation and de-escalation to the adjacent doses on the diagonals, i.e., (j + 1, k − 1), (j − 1, k + 1), assuming that (j, k) is the current dose level. The complication is that a priori we do not know the toxicity order between (j, k) and (j + 1, k − 1) and (j − 1, k + 1), thus it is not clear whether (j + 1, k − 1) and (j − 1, k + 1) belong to or . Different definitions of and lead to different definitions of coherence. A straightforward approach is to define and based on the true toxicity probability of these doses. We refer to the resulting coherence as strong coherence, which indicates that the design will never escalate to a dose that is truly more toxic than the current dose if the most recently treated patient has toxicity; and will never de-escalate to a dose that is truly less toxic than the current dose if the most recently treated patient does not have toxicity.

To establish sufficient conditions for a design to be strongly coherent, we define conditions B1 and B2 as follows.

Condition B1. For any θ, F(u, v, θ) is increasing in both u and v.
Condition B2. For any j and k and for any θ, and where the signum function is defined by sgn(x) = − 1, 0 or 1 for x < 0, x = 0 or x > 0, respectively.

Theorem 4 Under conditions B1 and B2,

If condition A holds, the D-design is strongly coherent.
If the condition of uniform monotonicity holds, the D-design is strongly coherent.

The proof of Theorem 4 is provided in the Appendix A of Supplementary material.

As the true toxicity probability is unknown in practice, a more practical approach is to define or based on the model estimates. Recall that to determine dose level X_n+1 for the (n + 1)th patient, the model-based design fits the model F(u_j, v_k, θ) using interim data H_n, and then makes the decision of dose escalation or de-escalation for X_n+1 based on , as specified by Eq (2). Based on , the toxicity order of (j + 1, k − 1) and (j − 1, k + 1), with respect to (j, k), can be determined and used to define and . We refer to the resulting coherence as weak coherence. Accordingly, weak coherence indicates that the design will never escalate to a dose for which the estimated toxicity probability is higher than the current dose if the most recently treated patient has toxicity; and will never de-escalate to a dose for which the estimated toxicity probability is lower than the current dose if the most recently treated patient does not have toxicity. Because of potential model misspecification, a weakly coherent design is not necessarily strongly coherent. For example, suppose X_n = (j, k), Y_n = 0 and X_n+1 = (j + 1, k − 1). If , but the truth is p_j+1,k−1 < p_j,k, then the dose assignment for the (n + 1)th patient is weakly coherent, but not strongly coherent. This phenomenon is unique in drug combination trial designs because of the unknown toxicity order between some drug combinations. This phenomenon does not exist for single-agent trial designs, where the toxicity order is completely known among all doses. Despite this issue, weak coherence is still useful because the true toxicity probability is unknown in practice. From the user’s perspective, it is certainly concerning if a trial design has a high likelihood of escalating to a dose that is expected to have higher toxicity after the most recently treated patient experienced toxicity at a lower dose. It can be shown that the D-design is weakly coherent under the same condition as the ND-design (see the Appendix A of Supplemental material for the proof).

Theorem 5 If condition A or the condition of uniform monotonicity holds, the D-design is weakly coherent.

Comparing Theorem 5 with Theorem 4, it is clear that weak coherence is more lenient than strong coherence. With the extra requirements specified by conditions B1 and B2, a weakly coherent D-design becomes strongly coherent.

A two-stage design.

The above results are established under the assumption that the design starts by treating the first patient at the prior estimate of the MTD and the dose for each subsequent patient is selected based on the model estimates according to Eq (2). In practice, however, for patient safety, the trial often starts with the lowest dose, which is not necessarily the prior estimate of the MTD. In addition, as the dose-toxicity model for drug combination trials is relatively complicated, to improve estimation and design reliability, the two-stage design is often used. In the first stage (or the initial design), we make the decision of dose escalation/de-escalation based on a set of simple prespecified rules, without using the model, to collect some preliminary data. We then switch to the second stage (or the model-based design), in which we base the decision of dose escalation/de-escalation on the model estimates, as described previously. A typical example of the initial design is the “titration” design, under which we pre-select a string of dose combinations with monotonically increasing toxicity; see Fig 2. We treat the first patient at the lowest combination (1, 1) and then escalate the dose for treating the next patient if no toxicity is observed for the current patient. We continue this dose escalation (or titration) process until we encounter the first toxicity, and then we switch to the model-based design. Cheung [1] studied the condition of coherence for two-stage single-agent trials. In what follows, we provide the coherence condition for two-stage combination trials.

Download:

Fig 2. Different ways to pre-select a string of drug combinations with monotonically increasing toxicity for the initial titration design.

https://doi.org/10.1371/journal.pone.0242561.g002

Let X_1n and X_2n denote the dose assignment according to the initial and model-based designs, respectively. Let R denote a prespecified rule that triggers the switch from the initial design to the model-based design. Then, the dose assignment for the two-stage design, denoted as , can be formally defined by

The theorem below provides a sufficient condition for the two-stage design to be coherent.

Theorem 6 Assume that the initial design and the model-based design are coherent. Let M denote the first patient whose dose is selected based on the model-based design, i.e., the patient enrolled at the moment of transition from the initial design to the model-based design. If almost surely, then the design with is coherent.

Application

We apply our results to examine the coherence of some model-based drug combination designs in the literature.

Example 1. Logistic and scaled logistic regression models Riviere et al. [9] considered a standard logistic regression model with F(u_j, v_k, θ) = logit⁻¹(θ₁ + θ₂ u_j + θ₃ v_k + θ₄ u_j v_k), where , with θ₂ > 0 and θ₃ > 0 ensuring that the toxicity probability is increasing with the increasing dose level of each agent alone, and θ₃ + θ₄ u_j > 0 for all j and θ₂ + θ₄ v_k > 0 for all k ensuring that the toxicity probability is increasing with the increasing dose levels of both agents together.

Depending on how the doses u_j and v_k are specified, Theorem 1 or Theorem 3 can be used to examine the coherence of the logistic model-based designs. When the raw dosages of the drugs are used, u_j > 0 and v_k > 0 and thus g(u_j, v_k, θ) ≡ θ₁ + θ₂ u_j + θ₃ v_k + θ₄ u_j v_k is increasing in θ_t for t = 1, …, 4. Because logit⁻¹(⋅) is monotonically increasing, it follows that F(u, v, θ) is monotonically increasing in θ_t for t = 1, …, 4, and thus the uniform monotonicity condition holds. By Theorem 3, the ND-design with the logistic regression model is coherent. Also, by Theorem 5, the D-design with the logistic regression model is weakly coherent. For the D-design to be strongly coherent, we need to examine conditions B1 and B2. Condition B1 holds because of θ₂ > 0 and θ₃ > 0, but it is difficult to verify condition B2 because it involves the true dose-toxicity relationship that is unknown.

In many cases, standardized doses, rather than the raw dosages, are often used in the logistic model to improve the numerical stability or interpretation. For example, Riviere et al. [9] suggested using the standardized doses u_j = logit(a_j) and v_j = logit(b_k), where a_j and b_k are estimates of the toxicity probabilities of the jth dose level of agent 1 and the kth dose level of agent 2, respectively, when they are administered individually as a single agent. In addition, logarithmic transformation or centering is often employed in practice to standardize the dose. As a result, the standardized doses u_j’s and v_k’s are not necessarily all positive. Thus, the uniform monotonicity condition does not hold and Theorem 3 cannot be used. In these cases, Theorem 1 provides a more general tool to use for studying the coherence by examining condition A, described briefly as follows. Because of the symmetric role of u_j and v_k, without loss of generality, we assume u_j ≥ 0 and v_k ≤ 0. Suppose that Y_n = 0, define and , where ϕ^(t) = ψ^(t) except for the tth element, t = 1, …, 4. Let be the posterior mean of the tth element of θ based on the first n observations. By the approach used in the proof of Lemma 2, for t = 1, 2, we have implying , and for t = 3, 4, we have , implying that . So, for t = 1, 2, 3, 4. Similarly, we obtain for t = 1, 2, 3, 4 when Y_n = 1. Therefore, condition A holds, and we obtain a more general result: the ND-design with the logistic regression model is coherent, and the D-design with the logistic regression model is weakly coherent, no matter how u_j and v_k are specified.

Braun and Jia [8] proposed different coding for the doses, specifying dose 1 as a categorical dummy variable and dose 2 as a continuous variable. They called the resulting model the generalized CRM model, given by where −∞ < α_k < ∞, k = 1, …, K and β > 0. Cai et al. [10] proposed a modification of the logistic model, namely the scaled logistic model, for drug combination trials that involve molecularly targeted agents, with the form where 0 < ρ < 1. The scaled logistic model plateaus at ρ, rather than 1 as in the standard logistic model. It can be shown that the ND-design with the generalized CRM model or scaled logistic regression model is coherent, and the D-design with the generalized CRM or scaled logistic regression model is weakly coherent. The details are provided in the Appendix B of Supplemental material.

Example 2. Change-point model For some targeted agent, the dose-toxicity curve may initially increase at low doses and then plateau at high doses. Cai et al. [10], Riviere et al. [11] and Sato et al. [12] proposed using the change-point model for some drug combination trials, as given by where I(⋅) denotes an indicator function, θ = (α, β, γ, w) with −∞ < α < ∞, β > 0, γ > 0 and −∞ < w < ∞. The curve of the model initially increases with the dose level but flattens once it passes the threshold defined by α + βu_j + γv_k = w. Let η(x) = logit(x) for 0 < x < 1. Suppose that Y_n = 0, in what follows, we use α as an example to show that condition A holds. Let denote a posterior mean of α based on the first n patients. For fixed values of β, γ and w, we take ϕ = (α₁, β, γ, w) and ψ = (α₂, β, γ, w) for any α₁ and α₂. By the mean value theorem, we have , where lies between F_j,k(ϕ), and F_j,k(ψ) and η′(F) denotes the derivative of η with respect to F. For α₁ ≤ α₂, there are three possible cases: (1) α₁ + βu_j + γv_k ≤ w and α₂ + βu_j + γv_k ≤ w; (2) α₁ + βu_j + γv_k ≤ w and α₂ + βu_j + γv_k > w; and (3) α₁ + βu_j + γv_k > w. In the first case, η{F_j,k(ϕ)} − η{F_j,k(ψ)} = α₁ − α₂ ≤ 0. The second case yields η{F_j,k(ϕ)} − η{F_j,k(ψ)} = α₁ + βu_j + γv_k − w ≤ 0, and the third case induces η{F_j,k(ϕ)} − η{F_j,k(ψ)} = w − w = 0. That is, F is nondecreasing in α, noting that η′(x) = 1/{x(1 − x)} ≥ 0 for 0 < x < 1. Thus, we have (α₁ − α₂){F_j,k(ψ) − F_j,k(ϕ)} ≤ 0 for any α₁ and α₂. By the approach used in the proof of Lemma 2, we have and . Along a similar line, it can be shown that a similar inequality holds for β, γ and w (see the Appendix B of Supplemental material for details). Similarly, we obtain the inequality for all parameters when Y_n = 1, and thus condition A holds. Applying Theorem 1 and Theorem 5, we conclude that the ND-design with the change-point model is coherent and the D-design with the change-point model is weakly coherent, respectively.

Example 3. Copula-type regression models Yin and Yuan [5] proposed a drug combination design based on the Clayton copula regression model where p_j and q_k are prior estimates of the toxicity probability for level j of agent 1 and level k of agent 2, respectively, when they are used as monotherapy, and θ = (α, β, γ) with α, β, γ > 0. Let G(x) = (1 − x)^−γ. Then, . Suppose that Y_n = 0. To apply Theorem 1, we check condition A with respect to α. For fixed values of β and γ, we take ϕ = (α₁, β, γ) and ψ = (α₂, β, γ) for any α₁ and α₂. By the mean value theorem, we have for some that lies between F_j,k(ϕ) and F_j,k(ψ), where G′(F) denotes the derivative of G with respect to F. Since G(x) is increasing and f(x) = (1 − p^x)^−γ for some p ∈ (0, 1) is decreasing, F_j,k(ϕ) − F_j,k(ψ) ≥ 0 for α₁ ≤ α₂, i.e., F is nonincreasing in α. Let be a posterior mean of α based on the first n patients. By using the approach used in the proof of Lemma 2, we have and thus . We can show that a similar inequality holds with respect to β and γ (see the Appendix B of Supplemental material). Thus, condition A holds. Therefore, the drug combination ND-design based on the Clayton copula regression model is coherent and the D-design with the Clayton copula regression model is weakly coherent.

For drug combination trials, Yin and Yuan [5] proposed an alternative copula-type regression model, i.e., the Gumbel model, as given by where θ = (α, β, γ) with α, β, γ > 0. As shown in the Appendix B of Supplemental material, the ND-design based on the Gumbel model is also coherent, and the D-design based on the Gumbel model is weakly coherent.

Example 4. Log-linear model Wang and Ivanova [4] considered a toxicity model given by with θ = (α, β, γ), where α > 0, β > 0 and γ < 0. Let G_j,k(θ) = α log(1 − u_j) + β log(1 − v_k) + γ log(1 − u_j) log(1 − v_k). Then, F_j,k(θ) = 1 − exp{G_j,k(θ)}. Suppose that Y_n = 0. We claim that condition A holds for the design with the log-linear model with respect to α. Let denote a posterior mean of α based on the first n patients. For fixed values of β and γ, we take ϕ = (α₁, β, γ) and ψ = (α₂, β, γ) for any α₁ and α₂. Then, for some that lies between G_j,k(ϕ) and G_j,k(ψ). If u_j < 0, then G_j,k(ψ) − G_j,k(ϕ) ≥ 0 for α₁ ≤ α₂, implying that F_j,k(ϕ) − F_j,k(ψ) ≥ 0 for α₁ ≤ α₂, i.e., F is nondecreasing in α. By the approach used in the proof of Lemma 2, . So, . Likewise, is obtained even if 0 ≤ u_j(< 1). In other words, regardless of the sign of the dose for the first agent u_j, we obtain . Similarly, we can show that the inequality, Eq (3), holds for β and γ (see the Appendix B of Supplemental material for details), and thus condition A holds. So, the ND-design is coherent and the D-design is weakly coherent.

Example 5. Six-parameter model Thall et al. [3] considered a six-parameter toxicity model given by where 0 ≤ u_j ≤ 1 and 0 ≤ v_k ≤ 1 are standardized doses, and θ = (α₁, α₂, α₃, β₁, β₂, β₃) with α₁ > 0, α₂ > 0, and , to ensure that toxicity monotonically increases with each of the doses. Let . Then, F_j,k(θ) = G_j,k(θ)/{1 + G_j,k(θ)}. Suppose that Y_n = 0. It is easy to see that and F is nondecreasing in α₁, because f(x) = x/(1 + x) is nondecreasing in x. Let denote the posterior mean of α₁ based on the first n patients. By the approach used in the proof of Lemma 2, and . We can show that a similar inequality, Eq (3), holds for α₂, α₃, β₁, β₂ and β₃. Therefore, condition A holds and, under the six-parameter model, the ND-design is coherent and the D-design is weakly coherent.

Discussion

Drug combination trials are more challenging than single-agent trials because of the higher dimensions of the dose search space and partial ordering among the drug combinations. We have proposed the concept of coherence for drug combination trials and distinguished two types of drug combination designs: the ND-design, which forbids diagonal dose movement, and the D-design, which allows for diagonal dose movement. To account for the possibility of model misspecification when using the D-design, we further defined weak coherence and strong coherence based on the model estimates and true toxicity probabilities of the doses, respectively. We provided sufficient conditions to study the coherence of the model-based drug combination designs. We investigated a number of drug combination models in the literature and showed that under these models, the ND-design is coherent and the D-design is weakly coherent. In general, it is difficult to establish strong coherence of a D-design because it involves the knowledge of the unknown, true dose-toxicity relationship. From a practical viewpoint, if strong coherence is desirable, we recommend adopting the ND-design by forbidding diagonal dose movement.

We have shown that some model-based designs are coherent for ND-designs and weakly coherent for D-designs, however not all model-based designs are coherent. One example is partial-order CRM [7], which fits multiple (single-agent) CRM models, each assuming a different toxicity order for the combinations, and then uses Bayesian model averaging (BMA) to summarize the estimates over different models and make the decision of dose transition. Although each CRM model is coherent, averaging over them would lead to incoherent decisions. This is because each of the CRM models may recommend a different dose-transition decision, and the final decision based on BMA would be consistent with some, but inconsistent with others, leading to incoherence.

We assumed that the toxicity outcomes of previous patients have been fully observed before accruing the next cohort of patients. This however may not be true when the toxicity is late-onset. In this case, we can employ the Bayesian data augmentation method to handle the delayed toxicity and facilitate real-time decision making of dose escalation and de-escalation [13, 14]. The coherence of drug combination designs in the presence of late-onset toxicity will be an interesting topic of future research.

Supporting information

S1 File. Appendix A and B of supplemental material are available with this paper at PLOS ONE website.

https://doi.org/10.1371/journal.pone.0242561.s001

(PDF)

References

1. Cheung YK. Coherence principles in dose-finding studies. Biometrika. 2005;92(4):863–873.
- View Article
- Google Scholar
2. O’Quigley J, Pepe M, Fisher L. Continual reassessment method: a practical design for phase I clinical trials in cancer. Biometrics. 1990; p. 33–48.
- View Article
- Google Scholar
3. Thall PF, Millikan RE, Mueller P, Lee SJ. Dose-Finding with Two Agents in Phase I Oncology Trials. Biometrics. 2003;59(3):487–496. pmid:14601749
4. Wang K, Ivanova A. Two-Dimensional Dose Finding in Discrete Dose Space. Biometrics. 2005;61(1):217–222. pmid:15737096
5. Yin G, Yuan Y. Bayesian dose finding in oncology for drug combinations by copula regression. Journal of the Royal Statistical Society: Series C (Applied Statistics). 2009;58(2):211–224.
- View Article
- Google Scholar
6. Yuan Y, Yin G. A Bayesian Phase I/II Design for Oncology Clinical Trials of Combining Biological Agents. The Annals of Applied Statistics. 2011;5(2A):924–942.
- View Article
- Google Scholar
7. Wages NA, Conaway MR, O’Quigley J. Continual reassessment method for partial ordering. Biometrics. 2011;67(4):1555–1563. pmid:21361888
8. Braun TM, Jia N. A generalized continual reassessment method for two-agent phase I trials. Statistics in biopharmaceutical research. 2013;5(2):105–115.
- View Article
- Google Scholar
9. Riviere MK, Yuan Y, Dubois F, Zohar S. A Bayesian dose-finding design for drug combination clinical trials based on the logistic model. Pharmaceutical statistics. 2014;13(4):247–257. pmid:24828456
10. Cai C, Yuan Y, Ji Y. A Bayesian dose finding design for oncology clinical trials of combinational biological agents. Journal of the Royal Statistical Society: Series C (Applied Statistics). 2014;63(1):159–173.
- View Article
- Google Scholar
11. Riviere MK, Yuan Y, Dubois F, Zohar S. A Bayesian dose finding design for clinical trials combining a cytotoxic agent with a molecularly targeted agent. Journal of the Royal Statistical Society: Series C (Applied Statistics). 2015;64(1):215–229.
- View Article
- Google Scholar
12. Sato H, Hirakawa A, Hamada C. An adaptive dose-finding method using a change-point model for molecularly targeted agents in phase I trials. Statistics in medicine. 2016;35(23):4093–4109. pmid:27221807
13. Liu S, Ning J. A Bayesian dose-finding design for drug combination trials with delayed toxicities. Bayesian analysis. 2013;8(3):703.
- View Article
- Google Scholar
14. Liu S, Yin G, Yuan Y. Bayesian data augmentation dose finding with continual reassessment method and delayed toxicity. The annals of applied statistics. 2013;7(4):1837.
- View Article
- Google Scholar

[ref1] 1. Cheung YK. Coherence principles in dose-finding studies. Biometrika. 2005;92(4):863–873.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. O’Quigley J, Pepe M, Fisher L. Continual reassessment method: a practical design for phase I clinical trials in cancer. Biometrics. 1990; p. 33–48.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Thall PF, Millikan RE, Mueller P, Lee SJ. Dose-Finding with Two Agents in Phase I Oncology Trials. Biometrics. 2003;59(3):487–496. pmid:14601749
View Article
PubMed/NCBI
Google Scholar

[8] View Article

[9] PubMed/NCBI

[10] Google Scholar

[ref4] 4. Wang K, Ivanova A. Two-Dimensional Dose Finding in Discrete Dose Space. Biometrics. 2005;61(1):217–222. pmid:15737096
View Article
PubMed/NCBI
Google Scholar

[12] View Article

[13] PubMed/NCBI

[14] Google Scholar

[ref5] 5. Yin G, Yuan Y. Bayesian dose finding in oncology for drug combinations by copula regression. Journal of the Royal Statistical Society: Series C (Applied Statistics). 2009;58(2):211–224.
View Article
Google Scholar

[16] View Article

[17] Google Scholar

[ref6] 6. Yuan Y, Yin G. A Bayesian Phase I/II Design for Oncology Clinical Trials of Combining Biological Agents. The Annals of Applied Statistics. 2011;5(2A):924–942.
View Article
Google Scholar

[19] View Article

[20] Google Scholar

[ref7] 7. Wages NA, Conaway MR, O’Quigley J. Continual reassessment method for partial ordering. Biometrics. 2011;67(4):1555–1563. pmid:21361888
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref8] 8. Braun TM, Jia N. A generalized continual reassessment method for two-agent phase I trials. Statistics in biopharmaceutical research. 2013;5(2):105–115.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref9] 9. Riviere MK, Yuan Y, Dubois F, Zohar S. A Bayesian dose-finding design for drug combination clinical trials based on the logistic model. Pharmaceutical statistics. 2014;13(4):247–257. pmid:24828456
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref10] 10. Cai C, Yuan Y, Ji Y. A Bayesian dose finding design for oncology clinical trials of combinational biological agents. Journal of the Royal Statistical Society: Series C (Applied Statistics). 2014;63(1):159–173.
View Article
Google Scholar

[33] View Article

[34] Google Scholar

[ref11] 11. Riviere MK, Yuan Y, Dubois F, Zohar S. A Bayesian dose finding design for clinical trials combining a cytotoxic agent with a molecularly targeted agent. Journal of the Royal Statistical Society: Series C (Applied Statistics). 2015;64(1):215–229.
View Article
Google Scholar

[36] View Article

[37] Google Scholar

[ref12] 12. Sato H, Hirakawa A, Hamada C. An adaptive dose-finding method using a change-point model for molecularly targeted agents in phase I trials. Statistics in medicine. 2016;35(23):4093–4109. pmid:27221807
View Article
PubMed/NCBI
Google Scholar

[39] View Article

[40] PubMed/NCBI

[41] Google Scholar

[ref13] 13. Liu S, Ning J. A Bayesian dose-finding design for drug combination trials with delayed toxicities. Bayesian analysis. 2013;8(3):703.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref14] 14. Liu S, Yin G, Yuan Y. Bayesian data augmentation dose finding with continual reassessment method and delayed toxicity. The annals of applied statistics. 2013;7(4):1837.
View Article
Google Scholar

[46] View Article

[47] Google Scholar

Figures