Figures
Abstract
Cancer development is a dynamic and continuously evolving process, with the emergence of drug-resistant cancer cells being one of the primary reasons for the failure of traditional treatments. Adaptive therapy, as an emerging cancer treatment strategy, is increasingly being applied in oncology. In this study, we incorporate pharmacokinetics into a cancer evolutionary game theory model and propose an optimal control problem constrained by maximum drug concentration and maximum tumor burden. Firstly, we demonstrate the existence of an optimal control for this problem. Secondly, using Pontryagin’s minimum principle, we formulated the structure of the optimal control to design an optimal adaptive therapy strategy. Finally, through numerical simulations, we compare the optimal adaptive therapy strategy with other adaptive therapies and traditional treatments, and further develop personalized treatment plans for different patient groups. The results demonstrate that the optimized adaptive treatment strategy effectively preserves a high survival rate of healthy cells during treatment. By maintaining drug-sensitive and drug-resistant cell populations in a state of low-level competition, this approach prevents the proliferation of drug-resistant cells, reduces the tumor burden on patients, and extends overall survival.
Citation: Li Z, Tan X, Yu Y (2025) Optimal adaptive cancer therapy based on evolutionary game theory. PLoS ONE 20(4): e0320677. https://doi.org/10.1371/journal.pone.0320677
Editor: Nikos Kavallaris, Karlstad University: Karlstads Universitet, SWEDEN
Received: September 13, 2024; Accepted: February 21, 2025; Published: April 23, 2025
Copyright: © 2025 Li et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper.
Funding: This work was supported by the National Natural Science Foundation of China (Nos. 11361104, 12261104), the Youth Talent Program of Xingdian Talent Support Plan (XDYC-QNRC 2022-0514), the Yunnan Provincial Basic Research Program Project (No. 202301AT070016, No. 202401AT070036), the yunnan Province International Joint Laboratory for Intelligent Integration and Application of Ethnic Multilingualism (202403AP140014). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The heterogeneity of cancer [1] and the evolution of drug resistance are significant challenges in cancer treatment. Ecological and evolutionary dynamics are increasingly recognized as crucial factors in the emergence of tumor heterogeneity and drug resistance [2–4]. Currently, maximum tolerated dose therapy and metronomic therapy [5,6] are still widely used clinical cancer treatment methods. While these treatments often show initial success, the toxic side effects of chemotherapeutic drugs and the emergence of resistance lead to the growth of uncontrolled drug-resistant cancer cell populations, making tumor recurrence inevitable.
In recent years, adaptive therapy [7,8] has emerged as an effective approach for cancer treatment. Adaptive therapy strategies involve dynamically adjusting treatment regimens based on current estimates of cancer cell growth or anticipated tumor evolution, derived from biopsies, antibody tests, and other diagnostic methods. These strategies determine the appropriate drug dosages and timing for treatment initiation or cessation. Recently, adaptive therapy has shown promising results in several clinical cancer treatments [9,10]. It is widely acknowledged that tumor heterogeneity and the development of drug resistance result from natural selection among different cancer cell subpopulations within the tumor. Incorporating evolutionary game theory into mathematical models of cancer treatment offers a fresh viewpoint on comprehending and refining therapeutic approaches. Park and Newton [11] formulated a restricted cellular framework to explore tumor evolutionary dynamics, concentrating on random fluctuations throughout multiple adaptive chemotherapy sessions. Zhang et al. [12] proposed a pilot study where they used an evolutionary mathematical model to guide the dosage of abiraterone, aiming to postpone the emergence of resistance. Masud and Kim [13] introduced a time-dependent evolutionary dosing strategy formulated to control the persistent rivalry among drug-sensitive, resistant, and plastic cells. Bukkuri and Adler [14] developed an evolutionary game theory model to investigate the competitive interactions among cancer cells with differing biomarker production levels. They designed simulated treatment protocols aimed at driving cancer cells into evolutionary traps.
In this study, we build upon the evolutionary game theory model introduced by West et al. [15] by integrating the pharmacokinetics of chemotherapeutic drugs. Based on the constraints of the maximum drug tolerance concentration and the maximum tumor burden that the patient’s body can endure, an optimal control problem is proposed. The goal of the optimal control scenario is to minimize drug toxicity and the proportion of cancer cells by the conclusion of treatment. The aim is to devise a treatment strategy that ensures the patient’s body can tolerate the maximum drug concentration while simultaneously controlling and managing tumor growth so that it does not exceed the maximum tumor burden the patient can handle. Firstly, in the optimal control problem section, we show that an optimal control exists for the designed optimal control problem. Secondly, in the optimal control characterization section, We demonstrate that the state constraints for maximum drug concentration and tumor burden satisfy the regularity conditions, and we obtain the control variable values (drug dosage) at the boundary states. Using Pontryagin’s minimum principle, we deduce the necessary conditions for the optimal control problem and obtain the optimal control structure. Finally, in the result section, we propose an optimal adaptive therapy scheme, comparing it with the adaptive therapy scheme proposed by West et al. and two traditional treatment strategies (MTD therapy and metronomic therapy). Additionally, we design personalized optimal adaptive therapy schemes for two types of patients: those who can tolerate high tumor burdens and those who can tolerate low tumor burdens. In the discussion section, we provide a conclusive evaluation of the optimal adaptive therapy scheme designed using the improved evolutionary game theory model, highlighting existing limitations and suggesting directions for future improvements.
Mathematical model
West et al. [15] proposed a model of cancer progression under the strategy of harsh environment induction,which details the dynamic competitive relationships among healthy cells, drug-sensitive cancer cells, and drug-resistant cancer cells:(1)where
represent the proportions of healthy cells, drug-sensitive cells, and drug-resistant cells respectively (
). The function
characterizes the fitness function of each cell subpopulation, which varies according to its environment. The term ⟨ f ⟩ denotes the average fitness of the three cell subpopulations. The fitness functions and average fitness are defined as follows:
(2)
(3)where
,
represents the i-th row element of
,
represents the natural selection pressure experienced by each subpopulation in the tumor microenvironment, and
. The natural selection pressure parameters for each cell subpopulation are equal before the start of chemotherapy (
). A represents a matrix denoting the payoff matrix of three cell subpopulations in an evolutionary game.
(4)
The payoff matrix for the evolutionary game is expressed as follows:where healthy cells (H), drug-sensitive cells (S), and drug-resistant cells (R) compete in pairs. In the game between healthy cells and cancer cells, healthy cells act as cooperators, while drug-sensitive and drug-resistant cells act as defectors. The game involving drug-sensitive and drug-resistant cells sees the former as defectors and the latter as cooperators. The payoffs obtained by the three cell subpopulations in their pairwise games correspond to the values in the payoff matrix. To ensure Gompertzian growth of the tumor cell population in a drug-free environment [16], the parameters in the above payoff matrix must satisfy the following inequalities:
The inequalities for the evolutionary game between the cell subpopulations are as follows:
- Between healthy cells and drug-sensitive cancer cells: e > a > g > b.
- Between healthy cells and drug-resistant cancer cells: k > a > o > d.
- Between drug-sensitive cancer cells and drug-resistant cancer cells: h > o > g > l.
In cancer chemotherapy, the continuous administration of chemotherapeutic drugs involves a series of processes within the human body [17,18], including absorption, distribution, metabolism, and excretion. Drugs can be administered through intravenous injection, intramuscular injection, intraperitoneal injection, oral administration, and other methods, entering the bloodstream and reaching the tumor site either directly or indirectly. As drugs accumulate and are cleared in the body, the drug concentration dynamically changes according to the dosage. Based on the above model, we introduce pharmacokinetics into the evolutionary game model of cancer progression.(5)where c represents the drug concentration in the body, m represents the dosage, and p,q represent the pharmacokinetic rates of drug administration and clearance, respectively.
Optimal control problem
This section introduces an optimal control problem that includes state constraints and demonstrates the existence of the optimal control. Under the state constraint conditions, by controlling the dosage of the drug, we regulate the drug concentration in the body. This guarantees that the healthy cells remain alive and maintain a high level of health,while limiting the proliferation of cancer cells and the toxic effects of the drug.
Optimal control problem with constraints
The control function of system (5) satisfies:(6)
All control functions m(t) satisfying (6) are called admissible controls. The collection of all permissible controls forms the admissible control set, denoted as(7)
To avoid the competitive imbalance between two types of tumor cells caused by drug toxicity and to ensure that healthy cells survive at a high fitness level, we introduce the following state constraints:(8)
(9)where
is the maximum tolerable drug concentration for the patient,
is the overall fraction of cancer cells, and
is the maximum tolerable tumor burden[20].
We put forward the following goal function for the control issue:(10)where the state variable is represented as
, and
denotes the overall fraction of cancer cells upon the completion of the treatment cycle, λ and η denote the weight coefficients.
Existence proof of the optimal control
To facilitate understanding, we restate the optimal control problem:(11)
Control constraints:(16)where
.
Next, we establish that an optimal control exists [19,20]. The optimal control problem Eq (11)-(16) has an optimal solutionsuch that
To establish that an optimal solution (x*,m*) exists, these conditions must be satisfied:
- There exists a pair (x*,m*).
- G(x,U,t) is convex for each (x,t), where
(17)
- The set U is bounded.
- For all
and for all feasible solutions, there exists a positive number δ such that ∥ x ( t ) ∥ ≤ δ.
Now, to prove the satisfaction of the first condition: To demonstrate the existence of a feasible solution for the optimal control problem, the following conclusion needs to be established. references [17,21,22]:
- (i) f ( ⋅ , m ) is continuous for every m ∈ U.
- (ii) Positive constants
and
exist such that for every
:
(18)
(19)
- (iii) U is not empty.
Based on the state equation, it is clear that f ( ⋅ , m ) is continuous for all m ∈ U, satisfying (i). We further consider (ii):where
In summary, (ii) holds. Obviously, U is non-empty, (iii) holds. Therefore, the first condition optimal control problem exists and feasible solutions exist.
Next, we prove the second condition, refer to [23]:
To establish the convexity of G(x,U,t), it is necessary to satisfy the following conditions:
Thus, let . Clearly,
, and since
, we have
.
Therefore, , which proves that G(x,U,t) is convex for every (x,t).
Additionally, since the admissible control set is defined to be bounded and closed, we can conclude that U is bounded and closed. Since for all , the solutions to the optimal control problem are bounded, we can find a δ > 0 for which ∥ x ( t ) ∥ ≤ δ holds. Therefore, the third and fourth conditions are also satisfied. In summary, the existence proof of the optimal control problem is complete.
Optimal control characterization
In this section, we use the method of Tan et al. [24] to prove that the state constraints satisfy regularity conditions and separately obtain the boundary drug dosages under each state. To explore the requisite conditions for the optimal control problem, we utilized Pontryagin’s minimum principle and formulated the optimal control structure for the problem.
State constraints
For the state constraint , consider the state equation
The control variable m is explicitly included in the state equation, which adheres to the first-order regularity criterion
During continuous administration of the drug, the drug concentration reaches . At this boundary state, ċ = 0, and we obtain the boundary drug dosage
(20)
For the state constraint , define
(21)the boundary state
can be expressed as
Clearly, the control variable m ( t ) is not directly present in the preceding equation. Next, let’s examine the second-order derivative,where
From the previous equations, it is evident that the control variable m first emerges in the second derivative of E ( t ) . Hence, a second-order state constraint is present.
Since(22)the regularity condition is satisfied. Further, for the boundary control
, setting 0 = Ë ( t ) , we obtain the boundary drug dosage
(23)
Optimal control structure
We introduce the adjoint variable . According to Pontryagin’s minimum principle [25–27], the standard Hamiltonian function is characterized as:
(24)
Furthermore, we introduce the Lagrange multipliers to connect the state constraints with the Hamiltonian function, obtaining the augmented Hamiltonian function:
(25)
Should an optimal control solution be found, the subsequent adjoint equations and transversality conditions must hold:
1. Adjoint differential equations:
4. Complementary slackness condition:
Subsequently, we derive the system of adjoint equations:(26)
To facilitate the discussion, the terms ⟨ f ⟩ and in the adjoint equation system, where i = 1 , 2 , 3, will not be expanded, and
Based on Pontryagin’s minimum principle, the optimal control structure is derived as follows:
We get the switching function:(27)
Consequently, the optimal control structure is represented as follows:(28)
Based on the optimal control structure obtained from Pontryagin’s minimum principle, we propose the optimal adaptive therapy for the cancer evolutionary game model. For patients with different initial tumor cell proportions, the core idea of optimal adaptive therapy is to maintain a high survival proportion of normal cells, ensuring a healthy competition between drug-sensitive and drug-resistant cells within the patient or tumor burden (maximum tumor burden), and ensuring the patient’s overall fitness while maintaining the drug concentration below the maximum tolerable level. The timing of administration and the dosage of the drug are determined based on the real-time detected proportions of the two cancer cell subpopulations.
During the initial phase of treatment, we utilize the highest tolerable dose of the medication to ensure that healthy cells maintain a higher fitness and competitive edge over tumor cells. This approach leads to a swift reduction in drug-sensitive cancer cells. When the drug concentration within the patient’s system reaches the highest tolerable level, we cease administering the medication. At this point, the duration of drug administration is relatively short, and the competitive ability of the drug-resistant cells is not yet fully manifested. However, the drug-sensitive cells will rapidly grow once the drug is stopped, while the healthy cells maintain relatively high fitness. By sustaining this periodic method of drug administration, we progress to the next phase of treatment when the overall tumor cell proportion in the patient’s body hits the peak tumor burden. We then resume drug administration until its level in the patient’s system is back to the highest tolerable limit, ensuring an equilibrium between drug-sensitive and drug-resistant cells. This strategy prevents the complete eradication of drug-sensitive cells and maintains competitive pressure between the two cell types. When the total proportion of tumor cells reaches a relatively stable state, we continue to administer the drug at the lowest effective level. During the second treatment phase, according to the cyclic treatment method, the drug administration is cyclically paused or resumed at the optimal time intervals. This ensures that the normal cells maintain a high survival rate, and the two types of cancer cells compete effectively within the patient’s body. This strategy prevents the predominance of drug-resistant cells and minimizes the negative impacts of the medication on the patient’s health, thereby extending survival time and enhancing therapeutic outcomes.
In this paper, we consider four treatment cycles and provide the optimal control structure as follows:(29)
Result
In this section, we employ numerical simulations to quantitatively optimize the timing and dosage of the adaptive treatment scheme. To address the optimal control problems Eq (11)-(16), we apply a discretization method. For simulating other therapeutic schemes, we utilize the ode45 solver in MATLAB 2019. We use the parameters from reference [11] and made some adjustments to the parameters of the benefit matrix due to environmental factors for the numerical simulations. The parameters used for the numerical simulations are shown in Table 1.
Continuous drug treatment simulation
To understand how optimal adaptive therapy strategies sustain healthy competition among different cell subpopulations and achieve sustainable treatment, we first study the dynamic evolution of the prisoner’s dilemma cancer evolution game model both without treatment and under various conventional treatment schemes.
Fig 1 illustrates the dynamic changes in the proportions of each cell type under no treatment conditions for cases with Fig 1 (A) high tumor proportion, Fig 1 (B) medium tumor proportion, and Fig 1 (C) low tumor proportion. Regardless of the initial tumor proportion, in the absence of chemotherapeutic intervention, the three cell subpopulations naturally engage in a prisoner’s dilemma game. The growth or decline of each cell subpopulation entirely depends on the payoff parameters in the payoff matrix. In all pairwise games between cell subpopulations, drug-sensitive cells always act as defectors, thus showing a natural advantage in the evolutionary process until they dominate the entire cell population.
(A) Initial low cancer cell proportion. (B) Initial high cancer cell proportion. (C) Initial medium cancer cell proportion.
Fig 2 describes the Maximum Tolerated Dose (MTD) treatment scheme. After 25 days of continuous treatment with the maximum drug dose, Fig 2 (A) the healthy cell population emerges as the dominant group upon completion of the treatment. However, the substantial drug dosage eradicates a considerable amount of drug-sensitive cells during therapy, causing them to forfeit their competitive advantage over drug-resistant cells. As a result, drug-resistant cells rapidly become dominant once the treatment is halted. Subsequently, as the drug concentration in the patient’s body gradually diminishes, drug-sensitive cells regain their competitive advantage and reestablish dominance in the cell population. However, regardless of which cancer cell population becomes dominant, it signifies a failure of the treatment, as evidenced by the changes in the total cancer cell proportion depicted in Fig 2 (B). Fig 3 depicts the approach of ongoing low-dose therapy. It becomes evident that continuously administering a low drug dose to avert resistance development is impractical. In Fig 3 (A), after a brief period where healthy cells and drug-resistant cells gain a competitive advantage, they are eventually replaced by drug-sensitive cells. In Fig 3 (B), the overall proportion of cancer cells shows a slight reduction initially but gradually dominates the entire cell population again. This is because continuous low-dose treatment fails to suppress the competitive advantage of drug-sensitive cells, highlighting the need for an appropriate drug dosage in cancer treatment. Fig 4 illustrates the metronomic therapy strategy. In Fig 4(A), due to the fixed drug dosage and fixed treatment on/off times, this strategy fails to maintain the competitive balance among the three cell subpopulations. Healthy cells initially maintain a high proportion but are soon replaced by drug-sensitive cells. In Fig 4 (B), the overall proportion of cancer cells fluctuates under intermittent drug administration, ultimately dominating the entire cell population. The metronomic therapy strategy highlights that appropriate timing for treatment on/off cycles is crucial for successful therapy.
(A) Temporal evolution of various cell subpopulation proportions during 25 days of continuous treatment at the maximum tolerated dose and following cessation of treatment. (B) Temporal evolution of the overall proportion of cancer cells.
(A) Temporal evolution of the proportions of various cell subpopulations. (B) Temporal evolution of the overall proportion of cancer cells.
(A) Temporal evolution of the proportions of various cell subpopulations. (B) Temporal evolution of the overall proportion of cancer cells.
The adaptive therapy strategy developed by Newton and Ma [28] involves selecting appropriate drug dosages to enable the three cell subpopulations to cycle through a fixed-ratio competition. Essentially, the drug dosage, treatment duration, and treatment intervals are fixed. Considering the importance of drug dosages and treatment timing, we propose the optimal adaptive therapy strategy.
Optimize treatment simulation
In Figs 5 and 6, we describe cases of high tumor proportion treated with the optimal adaptive therapy strategy. The proportions of healthy cells, drug-resistant cells, and drug-sensitive cells are ,
, and
. Fig 5 (A) shows the drug dosages and the treatment on/off periods during the optimal adaptive therapy process.Fig 5 (B) illustrates the changes in drug concentration within the patient over time. Because of the pharmacokinetics of drug clearance and accumulation, the drug concentration varies periodically. Through numerical simulation, we obtained the optimal drug dosages
,
and the treatment periods
. For cases with a high proportion of cancer cells, treatment starts with a short-term maximum drug dosage to eliminate some drug-sensitive cells. The therapy then switches to a different dosage until the drug concentration within the patient attains the maximum tolerable level(
), at which point drug administration is halted. At this point, healthy cells transition from a low to a high survival proportion, maintaining a balanced competition with drug-sensitive and drug-resistant cells. When the tumor burden in the patient’s body increases to the maximum tolerable level (
), treatment is resumed until the drug concentration returns to its peak level, initiating the second treatment period. According to the treatment steps, Fig 6 (A) illustrates the cyclic treatment based on the calculated optimal dosage and best on/off times, which intervenes in the competitive interactions among the three subpopulations. Healthy cells increase to a high proportion and then maintain this level with some fluctuations. Both drug-sensitive and drug-resistant cells retain their competitive abilities under the influence of chemotherapy, ensuring that both cancer cell subpopulations remain within the patient’s acceptable tumor threshold. Fig 6 (B) shows that the overall level of cancer cells fluctuates at a low proportion throughout the treatment process.
(A) Temporal evolution of the proportions of various cell subpopulations, Pale yellow and white backgrounds respectively indicate the treatment phase and the treatment interval phase (This will not be repeated further in the text.). (B) Temporal evolution of the overall proportion of cancer cells.
Compared to the adaptive therapy strategy proposed by Newton and Ma [28], the optimal adaptive therapy strategy presented here takes into account the patient’s drug toxicity tolerance and the tolerable tumor burden range, adjusting drug dosages and treatment on/off times accordingly. Due to the growth dynamics of cancer cells, the proliferation period and drug effect window vary. By precisely controlling drug dosages and treatment times, it is possible to intervene during the most drug-sensitive phases of tumor cell growth, thereby enhancing treatment efficacy and avoiding resistance development. For cases with a high initial tumor burden, the adaptive therapy strategy proposed by West et al. disregards the patient’s well-being throughout the treatment. As the tumor cell levels in the patient’s body continuously cycle back to the initial tumor state due to the closed-loop design of the treatment plan, the patient’s condition deteriorates repeatedly. Over time, the patient’s ability to endure high tumor burdens may diminish due to repeated stress, ultimately resulting in treatment failure. In the aforementioned simulation results, the optimal adaptive therapy strategy successfully treated cases with a high initial tumor burden. Healthy cells became the dominant population, while drug-sensitive and drug-resistant cells maintained stable competition at low threshold levels. This approach prevented patients from repeatedly enduring high tumor burdens during treatment, minimized the toxic side effects of the drugs, and ensured a high quality of life for patients throughout the treatment process.
Comparison of treatment options
Next, we compare the treatment effectiveness of the optimal adaptive therapy strategy with some traditional cancer treatment methods under the same initial proportions of cancer cells.
We consider the treatment simulation for cases with initial proportions of healthy cells and cancer cells ,
,
using three different treatment strategies with the same total drug dosage. We consider treatments over a limited time period, t ∈ [ 0 , 50 ] . In Fig 7 (A),(B) and (C) represent the changes in cell proportions over time for MTD treatment, metronomic therapy, and optimal adaptive therapy, respectively. MTD treatment results in the rapid reduction of drug-sensitive cells to near extinction. Throughout the post-treatment phase, drug-resistant cells acquire a notable edge over drug-sensitive cells, resulting in a competitive proliferation of drug-resistant cells. Consequently, by the conclusion of treatment, the quantity of healthy cells in the patient’s body tends to be completely surpassed by drug-resistant cells. While the metronomic therapy strategy effectively manages drug-resistant cells, during the advanced stages of treatment, drug-sensitive cells gradually obtain an advantage over healthy cells due to the failure to treat them effectively at the optimal moment. Consequently, healthy cells are gradually replaced by drug-sensitive cells. Towards the conclusion of treatment, drug-sensitive cells almost completely dominate the entire cell population. The optimal adaptive therapy persistently fine-tunes drug dosages to sustain a balanced interaction between drug-sensitive and drug-resistant cells. This method guarantees that the patient can endure the drug toxicity and tumor burden while hindering the emergence of drug resistance. Throughout the entire treatment period up to the conclusion of therapy, healthy cells maintain a high proportion, while the overall cancer cell population remains in a low-value
(A) MTD treatment scheme:Temporal evolution of the proportions of various cell subpopulations. (B) MTD treatment scheme: Temporal evolution of the overall proportion of cancer cells. (C) Metronomic therapy scheme: Temporal evolution of the proportions of various cell subpopulations. (D) Metronomic therapy scheme: Temporal evolution of the overall proportion of cancer cells. (E) Optimal adaptive therapy scheme: Temporal evolution of the proportions of various cell subpopulations. (F) Optimal adaptive therapy scheme: Temporal evolution of the overall proportion of cancer cells.
oscillation state. Fig 7(D),(E),(F) compares the changes in cancer cell proportions during the three treatment strategies, illustrating that optimal adaptive therapy maintains healthy cells as the dominant population throughout the treatment process.
Fig 8 describes the changes in fitness functions for each cell subpopulation across the three treatment strategies. The fitness function represents the survival competitiveness of each cell subpopulation. Fig 8 (A) In the MTD treatment scheme, the fitness function of healthy cells shows a higher level compared to the fitness function of tumor cells during the treatment period. However, at the conclusion of the drug administration, healthy cells consistently have lower fitness compared to the two types of cancer cells. This indicates that, after the treatment ends, the survival ability of healthy cells is lower than that of the two types of cancer cells. Fig 8 (B) In the metronomic therapy scheme, while the fitness level of healthy cells is higher than that of drug-sensitive and drug-resistant cells during the phases of treatment, the fitness level of drug-sensitive cells significantly surpasses that of the other two cell subpopulations during the intervals between treatments. This period’s fitness level of drug-sensitive cells is also higher than the fitness level of healthy cells during the treatment periods. This is why, by the end of the treatment, healthy cells are gradually replaced by drug-sensitive cells. Fig 8 (C) The optimal adaptive therapy avoids the competitive release between drug-sensitive and drug-resistant cells. Unlike in the metronomic therapy scheme, the fitness level of healthy cells during treatment periods is higher than that of drug-sensitive cells during the intervals between treatments. Thus, the average fitness function level of healthy cells is higher than that of tumor cells throughout the entire treatment period. This ensures that healthy cells have good competitive survival ability under the optimal adaptive therapy strategy.
(A) Fitness functions under the MTD treatment scheme. (B) Fitness functions under the metronomic therapy scheme. (C) Fitness functions under the optimal adaptive therapy scheme.
Personalized treatment
In actual clinical cancer treatment, different cancer patients and various cancer types often have complex and diverse conditions. Therefore, we can tailor treatment plans according to the needs of different patient groups, developing suitable therapies. Here, we introduce the optimal adaptive therapy strategies for two types of patient groups:
Low tumor burden group.
For patients with relatively low resistance, drug toxicity, and tumor burden tolerance, a low drug dosage treatment is adopted. Keeping the tumor population proportion within a relatively low tumor burden range is key to prolonging patient survival and maintaining a relatively high quality of life during treatment. Considering an initial proportion of healthy cells and tumor cells of ,
,
, we set the patient’s maximum drug tolerance
and the maximum tumor burden
. Through numerical simulation, we obtained the optimal drug dosages
and the optimal treatment on/off periods
. Fig 9 (A) and (B) show that during the treatment, the proportions of healthy cells and tumor cells remain in a low-value oscillation state, proving the stability and effectiveness of this strategy. Figs 9(C) and 9(D) depict the changes in drug dosage and concentration within the patient’s system during the optimal on/off treatment periods. These figures further validate the effectiveness of this personalized optimal adaptive therapy strategy. Based on the above drug dosage and tumor burden adjustments, this personalized optimal adaptive therapy strategy not only reduces the toxic effects of chemotherapy to a certain extent but also improves the patient’s quality of life during treatment.
(A)Temporal evolution of the proportions of various cell subpopulations. (B) Temporal evolution of the overall proportion of cancer cells. (C) Drug dosage. (D) Changes in drug concentration.
High tumor burden group.
We design personalized treatment plans for patients with the same initial conditions as those in the personalized treatment group with low tumor burden. Given that patients with a high tumor burden have better tolerance to drug toxicity, we set the maximum drug tolerance concentration for patients to and the maximum tumor burden to
. Since these patients exhibit good tolerance to drug toxicity, we use the maximum drug dosage for treatment. Through numerical simulations, we obtained the optimal treatment on/off times
. Fig 10 (A) and 10 (B) illustrate that under the conditions of high drug tolerance and high tumor burden, healthy cells fluctuate within the range of 50% to 75%. Due to the patient’s mechanism of high drug toxicity tolerance and high tumor burden capacity, tumor cell have more room for maneuver. Fig 10 (C) is based on the patient’s drug tolerance and tumor burden capacity to implement the drug administration plan. Fig 10 (D) shows the changes in drug concentration in the patient’s body during treatment.
(A)Temporal evolution of the proportions of various cell subpopulations. (B) Temporal evolution of the overall proportion of cancer cells. (C) Drug dosage. (D) Changes in drug concentration.
Discussion
Despite the significant role of evolutionary research in cancer treatment, there are very few studies that have incorporated evolutionary models into clinical cancer treatment. The emergence of adaptive therapy [29,30], a new treatment approach, has been applied to various cancer studies. However, accurately determining the dosage and duration of a fixed treatment period remains a major therapeutic challenge.
In this work, we integrated pharmacokinetics into evolutionary game theory to better characterize the accumulation and clearance of chemotherapy drugs in the human body during treatment, thus enabling the implementation of more precise drug treatment plans. Building on this model, we proposed an optimal control problem constrained by the highest permissible drug concentration and the peak tumor burden state, with the goal of minimizing drug toxicity and the proportion of tumor cells by the conclusion of treatment. Using the Pontryagin minimum principle, we derived the optimal control structure to design the optimal adaptive therapy strategy, optimizing drug dosage and treatment on/off times.
In the numerical simulation phase, we systematically compared the performance differences between the optimal adaptive therapy scheme and other therapy schemes (as shown in Figs 7 and 8) and conducted a detailed analysis from multiple perspectives. First, the maximum tolerated dose (MTD) therapy scheme exhibited significant limitations. This scheme failed to adequately consider the competitive balance mechanism among different subpopulations of cancer cells, leading to the disruption of the dynamic equilibrium between drug-sensitive and drug-resistant cells during treatment. Specifically, under MTD therapy, drug-sensitive cells were largely eliminated due to the high drug dosage, while drug-resistant cells, which were previously suppressed by drug-sensitive cells, lost their competitive constraints. As a result, these drug-resistant cells proliferated rapidly and eventually dominated the tumor population (as shown in Fig 7(A) and (B)). Although this "one-size-fits-all" high-intensity treatment approach may yield short-term therapeutic benefits, its neglect of tumor heterogeneity makes it unsuitable for achieving long-term control objectives.
Second, the rhythmic therapy scheme, while introducing temporal factors by administering drugs at fixed intervals to mimic tumor proliferation patterns, still had notable limitations. This scheme assumed that both tumor proliferation cycles and drug efficacy time windows were fixed parameters, an assumption that deviates significantly from clinical reality. In actuality, tumor growth rates, intercellular competition dynamics, and pharmacokinetic characteristics can vary dynamically due to individual differences and disease progression stages. Therefore, implementing drug administration at fixed time intervals often results in a mismatch between drug delivery timing and the tumor’s actual state (as shown in Fig 7(C) and (D)). This imprecise treatment approach not only fails to effectively control tumor growth but may also lead to suboptimal therapeutic outcomes or accelerated resistance development due to over-treatment or under-treatment.
Furthermore, although West’s proposed adaptive therapy scheme theoretically achieved closed-loop tumor control by maintaining tumor volume within a stable range (i.e., fluctuating between the initial tumor volume and a preset threshold), it still had critical shortcomings in clinical application. Specifically, West’s scheme focused excessively on quantitative tumor volume control while neglecting patients’ quality of life, a crucial consideration. In real clinical scenarios, patients’ systemic conditions are not static: prolonged exposure to high tumor burdens can lead to immune dysfunction, nutritional deterioration, and organ damage. Thus, a treatment approach that prioritizes tumor volume control without considering patients’ overall health status does not align with the ultimate goals of clinical practice.
In contrast, the optimal adaptive therapy scheme demonstrated significant advantages in several aspects: first, by dynamically optimizing treatment timing and drug dosages, this scheme effectively suppressed tumor growth while minimizing damage to healthy tissues. This precise treatment strategy helped maintain a high proportion of surviving healthy cell populations. Second, the optimal adaptive scheme maintained a robust competitive relationship between drug-sensitive and drug-resistant cells. By rationally regulating their population ratios, it effectively delayed the explosive growth of drug-resistant cell populations (as shown in Fig 7(E) and (F)). Third, this scheme fully considered patient-specific differences and tolerance levels. On one hand, it ensured patients’ tolerance to treatment-related toxicities; on the other hand, it also accounted for patients’ capacity to tolerate tumor burdens. Lastly, unlike West’s scheme, the optimal adaptive scheme not only focused on achieving tumor control objectives but also placed special emphasis on patients’ quality of life. By avoiding overtreatment and precisely controlling tumor burden, this scheme could sustain patients’ healthy states and quality of life over extended periods. Additionally, building upon the optimal adaptive therapy scheme, we further developed personalized treatment plans tailored to different patient populations.
In summary, the optimal adaptive therapy scheme offers precisely tailored drug dosages and treatment schedules for patients with varying conditions, providing a new therapeutic framework for clinical practice. Additionally, we can enhance this model’s applicability by integrating combination therapies involving multiple drugs into the evolutionary game model. However, implementing such treatment plans depends on having an accurate understanding of the system’s current state. Current clinical biopsy techniques [31,32] are not sufficiently advanced; traditional tissue biopsies require surgical or needle procedures to obtain tissue samples, which are invasive and carry risks such as infection and bleeding. Moreover, tissue biopsies only provide information about specific tumor sites and may not reflect the condition of the entire tumor or other metastatic locations. Overcoming these limitations in detection technology is a significant challenge for the implementation of adaptive therapy schemes.
Acknowledgments
The authors would like to thank the anonymous reviewers for their valuable comments and suggestions.
References
- 1. Demicco M, Liu X-Z, Leithner K, Fendt S-M. Metabolic heterogeneity in cancer. Nat Metab 2024;6(1):18–38. pmid:38267631
- 2. Zhang L, Ma J, Liu L, Li G, Li H, Hao Y, et al. Adaptive therapy: A tumor therapy strategy based on Darwinian evolution theory. Crit Rev Oncol Hematol 2023;192:104192. pmid:37898477
- 3. Cho J-W, Cao J, Hemberg M. Joint analysis of mutational and transcriptional landscapes in human cancer reveals key perturbations during cancer evolution. Genome Biol 2024;25(1):65. pmid:38459554
- 4. Brosda S, Aoude LG, Bonazzi VF, Patel K, Lonie JM, Belle CJ, et al. Spatial intra-tumour heterogeneity and treatment-induced genomic evolution in oesophageal adenocarcinoma: implications for prognosis and therapy. Genome Med 2024;16(1):90. pmid:39020404
- 5. Lilly MB, Wu C, Ke Y, Chen W-P, Soloff AC, Armeson K, et al. A phase I study of docetaxel plus synthetic lycopene in metastatic prostate cancer patients. Clin Transl Med 2024;14(3):e1627. pmid:38515274
- 6. Park SJ, Kweon S, Moyo MK, Kim HR, Choi JU, Lee NK, et al. Immune modulation of the liver metastatic colorectal cancer microenvironment via the oral CAPOX-mediated cGAS-STING pathway. Biomaterials 2024;310:122625. pmid:38820768
- 7. Gallagher K, Strobl MAR, Park DS, Spoendlin FC, Gatenby RA, Maini PK, et al. Mathematical model-driven deep learning enables personalized adaptive therapy. Cancer Res 2024;84(11):1929–41. pmid:38569183
- 8. Strobl MAR, Martin AL, West J, Gallaher J, Robertson-Tessi M, Gatenby R, et al. To modulate or to skip: De-escalating PARP inhibitor maintenance therapy in ovarian cancer using adaptive therapy. Cell Syst 2024;15(6):510–525.e6. doi: https://doi.org/10.1016/j.cels.2024.04.003. pmid:38772367
- 9. Tan RZ. Tumour growth mechanisms determine effectiveness of adaptive therapy in glandular tumours. Interdiscip Sci 2024;16(1):73–90. pmid:37776475
- 10. Gallaher J, Strobl M, West J, Gatenby R, Zhang J, Robertson-Tessi M, et al. Intermetastatic and intrametastatic heterogeneity shapes adaptive therapy cycling dynamics. Cancer Res 2023;83(16):2775–89. pmid:37205789
- 11. Park J, Newton PK. Stochastic competitive release and adaptive chemotherapy. Phys Rev E 2023;108(3–1):034407. pmid:37849192
- 12. Zhang J, Cunningham J, Brown J, Gatenby R. Evolution-based mathematical models significantly prolong response to abiraterone in metastatic castrate-resistant prostate cancer and identify strategies to further improve outcomes. Elife 2022;11:e76284.
- 13. Masud M, Kim E. Theoretical understanding of evolutionary dosing following tumor dynamics. Chaos Solitons Fract 2024:179(1);114451
- 14. Bukkuri A, Adler FR. Biomarkers or biotargets? Using competition to lure cancer cells into evolutionary traps. Evol Med Public Health 2023;11(1):264–76. pmid:37599857
- 15. West J, Ma Y, Newton PK. Capitalizing on competition: An evolutionary model of competitive release in metastatic castration resistant prostate cancer treatment. Journal of Theoretical Biology 2018 Oct 14;455:249–60
- 16. West J, Hasnain Z, Macklin P, Newton PK. An evolutionary model of tumor cell kinetics and the emergence of molecular heterogeneity driving gompertzian growth. SIAM Rev Soc Ind Appl Math 2016;58(4):716–36. pmid:29937592
- 17. Rajan MP, Nanditha CK. A multi-drug pharmacokinectic optimal control approach in cancer chemotherapy. J Optim Theory Appl 2022;195(1):314–33.
- 18. Ledzewicz U, Schättler H. Optimal controls for a model with pharmacokinetics maximizing bone marrow in cancer chemotherapy. Math Biosci 2007;206(2):320–42. pmid:16197967
- 19. Wan X. Some systems of ordinary differential equations from cancer modeling: Qualitative analysis and optimal treatment; 2011
- 20. Pillis L, Fister K, Gu W, Collins C, Daub M, Gross D, et al. Seeking bang-bang solutions of mixed immuno-chemotherapy of tumors. Electr J Differ Eq (EJDE) 2007;2007(2007):Paper-No
- 21. Sabir S, Raissi N, Serhani M. Chemotherapy and immunotherapy for tumors: A study of quadratic optimal control. Int J Appl Comput Math 2020;6(3).
- 22. Burden T, Ernstberger J, Fister K. Optimal control applied to immunotherapy. Discrete Continuous Dynamical Syst Ser B 2004;4(1):135–46
- 23. Fister KR, Donnelly JH. Immunotherapy: An optimal control theory approach. Math Biosci Eng 2005;2(3):499–510. pmid:20369936
- 24. Tan PL, Maurer H, Kanesan J, Chuah JH. Optimal control of cancer chemotherapy with delays and state constraints. J Optim Theory Appl 2022;194(3):749–70.
- 25. Alblowy A, Maan N, Ibrahim A. Optimal control strategies for SGLT2 inhibitors as a novel anti-tumor agent and their effect on human breast cancer cells with the effect of time delay and hyperglycemia. Comput Biol Med 2023;166:107552
- 26. Saldaña F, Camacho-Gutiérrez JA, Villavicencio-Pulido G, Velasco-Hernandez JX. Modeling the transmission dynamics and vaccination strategies for human papillomavirus infection: An optimal control approach. Appl Math Modell 2022;112(12):767–85
- 27. Omer S, Mambili-Mamboundou H. Assessing the impact of immunotherapy on oncolytic virotherapy in the treatment of cancer. J Appl Math Comput 2024;18(1):1–33
- 28. Newton PK, Ma Y. Nonlinear adaptive control of competitive release and chemotherapeutic resistance. Phys Rev E. 2019;99(2–1):022404. pmid:30934318
- 29. Gluzman M, Scott JG, Vladimirsky A. Optimizing adaptive cancer therapy: dynamic programming and evolutionary game theory. Proc R Soc B 2020:287(1925);20192454
- 30. Brady-Nicholls R, Enderling H. Range-bounded adaptive therapy in metastatic prostate cancer. Cancers 2022:14(21);5319
- 31. Li L, Jiang H, Zeng B, Wang X, Bao Y, Chen C, et al. Liquid biopsy in lung cancer. Clin Chim Acta 2024;554:117757. pmid:38184141
- 32. Urabe F, Sumiyoshi T, Tashiro K, Goto T, Kimura T, Kobayashi T. Prostate cancer and liquid biopsies: Clinical applications and challenges. Int J Urol 2024 March 29