Figures
Abstract
This paper addresses the joint optimization of system utility in Unmanned Aerial Vehicle (UAV)-enabled Mobile Edge Computing (MEC) networks. Unlike traditional approaches that optimize individual components, such as users, UAVs, or base stations, we propose a novel framework aimed at maximizing the overall system utility, which is defined as the difference between the total revenue of service providers (UAVs and base stations) and the total cost incurred by users. The proposed model incorporates realistic constraints, including limited computational resources and energy consumption, and formulates the problem as a Mixed-Integer Nonlinear Programming (MINLP) model. To solve this complex optimization problem, we develop an efficient algorithm that integrates the Block Successive Upper-Bound Minimization (BSUM) framework with heuristic methods, enabling the decomposition of the original problem into tractable subproblems that are solved iteratively. Simulation results demonstrate that the proposed approach significantly outperforms traditional heuristic algorithms in terms of system utility, while also exhibiting robust convergence and reliability across various network configurations. The results highlight the effectiveness of joint optimization in improving both the economic and operational efficiency of UAV-assisted MEC systems, providing a solid foundation for future research in network utility management.
Citation: Zuo H, Zhang E, Tang Y, Yin M, Dong W (2026) Joint optimization of system utility in UAV-enabled edge computing. PLoS One 21(2): e0342583. https://doi.org/10.1371/journal.pone.0342583
Editor: Zhiyuan Ren, Xidian University, CHINA
Received: April 8, 2025; Accepted: January 26, 2026; Published: February 26, 2026
Copyright: © 2026 Zuo et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data are in the paper and Supporting information files.
Funding: This work was supported by the Beijing Natural Science Foundation (1212010), the Important Project of Digital Education Research of the Beijing (BDEC2022619027), 2023 Project Proposal of Beijing Higher Education Association (MS2023168), the Research Project of Beijing Institute of Graphic Communication (20190122019, Ec202303, Ea202301, E6202405), the Disciplinary Construction and Postgraduate Education Project of Beijing Institute of Graphic Communication (21090122012, 21090323009). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: No authors have competing interests. The authors have declared that no competing interests exist.
Introduction
The rapid advancement of information technology has led to the integration of Unmanned Aerial Vehicle (UAV) and Mobile Edge Computing (MEC) technologies, emerging as a promising solution to address the challenges faced by modern wireless networks. UAVs, with their inherent mobility and flexibility, complement MEC systems by providing on-demand computational and communication support in areas where traditional infrastructure may be limited. Specifically, UAVs can dynamically deploy computational resources, offload tasks, and enhance communication coverage in remote, disaster-stricken, or high-density areas, thereby improving the overall performance of MEC systems. MEC, by bringing computational and storage resources to the network edge, reduces latency and bandwidth requirements, enabling faster data processing and an enhanced user experience [1–4]. When combined, these technologies form a powerful synergy that addresses a wide range of challenges, including resource allocation, task offloading, and network scalability.
Despite the advantages of UAV-assisted MEC networks, optimizing their performance remains a complex and unresolved problem. Most existing studies focus on optimizing individual performance metrics, such as computation delay, energy consumption, or task offloading efficiency, often treating MEC and UAV technologies as separate entities. These studies tend to concentrate on optimizing UAV trajectories, task offloading strategies, or resource allocation independently, without accounting for the interdependencies between UAVs, base stations, and users. As a result, they adopt fragmented optimization approaches that fail to maximize the overall system utility. Furthermore, existing research often overlooks the critical balance between the revenues of service providers (UAVs and base stations) and the costs incurred by users, which is essential for ensuring the long-term sustainability of UAV-assisted MEC networks in real-world applications.
The core problem addressed in this paper is the joint optimization of system utility in UAV-assisted MEC networks. Specifically, the goal is to maximize overall system utility, defined as the difference between the total revenue generated by service providers (UAVs and base stations) and the costs incurred by users. This system utility is influenced by multiple factors, including task offloading decisions, resource allocation (both computational and communicative), and pricing strategies set by UAVs and base stations. The primary challenge lies in optimizing these interrelated variables under several practical constraints, such as limited computational resources, energy consumption, and network load. Unlike traditional approaches that independently optimize individual components (e.g., task offloading or resource allocation), this paper formulates a comprehensive, multi-dimensional optimization problem that jointly considers the interdependent decisions of service providers and users. The optimization problem is further complicated by its non-convexity and the complex interactions among decision variables, necessitating an integrated solution approach.
This study aims to address the gap in existing research by providing a unified framework for optimizing system utility in UAV-assisted MEC networks. The objective is to jointly optimize task offloading, resource allocation, and pricing strategies, with the goal of balancing the revenues of service providers and the costs incurred by users. The key contributions of this paper are as follows:
- Proposing a novel optimization framework that simultaneously considers the interdependencies between UAVs, base stations, and users to maximize system utility.
- Introducing a Mixed-Integer Nonlinear Programming (MINLP) model that incorporates a range of practical constraints, including limited resources, energy consumption, and network load.
- Developing an efficient algorithm that integrates the Block Successive Upper Bound Minimization (BSUM) framework with heuristic methods to effectively solve the non-convex optimization problem.
- Verifying the effectiveness of the proposed approach through simulation results, which demonstrate substantial improvements in system utility compared to traditional methods.
Related work
In recent years, the rapid development of MEC and UAV technologies has led to a growing body of research focused on optimizing the collaborative operation of these two technologies. In UAV-assisted MEC networks, much of the research has concentrated on optimizing individual performance metrics, such as computation resource allocation, task offloading strategies, energy efficiency, and latency optimization. Current studies primarily focus on improving specific objectives, such as reducing computation delay, minimizing energy consumption, or enhancing task offloading efficiency, under predefined constraints. However, there remains a lack of research that comprehensively considers multiple performance metrics, particularly in the context of optimizing overall system utility.
Many studies focus on optimizing a single performance metric, such as computation delay, energy efficiency, or task offloading efficiency. For instance, [5] proposed a trajectory prediction-based edge offloading strategy to enhance the processing efficiency of computational tasks by optimizing the task offloading path. This method aims to reduce transmission latency through task offloading path optimization but does not consider the broader system utility, such as the balance between the utility of service providers (UAVs, base stations) and the costs incurred by users. Similarly, [2,6–10] applied game theory to examine how UAV-assisted MEC can improve system performance through computation offloading optimization, as well as optimize user scheduling, UAV management, and power allocation. While these game-theoretic approaches effectively optimize task offloading decisions, they focus mainly on optimizing individual objectives, such as energy efficiency or task offloading efficiency, without fully considering the global system utility. [11] proposed an efficient energy minimization scheme that optimizes UAV trajectory, resource allocation, and task offloading strategies, significantly reducing the network’s energy consumption. This approach transformed a non-convex optimization problem with multi-variable coupling into a more manageable form by introducing auxiliary variables, successfully achieving energy consumption optimization. However, the scheme primarily focuses on energy consumption. [6] optimized UAV trajectories and area division through a task offloading framework, reducing energy consumption during the task offloading process. The optimal transportation theory introduced in this study provided services to multiple users, successfully optimizing task offloading paths and energy efficiency. While it excels in terms of energy efficiency and offloading efficiency, it does not address the balance of benefits and costs between service providers and users, remaining limited to optimizing a single objective.
In UAV-assisted MEC systems, Deep Reinforcement Learning (DRL) [12–14] has been widely applied to optimize task offloading and trajectory design. [15] proposed a framework based on Double Deep Q-Learning (D-DQN) to optimize the 3D trajectory design and task offloading strategy of UAVs through online learning in dynamic environments. This method addresses dynamic optimization issues to some extent, enhancing task offloading and energy efficiency. However, it does not consider the trade-off between user costs and system revenues. [16] introduced a DRL-based approach aimed at optimizing UAV flight trajectories to maximize average secrecy rates in wireless security. While this study effectively improved system security, it focused solely on secrecy metrics and did not account for the balance among multiple performance metrics such as task offloading, energy efficiency, and latency. [17] proposed an iterative algorithm that jointly optimized UAV beamforming, CPU frequency, trajectory design, and user transmission power, successfully reducing the system’s energy consumption. Although this study provided valuable insights into resource allocation in UAV-assisted MEC systems, it remained focused on optimizing a single objective (energy efficiency).
In UAV-assisted MEC systems and Industrial Internet of Things (IIoT) environments [18,19], many studies have applied heuristic algorithms to optimize task offloading, resource allocation, and energy efficiency. [20] proposed a user satisfaction-oriented task offloading and UAV scheduling strategy to optimize task offloading and user satisfaction. This study employed a genetic algorithm (GA) and simulated annealing (SA) to combine task offloading decisions with UAV scheduling strategies, improving overall user satisfaction. However, it remains based on single-objective optimization and does not comprehensively balance multiple performance metrics. [21] focused on resource allocation in IIoT environments and proposed a learning-based resource allocation scheme using a Cooperative Particle Swarm Optimization (LCPSO) algorithm, successfully optimizing response time in forest fire monitoring. Although this approach improved response time, it did not sufficiently address the weight balancing problem across multiple performance metrics [22]. Furthermore, many existing studies, when faced with complex constraints, tend to rely on heuristic algorithms and convex optimization methods to solve problems. For example, an energy consumption minimization scheme based on a low-complexity resource allocation algorithm was proposed, which optimized user delay and energy efficiency, particularly in Heterogeneous Non-Orthogonal Multiple Access (HNOMA) networks. While this scheme significantly improved energy efficiency, it remained focused on energy efficiency alone, without a comprehensive improvement in system utility.
Previous research [23,24] applied the BSUM algorithm to optimize overall energy consumption and latency in task offloading within MEC systems. The novelty of this study lies in proposing a global system utility optimization scheme based on the BSUM framework. By comprehensively considering the differences between user costs and service provider (UAV and base station) revenues, a new multidimensional optimization framework is introduced. However, despite these advancements, most existing studies focus on optimizing individual performance metrics, such as computation delay, energy efficiency, or task offloading efficiency. These approaches typically overlook the balance between the revenues of service providers (UAVs and base stations) and the costs incurred by users. For instance, as shown in Table 1, studies like those of [23] primarily focus on optimizing specific metrics, such as task offloading and energy efficiency, but often fail to address the balance between provider revenues and user costs.
Furthermore, while some studies utilize the BSUM algorithm or other optimization techniques, they generally focus on single-objective optimization, such as minimizing energy consumption or enhancing task offloading efficiency, without considering the trade-off between service provider revenue and user costs. The current literature also lacks a unified, comprehensive framework capable of simultaneously addressing task offloading, resource allocation, and pricing strategies, while ensuring an optimal balance between service providers’ revenues and user costs.
System model and problem formulation
As shown in Fig 1, the system considers a UAV-assisted MEC network task offloading model. The model is divided into three layers: users, UAVs, and base stations, represented by sets B, J, and I, respectively. The UAV hovers above the users, providing edge computing offloading services. The UAVs’ positions are known to the users, and users prefer to offload tasks Di to the UAV or base station that is closest to them. Each user has a specific task to offload. Users can choose from four task offloading methods: local offloading, offloading to the UAV for computation, offloading to the base station for computation, or offloading to the base station via UAV relay. Due to the limited computational resources, users can adopt any of these four offloading methods to handle tasks. Meanwhile, both the UAV and base station charge the users corresponding fees to ensure their own benefits.
Base station layer
In the MEC network, ground base stations possess powerful computing capabilities and a continuous power supply. The set of users requiring task offloading is denoted as , where
represents the maximum number of users in the set. Similarly, the sets of base stations and UAVs serving these users are represented as
and
, where
and
denote the maximum number of base stations and UAVs in the respective sets.
For each base station b, user i can either directly offload tasks to the base station for computation or offload tasks to the base station via UAV j as a relay. The base station sets a price for the computational resources used by user i, and a price
for the relay UAV j. The base station’s computational resources Fb are limited, and the resources allocated to user i are denoted by
. Let
represent the user’s task offloading decision variable, where
indicates that the task is offloaded to the base station via a wireless link, and
otherwise. Similarly,
indicates whether the task of user i is offloaded to the base station through UAV j as a relay.
The offloading decisions can be mathematically expressed as follows:
The base station’s utility function is defined as , where Pb represents the profit of the base station from providing computation services to users, and Cb is the cost of using UAVs. These can be expressed as:
UAV layer design
In the MEC network, UAVs can serve either as base stations to receive and compute tasks offloaded by users, or as relay nodes to forward users’ tasks to base stations for computation. When a UAV functions as a base station, it sets a pricing for the computing resources allocated to user i. Similarly, the UAV’s computing resources Fj are limited. Let
denote the user task offloading decision variable, indicating whether user i offloads the task to UAV j for computation.
When the UAV acts as a base station, it sets the pricing for the computing resources allocated to users, and similarly, the UAV’s computing resources are also constrained. The utility function of the UAV layer can be expressed as , where
and
represent the UAV’s revenue when acting as a base station and a relay, respectively, while
and
represent the computational cost when the UAV acts as a base station and the communication cost for transmitting tasks as a relay, as shown below:
where represents the computing resources allocated to user i by UAV j, and
where Pj is the transmission power of UAV j, and is the transmission rate from UAV j to base station b. Let
represent the number of CPU cycles required to complete the computation for user i.
User layer design
In the MEC network, the utility of the user is determined by both the task offloading cost and the processing cost
, which includes computation delay and communication cost during task transmission. Let
represent the user’s decision variable indicating whether user i performs local computation for the task.
The task offloading cost is defined as:
The computation cost is expressed as:
where Pi is the transmission power of user i, and represent the user’s offloading decisions. Based on these, the utility of the user layer can be expressed as
(Table 2).
System utility and optimization problem
In the user offloading system designed for our MEC network, the goal is to maximize the overall system utility while ensuring that constraints, such as UAV and base station computing resources and offloading decisions, are satisfied. In the proposed system model, the interactions between users, UAVs, and base stations are critical for optimizing overall system utility. Users offload their computational tasks to the nearest UAV or base station, depending on resource availability and pricing strategies set by the UAV and base station. UAVs, acting as either relays or computation units, interact with both users and base stations to maximize the overall system utility by optimizing the offloading decisions, resource allocation, and pricing schemes. Base stations, with powerful computing resources, set prices and allocate resources to users either directly or via UAVs. Thus, the system-level utility depends on the joint decisions made by users, UAVs, and base stations, making their interactions a key part of the optimization problem.
The overall system utility is defined as the difference between the total revenue generated by the service providers (UAVs and base stations) and the costs incurred by the users. Therefore, the system utility is influenced by task offloading decisions, resource allocation (computation and communication), and pricing strategies. The utility function for the overall system is defined as U = Ub + Uj − Ui, where Ub represents the utility of the base station, Uj represents the utility of the UAV, and Ui represents the utility of the user.
To facilitate the optimization process and make the problem compatible with standard optimization techniques, we transform the objective from maximizing U to minimizing −U. This transformation is commonly used in optimization, as most solvers are designed to minimize objective functions. Maximizing U is equivalent to minimizing −U, as the optimal solution remains unchanged, and the problem is computationally more convenient when expressed as a minimization problem.
Thus, the objective becomes minimizing −U, which is mathematically equivalent to maximizing U. Our optimization problem can be reformulated as:
This transformation ensures that the optimization problem remains consistent with the goal of maximizing system utility, while leveraging optimization techniques that are optimized for minimization problems.
Finally, we can express the formula for the overall system utility as:
The bounds for the pricing and offloading parameters, such as ,
, and
, are determined based on practical considerations and typical network operations. The minimum values are constrained by the operational costs of maintaining the UAVs and base stations, ensuring that these services remain economically viable. On the other hand, the maximum values are constrained by market conditions and competitive pricing, ensuring that pricing does not become prohibitively high for users, which could lead to inefficiencies in offloading decisions. The offloading parameters are similarly bounded by the available computational and communication resources of the network, ensuring that offloading decisions remain feasible within the system’s capabilities.
These bounds ensure that the system operates within realistic parameters while maintaining both efficiency and feasibility.
Eq (17) represents the constraint on the computing resources allocated to all users by the base station and UAV. Eq (18) defines a range constraint on the pricing set by the base station and UAV for all users. Eq (19) indicates that each user can only choose one offloading method, and the decision is binary. Due to the coupling, non-linearity, binary constraints, and the non-convex structure of the decision variables in the objective function, convex optimization techniques cannot be directly applied to solve the problem.
To effectively solve the non-convex offloading optimization problem, we adopt the Block Successive Upper Bound Minimization (BSUM) method. BSUM decomposes the problem into several subproblems and constructs a local convex approximation for each subproblem, effectively avoiding the convergence issues that traditional methods, such as the Alternating Direction Method of Multipliers (ADMM), may encounter when dealing with non-convex problems. Additionally, BSUM offers lower computational complexity, making it suitable for solving large-scale, multi-task decision-variable optimization problems. In our case, BSUM demonstrates superior performance, particularly when handling complex non-convex optimization problems that involve both discrete and continuous decision variables.
In the BSUM framework, the upper bound of the objective function is minimized iteratively over the variables ,
, and
. To apply the BSUM framework, the binary decision variable constraints in Eq (19) are relaxed into continuous constraints, i.e.,
,
,
, and
. This relaxation is a standard approach in non-convex optimization that simplifies the problem, enabling the use of convex optimization techniques more effectively. By treating the binary variables as continuous, we can solve the problem using standard optimization methods. However, this relaxation introduces potential errors, as the continuous solutions may not directly satisfy the original binary constraints. To mitigate this, a post-processing step is applied in which the continuous values are rounded to the nearest binary values (0 or 1), ensuring that the final solution adheres to the original binary constraints.
These methods guarantee that the solution complies with the original problem’s binary constraints. Although the relaxation introduces some error, it does not significantly affect the optimality of the solution, as the resulting binary solution is close to the continuous optimal solution. This relaxation technique provides a more tractable approach to solving complex non-convex problems, while still maintaining feasibility and optimality with respect to the original binary constraints. We can now introduce the feasible set of ,
, and
.
Finally, we define the closest upper bound function Tn for the objective function in Eq (13). For each iteration t, , where N is the index set. To ensure that the upper bound function is more beneficial for our optimization, we apply a quadratic penalty to the objective function in Eq (13), as follows:
where is the penalty parameter, which can also be applied to other variables. Additionally, in each iteration t, the upper bound function is minimized with respect to
using the values
, which are the solutions from the previous iteration. The solution at iteration t + 1 can be updated by solving the following optimization problem:
The iterative procedure in the BSUM framework updates the decision variables (M, J, and ) in each iteration by minimizing a surrogate objective function for each respective variable. Each subproblem is solved independently, and the solution is updated iteratively until convergence. The convergence of this iterative process is guaranteed under the assumptions of the BSUM framework, which ensures convergence to a stationary point due to the block structure and the use of upper-bound minimization. However, since the problem is non-convex, convergence to the global optimum is not guaranteed. Each iteration involves solving three subproblems corresponding to M, J, and
, with the complexity of each subproblem depending on the number of users, UAVs, and base stations. Specifically, solving each subproblem requires linear or quadratic optimization, so the overall complexity per iteration is
⋅
, where n represents the number of decision variables and m is the number of iterations required for convergence.
In the BSUM framework, we must also solve subproblems for three sets of variables. Since task offloading, pricing, and resource allocation involve multiple decision variables and complex constraints, and the optimization objectives are often nonlinear, traditional exact optimization methods (such as linear programming and integer programming) are frequently unable to find the global optimal solution within a reasonable time for large-scale problems, due to the vast solution space. Heuristic algorithms, which do not rely on the continuity or differentiability of the problem, are more suitable for handling complex combinatorial optimization problems. For the optimization of pricing variables, resource allocation, and offloading strategies, the interdependencies between variables and the complex constraints make exact solutions difficult. Heuristic algorithms can flexibly adapt to these changes, handle discrete decision spaces and complex constraints, and thus provide effective solutions. Therefore, the use of heuristic algorithms, such as GAs and SA, offers an effective solution.
Algorithm 1 Joint optimization of system utility based on the BSUM framework.
1: Initialization: Set t = 0, , and find the initial feasible solution.
2: repeat 3: Select the index set N.
4: Set .
5: Set .
6: Interface with GA: Pass the continuous decision variables (,
,
) to the GA to optimize the discrete decision variables.
7: Update with GA output: Receive the optimized discrete variables from GA and update ,
,
.
8: Interface with SA: Pass the updated continuous variables to the SA algorithm for further refinement of the continuous decision variables.
9: Update with SA output: Receive the refined continuous variables from SA and update ,
,
.
10: Set t = t + 1.
11: unit
12: Final Solution: Set ,
, and
as the required solution.
Optimization of Pricing Variables: GA. The optimization of pricing variables involves the pricing set by the base station for users, the pricing
set by the base station for UAVs, and the pricing
set by the UAV for users. These pricing variables need to be optimized under multiple constraints, with the goal of maximizing system benefits and reducing costs.
The Genetic Algorithm (GA) is used to optimize the pricing variables, including those set by base stations and UAVs. The population size is set to 200, striking a balance between solution diversity and computational efficiency. Tournament selection is employed with a tournament size of 4, which helps maintain diversity and avoid premature convergence. The GA terminates either after 50 generations or when the fitness improvement is smaller than a threshold ().
The crossover rate is set to 0.9, and the mutation rate is set to 0.25, based on preliminary testing. A sensitivity analysis showed that the algorithm remains stable across a range of parameters, demonstrating robustness. The GA evaluates and selects solutions, iterating through crossover and mutation steps to improve the pricing strategy and maximize overall system utility.
Algorithm 2 Optimizing pricing variables using GA.
1: Randomly initialize the population .
2: Define the fitness function based on the system objective.
3: Select the index set N.
4: Select individuals based on their fitness.
5: Perform crossover and mutation operations to generate new individuals.
6: Update the population with the newly generated individuals.
7: Repeat the steps until convergence (maximum generations or fitness threshold is reached).
8: repeat
9:
10: until
11: Output the optimized pricing variables .
Optimization of Resource Allocation Variables: SA Algorithm. The optimization of resource allocation variables involves the distribution of computing resources and
by the base station and UAV. The optimization objective is to maximize the utilization of computing resources while ensuring that the resource allocation meets the system’s demands and constraints.
The SA algorithm is used to optimize the resource allocation variables. The SA algorithm perturbs the current solution randomly and uses the Metropolis criterion to decide whether to accept the new solution, thereby avoiding getting stuck in local optima.
Algorithm 3 Optimizing resource allocation variables using SA.
1: Initialize the resource allocation variables .
2: Define the objective function for resource allocation.
3: Set the initial temperature and cooling coefficient.
4: while the temperature is greater than the final temperature do
5: Generate a new solution by perturbing the current solution.
6: Calculate the objective function value of the new solution.
7: Accept the new solution based on the Metropolis criterion, with the acceptance probability depending on the temperature.
8: Lower the temperature.
9: end while
10: Output the optimized resource allocation variables .
The optimization of offloading strategy variables involves the user’s task offloading decisions . These decisions need to be optimized based on the user’s computational requirements, network conditions, and resource availability to maximize the system’s utility.
The offloading strategy variables are also optimized using the SA algorithm. The algorithm steps are outlined in Algorithm 3, where the offloading strategy is randomly adjusted, and the new offloading scheme is evaluated based on the objective function. This process gradually converges to the optimal solution.
The solution methods for these three optimization problems involve the use of GAs and SA algorithms to optimize the pricing variables, resource allocation variables, and offloading strategy variables, respectively. The GA optimizes pricing and resource allocation through a global search, while the SA algorithm optimizes the offloading strategy by gradually cooling down, ensuring optimal performance of the entire system under complex constraints.
Simulation and result analysis
To evaluate the proposed solution, the simulation environment consists of an MEC network with two base stations, six UAVs, and eight users. The locations of the base stations, UAVs, and users are distributed in a Cartesian coordinate system to facilitate task offloading. The remaining simulation parameters, including transmission power, resource allocation, and task offloading settings, are shown in Table 3. Specifically, the transmission power values for users and UAVs were chosen based on typical values used in UAV-assisted MEC systems from previous studies. The resource allocation values were adjusted through experimentation to balance system efficiency and user fairness.
A schematic illustrating the spatial distribution of base stations, UAVs, and users in the simulation setup is shown in Fig 2. The figure clearly depicts the placement of the nodes (base stations, UAVs, and users) in the Cartesian coordinate system. This layout allows for the simulation of offloading tasks, where users connect to either the base stations or UAVs for task offloading. The communication topology between the nodes is defined by wireless links, with users offloading tasks to the nearest base station or UAV based on distance and resource availability.
We present the convergence of different algorithms during the iteration process in Fig 3. As shown in the tables and figures, the BSUM algorithm demonstrates a clear advantage over the Block Coordinate Descent (BCD) and SA algorithms. For instance, with 8 users, BSUM achieves a system utility of 89,217, while BCD and SA achieve 575,832 and 122,182, respectively. This results in a percentage improvement of approximately 46.5% for BSUM over BCD and 26.9% over SA. As the number of users increases, the performance gap becomes even more pronounced, with BSUM outperforming BCD by more than 40% and SA by over 60% for 24 users. These results clearly demonstrate the superior efficiency and utility of the BSUM algorithm across different network configurations.
The BSUM algorithm not only converges in fewer iterations but also achieves a significantly higher total utility at the end, indicating that it performs more efficiently in handling complex optimization problems.
Fig 4 shows the comparison of system utility under different numbers of users (8, 16, and 24 users) for various offloading strategies. The overall utility using the BSUM algorithm is significantly higher than with other offloading methods. Particularly when the number of users is 8 and 24, the BSUM algorithm demonstrates a considerable improvement in utility. For the “all users local computation” scenario, the system utility is relatively low, indicating that local computation does not effectively enhance system performance. Furthermore, offloading tasks through UAVs or base stations provides some utility improvement, but it is not as effective as the optimization achieved by the BSUM algorithm.
Fig 5 further compares the total utility of the BSUM algorithm with the other two optimization algorithms (BCD and SA) under different numbers of users. It is evident that the BSUM algorithm outperforms both the BCD and SA algorithms in all user scenarios. Particularly when the number of users is 24, the BSUM algorithm demonstrates a clear advantage in terms of system utility, outperforming the other two algorithms. This indicates that the BSUM algorithm can effectively improve resource utilization efficiency and overall system performance in large-scale user environments and complex network settings.
Fig 6 shows the changes in base station revenue, UAV revenue, and user costs under different numbers of users. In all cases, the revenue of the base stations and UAVs is much lower than the costs of the users, especially as the number of users increases, leading to a significant rise in user costs.
To further investigate the effect of key parameters on system performance, Fig 7 presents the sensitivity analysis of system utility to the variation of and
. The parameter
is a penalty factor that controls the trade-off between maximizing system utility and ensuring that constraints are met, such as resource limits and cost balances. As
increases, system utility rises, reflecting that a stronger penalty on constraint violations leads to a more balanced and efficient system, optimizing the trade-off between revenue and user costs. On the other hand,
is related to the resource allocation efficiency or the system’s pricing strategy. As
increases, system utility decreases, indicating that higher values of
may increase costs or reduce resource allocation efficiency, ultimately affecting the system’s overall utility.
Conclusion
This paper addresses the system utility optimization problem in UAV-assisted MEC networks by proposing a new optimization framework. The goal is to maximize overall system utility, defined as the difference between the revenues of UAVs and base stations and the costs incurred by users. Unlike traditional single-objective optimization approaches, this study considers multiple constraints, including computational resources and energy consumption. The problem is formulated as a Mixed-Integer Nonlinear Programming (MINLP) problem, and the solution is obtained by combining the Block Successive Upper Bound Minimization (BSUM) framework with heuristic algorithms.
Simulation results show that the proposed BSUM algorithm performs effectively across various scenarios, significantly improving system utility. As the number of users increases, the system optimizes resource allocation and task offloading strategies more efficiently, although resource usage also rises with the user count. The analysis further compares the BSUM algorithm with traditional heuristics, highlighting its advantages in terms of resource consumption and utility under varying user loads.
However, the study has limitations. The simulations were conducted in a small-scale setup with idealized channel conditions, and real-world systems may involve more complex interactions in larger, dynamic networks. Additionally, the absence of mobility modeling for UAVs and users means that the impact of mobility on network performance was not considered. These factors may affect the system’s performance in practical deployments.
Despite these limitations, the proposed framework demonstrates strong convergence and robustness in controlled environments, indicating its scalability potential for larger networks. Future research will focus on incorporating mobility effects, realistic channel models, and testing the framework in larger, more dynamic network setups. Furthermore, we aim to explore energy-efficient task offloading strategies and real-time adaptability to address challenges such as network congestion and environmental factors in real-world applications.
Simulation setup and computational environment
All simulations were conducted using MATLAB R2019b. The experiments were executed on a laptop with the following hardware configuration:
- CPU: AMD Ryzen 7 7735H with Radeon Graphics, 8 cores, 16 logical processors, clock speed of 3.2 GHz.
- GPU: NVIDIA GeForce RTX 4060 Laptop GPU with 4 GB memory, core frequency of 480 MHz, and memory frequency of 6000 MHz.
The simulations were run for an average duration of five hours depending on the specific scenario, with the results obtained reflecting the performance of the proposed algorithm under typical system conditions. The choice of MATLAB and this hardware setup ensures the computational efficiency and scalability of the approach for the network sizes considered in this study.
These details are provided to enable reproducibility of our results and fair performance comparison by other researchers using similar setups.
Supporting information
S1 File. Code.
This file contains all the code for the experiments in the manuscript.
https://doi.org/10.1371/journal.pone.0342583.s001
(ZIP)
References
- 1. Li M, Cheng N, Gao J, Wang Y, Zhao L, Shen X. Energy-efficient UAV-assisted mobile edge computing: resource allocation and trajectory optimization. IEEE Transactions on Vehicular Technology. 2020;69(3):3424–38.
- 2. Yuan X, Xie Z, Tan X. Computation offloading in UAV-enabled edge computing: a stackelberg game approach. Sensors (Basel). 2022;22(10):3854. pmid:35632262
- 3. Zhao GY, Zhang X, Zhang J, Zhang J, Yuan P. A trajectory prediction-based edge offloading strategy for internet of vehicles. Journal of Shandong University (Engineering Science). 2024;54(1):52–62.
- 4. Mrabet M, Sliti M. Toward secure, trustworthy, and sustainable edge computing for smart cities: innovative strategies and future prospects. IEEE Access. 2025;13:174236–53.
- 5. Wang L, Wang K, Pan C, Xu W, Aslam N, Hanzo L. Multi-agent deep reinforcement learning-based trajectory planning for multi-UAV assisted mobile edge computing. IEEE Trans Cogn Commun Netw. 2021;7(1):73–84.
- 6. Hong M, Razaviyayn M, Luo Z-Q, Pang J-S. A unified algorithmic framework for block-structured optimization involving big data: with applications in machine learning and signal processing. IEEE Signal Process Mag. 2016;33(1):57–77.
- 7. Wang D, Tian J, Zhang H, Wu D. Task offloading and trajectory scheduling for UAV-enabled MEC networks: an optimal transport theory perspective. IEEE Wireless Commun Lett. 2022;11(1):150–4.
- 8. Wu Q, Chen J, Xu Y, Qi N, Fang T, Sun Y, et al. Joint computation offloading, role, and location selection in hierarchical multicoalition UAV MEC networks: a stackelberg game learning approach. IEEE Internet Things J. 2022;9(19):18293–304.
- 9. Zhou H, Wang Z, Min G, Zhang H. UAV-aided computation offloading in mobile-edge computing networks: a Stackelberg game approach. IEEE Internet Things J. 2023;10(8):6622–33.
- 10. Asheralieva A, Niyato D. Hierarchical game-theoretic and reinforcement learning framework for computational offloading in UAV-enabled mobile edge computing networks with multiple service providers. IEEE Internet Things J. 2019;6(5):8753–69.
- 11. Wölfl B, Te Rietmole H, Salvioli M, Kaznatcheev A, Thuijsman F, Brown JS, et al. The contribution of evolutionary game theory to understanding and treating cancer. Dyn Games Appl. 2022;12(2):313–42. pmid:35601872
- 12.
Ding Y, Lu W, Zhang Y, Feng Y, Li B, Gao Y. Energy consumption minimization for secure UAV-enabled MEC networks against active eavesdropping. 2023. p. 1–5.
- 13. Li Y, Aghvami AH, Dong D. Path planning for cellular-connected UAV: a DRL solution with quantum-inspired experience replay. IEEE Trans Wireless Commun. 2022;21(10):7897–912.
- 14. Li Y, Madhukumar AS, Ernest TZH, Zheng G, Saad W, Aghvami AH. Energy-efficient UAV-driven multi-access edge computing: a distributed many-agent perspective. IEEE Transactions on Communications. 2025;73(9):8405–15.
- 15. Li Y, Aghvami AH. radio resource management for cellular-connected UAV: a learning approach. IEEE Trans Commun. 2023;71(5):2784–800.
- 16. Liu C, Zhong Y, Wu R, Ren S, Du S, Guo B. Deep reinforcement learning based 3D-trajectory design and task offloading in UAV-enabled MEC system. IEEE Trans Veh Technol. 2025;74(2):3185–95.
- 17.
Jing L, Jia X, Lv Y, Wan N. Maximizing the average secrecy rate for UAV-assisted MEC: a DRL method. In: 2021 IEEE 5th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC). 2021. p. 2514–8. https://doi.org/10.1109/iaeac50856.2021.9390913
- 18. Morchid A, Ismail A, Khalid HM, Qjidaa H, Alami RE. Blockchain and IoT technologies in smart farming to enhance the efficiency of the agri-food supply chain: a review of applications, benefits, and challenges. Internet of Things. 2025;33:101733.
- 19. Morchid A, Oughannou Z, Khalid HM, Qjidaa H, El Alami R, Siano P. Fire risk assessment system for food and sustainable farming using ai and IoT technologies: benefits and challenges. Internet of Things. 2025;33:101704.
- 20. Liu B, Wan Y, Zhou F, Wu Q, Hu RQ. Resource allocation and trajectory design for MISO UAV-assisted MEC networks. IEEE Trans Veh Technol. 2022;71(5):4933–48.
- 21. Tian J, Wang D, Zhang H, Wu D. Service satisfaction-oriented task offloading and UAV scheduling in UAV-enabled MEC networks. IEEE Trans Wireless Commun. 2023;22(12):8949–64.
- 22. Sun L, Wan L, Wang X. Learning-based resource allocation strategy for industrial IoT in UAV-enabled MEC systems. IEEE Trans Ind Inf. 2021;17(7):5031–40.
- 23. Kota NR, Naidu K. Minimizing energy consumption in H-NOMA based UAV-assisted MEC network. IEEE Commun Lett. 2023;27(9):2536–40.
- 24. Tun YK, Park YM, Tran NH, Saad W, Pandey SR, Hong CS. Energy-efficient resource management in UAV-assisted mobile edge computing. IEEE Commun Lett. 2021;25(1):249–53.
- 25. Tun YK, Kim KT, Zou L, Han Z, Dán G, Hong CS. Collaborative computing services at ground, air, and space: an optimization approach. IEEE Transactions on Vehicular Technology. 2024;73(1):1491–6.
- 26. Asheralieva A, Niyato D. Hierarchical game-theoretic and reinforcement learning framework for computational offloading in UAV-enabled mobile edge computing networks with multiple service providers. IEEE Transactions on Vehicular Technology. 2024;73(1):1491–6.
- 27.
Du F, Wang Y, Huang J, Yang Y. Research on gridded UAV scheduling based on heuristic greedy algorithm. In: 2023 International Conference on Image Processing, Computer Vision and Machine Learning (ICICML). 2023. 979-8-3503-3141-7/23.
- 28.
Zhou T, Liu Z, Jin W, Han Z. Multiple targets assignment based on reinforcement learning hyper-heuristic algorithm of multi-UAV. In: Proceedings of the 44th Chinese Control Conference, Chongqing, China. 2025.
- 29. Tian J, Wang D, Zhang H, Wu D. Service satisfaction-oriented task offloading and UAV scheduling in UAV-enabled MEC networks. IEEE Trans Wireless Commun. 2023;22(12):8949–64.
- 30. Wang D, Tian J, Zhang H, Wu D. Task offloading and trajectory scheduling for UAV-enabled MEC networks: an optimal transport theory perspective. IEEE Wireless Commun Lett. 2022;11(1):150–4.
- 31. Sun L, Wan L, Wang X. Learning-based resource allocation strategy for industrial IoT in UAV-enabled MEC systems. IEEE Trans Ind Inf. 2021;17(7):5031–40.
- 32.
Ding Y, Lu W, Zhang Y, Feng Y, Li B, Gao Y. Energy consumption minimization for secure UAV-enabled MEC networks against active eavesdropping. In: 2023 IEEE 98th Vehicular Technology Conference (VTC2023-Fall). 2023. p. 1–5. https://doi.org/10.1109/vtc2023-fall60731.2023.10333674