Reconfigurable intelligent surface and UAV coordination for reliable THz wireless networks

M. Rudra Kumar; Ravi Uyyala; M. Ramchander; Y. Ramadevi; Ramesh Babu Palamakula; E. Padmalatha; Deema Mohammed Alsekait; Diaa Salama AbdElminaam; Premkumar Chithaluru

doi:10.1371/journal.pone.0345290

Abstract

Terahertz (THz) communication is a promising enabler for next-generation wireless networks because it can support ultra-high data rates. However, severe path loss, molecular absorption, and high sensitivity to blockage significantly limit coverage and reliability. To address these challenges, this work proposes a RIS-assisted UAV positioning (RAVP) framework that integrates reconfigurable intelligent surfaces (RIS) with unmanned aerial vehicles (UAVs) and jointly optimizes RIS configuration and UAV deployment to enhance THz communications. RISs provide controllable reflections to improve propagation conditions, while UAVs enable flexible placement of RISs at advantageous locations. A reinforcement learning (RL)-based strategy that combines modified K-means clustering with gradient-based optimization coordinates user grouping, RIS phase-shift adaptation, and UAV positioning within a unified framework. Simulation results show consistent gains in link robustness, achievable data rate, and user connectivity across different network configurations compared with conventional THz systems without RISs or UAV-assisted optimization. These findings highlight the potential of coordinated RIS-UAV optimization for future 6G-enabled wireless networks, including smart-city and Internet of Things (IoT) applications.

Citation: Kumar MR, Uyyala R, Ramchander M, Ramadevi Y, Palamakula RB, Padmalatha E, et al. (2026) Reconfigurable intelligent surface and UAV coordination for reliable THz wireless networks. PLoS One 21(3): e0345290. https://doi.org/10.1371/journal.pone.0345290

Editor: Zeashan Hameed Khan, King Fahd University of Petroleum & Minerals, SAUDI ARABIA

Received: July 31, 2025; Accepted: March 3, 2026; Published: March 23, 2026

Copyright: © 2026 Kumar et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: This study did not use, generate, or analyze any real-world or third-party datasets. All results reported in the manuscript were obtained through simulator-based computations using analytically modeled THz channel characteristics and algorithmically generated network scenarios. As no experimental, human-subject, proprietary, or sensitive data were involved, there are no ethical, legal, or institutional restrictions on data sharing. Consequently, no data access committee or third-party organization is applicable to this work. The results can be reproduced by re-running the simulation framework using the parameters reported in the manuscript.

Funding: This research was funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2026R435), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Wireless data demand is increasing rapidly, driving research toward new frequency bands that can provide higher data rates and more reliable connectivity. THz communication, operating in the frequency range of 0.1–10 THz, offers extremely large bandwidth and can enable ultra-high-speed data transmission beyond the capabilities of conventional radio-frequency (RF) systems [1]. These features make THz communication suitable for applications such as real-time 8K video streaming, low-latency cloud services, remote healthcare monitoring, and advanced sensing systems [2].

Despite these advantages, THz communication faces significant technical challenges. A major limitation is severe signal attenuation caused by molecular absorption and free-space path loss. These effects reduce transmission range and link reliability, particularly in outdoor and dense environments where obstacles can block or scatter signals [3]. As a result, maintaining stable links under practical deployment conditions remains difficult, which limits real-world adoption.

Challenges

Deploying THz systems in real-world environments introduces several challenges. Due to the high operating frequency, THz signals experience high propagation loss and strong reflections compared to lower-frequency waves. This limits communication range and creates coverage gaps. Physical obstacles such as buildings, trees, and vehicles can obstruct line-of-sight (LoS) links, leading to frequent interruptions. User mobility further complicates operation because the topology changes dynamically and direct links become difficult to maintain. In addition, variations in weather and temperature can affect signal propagation characteristics [4].

To address these challenges, adaptive and intelligent solutions are required. RIS and UAV technologies have emerged as promising options. RIS can manipulate propagation through programmable phase adjustments, while UAVs can reposition dynamically to act as aerial relays. Their joint use can improve coverage, enhance signal quality, and increase adaptability to varying network conditions [5].

Problem statement

Despite the promise of THz communication for ultra-high-data-rate wireless systems, practical deployment remains constrained by unresolved technical limitations. THz signals experience severe distance-dependent path loss, strong molecular absorption, and extreme sensitivity to blockage, which collectively lead to intermittent links and fragmented coverage in realistic environments. Existing mitigation approaches-including static or heuristically placed RISs and preplanned UAV deployments-often cannot adapt to rapid variations in user distribution, channel conditions, and blockage dynamics. Moreover, many optimization strategies treat user grouping, RIS configuration, and UAV trajectory planning as separate (or loosely coupled) problems, which can yield suboptimal overall performance. These limitations prevent conventional THz systems from achieving reliable connectivity and consistent quality of service, particularly in infrastructure-scarce or highly dynamic scenarios. Therefore, a coordinated and adaptive optimization framework is needed to jointly address propagation control, mobility management, and user association in THz networks.

Motivation

The growing demand for data-intensive and latency-sensitive applications has intensified interest in THz communication as a key enabler for future wireless networks. However, the propagation vulnerabilities of THz signals limit the effectiveness of solutions that rely on fixed infrastructure or static optimization assumptions. While RIS technology provides programmable control over reflections and UAVs provide mobility for flexible network support, their potential is often underutilized due to limited adaptability and a lack of intelligent coordination. In particular, conventional deployment methods may not respond effectively to time-varying channel conditions, user mobility, and blockage-induced link degradation. RL offers a principled way to address these challenges by enabling continual learning and decision-making based on environmental feedback. By jointly optimizing UAV positioning, RIS phase configuration, and user grouping within an RL-driven framework, the network can adapt to changing conditions, improve link robustness, and increase spectral efficiency. This motivates the development of an integrated RAVP framework for reliable, high-performance THz communication in dynamic environments.

Contributions

In light of the above discussion, this work introduces a novel RAVP framework that addresses key limitations of existing THz communication systems through coordinated and adaptive optimization. The proposed approach jointly handles user association, RIS phase adjustment, and UAV positioning within a unified RL-based framework, enabling real-time adaptation to dynamic network conditions. The main contributions of this paper are summarized as follows:

An intelligent RIS and UAV-assisted THz communication framework is proposed to mitigate signal blockage, coverage fragmentation, and reliability challenges in high-frequency wireless environments.
A joint optimization strategy is developed by integrating RL with modified K-means clustering and gradient-based optimization to simultaneously perform user grouping, RIS phase configuration, and UAV positioning.
Extensive performance evaluation demonstrates enhanced data rates, improved signal quality, and increased system efficiency, highlighting the effectiveness of the proposed RAVP framework for 6G and future wireless networks.

The remainder of this paper is organized as follows. The literature review summarizes related work. The system model section describes the proposed RIS-UAV system model and RL framework. The results and discussion section presents the simulation results and their implications. Finally, the conclusion summarizes the key findings and outlines future research directions.

Literature review

The literature highlights the challenge of providing wide-area connectivity in satellite-based IoT networks, particularly in terms of efficient spectrum utilization. One proposed solution is a non-orthogonal, slotted-ALOHA multiple-access scheme for satellite communications. This approach supports large-scale device access under limited bandwidth conditions. Although it may introduce additional transmission delay, it provides a practical mechanism for connecting many IoT devices in regions where terrestrial infrastructure is unavailable. Satellite-assisted communication can therefore extend network coverage to remote and underserved areas [5,6].

IoT applications

UAVs play a significant role in data collection and environmental monitoring because of their relatively low operating cost. Advances in onboard hardware and image-processing techniques have expanded UAV applications, particularly in agriculture. When UAVs and sensors are integrated through IoT frameworks, the collected data can be transmitted over the Internet, enabling better decision-making and improved operational efficiency [7,21].

This study examines the integration of UAVs into 5G new radio networks to enhance wireless communication (WC) capabilities. The focus is on providing reliable and cost-effective connectivity for IoT devices, especially in scenarios where conventional cellular infrastructure is unavailable. Experimental results demonstrate the effectiveness of the UAV-assisted framework in delivering seamless wireless connectivity, highlighting its potential to complement existing 5G wireless networks [8,24].

The emergence of 6G-IoT promises unprecedented advancements in network capacity and management. This work explores the potential of 6G-IoT and ongoing research efforts aimed at realizing its capabilities. Owing to its transformative features, 6G-IoT is expected to play a central role in shaping the future of connected systems [9,15,25].

Emerging UAV-assisted 6G networks

In 6G networks, UAVs play a critical role in WC; however, operating in open environments and relying on LoS links introduce security vulnerabilities. To enhance security, a three-dimensional beamforming approach with physical-layer support is employed in UAV systems. This method improves the average secrecy rate using an unsupervised neural network based on a Denoising Autoencoder (DAE). By adaptively adjusting beam directions and incorporating artificial noise, the approach enhances communication security in UAV-enabled 6G systems [10,11].

The integration of UAVs with Artificial Intelligence (AI) establishes a robust foundation for edge computing in vehicular environments. This architecture enables a smart and adaptive Vehicle-to-Everything (VoE) network operating in three-dimensional (3D) space. It addresses challenges such as fluctuating network conditions, high computational demands, and large data volumes generated by mobile vehicles. Since satellite networks often experience high latency and limited throughput, UAVs offer extended 3D coverage beyond fixed terrestrial networks, which is particularly relevant for future 6G deployments [10,11].

A real-world experimental evaluation using a 5G radio system demonstrates the performance of UAV-based communication under practical conditions. The network is divided into two slices: one dedicated to UAV control signaling and another for data transmission. The results show that network slicing effectively provides isolated and reliable communication channels for UAV operations [12,13].

Another approach integrates visible light communication (VLC), UAVs, and RISs to improve energy efficiency in wireless networks. In this framework, UAVs serve dual roles by providing network coverage and illumination, while RISs are strategically deployed to reflect signals and enhance connectivity. This combination improves energy utilization and strengthens communication links between UAVs and ground users [14–17].

In UAV-based systems, data management is commonly supported by a distributed database that interfaces with multiple external components through a unified platform. This architecture facilitates efficient data exchange across the network. As UAV operations require reliable and stable communication links, current 5G technologies are considered suitable for supporting both UAV-to-ground and UAV-to-UAV communications [18,19].

The large-scale deployment of Unmanned Autonomous Intelligent Systems (UAIS) in conjunction with 6G Non-Terrestrial Networks (NTN) enables autonomous interactions among self-driving vehicles, UAVs, and robotic systems for applications such as precision agriculture and disaster response. Ongoing research in 6G technologies aims to integrate connectivity and intelligence more seamlessly. Consequently, a comprehensive review of 6G NTN from a UAIS perspective is essential for supporting future developments in this domain [20,21].

Millimeter-wave spectrum band

This work introduces a passive relay architecture that steers signals in different directions based on frequency characteristics. Using this approach, UAVs can exploit the 5G millimeter-wave (mmWave) spectrum to transmit multiple directional beams, with each beam operating on a distinct channel. This enables simultaneous coverage of multiple regions. The proposed design relies solely on passive components, eliminating the need for complex and costly phased-array systems [12].

Future 6G networks aim to surpass the capabilities of 5G by introducing new functionalities such as ambient sensing and enhanced human–machine interaction. These advancements are driven by the integration of AI and emerging technologies, including THz communication, three-dimensional networking, quantum communication, holographic beamforming, backscatter communication, intelligent reflecting surfaces, and smart caching. Collectively, these innovations are expected to redefine next-generation WC systems [10].

Modulation techniques

The modulation techniques adopted in this framework comply with established network standards. These schemes dynamically adapt to user requirements, enabling efficient communication while adhering to predefined user configurations and latency constraints. Such flexibility enables effective management of core communication functions through passive processing mechanisms, even in interference-prone environments.

Data privacy

A layer-wise review methodology enables system designers and network operators to identify vulnerabilities within communication protocols used in 6G applications. By analyzing each protocol layer independently, this approach provides a structured understanding of privacy and security challenges while highlighting areas requiring further investigation. The findings emphasize the importance of strengthening security and data protection mechanisms to address evolving threats in 6G networks [14].

Maintaining secure and efficient communication requires careful examination of the impact of security mechanisms on system latency. Addressing these requirements necessitates robust security models that support controlled system access and efficient resource utilization. In parallel, the expansion of the Internet of Everything (IoE) introduces additional challenges, underscoring the need for intelligent and distributed security solutions suitable for resource-constrained devices [19].

Privacy and security are fundamental to the reliability of next-generation wireless networks and must be incorporated during the early stages of 6G design. For instance, Internet of Vehicles applications introduce new privacy and security concerns affecting road users. Accordingly, comprehensive frameworks are required to protect both system architecture and user data [20].

Wireless communication

Wireless communication is central to enabling diverse UAV applications and has attracted significant research interest. While conventional technologies such as direct links, Wi-Fi, and satellite communication remain relevant in remote areas lacking cellular coverage, emerging 5G and beyond cellular networks offer scalable and cost-effective solutions for supporting the growing number of UAVs [16].

The realization of telepresence and mixed-reality applications is supported by advances in high-resolution imaging, sensing technologies, accurate positioning systems, and next-generation wireless networks. These developments are expected to drive a transition from traditional smartphones to immersive extended-reality platforms delivered through lightweight wearable devices. Such systems will offer enhanced resolution, frame rates, and dynamic range, enabling more interactive and immersive user experiences [22].

The Internet of Everything facilitates connectivity in remote and underserved regions and represents a key application shaping 6G system architectures. These emerging use cases demand higher data rates and lower latency than current 5G systems, thereby accelerating the evolution toward next-generation WC technologies [23].

Machine learning in 6G IoT

Machine learning (ML) plays a critical role in enabling intelligent IoT applications across multiple layers, including the application, network, and perception layers. At the application layer, ML techniques are widely used for task offloading and resource allocation, improving system efficiency and scalability. Additionally, edge computing provides localized storage and processing capabilities, facilitating the deployment of edge intelligence. This work presents a comprehensive review of ML applications in IoT systems, emphasizing their role in advancing intelligent IoT environments [24,25]. Furthermore, a detailed survey of ML techniques applied to UAV networks categorizes existing methods based on network and communication characteristics. The study highlights the potential of ML-driven approaches to optimize UAV network performance by leveraging multi-source data and deep learning models [20].

The evolution of 6G networks is expected to introduce AI-driven autonomous systems as a core component. Among various 6G services, video-centric applications are anticipated to dominate data traffic. Key technologies contributing to 6G development include THz communication, AI, optical messaging, three-dimensional networking, UAV integration, and wireless power transfer (WPT). This work examines the combined impact of these technologies on the future landscape of WC [6].

Recent literature on RIS-UAV-THz communication

Pan et al. (2025) [26] introduced an adaptive resource allocation framework for IoT systems operating over RIS-UAV-aided NOMA-enabled THz communication networks. The study integrates computing power networks (CPNs) with intelligent resource management to jointly optimize communication and computation performance. The proposed scheme demonstrates effective improvements in spectral efficiency and task execution reliability under dynamic IoT traffic conditions. However, the work primarily focuses on centralized optimization and assumes ideal channel estimation, which may limit its applicability in highly dynamic UAV-assisted environments with imperfect channel knowledge.

Du et al. (2022) [27] investigated the performance limits and optimization strategies of RIS-aided THz communication systems. The authors developed analytical models to characterize path loss, beam misalignment, and reflection efficiency in THz bands, followed by optimization of RIS phase configurations. The results confirm that RIS deployment significantly enhances signal coverage and link reliability in THz networks. Despite its strong theoretical foundation, the study does not consider UAV-assisted mobility or learning-based adaptation, which restricts its applicability to static or semi-static network scenarios.

Pan et al. (2025) [28] presented an intelligent resource optimization framework for CPN-enabled IoT systems using RIS-UAV-assisted NOMA-THz communication. The approach employs learning-based optimization to jointly manage communication, computation, and networking resources. Simulation results demonstrate improved system throughput and reduced latency compared to conventional resource allocation schemes. Nevertheless, the framework relies on iterative learning mechanisms with notable computational overhead, which may challenge real-time deployment in large-scale IoT networks.

Song et al. (2025) [29] proposed a miniature UAV-aided cooperative THz network incorporating reconfigurable energy harvesting holographic surfaces. The work introduces a novel energy-aware THz communication model that combines cooperative UAV relaying with simultaneous wireless information and WPT. The proposed system achieves enhanced energy efficiency and extended network lifetime. However, the reliance on specialized holographic surfaces and energy harvesting hardware introduces practical deployment constraints and increases system complexity.

Gaps in existing literature

In THz communication systems, signal propagation remains highly vulnerable to blockage caused by obstacles such as buildings and human bodies, leading to significant degradation in link reliability and coverage. Existing solutions do not fully address the need for robust transmission mechanisms under dynamically obstructed environments.
The increasing deployment of UAVs and IoT devices places substantial pressure on spectrum resources, creating challenges in efficient spectrum allocation and utilization. Current approaches provide limited support for meeting the stringent data rate and latency requirements of emerging UAV-assisted IoT services.
Although extensive research is underway toward 6G wireless networks, the operational capabilities and performance boundaries of 6G systems remain insufficiently characterized. This uncertainty restricts a clear understanding of how future networks will simultaneously satisfy latency, throughput, and reliability demands.
While 5G and emerging 6G architectures aim to support heterogeneous services, existing studies offer limited insight into addressing joint latency, throughput, and reliability constraints in UAV-centric and highly dynamic communication scenarios.

S1 Table presents a comparative analysis of representative UAV-enabled IoT and 6G communication frameworks with the proposed approach. Existing works primarily focus on isolated aspects such as data collection efficiency, experimental 5G NR connectivity, secrecy enhancement through beamforming, or energy efficiency using RIS and VLC integration. While these approaches demonstrate notable improvements in specific performance metrics, most rely on fixed configurations, limited adaptability, or partial optimization strategies. In contrast, the proposed framework integrates UAVs, RIS, and THz/mmWave communication within a unified 6G IoT architecture and employs RL for joint optimization of communication and deployment parameters. This enables improved coverage and data-rate performance while addressing adaptability challenges inherent in dynamic 6G environments, albeit with practical hardware constraints.

Proposed methodology

The proposed model is a unified system that integrates RIS-aided UAV-enabled WPT and THz-enabled multi-IRS-assisted UAV WC. The overall workflow of the proposed approach is illustrated in S1 Fig.

The workflow illustrated in the diagram presents a sequential process designed to enhance the performance of a technical system, particularly in the context of network control. It begins with the Initialization phase, during which key system parameters are configured and the required data are loaded. This phase establishes the basis for subsequent operations.

In the Group Update Iteration step, user groups are iteratively reviewed and updated to improve system performance. The adjustments performed in this stage support the formation of more efficient group structures, thereby contributing to smoother system operation. Subsequently, the Optimization Problem Formulation step defines the system objectives and constraints. This formulation provides a structured framework for achieving key goals, such as enhanced network coverage and reduced energy consumption. After defining the objectives and constraints, the process proceeds to the User Grouping Optimization step, where users are organized into optimized clusters to improve resource utilization. The method then focuses on two critical components: IRS phase-shift optimization and UAV location optimization. By tuning the reflection characteristics of the IRS and optimizing UAV placement, the system enhances signal directionality and coverage, leading to improved network performance.

RIS-aided UAV-enabled WPT system

The system considers a deployment in which the RIS is mounted on top of a structure. The RIS supports both WC and WPT. The network consists of N single-antenna users randomly distributed within the service area. Based on user locations, the area is partitioned into K service zones (SZ) to improve system efficiency. Each service zone is served by a UAV, denoted as U_AV, which is also equipped with a single antenna. The RIS, UAVs, and users jointly enable WC and WPT across the region. A UAV is an unmanned aerial device capable of autonomous or remote-controlled flight. In WC, UAVs assist in tasks such as monitoring, data collection, and network coverage enhancement. For each service zone, the UAV operates at a fixed altitude h_t to maintain a consistent transmission range T_r. The horizontal position of the UAV is defined as . Compared to variations in altitude h_t, horizontal position adjustments introduce fewer stability constraints, as illustrated in S2 Fig.

The RIS is mounted on a wall at a specified height, denoted by Rh_t, as illustrated in S3 Fig.

The horizontal placement of the RIS is determined by the coordinates . The RIS enhances WC by adaptively adjusting the phase θ and amplitude R of the incident signal I_S. This configuration enables precise control over the reflected signal S_R, resulting in improved signal strength S_S, reduced interference, and enhanced overall system performance. Here, I_S denotes the signal incident on the RIS.

The received signal power after reflection by the RIS, denoted R_P, is used to evaluate the effectiveness of the RIS in enhancing W_C. It represents the signal strength S_S after reflection. In contrast, T_P denotes the power of the transmitted signal incident on the RIS, i.e., the signal strength prior to reflection [5,19].

Phase Adjustment:

where θ represents the complex reflection coefficient for phase adjustment, and Φ denotes the phase shift.

Amplitude Adjustment:

where α represents the complex reflection coefficient for amplitude adjustment, and R denotes the desired amplitude-scaling factor. The term denotes the distance between the U_AV in service zone S_Z and user . It is computed as the Euclidean distance between the horizontal position of the U_AV, , and the position of user n, , while accounting for the UAV altitude term Rh_t (squared in the distance expression). Accordingly, the distance is expressed as follows [5,19]:

This equation computes the straight-line distance between the U_AV and the user by accounting for both horizontal separation and UAV altitude. Based on this formulation, the remaining distances in the system are defined as follows [5,19].

Here, denotes the distance between the RIS and the U_AV operating in S_Z.

Similarly, represents the distance between the RIS and user n in set N, where C_D is a constant and the distance remains fixed during the UAV flight.

THz-enabled multi-IRS-assisted UAV

In addition to the RIS-aided W_PT system, the network supports downlink D_L data transmission using THz frequencies. The system architecture includes a base station (B_S) operating at THz frequencies and equipped with multiple antennas. The direct communication channels between the B_S and users in set N are obstructed by obstacles such as buildings. Consequently, D_L data transmission is facilitated through airborne IRSs (A_IRS). The A_IRS are modeled as uniform planar arrays and assist in reflecting the incident signal I_S to enhance communication performan A uniform planar array is a common antenna configuration in W_C systems. It consists of multiple antenna elements arranged on a two-dimensional grid with uniform spacing. The proposed system integrates RIS-aided W_PT with THz-enabled multi-IRS-assisted U_AV-based W_C, thereby enabling both WPT and data transmission for users distributed across multiple service areas, as illustrated in S4 Fig.

The total number of reflective elements E in the IRS is denoted by , where E_lx and E_ly represent the number of elements within the IRS area A_IRS along the X-axis and Y-axis, respectively. The IRS adaptively adjusts the phase shifts P_S of these elements to reflect the incident signal I_S, thereby improving the performance of the THz network. For analytical convenience, a relative coordinate system associated with the IRS surface A_IRS is defined by x, y, z, representing the directions along the X, Y, and Z axes, respectively. These coordinates serve as reference points for measuring distances and angles, which enables accurate determination of signal propagation paths and reflection directions. The distances and angles with respect to the reference coordinates x, y, z are computed using standard trigonometric relations. The distance d between two points with coordinates and is calculated using the Euclidean distance formula [5,19]:

The U_AV is assumed to maintain a constant altitude over the target area. In addition, the energy consumption associated with the movement duration of the U_AV is neglected. These assumptions simplify the analysis of the THz network performance by excluding detailed operational constraints of the U_AV. Likewise, the angles between vectors are computed using dot products and inverse trigonometric functions. For instance, the angle θ between two vectors v_c1 and v_c2 is obtained as

Time-slot allocation

The system operation relies on a time-division multiple access (TDMA) protocol, as demonstrated in S5 Fig. In particular, the entire operation time indicated as T, is split into S_Z + N time slots. The interval from 0 to is allocated for downlink WPT (), while the remaining time slots are assigned for uplink (U_L) wireless data transfer (W_DT). In this model, S_WT denotes the S_Z-th W_PT time slot, and N_DT represents the n^th W_DT time slot. Here, T_x and R_x indicate packet transmission and reception, respectively.

The constraints for time-slot allocation are defined as [5,19]:

During the time-slot S_WT, the U_AV remains stationary at a predefined location within the service area and transmits power to all N users in that area simultaneously. Since adjusting the transmit power P_t is impractical due to payload and power constraints, it is assumed to be constant. The power received by the n-th user in the S_Z-th service area is expressed as:

Here, represents the composite channel gain between the U_AV and user n. The parameter η denotes the energy conversion efficiency , and is the diagonal reflection coefficient matrix R_CM of the D_L RIS with a fixed reflection amplitude , representing the phase shift of the i^th reflective element. The channels h_tn, , and denote the links between the RIS and user n, the U_AV and RIS, and the U_AV and user n, respectively. Both the U_AV and the user are equipped with single antennas. Due to this single-antenna configuration, the channels associated with the D_L and U_L links are expressed using the same formulation.

During the time-slot N_DT, user n transmits data to the U_AV. As each time slot operates independently, inter-user interference is neglected. The power allocated to user n is denoted by P_tn in the U_L W_DT phase. Accordingly, the data rate d_rn from user n to the U_AV is given by [5,19]:

Here, represents the RIS reflection matrix R_CM, denotes the noise power, and P_S corresponds to the power consumed by the RIS for information processing. To maintain the self-sustainability of the W_PT structure, the energy consumed by user n during the U_L W_DT phase must not exceed the energy harvested during the D_L W_PT phase. Therefore, the energy constraint e_c is defined as:

Since, the RIS is installed on building rooftops and the U_AV operates at a high altitude, the transmission between them experiences minimal obstruction. Hence, a LoS channel model is adopted for the channel vector c_v between the U_AV and the RIS, expressed as:

Here, represents the channel power gain at the reference distance d₀ = 1 m, , d denotes the antenna spacing, and is the carrier wavelength. For the links between the U_AV and users, as well as between the RIS and users, a Rician fading channel model is employed to capture the effects of scattering in the user vicinity. This model incorporates both LoS and non-line-of-sight (NLoS) components, reflecting the propagation conditions in WC environments. Accordingly, the channels g_s,n and h_n are modeled as [5,19]:

In the context of WC, denotes complex Gaussian noise. The terms and represent the scattering and deterministic LoS components, respectively, where

The parameters and denote the path loss exponents, while and represent the corresponding propagation distances. These parameters characterize the attenuation of signal power with distance in WC systems. Without loss of generality, the base station B_S is located at , while the locations of the users and the U_AV are defined accordingly.

Channel model

Spreading loss and molecular absorption are the primary factors influencing signal propagation in the THz band. Spreading loss arises from geometric wavefront expansion during propagation, which reduces received power. Molecular absorption results from interactions between THz waves and atmospheric molecules, causing frequency-dependent attenuation. Together, these effects restrict communication range and link reliability in THz-based systems. The channel transfer function is expressed as [5,19]:

In this expression, f denotes the carrier frequency, c represents the speed of light, and captures the path-loss effect due to molecular absorption. The carrier frequency of the IRS corresponds to the frequency of the electromagnetic wave manipulated by the IRS for signal reflection and enhancement. This frequency determines the operational range of the IRS and directly affects its ability to improve W_C links. The absorption loss coefficient k(f) depends on the transmission frequency and is given by:

Here, P_STP and T_STP denote the standard pressure and temperature, respectively. P and T represent the ambient pressure and temperature, Q_i,g characterizes the propagation environment, and denotes the molecular absorption coefficient at frequency f.

The total propagation distance d is expressed as the sum of two segments, , where d₁ denotes the distance between the base station B_S and the aerial IRS A_IRS, and d₂ denotes the distance between the A_IRS and the user N. These distances are computed as [5,19]:

Direct links between the B_S and the users are assumed to be blocked. Consequently, aerial platforms such as the A_IRS are employed to establish reliable wireless links for data transmission. An IRS mounted parallel to a hovering UAV is considered, with its reflecting elements aligned along the x- and y-axes. Let N_x and N_y denote the number of reflecting elements along each axis, resulting in a total of IRS elements. The channel gain C_G of the cascaded link involving the B_S, A_IRS, and user N is denoted by g(t). The IRS beamforming matrix θ is defined as [5,19]:

Each diagonal entry corresponds to the reflection coefficient of the n-th IRS element, where and j denotes the imaginary unit. For analytical simplicity, is assumed, and the phase shift . The cascaded channel gain from the B_S to the A_IRS and then to the user N at time t is expressed as:

Here, r_t(t) represents the propagation distance from the B_S to the A_IRS, and r_o(t) denotes the distance from the A_IRS to the user. The relative phase difference between the signal received from the B_S and the reference IRS element is given by:

where denotes the relative position vector of the IRS element. The array response vector from the B_S to the IRS at time t is defined as:

The reflected signal phases are jointly adjusted to enable directional beamforming toward the intended users. Accordingly, the transmit array vector from the A_IRS to the k^th user is expressed as [5,19]:

All multi-IRS-enabled UAVs operate within the same frequency band, which introduces co-channel interference denoted by ψ. The signal-to-interference-plus-noise ratio (S_INR) at a user served by a UAV at time t is given by:

Here, B_i denotes the allocated bandwidth, and represents the additive white Gaussian noise power. The achievable data rate R_m of user R_m, derived from the corresponding S_INR, is expressed as:

To address the complexity of the considered communication system, the original optimization problem is decomposed into tractable subproblems. The proposed framework adopts a two-stage strategy: first, optimal user grouping is performed; second, joint optimization of the transmit power P_S and UAV placement is carried out to maximize the average system data rate. This decomposition improves computational efficiency while enhancing overall network performance in terms of d_rn.

K-means grouping

This section describes a step-by-step procedure for improving user grouping in the network. The objective is to iteratively refine the group configuration until convergence. The method focuses on enhancing the value of d_rn while satisfying the associated constraints. To simplify the optimization process, the overall problem is decomposed into smaller subproblems. Initially, users are grouped using an improved variant of the K-means clustering algorithm.

The core principle of this clustering strategy is to partition users into distinct groups based on spatial proximity and the d_rn constraints. In large-scale THz communication scenarios, managing a high number of users becomes challenging. To address this issue, K-means clustering is applied to group users according to minimum distance and rate requirements. This strategy reduces computational complexity and improves the effectiveness of user grouping. K-means is well suited to large-scale, unlabeled datasets and is widely used due to its simplicity, fast convergence, and low computational overhead (Ref. Algorithm 2 and Algorithm 3). In the proposed implementation, the initial user clusters are selected carefully to accelerate convergence and improve clustering stability.

Algorithm 1 User Grouping Algorithm

Require: Number of users, distance threshold, d_rn threshold.

Ensure: Optimal user groups.

1: Initialization: Randomly generate user locations and initialize d_rn for each user.

2: Group update iteration:

3: For each user m = 1 to M do

4: Estimate distance to neighboring users.

5: Assign random d_rn for each user.

6: Assign the user to the group with the smallest d and the largest d_rn within the cluster.

7: End For

8: Repeat steps for all users until convergence.

At each iteration, the Algorithm 1 identifies the minimum distance d between user n and its neighbors. A user with the smallest distance to a neighbor is assigned to the group served by A_IRS. Each user is initially assigned a random distance value d_rn. Group formation is then refined based on the aggregate d_rn values within each group to improve cluster compactness. Subsequently, the cluster coverage probability is evaluated to verify the suitability of the resulting groups. The target area T_r is computed using the altitude of A_IRS and the beam angle θ of the U_AV in the THz communication setup, as given by [5,19]:

In this expression, θ denotes the half-power beam angle, corresponding to the narrow radiation width of the antenna.

Algorithm 2 K-means Clustering

1: Initialization: Select centroids randomly from the dataset: .

2: Assignment: Assign each point x_i to the nearest centroid c_j based on distance: .

3: Update Centroids: Update centroid positions by computing the mean of data points assigned to each centroid:

where s_j represents the set of data points allocated to centroid c_j.

4: Repeat: Repeat until a stopping criterion is met.

5: Optimal Grouping: Once convergence is achieved, the data points are grouped based on the final centroid positions.

Therefore, the problem of maximizing d_rn can be expressed as follows:

subject to

Algorithm 3 Deterministic Rules Estimation

1: Execute the selected action a_t and observe the next state s_t+1 and reward r_t.

2: Store the transition in the experience replay buffer R.

3: Sample a mini-batch of transitions from the replay buffer R.

4: Estimate the target Q-value y_i using the target critic network:

5: Update the critic network by minimizing the loss function:

6: Update the actor network using the tested R_G:

7: Update the actor parameters using the sampled R_G.

8: Update the target networks and using soft updates:

9: Repeat steps until convergence or a predefined number of iterations.

Here, the constraint () guarantees the minimum achievable rate for all N users, thereby satisfying the quality requirements for each n. The constraint limits the total transmit power T_P so that it does not exceed the maximum allowable power budget. The condition () indicates that each IRS reflecting element operates with full reflection, where the reflection coefficient magnitude of all IRS elements is set to unity. The constraints () describe the phase-shift characteristics of the IRS reflecting matrices. Each matrix corresponds to an IRS reflecting matrix P_S, implying that all incident signals are reflected without power loss, thereby improving signal transmission efficiency. The restriction () confines the UAV trajectory to a predefined feasible region. The optimization problem in () is non-convex and challenging to solve directly. Hence, the original problem is decomposed into sequential subproblems, focusing on optimal user clustering and the joint optimization of UAV locations and IRS configurations.

This framework provides a systematic way to select the U_AV location and IRS placement P_S to enhance system performance. By employing RL techniques, the model adapts to varying network conditions, improves performance, and reduces power usage and computational cost. The framework consists of a state s_t, an action a_t, a reward r_t, and an IRS-U_AV agent. The agent interacts with the environment by taking actions, receiving rewards, and updating its state accordingly. Rewards provide feedback that guides the agent toward actions that yield better outcomes. Through repeated interactions, the agent progressively improves its decision-making capability.

In RL, actions are selected sequentially to maximize cumulative reward under varying conditions. The agent observes the current state and chooses an action that is expected to yield a higher reward. Through repeated actions and feedback, the agent accumulates experience and gradually refines its policy, improving performance over time. The integrated design combines UAVs and RISs to improve energy efficiency in wireless links. In RL, the state represents the current environment observed by the agent and contains the information required for decision-making. The state at time t, denoted by s_t, consists of the IRS placement P_S and the U_AV position. Let t_i = t represent the previous UAV horizontal position.

The action defines the set of operations available to the agent at a given time step and represents possible changes to the current system configuration. The action taken at time t denoted by a_t, includes the IRS configuration P_S and the movement of the U_AV [5,19]:

The reward quantifies the immediate benefit obtained after taking an action in a given state and serves as the basis for learning. After executing action a_t in state s_t at time t, the agent receives a reward . The reward corresponds to the aggregated value d_rn across all clusters:

The agent selects actions in a continuous manner, guided by the observed state, to achieve higher rewards. The proposed algorithm enables the agent to learn optimal strategies under dynamically changing THz network conditions.

Advantages of the proposed method

The proposed optimization framework offers several advantages for THz communication systems in future wireless networks.

The joint optimization of user grouping, RIS phase configuration, and UAV positioning improves convergence behavior compared to conventional separate or iterative optimization approaches.
The integration of RL with modified K-means clustering enables adaptive and scalable user association under dynamic network conditions.
UAV-assisted deployment enhances network coverage and link reliability in blockage-prone THz environments, particularly in infrastructure-limited scenarios.
RIS-based passive beam control improves signal quality without introducing significant power or hardware overhead.
The proposed framework is compatible with emerging 6G architectures and supports intelligent, autonomous network operation.

Limitations of the proposed method

Despite these advantages, the proposed framework has certain limitations that warrant further investigation.

The RL process introduces additional computational complexity during the training phase, which may impact real-time deployment in highly resource-constrained systems.
The performance of the framework depends on accurate channel state information and environmental awareness, which can be challenging to maintain in rapidly changing UAV-assisted networks.
The current study considers a limited number of UAVs and RIS elements, and scalability to ultra-dense deployments requires further validation.
The optimization framework is evaluated under controlled simulation settings, and real-world implementation may introduce additional constraints such as hardware imperfections and regulatory restrictions.
Energy consumption associated with UAV mobility is not explicitly optimized and may affect long-duration operations.

Results and discussion

This section evaluates the effectiveness of the proposed approach using RIS and IRS in communication systems. The analysis is based on simulation results and demonstrates the role of these techniques in enhancing the performance of W_C systems. The integration of RIS and IRS into the network results in noticeable improvements in key performance aspects such as data rate, coverage, and link reliability.

Simulation setup

This subsection describes the key factors contributing to the effectiveness of RIS and IRS in the proposed framework. By enabling controllable signal reflection, these technologies improve signal propagation, mitigate interference, and support more efficient utilization of network resources. The coordinated integration of RIS and IRS facilitates enhanced spectrum usage and improved communication reliability under dynamic channel conditions.

The performance evaluation was conducted using the proposed RAVP routing and optimization framework. The simulation environment consisted of 101 IoT nodes deployed over a two-dimensional area of m². UAV-assisted RIS elements were adaptively positioned at controlled altitudes to support THz-band communication. The total simulation duration was set to 200 time units. All experiments were executed over multiple Monte Carlo trials with fixed random seeds, and the reported results represent statistically averaged performance metrics, as summarized in S2 Table. S2a Table summarizes the reproducibility and statistical-robustness reporting used in this study.

Reachable Hops with and without RIS

S6 Fig and S3 Table report the number of reachable hops with and without RIS. The results indicate that using RIS increases the number of reachable hops, implying that the network can maintain connectivity across more nodes and over longer paths. The proposed RIS-assisted approach helps address coverage limitations, extends communication range, and improves communication stability. Overall, these observations show that RIS can enhance the performance of W_C systems, particularly in large-scale and dynamic network environments.

Integrating RIS into the network yields a clear improvement in the number of reachable hops. For example, in the single-user scenario, reachable hops increase from 24 (No RIS) to 87 (RIS). As the number of users increases, reachable hops remain consistently higher with RIS: 77–178 (2 users), 165–292 (3 users), 272–376 (4 users), 362–482 (5 users), 436–563 (6 users), 534–574 (7 users), and 558–594 (8 users). All measurements were obtained over a simulation duration of 200 units. Overall, these results confirm that RIS extends communication paths across varying user configurations.

Average data rate comparison

S4–S7 Tables and S7–S10 Figs compare the average data rate as a function of the number of users and the number of IRS elements for different protocols. The proposed RAVP consistently achieves the highest data rates among the evaluated schemes. This improvement is primarily attributed to effective RIS utilization and its coordinated operation with other system components. Overall, RIS integration improves signal propagation, reduces interference, and enhances channel conditions, which collectively increase achievable data rates.

The average data rate (bps/Hz) achieved by the proposed RAVP framework consistently exceeds that of existing methods, including PPO, Phase Shift, and Random Phase, across a wide range of user counts. For example, with 10 users, RAVP attains 405 bps/Hz, compared with 387 bps/Hz (PPO), 365 bps/Hz (Phase Shift), and 340 bps/Hz (Random Phase). This advantage persists as the number of users increases, with RAVP achieving 467 bps/Hz (20 users), 503 bps/Hz (30 users), and a peak value of 555 bps/Hz (60 users). Even at 90 users, RAVP maintains a higher average data rate of 530 bps/Hz, outperforming the other approaches across all evaluated scenarios.

S4a Table provides a compact summary of the main performance outcomes (reachable hops, data rate, satisfaction, and signal-quality proxies) extracted from S3–S8 Tables, while explicitly tying the reporting to the simulation protocol in S2 Table. For each highlighted scenario, it identifies a representative baseline and the corresponding RAVP value and states that results are to be reported with mean ± standard deviation and 95% confidence intervals over N = 100 independent Monte Carlo trials (with fixed seeds), with additional run-to-run variability reporting for RL-based methods.

Data rate comparison of existing vs. proposed methods

By exploiting the reflective properties of RIS and strategically integrating it with other network components, the proposed RAVP demonstrates significant improvements in data rate performance compared with existing methods. These results highlight the importance of RIS in enabling high-speed and reliable communication in future wireless networks.

The data rate performance of the proposed RAVP algorithm was evaluated and compared with several existing methods, including the proximal policy optimization (PPO) algorithm with 16-IRS and 64-IRS configurations, the Phase Shift algorithm, and the Random Phase Shift algorithm. For the single-user scenario, the UAV algorithm achieved a data rate of 235 bps/Hz, whereas the proposed RAVP with 16 IRS achieved 402 bps/Hz. When the proposed RAVP algorithm was configured with 64 IRS, it achieved the highest data rate of 451 bps/Hz. As the number of users increased, the proposed RAVP algorithm with 64 IRS consistently outperformed all other algorithms, reaching a maximum data rate of 551 bps/Hz with five users. These results demonstrate the improved efficiency of the proposed RAVP algorithm, particularly when combined with 64 IRS, in enhancing data rate performance across different user scenarios.

S6 Table presents a comparison of data rates (in bps/Hz) under different conditions, including the absence of IRS and the use of various algorithms such as PPO, Phase Shift, Random Phase Shift, and the proposed RAVP. Across all considered scenarios (4, 16, 32, 56, and 64 IRS elements), IRS-assisted configurations consistently achieve higher data rates than the case without IRS. Among the evaluated methods, the proposed RAVP attains the highest data rate in every scenario, followed by the PPO algorithm, the Phase Shift algorithm, and the Random Phase Shift method. Furthermore, data rates generally increase as the number of IRS elements increases, with the proposed RAVP demonstrating the most pronounced performance improvement.

S7 Table compares satisfaction rates for five algorithms-the UAV algorithm, PPO, Phase Shift, Random Phase Shift, and the proposed RAVP-across user counts ranging from 10 to 70. The proposed RAVP consistently achieves the highest satisfaction rate, starting at 0.90 for 10 and 20 users and gradually decreasing to 0.51 for 70 users. In contrast, the other algorithms yield lower satisfaction rates across all scenarios. The UAV algorithm performs worst, particularly as the number of users increases. PPO and Phase Shift show moderate performance, while Random Phase Shift generally outperforms them but remains inferior to RAVP.

S11 Fig and S8 Table compare existing and proposed algorithms using three key metrics: signal propagation, interference mitigation, and channel conditions. Without IRS technology, baseline performance is limited (150 m signal propagation, 10 dB interference mitigation, and 4.0 bps/Hz channel conditions). With IRSs, performance improves substantially. PPO increases signal propagation to 200 m, improves interference mitigation to 15 dB, and enhances channel conditions to 5.5 bps/Hz. Phase Shift further improves these metrics to 250 m, 18 dB, and 6.0 bps/Hz, respectively. Random Phase Shift yields moderate gains (220 m, 16 dB, and 5.7 bps/Hz). The proposed RAVP achieves the best performance across all metrics: 280 m, 20 dB, and 6.5 bps/Hz. Overall, these results demonstrate the effectiveness of IRS-based optimization, with RAVP delivering superior signal quality and improved network efficiency.

S9 Table compares the proposed RAVP framework with representative state-of-the-art methods in RIS- and UAV-assisted THz (or related) wireless networks using aligned metrics from the Results and discussion section. Du et al. (2022) [27] provides theoretical insights into RIS-aided THz channels but does not report key metrics such as reachable hops or adaptability under dynamic conditions. Pan et al. (2025) [26] and Pan et al. (2025) [28] report moderate-to-high data rates using resource allocation and iterative learning, but they do not quantify coverage extension or robustness under dynamic UAV-assisted scenarios. Song et al. (2025) [29] emphasizes energy-aware design and improved signal propagation; however, reliance on specialized hardware can constrain scalability. In contrast, the proposed RAVP framework achieves higher reachable hops and average data rates and improves signal propagation, interference mitigation, and channel conditions, while supporting dynamic adaptability and scalability through RL-based joint optimization. Overall, these results demonstrate the performance advantages of RAVP under the simulation conditions considered in this work (S2 Table).

S9a Table reports a quantitative metric comparison between the proposed RAVP coordination strategy and representative recent RIS/THz optimization studies [26–29] under consistent evaluation assumptions (same channel model and system parameterization as documented in S2 Table and averaged over N = 100 Monte Carlo trials with fixed seeds tabulated in S2a Table). The reported measures are: (i) throughput (spectral efficiency, bps/Hz), (ii) E2E latency (ms) capturing scheduling/coordination delay effects, (iii) reliability (success probability, equivalently ), and (iv) energy efficiency (bits/J). In addition, the table explicitly reports the relative gains of RAVP versus the best SOTA entry among [26–29], showing a + 5.7% throughput improvement (555 vs. 525 bps/Hz), an 18.9% latency reduction (15.0 vs. 18.5 ms), and a + 2.1% absolute reliability increase (0.96 vs. 0.94). These significant gains are achieved because RAVP performs joint coordination across (a) user grouping (reducing contention and improving link matching), (b) RIS phase adaptation (constructive combining to increase effective channel gain/SINR), and (c) UAV positioning (shorter propagation distances and improved LoS probability). By optimizing these coupled degrees of freedom together, the proposed framework reduces packet retransmissions/outage events (higher reliability), shortens service times (lower latency), and increases the delivered information per unit power (higher energy efficiency), thereby outperforming approaches that optimize only a subset of these components.

S9b Table reports variability measures for the S9a Table results. Specifically, it states the Monte Carlo protocol (N = 100 independent trials per point) and summarizes the averaging procedure (mean over trials) with standard deviations and 95% confidence intervals for each key metric. Fixed and documented random-seed initialization is assumed across all methods to enable independent replication. For the RL-based evaluation of the proposed RAVP, multiple independent training runs (five runs with different seeds) are also summarized to quantify run-to-run variability. This separation of Monte Carlo channel randomness from learning stochasticity makes the robustness of the reported gains explicit.

Discussion

The results provide clear evidence that the proposed RAVP framework enhances the reliability and performance of THz communication systems through coordinated RIS and UAV optimization. Compared with conventional THz setups without RIS or UAV-assisted control, the proposed approach yields consistent improvements across multiple performance dimensions, including coverage, achievable data rate, signal quality, and user satisfaction.

A key observation from the reachable-hops analysis is the substantial expansion of communication paths enabled by RIS integration. The increase in reachable hops across varying user densities indicates that the proposed framework mitigates THz signal blockage and molecular absorption effects. This improvement translates into enhanced network coverage and more robust connectivity, particularly in dense or obstructed environments where conventional THz links suffer from frequent disconnections. Overall, RIS-assisted reflection combined with adaptive UAV positioning provides a practical mechanism for sustaining multi-hop THz communication under dynamic conditions.

The average-data-rate results further highlight the benefits of the proposed joint optimization strategy. Across all evaluated scenarios, the RAVP framework outperforms benchmark approaches such as PPO-based control, phase-shift optimization, and random phase configurations. These gains are attributed to coordinated adjustment of RIS phase shifts and UAV trajectories, which improves channel alignment and reduces attenuation. Unlike static or separately optimized schemes, the joint learning-based design can adapt to changes in user distribution and channel conditions, yielding sustained throughput improvements as network load increases.

The effect of RIS element density on data-rate performance reveals an important scalability characteristic of the proposed approach. As the number of IRS elements increases, the RAVP framework exhibits more pronounced gains than existing methods. This behavior indicates that the proposed optimization strategy effectively exploits the additional spatial degrees of freedom provided by larger RIS configurations. In contrast, baseline methods show diminishing returns under similar conditions, suggesting limited adaptability to increasing system complexity.

Further insights are obtained from the signal-propagation, interference-mitigation, and channel-condition evaluations. The proposed RAVP framework achieves the highest signal-propagation distance and strongest interference suppression among the compared methods. These improvements reflect the ability of the RL-based controller to steer reflected beams and reposition UAVs in response to environmental variation. Improved channel conditions (higher achievable capacity) confirm that the proposed approach enhances link quality rather than merely increasing transmit power or resource usage.

User-satisfaction results provide an application-level perspective on system performance. The consistently higher satisfaction rates achieved by RAVP across all user densities indicate improved fairness and quality-of-service provisioning. As the number of users increases, competing schemes show a sharp decline in satisfaction due to congestion and suboptimal resource allocation. In contrast, the proposed framework maintains relatively stable satisfaction levels, demonstrating its effectiveness in balancing coverage, throughput, and interference under high-load conditions.

Compared with recent state-of-the-art studies [26,29] on RIS-aided THz optimization and UAV-assisted communication, the proposed method offers distinct advantages. Prior works primarily focus on static RIS optimization, centralized resource allocation, or iterative learning mechanisms and often neglect mobility, scalability, or real-time adaptability. The RAVP framework addresses these limitations through a unified RL-based joint optimization strategy that simultaneously considers user grouping, RIS configuration, and UAV positioning. This integrated design improves convergence behavior and enhances system responsiveness in dynamic environments.

Despite these strengths, certain limitations remain. The proposed framework assumes accurate environmental sensing and control signaling between UAVs and RIS units, which may introduce overhead in large-scale deployments. In addition, hardware constraints related to RIS size and UAV endurance may affect practical implementation. These aspects motivate future work on distributed learning strategies, lightweight control signaling, and energy-aware optimization.

Conclusion and future scope

This study presented a joint RIS-UAV-assisted optimization framework to enhance THz communication performance. The proposed approach enables controllable signal reflections via RIS, improving propagation conditions and reducing interference inherent to high-frequency THz links. UAVs serve as flexible aerial platforms for adaptive RIS deployment, thereby extending network coverage and improving link reliability. An RL-based optimizer that integrates modified K-means clustering and rule-based gradient adjustment jointly addresses user grouping, RIS phase configuration, and UAV positioning. Simulation results demonstrate consistent improvements in coverage, achievable data rate, and signal quality across diverse THz communication scenarios, validating the effectiveness of the proposed framework. Despite these gains, certain limitations remain. The framework relies on RIS hardware with fine-grained phase control and UAV platforms capable of stable, precise positioning, which may introduce deployment complexity and cost constraints in large-scale networks. In addition, the computational overhead associated with learning-based joint optimization may affect scalability under stringent latency requirements. The impact of channel-estimation errors, fast-varying mobility, and dense blockage conditions was also not fully captured, which may influence performance in highly dynamic real-world environments. Future research directions include integrating deep RL to enhance adaptability under rapidly varying channel conditions and user mobility. Practical investigations involving real-time RIS and UAV deployments-such as hardware-in-the-loop experiments and outdoor field trials-can provide deeper insights into implementation feasibility. Further work can also focus on low-cost, energy-efficient RIS architectures and autonomous UAV platforms to improve scalability and support smart-city and IoT-oriented THz communication applications. These directions provide a pathway toward robust, high-performance THz communication systems for next-generation wireless networks.

Supporting information

S1 Fig. Workflow Schematic of the Proposed Approach.

https://doi.org/10.1371/journal.pone.0345290.s001

(ZIP)

S2 Fig. System Architecture of RIS-assisted UAV Communication Network.

https://doi.org/10.1371/journal.pone.0345290.s002

(ZIP)

S3 Fig. Height Optimization of UAV-IRS for Maximized Signal Strength.

https://doi.org/10.1371/journal.pone.0345290.s003

(ZIP)

S4 Fig. IRS placement and coverage area for optimal user communication.

https://doi.org/10.1371/journal.pone.0345290.s004

(ZIP)

S5 Fig. TDMA time-slot allocation in the proposed model.

https://doi.org/10.1371/journal.pone.0345290.s005

(ZIP)

S6 Fig. Reachable Hops Comparison With and Without RIS.

https://doi.org/10.1371/journal.pone.0345290.s006

(ZIP)

S7 Fig. Impact of Number of Users on Average Data Rate (bps/Hz).

https://doi.org/10.1371/journal.pone.0345290.s007

(ZIP)

S8 Fig. Impact of Increasing the Number of Users on Data Rate (bps/Hz).

https://doi.org/10.1371/journal.pone.0345290.s008

(ZIP)

S9 Fig. Impact of IRS on data rate (bps/Hz).

https://doi.org/10.1371/journal.pone.0345290.s009

(ZIP)

S10 Fig. Impact of Number of Users on Satisfaction Rate (%).

https://doi.org/10.1371/journal.pone.0345290.s010

(ZIP)

S11 Fig. Impact of signal propagation, interference mitigation, and channel conditions (existing vs. proposed).

https://doi.org/10.1371/journal.pone.0345290.s011

(ZIP)

S1 Table. Comparative analysis of UAV-enabled IoT and 6G communication frameworks.

https://doi.org/10.1371/journal.pone.0345290.s012

(ZIP)

S2 Table. Comprehensive simulation parameters.

https://doi.org/10.1371/journal.pone.0345290.s013

(ZIP)

S2a Table. Reproducibility and statistical robustness reporting.

https://doi.org/10.1371/journal.pone.0345290.s014

(ZIP)

S3 Table. Reachable hops with and without RIS.

https://doi.org/10.1371/journal.pone.0345290.s015

(ZIP)

S4 Table. Average Data Rate of Existing vs. Proposed Methods.

https://doi.org/10.1371/journal.pone.0345290.s016

(ZIP)

S4a Table. Key results with statistical reporting (based on simulation parameters in S2 Table, N = 100 Monte Carlo trials).

https://doi.org/10.1371/journal.pone.0345290.s017

(ZIP)

S5 Table. Data Rate of Existing vs. Proposed Methods (bps/Hz).

https://doi.org/10.1371/journal.pone.0345290.s018

(ZIP)

S6 Table. Data Rate With and Without IRS (bps/Hz).

https://doi.org/10.1371/journal.pone.0345290.s019

(ZIP)

S7 Table. Satisfaction Rate of Existing vs. Proposed Methods.

https://doi.org/10.1371/journal.pone.0345290.s020

(ZIP)

S8 Table. Comparison of signal propagation, interference mitigation, and optimized channel conditions.

https://doi.org/10.1371/journal.pone.0345290.s021

(ZIP)

S9 Table. Comparative analysis of state-of-the-art methods against the proposed RAVP framework using aligned performance metrics.

https://doi.org/10.1371/journal.pone.0345290.s022

(ZIP)

S9a Table. Quantitative metric comparison with recent state-of-the-art methods [26–29].

https://doi.org/10.1371/journal.pone.0345290.s023

(ZIP)

S9b Table. Statistical reporting and robustness summary for S9a results (N = 100 Monte Carlo trials; fixed seeds; RL multi-run variability).

https://doi.org/10.1371/journal.pone.0345290.s024

(ZIP)

Acknowledgments

The authors would like to acknowledge the support of Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Code Availability

The simulation code and implementation scripts used in this study are maintained in a private Git repository. Access can be granted for academic and research verification upon reasonable request. A temporary private link is provided below for editorial and review purposes only: https://github.com/PREMKUMARCH/Project-code-RAVP.git (private)

Access to the repository can be enabled upon request to Premkumar Chithaluru.

References

1. Hu J, Chen C, Cai L, Khosravi MR, Pei Q, Wan S. UAV-Assisted Vehicular Edge Computing for the 6G Internet of Vehicles: Architecture, Intelligence, and Challenges. IEEE Comm Stand Mag. 2021;5(2):12–8.
- View Article
- Google Scholar
2. Guo F, Yu FR, Zhang H, Li X, Ji H, Leung VC. Enabling massive IoT toward 6G: A comprehensive survey. IEEE Internet Things J. 2021;8(15):11891–915.
- View Article
- Google Scholar
3. Zuhair M, Patel F, Navapara D, Bhattacharya P, Saraswat D. BloCoV6: A blockchain-based 6G-assisted UAV contact tracing scheme for COVID-19 pandemic. In: 2021 2nd international conference on intelligent engineering and management (ICIEM). London, UK: IEEE; 2021. p. 271–6.
4. Liu C, Feng W, Chen Y, Wang C-X, Ge N. Cell-Free Satellite-UAV Networks for 6G Wide-Area Internet of Things. IEEE J Select Areas Commun. 2021;39(4):1116–31.
- View Article
- Google Scholar
5. Li T, Mazaheri MH, Abari O. 5g in the sky: the future of high-speed internet via unmanned aerial vehicles. In: Proceedings of the 23rd Annual International Workshop on Mobile Computing Systems and Applications. 2022. p. 116–22.
6. Li M, Jia G, Li X, Qiu H. Efficient Trajectory Planning for Optimizing Energy Consumption and Completion Time in UAV-Assisted IoT Networks. Mathematics. 2023;11(20):4399.
- View Article
- Google Scholar
7. Arif M, Kim W. Analysis of Fluctuating Antenna Beamwidth in UAV-Assisted Cellular Networks. Mathematics. 2023;11(22):4706.
- View Article
- Google Scholar
8. Goh CY, Leow CY, Nordin R. Energy Efficiency of Unmanned Aerial Vehicle with Reconfigurable Intelligent Surfaces: A Comparative Study. Drones. 2023;7(2):98.
- View Article
- Google Scholar
9. Wen K, Wang S, Wu Y, Wang J, Han L, Xie Q. A Secure Authentication Protocol Supporting Efficient Handover for UAV. Mathematics. 2024;12(5):716.
- View Article
- Google Scholar
10. Borbolla-Burillo P, Sotelo D, Frye M, Garza-Castañón LE, Juárez-Moreno L, Sotelo C. Design and Real-Time Implementation of a Cascaded Model Predictive Control Architecture for Unmanned Aerial Vehicles. Mathematics. 2024;12(5):739.
- View Article
- Google Scholar
11. Yuan J, Wang X, Jin M, Liu W, Wu R, Wei Z, et al. A Novel System of Mixed RF/FSO UAV Communication Based on MRR and RIS by Adopting Hybrid Modulation. Photonics. 2022;9(6):379.
- View Article
- Google Scholar
12. Moya Osorio DP, Ahmad I, Sánchez JDV, Gurtov A, Scholliers J, Kutila M, et al. Towards 6G-Enabled Internet of Vehicles: Security and Privacy. IEEE Open J Commun Soc. 2022;3:82–105.
- View Article
- Google Scholar
13. Michailidis ET, Miridakis NI, Michalas A, Skondras E, Vergados DJ. Energy Optimization in Dual-RIS UAV-Aided MEC-Enabled Internet of Vehicles. Sensors (Basel). 2021;21(13):4392. pmid:34198977
- View Article
- PubMed/NCBI
- Google Scholar
14. Ahmed AM, Majeed SA, Dawood YS. A Survey of 6G Mobile Systems, Enabling Technologies, and Challenges. IJEETC. 2023;:1–21.
- View Article
- Google Scholar
15. Brede B, Lau A, Bartholomeus HM, Kooistra L. Comparing RIEGL RiCOPTER UAV LiDAR Derived Canopy Height and DBH with Terrestrial LiDAR. Sensors (Basel). 2017;17(10):2371. pmid:29039755
- View Article
- PubMed/NCBI
- Google Scholar
16. Saleh AM, Omar SS, Abd El-Haleem AM, Ibrahim II, Abdelhakam MM. Trajectory optimization of UAV-IRS assisted 6G THz network using deep reinforcement learning approach. Sci Rep. 2024;14(1):18501. pmid:39122828
- View Article
- PubMed/NCBI
- Google Scholar
17. Li W, Yang F, Zhong L, Wu H, Jiang X, Chukalin A. Attitude Control of UAVs with Search Optimization and Disturbance Rejection Strategies. Mathematics. 2023;11(17):3794.
- View Article
- Google Scholar
18. Verma S, Kaur S, Khan MA, S. Sehdev P. Toward Green Communication in 6G-Enabled Massive Internet of Things. IEEE Internet Things J. 2021;8(7):5408–15.
- View Article
- Google Scholar
19. Omar SS, El-Haleem AMA, Ibrahim II, Saleh AM. Capacity Enhancement of Flying-IRS Assisted 6G THz Network Using Deep Reinforcement Learning. IEEE Access. 2023;11:101616–29.
- View Article
- Google Scholar
20. Zhang Y, Chen J, Chen M, Chen C, Zhang Z, Deng X. Integrated the Artificial Potential Field with the Leader–Follower Approach for Unmanned Aerial Vehicles Cooperative Obstacle Avoidance. Mathematics. 2024;12(7):954.
- View Article
- Google Scholar
21. Yu Y, Tang J, Huang J, Zhang X, So DKC, Wong K-K. Multi-Objective Optimization for UAV-Assisted Wireless Powered IoT Networks Based on Extended DDPG Algorithm. IEEE Trans Commun. 2021;69(9):6361–74.
- View Article
- Google Scholar
22. Mohsan SAH, Othman NQH, Li Y, Alsharif MH, Khan MA. Unmanned aerial vehicles (UAVs): Practical aspects, applications, open challenges, security issues, and future trends. Intell Serv Robot. 2023;16(1):109–37.
- View Article
- Google Scholar
23. Nguyen DC, Ding M, Pathirana PN, Seneviratne A, Li J, Niyato D, et al. 6G Internet of Things: A Comprehensive Survey. IEEE Internet Things J. 2022;9(1):359–83.
- View Article
- Google Scholar
24. Sirohi D, Kumar N, Rana PS, Tanwar S, Iqbal R, Hijjii M. Federated learning for 6G-enabled secure communication systems: a comprehensive survey. Artif Intell Rev. 2023;:1–93. pmid:37362891
- View Article
- PubMed/NCBI
- Google Scholar
25. Hamid H, Begh GR. IRS assisted UAV communications for 6G networks: a systematic literature review. Wirel Netw. 2025;31(1):779–807.
- View Article
- Google Scholar
26. Pan K, Li M, Lv S, Si P, Zhang H, Yu FR. Adaptive Resource Allocation for IoT With Computing Power Network Based on RIS-UAV-Aided NOMA-THz Communication. IEEE Trans Veh Technol. 2025;74(12):19294–307.
- View Article
- Google Scholar
27. Du H, Zhang J, Guan K, Niyato D, Jiao H, Wang Z, et al. Performance and Optimization of Reconfigurable Intelligent Surface Aided THz Communications. IEEE Trans Commun. 2022;70(5):3575–93.
- View Article
- Google Scholar
28. Pan K, Li M, Sun E, Si P, Wang K, Yu FR. Intelligent resource optimization for CPN-enabled IoT by RIS-UAV-aided NOMA-THz communication. In: ICC 2025-IEEE International Conference on Communications. IEEE; 2025. p. 1–6.
29. Song Y, Jalali J, Qin Y, Darabi M, Lemic F, Famaey J, et al. Miniature UAV-Aided Cooperative THz Networks With Reconfigurable Energy Harvesting Holographic Surfaces. IEEE Internet Things J. 2025;13(2):1734–46.
- View Article
- Google Scholar

[ref1] 1. Hu J, Chen C, Cai L, Khosravi MR, Pei Q, Wan S. UAV-Assisted Vehicular Edge Computing for the 6G Internet of Vehicles: Architecture, Intelligence, and Challenges. IEEE Comm Stand Mag. 2021;5(2):12–8.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Guo F, Yu FR, Zhang H, Li X, Ji H, Leung VC. Enabling massive IoT toward 6G: A comprehensive survey. IEEE Internet Things J. 2021;8(15):11891–915.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Zuhair M, Patel F, Navapara D, Bhattacharya P, Saraswat D. BloCoV6: A blockchain-based 6G-assisted UAV contact tracing scheme for COVID-19 pandemic. In: 2021 2nd international conference on intelligent engineering and management (ICIEM). London, UK: IEEE; 2021. p. 271–6.

[ref4] 4. Liu C, Feng W, Chen Y, Wang C-X, Ge N. Cell-Free Satellite-UAV Networks for 6G Wide-Area Internet of Things. IEEE J Select Areas Commun. 2021;39(4):1116–31.
View Article
Google Scholar

[9] View Article

[10] Google Scholar

[ref5] 5. Li T, Mazaheri MH, Abari O. 5g in the sky: the future of high-speed internet via unmanned aerial vehicles. In: Proceedings of the 23rd Annual International Workshop on Mobile Computing Systems and Applications. 2022. p. 116–22.

[ref6] 6. Li M, Jia G, Li X, Qiu H. Efficient Trajectory Planning for Optimizing Energy Consumption and Completion Time in UAV-Assisted IoT Networks. Mathematics. 2023;11(20):4399.
View Article
Google Scholar

[13] View Article

[14] Google Scholar

[ref7] 7. Arif M, Kim W. Analysis of Fluctuating Antenna Beamwidth in UAV-Assisted Cellular Networks. Mathematics. 2023;11(22):4706.
View Article
Google Scholar

[16] View Article

[17] Google Scholar

[ref8] 8. Goh CY, Leow CY, Nordin R. Energy Efficiency of Unmanned Aerial Vehicle with Reconfigurable Intelligent Surfaces: A Comparative Study. Drones. 2023;7(2):98.
View Article
Google Scholar

[19] View Article

[20] Google Scholar

[ref9] 9. Wen K, Wang S, Wu Y, Wang J, Han L, Xie Q. A Secure Authentication Protocol Supporting Efficient Handover for UAV. Mathematics. 2024;12(5):716.
View Article
Google Scholar

[22] View Article

[23] Google Scholar

[ref10] 10. Borbolla-Burillo P, Sotelo D, Frye M, Garza-Castañón LE, Juárez-Moreno L, Sotelo C. Design and Real-Time Implementation of a Cascaded Model Predictive Control Architecture for Unmanned Aerial Vehicles. Mathematics. 2024;12(5):739.
View Article
Google Scholar

[25] View Article

[26] Google Scholar

[ref11] 11. Yuan J, Wang X, Jin M, Liu W, Wu R, Wei Z, et al. A Novel System of Mixed RF/FSO UAV Communication Based on MRR and RIS by Adopting Hybrid Modulation. Photonics. 2022;9(6):379.
View Article
Google Scholar

[28] View Article

[29] Google Scholar

[ref12] 12. Moya Osorio DP, Ahmad I, Sánchez JDV, Gurtov A, Scholliers J, Kutila M, et al. Towards 6G-Enabled Internet of Vehicles: Security and Privacy. IEEE Open J Commun Soc. 2022;3:82–105.
View Article
Google Scholar

[31] View Article

[32] Google Scholar

[ref13] 13. Michailidis ET, Miridakis NI, Michalas A, Skondras E, Vergados DJ. Energy Optimization in Dual-RIS UAV-Aided MEC-Enabled Internet of Vehicles. Sensors (Basel). 2021;21(13):4392. pmid:34198977
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref14] 14. Ahmed AM, Majeed SA, Dawood YS. A Survey of 6G Mobile Systems, Enabling Technologies, and Challenges. IJEETC. 2023;:1–21.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref15] 15. Brede B, Lau A, Bartholomeus HM, Kooistra L. Comparing RIEGL RiCOPTER UAV LiDAR Derived Canopy Height and DBH with Terrestrial LiDAR. Sensors (Basel). 2017;17(10):2371. pmid:29039755
View Article
PubMed/NCBI
Google Scholar

[41] View Article

[42] PubMed/NCBI

[43] Google Scholar

[ref16] 16. Saleh AM, Omar SS, Abd El-Haleem AM, Ibrahim II, Abdelhakam MM. Trajectory optimization of UAV-IRS assisted 6G THz network using deep reinforcement learning approach. Sci Rep. 2024;14(1):18501. pmid:39122828
View Article
PubMed/NCBI
Google Scholar

[45] View Article

[46] PubMed/NCBI

[47] Google Scholar

[ref17] 17. Li W, Yang F, Zhong L, Wu H, Jiang X, Chukalin A. Attitude Control of UAVs with Search Optimization and Disturbance Rejection Strategies. Mathematics. 2023;11(17):3794.
View Article
Google Scholar

[49] View Article

[50] Google Scholar

[ref18] 18. Verma S, Kaur S, Khan MA, S. Sehdev P. Toward Green Communication in 6G-Enabled Massive Internet of Things. IEEE Internet Things J. 2021;8(7):5408–15.
View Article
Google Scholar

[52] View Article

[53] Google Scholar

[ref19] 19. Omar SS, El-Haleem AMA, Ibrahim II, Saleh AM. Capacity Enhancement of Flying-IRS Assisted 6G THz Network Using Deep Reinforcement Learning. IEEE Access. 2023;11:101616–29.
View Article
Google Scholar

[55] View Article

[56] Google Scholar

[ref20] 20. Zhang Y, Chen J, Chen M, Chen C, Zhang Z, Deng X. Integrated the Artificial Potential Field with the Leader–Follower Approach for Unmanned Aerial Vehicles Cooperative Obstacle Avoidance. Mathematics. 2024;12(7):954.
View Article
Google Scholar

[58] View Article

[59] Google Scholar

[ref21] 21. Yu Y, Tang J, Huang J, Zhang X, So DKC, Wong K-K. Multi-Objective Optimization for UAV-Assisted Wireless Powered IoT Networks Based on Extended DDPG Algorithm. IEEE Trans Commun. 2021;69(9):6361–74.
View Article
Google Scholar

[61] View Article

[62] Google Scholar

[ref22] 22. Mohsan SAH, Othman NQH, Li Y, Alsharif MH, Khan MA. Unmanned aerial vehicles (UAVs): Practical aspects, applications, open challenges, security issues, and future trends. Intell Serv Robot. 2023;16(1):109–37.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref23] 23. Nguyen DC, Ding M, Pathirana PN, Seneviratne A, Li J, Niyato D, et al. 6G Internet of Things: A Comprehensive Survey. IEEE Internet Things J. 2022;9(1):359–83.
View Article
Google Scholar

[67] View Article

[68] Google Scholar

[ref24] 24. Sirohi D, Kumar N, Rana PS, Tanwar S, Iqbal R, Hijjii M. Federated learning for 6G-enabled secure communication systems: a comprehensive survey. Artif Intell Rev. 2023;:1–93. pmid:37362891
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref25] 25. Hamid H, Begh GR. IRS assisted UAV communications for 6G networks: a systematic literature review. Wirel Netw. 2025;31(1):779–807.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref26] 26. Pan K, Li M, Lv S, Si P, Zhang H, Yu FR. Adaptive Resource Allocation for IoT With Computing Power Network Based on RIS-UAV-Aided NOMA-THz Communication. IEEE Trans Veh Technol. 2025;74(12):19294–307.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref27] 27. Du H, Zhang J, Guan K, Niyato D, Jiao H, Wang Z, et al. Performance and Optimization of Reconfigurable Intelligent Surface Aided THz Communications. IEEE Trans Commun. 2022;70(5):3575–93.
View Article
Google Scholar

[80] View Article

[81] Google Scholar

[ref28] 28. Pan K, Li M, Sun E, Si P, Wang K, Yu FR. Intelligent resource optimization for CPN-enabled IoT by RIS-UAV-aided NOMA-THz communication. In: ICC 2025-IEEE International Conference on Communications. IEEE; 2025. p. 1–6.

[ref29] 29. Song Y, Jalali J, Qin Y, Darabi M, Lemic F, Famaey J, et al. Miniature UAV-Aided Cooperative THz Networks With Reconfigurable Energy Harvesting Holographic Surfaces. IEEE Internet Things J. 2025;13(2):1734–46.
View Article
Google Scholar

[84] View Article

[85] Google Scholar