Capsule-based federated reinforcement learning adaptive sliding mode for anomaly detection and control of floating wind turbines

Hadi Mohammadian KhalafAnsar; Jafar Keighobadi; Mohsen Shahhosseini

doi:10.1371/journal.pone.0336410

Abstract

Floating wind turbines (FWTs) are now recognized as one of the most effective and affordable renewable energy sources. However, their performance is strongly influenced by dynamic environmental conditions, particularly sea waves under significant oscillatory conditions. Ocean wave and wind disturbance affect turbine positioning, underscoring the critical essential for adaptive and robust control mechanisms to manage the unpredictable external inputs. In this context, we present an innovative method based on federated deep learning for training capsule networks to detect disturbances and enable adaptive robust control of FWTs among the environmental uncertainty. Through the proposed technique, a unique mixture of sliding mode control and deep reinforcement learning (DRL) yields in the extraction of wide features and modeling of spatial relationships between sensor data in the capsule networks framework. Furthermore, by employing federated learning, the capsule-net model is trained in a distributed manner across multiple wind turbines. Therefore, enhanced accuracy and effectiveness of disturbance detection are guaranteed. Simulation results reveal effective identification of disturbances which in turn improves the performance and stability of FWTs under the coarse environmental situation. The global Lyapunov stability analysis proves the FWTs’ closed-loop stability. Performance of the superior DRL is evaluated in comparison with a radial basis function neural network (RBFNN) estimation. The innovative DRL method represents a significant advancement in the control of FWTs as a high potential of development for intelligent management of similar systems. As a final aim, this research work finds out the reliability and efficiency of FWTs in variable weather conditions (short-term) and erratic ocean environments (long-term). Moreover, the control system makes a substantial impact on the sustainable development of the wind and renewable energy sector.

Citation: Mohammadian KhalafAnsar H, Keighobadi J, Shahhosseini M (2025) Capsule-based federated reinforcement learning adaptive sliding mode for anomaly detection and control of floating wind turbines. PLoS One 20(12): e0336410. https://doi.org/10.1371/journal.pone.0336410

Editor: Zhipeng Zhao, Tongji University, CHINA

Received: August 6, 2025; Accepted: October 24, 2025; Published: December 4, 2025

Copyright: © 2025 Mohammadian KhalafAnsar et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the manuscript. The minimal dataset necessary to replicate the study findings has been deposited in the Zenodo repository (https://zenodo.org/) under DOI: [https://doi.org/10.5281/zenodo.17519490].

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Fossil fuels as the worldwide energy source along recent decades lead to polluted air, soil degradation, groundwater pollution, and greenhouse gas emissions that directly threaten the health of ecosystems and human communities. The combustion of fuels releases about 35 gigatons of carbon dioxide into the atmosphere annually, which accounts for nearly 89 percent of total global emissions. Given the lower level of natural processes, like ocean absorption, in reduction of this destructive phenomenon, the carbon concentration increases in the atmosphere each year. Such a growth in greenhouse gases brings some consequences including global warming, ocean acidification, and significant climate changes. The impact on human life is also significant; air pollution caused by burning fossil fuels affects millions of lives annually, and it is estimated that reduction in the use of these resources could save 3.6 million lives worldwide each year [1–3].

To overcome the mentioned threats, the global community has initiated a significant transition from fossil fuels to renewable energy sources. International agreements, such as the Paris Agreement and the United Nations Sustainable Development Goals, put emphasis on the requirement of decreasing dependence on non-renewable energy and substitute clean energy. Countries are investing in wind, solar, geothermal, and hydroelectric power projects to balance some energy demand currently afforded by fossil fuels. Renewable energy sources now comprise approximately 18 percent of total energy consumption, and this share is likely to increase significantly in the next decades. Among the alternative technologies, FWTs have emerged as a promising innovation, empowering the harnessing of strong and consistent winds in deep-water areas. This reduces reliance on shore-founded installations and minimizes the visual impact of offshore wind farms. Furthermore, relocating wind farms farther offshore not only provides access to more stable wind resources but also enhances opportunities for new transportation routes and fisheries [1–3].

Despite their abundant advantages, the steady process of FWTs encounters momentous challenges owing to the high variability of wind speeds, wave turbulence, and environmental degradation. Therefore, the development of progressive control methods is essential to ensure such systems remain dynamic and energy-efficient. Various methods have been proposed in recent years to enhance the reliability of FWTs. Among them, fractional-order sliding mode controllers (FOSMC) have obtained particular importance for their ability to improve transient response and keep the system stable in the presence of disturbances. Additionally, the application of fractional calculus in load frequency control (LFC) of island microgrids has validated that the above methods can more effectively overturn fluctuations caused by renewable energy sources. The use of meta-heuristic algorithms, such as the Sine Cosine Algorithm (SCA) and Harmony Search (HS), to fine-tune control parameters also enables efficient system control under unstable conditions [4].

Based on the mentioned techniques, further research explored fractional-order control methods for chaotic systems, such as stabilizing laser systems using fractional-order PID sliding mode control (SMC). This approach employs fractional derivatives to decrease chattering noise, to cause smoother control and to enhance the stability without requiring many computational resources. Moreover, adaptive-SMC was developed to address synchronization challenges in fractional-order chaotic neural network systems, overcoming issues such as input saturation and delays commonly encountered in real-world applications. Recently, these methods have been extended to synchronize chaotic systems in power networks, providing a promising foundation for controlling FWTs, where synchronization and stability are critical [5–7].

More generally, traditional feedback control techniques face problems in systems that are subject to multiple non-linearities and externalities. Consequently, adaptive control systems are used which automatically adjust the controller settings in response to changes in the system’s conditions. Sliding-mode controllers (SMC) are usually used when stability and resilience to failure are a priority. These controllers use sliding techniques and include adaptive mechanisms to ensure the stability of the system. Intelligent control algorithms, such as neural networks and fuzzy logic, have been developed for managing the uncertainties in the system’s performance and have proved their worth in a wide range of applications. In the area of floating wind turbines, adaptive control algorithms, including adaptive sliding mode control and neural network controllers, have been developed. Recent studies have also examined the use of reinforcement learning (RL) to control variable-speed wind turbines, adapt controllers to fluctuating wind conditions, and manage doubly fed induction generators in real-time. Hybrid intelligent controllers that combine conventional PID regulators with fuzzy systems have shown promising results in reducing turbine loads and increasing power output [8–15]. In addition to control problems, accurate modelling of FWT dynamics is an active field of research. Recent work on fractional time equations and mesh-free methods has provided promising approaches to modelling FWT. Polynomial fractional time equations, particularly those using the Caputo derivative, offer great flexibility for dynamic systems modelling [16]. For complex geometry, mesh-free methods such as the Local Radial Point Interpolation Method (RPIM) are ideal, but may be computationally expensive [17]. Another option is the Predynamic Differential Operator (PDO), which is good for simulating complex, highly interacting systems such as FWTs. However, the defining functions at each node may increase computational complexity [18]. Spectral collocation methods, especially those utilizing polynomials, show potential to solve time-fractional diffusion equations with stochastic time-fractional coefficients, and to model unpredictable and noisy dynamics of FWTs. These methods offer high accuracy; they come with computational costs and sensitivity irregular data [8].

However, a review of the technical literature reveals a clear gap in the research: the lack of a unified control framework combining the high reliability of conventional controllers with the advanced intelligence and distributed learning capabilities of FWTs. To fill the gap and provide a comprehensive solution, this study proposes a multi-faceted approach to the control and stabilization of floating wind turbines. This research develops an adaptive sliding mode controller using deep reinforcement learning and real time data feeding of dynamic tuning. The main innovations of this research are listed as:

Use of capsule networks for precise identification of disturbances instead of traditional neural networks
Application of federated learning for distributed training of controllers in order to preserve data privacy and increase scalability
Integration of sliding mode control with deep reinforcement learning to increase the stability and adaptability of the system to different environmental conditions.
These collective innovations cause to decrease of unwanted oscillations as chattering phenomena and increase of durability of mechanical structures, more energy efficiency. Besides, robust stability in the presence of uncertainty and disturbances is acquired based on the Lyapunov method. Hence, the results of this research work may serve as an effective step toward the sustainable development of renewable energy and the global movement to reduce dependence on fossil fuels.

Materials and methods

The main objective of this research is to design a control system that can keep the FWT stable against uncertain and variable conditions of waves and wind in ocean. Our control system is a hybrid framework that leverages the power and stability of classical control methods along with the captured intelligence of machine learning. The components of this framework are explained in a logical order as following steps of an applicable algorithm. Then, we go in details. This research did not require ethics approval as it involved only computational simulations and theoretical analysis without any human or animal participation.

Step 1: Control Foundation - Adaptive Sliding Mode Control (ASMC)

The main foundation of our controller is constructed based on SMC as an ideal choice for under disturbance offshore FWTs owing to robustness against external disturbances and model uncertainties. In simple terms, the SMC push the plant to move on a predefined and stable sliding surface path and stay on the surface as well. In a standard SMC, we need to have in advance the maximum possible disturbance bound, which is an unknown value in practice. To solve this problem, we design an adaptive control mechanism in which the controller should adjust characteristic parameters online. Therefore, according to the changing conditions tolerating the hypothesized offshore FWT, a real time control process is carried out without a priori knowledge of the disturbances.

In the main term − ρ∣∣s∣∣s, of the SMC responsible for trajectory tracking in the presence of disturbances and uncertainties, we aim intelligent and optimal estimate of the parameter ρ which represents the disturbance index at each moment.

Step 2: The Brain - Deep Reinforcement Learning (DRL) Neural Network

To solve the challenge of estimating ρ, we design a “smart brain” for the SMC gathered with the DRL. This neural network continuously observes the turbine state through sensor data of rotation, yaw angle and accordingly learns how to best adjust the parameter ρ to both maintain stability and prevent unwanted chattering oscillations. This approach transforms our controller from a merely robust system to an intelligent and self-regulating system.

Step 3: Innovations in network architecture

To increase the efficiency of this “smart brain,” we implemented two key innovations in its architecture:

1. Using a capsule network for accurate disturbance detection

Instead of using conventional neural networks, we used a capsule network to analyze the input data from sensors. Turbine sensor data for surge, sway, pitch, roll, heave, and yaw have complex spatial relationships with each other. Capsule networks, unlike traditional networks, are able to understand these hierarchical and spatial relationships. This feature allows complex disturbances caused by wind and waves to be detected with much higher accuracy.

2. Using federated learning for distributed training

In a wind farm, several turbines are operating simultaneously. To train a powerful model, data from all turbines would be used. Continuously sending huge amounts of raw data from each turbine to a central server is both costly and privacy- insecure. In this method, each turbine trains its own smart model locally through its own data. Then, merely the updated model parameters without the raw data are sent to a central server to be aggregated to form an improved global model. This global model is then fed back to all turbines. This process allows the final model to benefit from the “collective experience” of all turbines without violating data privacy.

Step 4: Mathematical proof of system stability

After designing of a complex system, a fundamental question about the stability of control system remains. Using the Lyapunov Stability Analysis as a standard and powerful mathematical method in control engineering, the system’s performance is assessed by a pseudo-energy function. The positive definite candidate for Lyapunov function yields in a negative definite time derivative showing that the system is stable. Therefore, all internal signals remain bounded, and the tracking error tends to zero, i.e., the controller will steer the turbine exactly to the desired path over time. This mathematical proof ensures the stability and reliability of our proposed control framework.

Modeling of the online adaptive intelligent control

In this section, we derive the governing equations prior to design of the feedback control system. Subsequently, an in-detail proof is provided to examine the convergence of the system’s weighting parameters and the asymptotic stability of the tracking error, i.e., the off-track of the actual states of system from corresponding desired values. Now, the assumed model of the multi-input FWT system is expressed as [9,14]:

(1)

With respect to Eq. (1), the reference model for the aforementioned adaptive control system is applied to generate tracking trajectories:

(2)

We establish the following assumptions for our analysis:

The eigenvalues of user defined matrix are located in the left−half of the complex Laplace plane, and the signal function is energy bounded.
There exist constant matrices, defined as and , such that:

The matrix is positive-definite.

Consequently, the ultimate upper bound of the uncertainty and disturbance is the next user-defined known and bounded parameter. This quantitative aspect implies that the system’s robustness against uncertainties and disturbances is guaranteed as long as they remain within this established bound:

Regarding the tracking error as [14]:

(3)

Taking the time derivate of Eq. (3) gives dynamics of the tracking error as:

(4)

By defining the integral surface as follows:

(5)

the sliding surface kinematics is obtained as:

(6)

To figure out the equivalent control input, yields [19]:

(7)

where and .

In this context, the proposed control signal is formulated as follows:

(8)

To address the innovation of the proposed sliding surface and the adaptive controller , the sliding surface introduces an integral term that dynamically adjusts the system’s trajectory by considering the history of states within the sliding window. This enhances robustness against disturbances and uncertainties. Unlike conventional sliding surfaces that rely solely on the current tracking error, the proposed surface allows for reduced sensitivity of the overall control system to sudden changes, as well as high adaptability over time. The designed adaptive controller incorporates real-time adaptation of the gains and in response to variations in the system dynamics and disturbances. The final term in the control law ensures that the control input effectively counteracts large deviations, which in turn leads to reduced chattering by modulating the control force based on the magnitude of the sliding surface. This approach provides a more nuanced response compared to conventional adaptive controllers, resulting in improved stability and performance under a broader range of operational conditions.

SMC is well-regarded for its robustness against uncertainties and external disturbances, making it a natural choice for controlling the complex dynamics of FWTs. However, standard SMC can suffer from the chattering phenomenon and requires precise model information, which can be challenging to obtain in real-world scenarios. On the other hand, DRL excels at learning optimal control strategies from data, adapting to changing conditions, and managing complex, high-dimensional systems. By integrating DRL with SMC, we aimed to create a control system that leverages the robustness of SMC while utilizing DRL to adaptively tune the controller, reducing chattering and improving performance under varying operational conditions. This approach addresses the specific challenge of maintaining stability and performance in the highly uncertain and dynamic environment of FWTs, where traditional control methods may fall short. By implementing an adaptive sliding mode (ASM) controller with DRL, the hybrid system becomes robust against any uncertainties or disturbances. Fig 1 depicts the block diagram of the developed closed-loop system in MATLAB^© Simulink. The implementation comprises several essential blocks, detailed as follows:

Download:

Fig 1. The block diagram of the simulated system in MATLAB^©.

https://doi.org/10.1371/journal.pone.0336410.g001

The adaptive laws are used to calculate the coefficients in Eq. (8). This block plays a key role in the dynamic adjustment of the controller parameters according to changing system conditions. The controller block is responsible for computing the final control signal after receiving the coefficients from the Adaptation Block. It is a key component of the closed-loop system and ensures effective control in response to the dynamic nature of uncertainties and disturbances. The hierarchical relationship among the blocks in MATLAB© Simulink offers profound insight into the holistic approach used to address uncertainties and manage system inconsistencies. The described methodology combines the advantages of adaptive control in sliding mode and DRL, resulting in a control system that dynamically responds to the complexities of real-world applications. The DRL neural network block estimates the upper bounds of uncertainty and interference based on adaptive laws derived from the Lyapunov function.

The Lyapunov function plays a central role in deriving the adaptation rules within a system. Therefore, by assuming a Lyapunov candidate function with positive-definite properties, the stability proof is terminated when the derivative of this function becomes negative semi-definite. This is a critical point in the analytical evaluation, where the Lyapunov function serves as the key mathematical tool for determining the stability and convergence characteristics of the adaptive system [9,14].

Substitution of the estimated values and in nominal control Eq. (8):

(9)

Substituting this control input into the original system dynamics yields the expected change in the system’s behavior.

(10)

Given the expression above for the derivative of the state variables, we can deduce the dynamics of the tracking error:

(11)

DRL neural network

DRL value will be utilized to estimate the uncertainty and disturbance upper limit denoted by . The neural network employed for this purpose comprises several layers, as illustrated in Fig 2. This study employs a hybrid architecture combining a Capsule Network and DRL to estimate the disturbance effects on a floating wind turbine. The model is trained using federated learning and leverages the advantages of the Capsule Network for hierarchical modeling and robustness to perturbations. The input consists of six system status variables collected from the wind turbine’s sensors. These variables include surge, sway, heave, roll, pitch and yaw. The input data is represented by the vector:

Download:

Fig 2. Disturbance estimation for floating wind turbine using capsule networks and Actor-Critic federated learning.

https://doi.org/10.1371/journal.pone.0336410.g002

where is the vector of the sensor data and index 6 is the number of input elements. The orange capsules are processing the input functions as shown. Each capsule stores data on part of the spatial structure as a vector. The properties of each layer are mapped to the predictions of the next layer by means of the transformation matrix . In a capsule network, the routing-by-assignment mechanism adjusts the routing weight according to the similarity and the concurrency of the capsules. In this mechanism, the output of each capsule i is transformed into a capsule j by means of a transformation matrix . The weighting is adjusted by a concave algorithm between the output vector of the capsules [20].

In the Capsule Network routing process, the initial weights of are usually initialized in a uniform way. These weights are then iteratively adjusted by agreement between the vector(s) of the predictor . The following update rule applies [20]:

The arrows in Fig 2 represent the collection of votes and the transfer of these votes to the output capsules. The agreement between capsule inputs is calculated and the weightings updated. This process continues until convergence or predetermined number of iterations have been achieved.

The middle capsule layer is responsible for modelling the middle level functions. This layer receives information from the convolution layer and transfers it to the output capsule layer. As the arrows in Fig 2 show, the characteristics of the input capsules are converted into predictions of the output capsules (the votes). Transformation matrices are used to perform these transformations . Each input capsule votes for a different output capsule. These votes, , are sent to the intermediate layers for combining and fine-tuning. The output of this layer is composed of three capsules which predict the primary error values of the following variables . Final output is defined by vector:

The activation of each output capsule is calculated as follows:

where is the output vector and σ is the squashing function (similar to the sigmoid function).

The Capsule network acts as an actor in the DRL architecture. The output values of the capsules are called actions. They are defined as follows:

Here, represents the action in step , and represents the function of the actor network. The critic network evaluates the performance of the actor and calculates the value function as follows:

where is the function of the critical network.

The Federated learning is integrated in the network modelling as shown in Fig 2 and a pseudo code for this algorithm is given in Table 1 where are the weights of the model in round , is the number of data samples in client , is the cost function of the model for the mini-set . In this algorithm, the server periodically receives the updated models from the clients and calculates their average to create a new model. Each wind turbine trains its local capsule model on the following local data:

Download:

Table 1. Federated averaging algorithm pseudocode.

https://doi.org/10.1371/journal.pone.0336410.t001

where is cost function and stands for turbine model parameters.

Local parameters of the turbines are sent to the central server and summarized:

where is the number of turbines.

This hybrid architecture, which makes use of capsule networks and deep reinforcement learning, provides an optimal method for predicting the faults in floating wind turbines.

Stability analysis

The final threshold for interference is expected to be:

(12)

In the described system, the input is the state variables, and the coefficients are given by and serve as weights. The output of the second layer is equal to the function .

Several key assumptions are taken into account according to [9,14,19], which is prevalent in literature review:

Optimal network weight values shall be determined by meeting the following equality criteria:

(13)

These assumptions are an additional basis for analyzing and investigating the behavior of the system and the optimization criteria.

The final threshold value relates to the following inequality:

(14)

Based on the derivative of the sliding surface as expressed in Eq. (15), we can determine the Lyapunov function as shown in Eq. (16):

(15)

(16)

This formulation is motivated by the desire to use the knowledge obtained from the sliding surface derivative to develop a Lyapunov function that is consistent with the stability analysis of the system. This encourages careful consideration of system dynamics and allows for evaluation of stability parameters. The choice to base the Lyapunov function on the derivative of the sliding surface shows a structured and systematic analysis which makes the theoretical framework robust and reliable.

We consider the selection process as an additional criterion and highlight its positive definitional nature. In this context, we envisage the following:

Taking the derivative of the expression Eq. (16) in time yields:

(17)

Substituting Eq. (15) in Eq. (17) results in:

(18)

For simplicity, the terms in brackets are set to zero. The traces of the internal expressions are expected to be equal, since the expressions concerned must maintain an internal trace balance. The reflection () returns the following:

(19)

The following results are obtained from Eq. (19) to calculate the values of and using the invariance property of the constant, i.e., applying a certain operation to the constant does not change its value.

(20)

Thus, the following results are obtained from the simplification of the adaptive rules Eq. (20). Substituting the rules into the derivative of the Lyapunov function, the resulting expression will be:

(21)

Given the above data in which for each variable it is set that it is less or equal to the norm value of the corresponding variable, it can be concluded that the next result will look like:

(22)

Where and were replaced by and , correspondingly using Eqs. (12) and (13). The value inside the brackets in this specific case is set equal to zero to obtain the formula allowing to express the adaptive rule for changing the weights in the neural network:

(23)

The reasoning is continued with excluding the weight factor from the derivative of the Lyapunov function, combined with the norm inequality surpassing the value of the variable. The purpose of this method is to differentiate and improve the analytical logic, while creating an opportunity to include more sophisticated analysis of mathematical issues:

(24)

Given the negative sign of the derivative of the Lyapunov function, it is seen that the parameters , and converge to zero. Consequently, the values of the parameters are bounded. This deduction is extracted from the derivative equation of the sliding surface that requires bounded. Integrating both sides of Eq. (24) given that:

Considering that and are finite, the integral is finite as well. Therefore, the combination of the finiteness of the integral and the derivative of the sliding surface, it is possible to conclude that the sliding surface, which is , “will vanish asymptotically” due to Barbalat’s lemma. This demonstrates that with the convergence of the sliding surface to zero, the error will tend to zero, which is shown in Eq. (5).

Results and discussion

The analysis of the FWT is well established, based on the NREL model of design in the USA. Three buoyant cylinders with a triangular shape and a central cylinder for the control tower were designed as a model of a turbine. The whole system weighs 13.5 kilotons and is purposely located 13.46 meters below sea level. Fig 3 represent design highlights of the complexity of the system. The design incorporates aerodynamic forces, buoyancy forces, linear rope forces and drag and inertia forces, each of which generates a torque, but the torque is omitted for reasons of schematic simplicity and focus [21].

Download:

Fig 3. General diagram of the nonlinear FWT.

https://doi.org/10.1371/journal.pone.0336410.g003

In order to formulate the motion equations for the system in question, it is necessary to define the key components, including the state variables named as and the control inputs named as and the disturbances introduced by and The non-linear function enclosing the system dynamics, named as can be briefly expressed as [3]:

(25)

Summing the force equation gives acceleration. In Eq. (25), the sum of all the forces on the structure can be expressed as.

(26)

In the given equation, the symbol m_g represents the total mass of the plant, and the symbol I_3 × 3 represents the unity matrix. The expression for the torque, f_F (x, u, v, w), is calculated as the sum of all the forces acting on the system.

The torque expression, f_T (x, u, v, w), derived from angular accelerations, is computed as the sum of all torques induced by the forces exerted on the plant:

(27)

In this context, the symbol I_g represents the tensor of inertia around the vertical axis, while the symbol 𝑅 represents the matrix of transformation. The term T_j (x, u, v, w) encapsulates all the moments of force that are exerted on a structure. The following derivations are given for the expressions f_1Q (x, u, v) and f_2Q (x, u, v,).

(28)

The calculation of aerodynamic efficiency is approached by taking C_p as the coefficient of performance as defined below:

(29)

The paper proposes that the defined paths for the required system movements are structured in a way that optimizes the behavior of the system to achieve maximum uniform energy. In mathematical terms, the pursuit of a maximum is conventionally expressed as the determination of the derivative of a function as being equal to zero. Therefore, this principle applies to the derivative of Eq. (29) both for countries and input variables, which results in the following expressions:

(30)

Given the minimal effect of state variable variations on captured energy, it is acceptable to ignore the state component of Eq. (30) and replace the relevant input(s).

(31)

The optimization target refers to the variable amount of angle β as the central control target:

(32)

The adjustment of the generator torque, the final control input, is performed in response to the generator speed fluctuations. In practical applications, the main principle of generator torque regulation is to maintain a constant rotor speed. This operational strategy is in line with the wider objective of achieving stability and control over the dynamic behavior of the system:

(33)

which gives:

(34)

By solving Eq. (34), the computation of the generator torque is achieved. In addition, the ultimate goal was to find an angle of yaw that corresponded to the direction of the wind. However, due to the limitations of the actuator, the use of the average path is required to ensure a smooth and easy implementation of the action. In addition to the nonlinear model controller, the NREL 5-MW controller defines the PI controller for floating offshore wind turbine (FOWT) as follows:

and

In this equation, is the inertia of the rotor, is the nominal speed of the rotor, and is the gear ratio. The parameters of the variables and are user-defined tuning variables. In addition, the coefficient of aerodynamic power, ∂P/∂β(v), is a wind speed dependent metric that determines the sensitivity of aerodynamic power to the angle of the blade. Traditionally, this sensitivity component has been assessed by using numerical linearized wind turbine models and an aerodynamic solver such as OpenFAST. Even for those familiar with this approach, obtaining these models and carrying out the subsequent analysis necessary to define this concept may prove to be time-consuming, inter alia. Given the system characteristics in Table 2, figure. Fig 4 shows a MATLAB simulation of FOWT, comparing the nonlinear controller proposed in this work with the PI gain controller and the output oscillation by using the controllers described above. The modest amount of variation between NREL’s reported work and our model demonstrates the correctness of the strategy given in this study.

Download:

Table 2. FWT’s properties.

https://doi.org/10.1371/journal.pone.0336410.t002

Download:

Fig 4. Desired system simulation in MATLAB for a) translational, b) rotational states.

https://doi.org/10.1371/journal.pone.0336410.g004

In this section we aim to examine the performance of the proposed controller by a thorough analysis of the proposed FWT. In addition, we rewrite the control equations in section Methods and modeling as MATLAB software-adapted, so that further implementation and analysis can be done in Simulink. The process described will allow a thorough and systematic assessment of the advantages and disadvantages of the controller, its behavior under various conditions and the actual dynamic processes taking place in the FWT. The use of MATLAB^© and Simulink provides a powerful computational framework in which to perform accurate and insightful simulations to determine the effectiveness of the proposed controller in the timely regulation of the dynamics of FWT.

The dynamic system of FWT of this paper is defined as follows:

where and are defined as follows.

Our controller’s enhanced smooth trajectory is attributed to its capability in accurately and instantaneously estimating complex disturbances and responding intelligently to them. This high precision is achieved through our DRL agent and two key technologies:

Capsule Network: This model creates a deep and multi-dimensional understanding of wind and wave conditions, modeling the system’s state far more accurately than linear models.

Federated Learning: This framework, by leveraging the collective experience of all turbines in a wind farm, renders the estimation model highly robust and generalizable.

Consequently, the combination of this precise disturbance estimation and the guaranteed stability of the SMC yields a more stable and smoother performance compared to the linear PI controller.

The vector represents the dynamic states of FWT, which are essential for characterizing its motion and overall behavior. The desired FWT trajectories are generated using inputs from the proposed adaptive controller, and these trajectories serve as a critical benchmark for evaluating the effectiveness of the new control strategy presented in this paper. By employing adaptive laws derived from Lyapunov theory, the controller successfully ensures that the tracking error is minimized, as illustrated in Fig 5. The results demonstrate that the tracking error consistently remains close to zero, thereby confirming both the reliability and the accuracy of the adaptive control method.

Download:

Fig 5. State tracking errors using RBFNN- and DRL-based controller (red diagrams are RBFNN and blue ones are DRL based).

https://doi.org/10.1371/journal.pone.0336410.g005

Furthermore, a comparative analysis highlights the superior performance of DRL-based approaches over radial basis function neural networks (RBFNNs). In particular, DRL methods exhibit faster convergence rates and improved tracking accuracy, enabling the system to achieve the desired response with minimal fluctuations. This enhanced stability and robustness emphasize the advantages of DRL controllers in managing the complex dynamics of FWT systems, while also underscoring their suitability for practical deployment in real-world scenarios.

In addition, the controller design is rigorously validated by demonstrating that its surfaces asymptotically converge to zero. This behavior is illustrated in Fig 6, where the continuous curves show the temporal evolution of the surfaces. The graphical results clearly confirm that all surfaces converge to zero, thereby validating the robustness of the controller design. In this context, the phrase asymptotically converges to zero is used interchangeably with converges to zero, emphasizing the systematic reduction and quantification of tracking errors. The presented evidence highlights that the controller surfaces were intentionally designed to achieve this property, reinforcing the overall effectiveness of the proposed control strategy.

Download:

Fig 6. RBFNN- and DRL-based defined sliding surfaces (red diagrams are RBFNN and blue ones are DRL based).

https://doi.org/10.1371/journal.pone.0336410.g006

Furthermore, the comparative results between the RBFNN and the DRL-based control system gives additional support for the advantages of DRL. The results reveal that DRL enables faster tracking with fewer fluctuations, allowing the system to reach the desired response more efficiently. This reduction in oscillations not only increases tracking accuracy but also extends actuator lifespan by minimizing excessive wear. Taken together, these findings underscore the superiority of DRL in delivering accurate and stable control performance, establishing it as a strong candidate for highly dynamic and complex control systems.

Finally, within the control task framework, three distinct control inputs are required to achieve the results presented in Figs 5 and 6. The corresponding control actions are depicted in Fig 7, which illustrates the nacelle pitch and yaw angles along with the generator torque. Specifically, the pitch angle should exhibit only small fluctuations around zero degrees, the yaw angle should stabilize at an average of around 4 degrees, and the generator torque should oscillate around 1.5 × 10⁴. Overall, these coordinated control actions yield the performance results highlighted above.

Download:

Fig 7. Pitch and yaw angles of the nacelle and torque of generator as control actions.

https://doi.org/10.1371/journal.pone.0336410.g007

The dynamics of a FWT involve complex interactions between the turbine and the floating platform. To simplify the analysis, a linear model can be employed to capture the fundamental behavior of the system. In this framework, the floating platform can be represented as a spring–damper system, where wave forces and the turbine’s mass govern its motion. The turbine itself may be modeled as a rotor system that incorporates both rotor and generator dynamics. Furthermore, the interaction between the platform and the turbine introduces coupling effects, such as the forces exerted by the rotor on the platform and the reactive forces of the platform on the rotor. The motion of the floating platform in response to wave excitation can therefore be expressed by the following equation:

where is the mass of the platform, is the damping coefficient, is the spring constant, is the displacement of the platform, is the external wave force, is the force exerted by the turbine on the platform.

The dynamics of the turbine rotor can be modeled as:

where: is the moment of inertia of the turbine rotor, is the damping coefficient of the turbine, is the spring constant related to the turbine’s stiffness, is the rotational displacement of the turbine rotor, is the torque from the wind, is the torque exerted by the platform on the turbine.

The coupling effects between the turbine and the platform can be incorporated as:

where is the coupling stiffness constant.

Assuming small oscillations, we linearize the system around its equilibrium points. The equations of motion become:

We transform these equations into the frequency domain using either the Laplace or Fourier transform. For simplicity, sinusoidal inputs are assumed, and the steady-state response is derived as:

where and are the transfer functions for the platform and turbine, respectively, is the angular frequency of the excitation, and are the Fourier transforms of and , respectively.

The amplitude response of the system is determined by the magnitude of its transfer function:

The peak amplitude arises when the denominator is minimized, which depends on the system’s natural frequency and damping ratio. The frequency at which this maximum amplitude occurs is referred to as the resonant frequency. To provide deeper insight into the energy consumption of a FWT and its relationship to the system’s nonlinear vibration characteristics, Fig 8 presents the spectral distribution of the dynamic response of the uniform structure across all degrees of freedom, plotted on a logarithmic scale. This figure highlights the frequency characteristics of the system, which are critical for understanding its potential in energy recovery. Furthermore, Table 3 lists the extracted natural frequencies, offering a clear reference for the frequencies considered in the analysis. Together, these additions strengthen the demonstration of the analogy between the spring–mass system and the frequency-dependent energy recovery characteristics of FWT.

Download:

Table 3. Extracted natural frequencies.

https://doi.org/10.1371/journal.pone.0336410.t003

Download:

Fig 8. Spectrum frequency of dynamic response for faultless structure in each degree of freedom (logarithmic scale).

https://doi.org/10.1371/journal.pone.0336410.g008

This study also evaluates the use of MEMS-based systems, piezoelectric biosensors, and periodic MEMS methods for stabilizing wind turbine structures. Aerodynamic and acoustic measurement systems based on MEMS offer considerable benefits for wind turbines because of their small size, excellent sensitivity, and durability in extreme conditions. These MEMS sensors measure aerodynamic forces by analyzing pressure distributions through the equation , assuming the known parameters , , and as the density of air, the speed of the wind, the lift coefficient, and the area of the blade, correspondingly. They also detect acoustic signals, where sound intensity is linked to acoustic pressure by , where , stand for the density of air and the velocity of sound, correspondingly. Data from MEMS accelerometers and microphones are utilized for monitoring vibrations and optimizing control systems, thus improving turbine performance and stability. Piezoelectric biosensors serve to track the structural stability and facilitate energy collection in floating wind turbines. These sensors transform mechanical strain into electrical signals, with the voltage expressed as with denoting the piezoelectric strain coefficient and the force applied. Furthermore, they are capable of collecting energy, which is determined as , assuming and as capacitance and voltage, respectively. In FWTs, piezoelectric sensors are crucial for identification of modifications in structural loads and vibrations, and the harvested energy can be derived from small turbine systems, improving efficiency and minimizing maintenance requirements. Grasping periodic solutions in MEMS is crucial for predicting and improving their performance in FWTs, especially in dynamic scenarios like varying wind speeds. Periodic behavior in MEMS can be examined using techniques like harmonic balance and Floquet theory. In a simple harmonic oscillator model, the equation that describes it is a differential equation:

where is the mass, is the damping coefficient, is the spring constant, is the amplitude of the force, and is the excitation frequency. The steady-state solution is:

with being the phase shift computed from:

Analysis of these solutions helps to assess the stability of MEMS devices under regular loads and guides the optimization of design parameters to avoid resonance and to ensure a reliable operation under varying conditions. This analysis is essential for the design of MEMS that can cope with the dynamic loads encountered in the applications of wind turbines.

The resonance peaks of the linearized model over the system’s natural frequencies in Table 3 reveal identification of critical structural vulnerabilities corresponding to wind and wave disturbances. The proposed power optimization controller through combining SMC and DRL mitigates these resonant oscillations and hence prevents energy stuck in the system’s natural modes. Unlike conventional linear controllers, which are limited to narrow operating ranges, our nonlinear controller ensures stability and robustness across diverse operational conditions through two synergistic mechanisms:

SMC Component: Utilizes a well-defined sliding surface to asymptotically drive the system toward equilibrium while maintaining robustness against parametric uncertainties.
DRL Component: Dynamically adapts SMC parameters in real time to environmental variations (e.g., fluctuating wind speed and wave height), ensuring optimal oscillation damping at critical frequencies.

To validate the controller’s efficacy, we injected a frequency-sweep chirp signal into the closed-loop system and analyzed its frequency response. Key findings from the Bode diagram (Fig 9) include:

Download:

Fig 9. Performance analysis of the proposed SMC-DRL compared to the PID controller and open-loop mode.

https://doi.org/10.1371/journal.pone.0336410.g009

The open-loop system exhibits pronounced resonance peaks at natural frequencies (e.g., Pitch and Surge), indicating high susceptibility to wave-induced disturbances.
A conventional PID controller normally attenuates these peaks, however our SMC-DRL controller suppresses resonance amplitudes by more attenuation about 20% in across all state variables, Surge, Sway, Heave, Roll, Pitch, Yaw, as depicted in the Resonance Peak Comparison bar graph at bottom-right of Fig 9.

Furthermore, the time-domain response to the chirp signal (bottom-left) demonstrates the controller’s adaptability to time-varying disturbances. While the PID controller yields significant oscillations, the SMC-DRL system maintains minimal amplitude deviation, underscoring its superiority in:

Stability margin enhancement (reduced resonance energy transfer),
Disturbance rejection (critical for power generation optimization under dynamic conditions).

These results conclusively establish the controller’s ability to decouple disturbance energy from structural modes, achieving robust performance in realistic marine environments.

Conclusions

In this study, a novel intelligent control framework was presented to stabilize and optimize the performance of FWT under uncertain environmental situations. The main goal is to overcome the challenges arising from nonlinear system dynamics and unpredictable wind and wave forces to maximize the efficiency and sustainability of clean energy exploitation. Our innovative framework combines the inherent stability of ASMC with the learning and optimal decision-making capabilities of DRL. To increase the accuracy of disturbance detection, a capsule network was used instead of traditional neural networks, which is capable of figuring out spatial and hierarchical relationships between sensor data. Furthermore, by employing federated learning, it is possible to train a robust model in a distributed manner on multiple turbines, which increases the overall accuracy of the system while preserving data privacy. Lyapunov stability analysis mathematically proved that the proposed controller is globally stable and the system tracking error converges to zero asymptotically, which ensures the reliability of the controller under various conditions. Simulation results also clearly agreed with the absolute superiority of this method compared to an advanced neural network-based controller (RBFNN); Our controller achieved faster convergence with less oscillations, which means reduced mechanical wear and increased turbine service life.

These achievements are an important step towards making FWTs a more reliable and efficient energy source. This advanced approach not only improves the stability and performance of turbines, but also has the potential to revolutionize the management of large-scale renewable energy systems. Despite the promising results, this research can be expanded in the future. It is proposed to evaluate the performance of this controller under extreme sea conditions and also considering time-delays in the control signals. Overall, this paper presents a comprehensive and intelligent solution that can significantly transform the stability and efficiency of the next generation of FWTs, paving the way for a wider exploitation of wind energy in deep waters.

Supporting information

S1 File. Uploaded file in Zenodo repository.

https://doi.org/10.1371/journal.pone.0336410.s001

(XLSX)

References

1. Brown AM, Bass AM, Garnett MH, Skiba UM, Macdonald JM, Pickard AE. Sources and controls of greenhouse gases and heavy metals in mine water: a continuing climate legacy. Sci Total Environ. 2024;906:167371. pmid:37758145
- View Article
- PubMed/NCBI
- Google Scholar
2. Sharifi A, Allam Z, Bibri SE, Khavarian-Garmsir AR. Smart cities and sustainable development goals (SDGs): a systematic literature review of co-benefits and trade-offs. Cities. 2024;146:104659.
- View Article
- Google Scholar
3. Keighobadi J, Mohammadian KhalafAnsar H, Naseradinmousavi P. Adaptive neural dynamic surface control for uniform energy exploitation of floating wind turbine. Appl Energy. 2022;316:119132.
- View Article
- Google Scholar
4. Roohi M, Mirzajani S, Basse-O’Connor A. A no-chatter single-input finite-time PID sliding mode control technique for stabilization of a class of 4D chaotic fractional-order laser systems. Mathematics. 2023;11(21):4463.
- View Article
- Google Scholar
5. Roohi M, Zhang C, Taheri M, Basse-O’Connor A. Synchronization of fractional-order delayed neural networks using dynamic-free adaptive sliding mode control. Fractal Fract. 2023;7(9):682.
- View Article
- Google Scholar
6. Rasooli Berardehi Z, Zhang C, Taheri M, Roohi M, Khooban MH. A fuzzy control strategy to synchronize fractional‐order nonlinear systems including input saturation. Int J Intell Syst. 2023;2023(1).
- View Article
- Google Scholar
7. Zhu Q, Shang H, Lu X, Chen Y. Adaptive sliding mode tracking control of underwater vehicle-manipulator systems considering dynamic disturbance. Ocean Eng. 2024;291:116300.
- View Article
- Google Scholar
8. Taimoor M, Lu X, Shabbir W, Sheng C. Neural network observer based on fuzzy auxiliary sliding-mode-control for nonlinear systems. Expert Syst Appl. 2024;237:121492.
- View Article
- Google Scholar
9. Yu J, Wu M, Ji J, Yang W. Neural network-based region tracking control for a flexible-joint robot manipulator. J Comput Nonlinear Dynam. 2023;19(2).
- View Article
- Google Scholar
10. Zhou Y, Bhowmick P, Zhang L, Chen L, Nagamune R, Li Y. A model reference adaptive control framework for floating offshore wind turbines with collective and individual blade pitch strategy. Ocean Eng. 2024;291:116054.
- View Article
- Google Scholar
11. Mazare M. Adaptive optimal secure wind power generation control for variable speed wind turbine systems via reinforcement learning. Appl Energy. 2024;353:122034.
- View Article
- Google Scholar
12. Zhou X, Ke Y, Zhu J, Cui W. Sustainable operation and maintenance of offshore wind farms based on the deep wind forecasting. Sustainability. 2023;16(1):333.
- View Article
- Google Scholar
13. Haque A, Malik A. Machine learning in renewable energy systems for smart cities. Smart cities: Power electronics, renewable energy, and Internet of Things. CRC Press; 2024. pp. 287–314.
14. Abdollahzadeh M, Pourgholi M. Adaptive fuzzy sliding mode control of magnetic levitation system based on Interval Type-2 Fuzzy Neural Network Identification with an Extended Kalman–Bucy filter. Eng Appl Artif Intell. 2024;130:107645.
- View Article
- Google Scholar
15. Jafari H, Malidareh BF, Hosseini VR. Collocation discrete least squares meshless method for solving nonlinear multi-term time fractional differential equations. Eng Anal Bound Element. 2024;158:107–20.
- View Article
- Google Scholar
16. Hosseini VR, Rezazadeh A, Zheng H, Zou W. A nonlocal modeling for solving time fractional diffusion equation arising in fluid mechanics. Fractals. 2022;30(05).
- View Article
- Google Scholar
17. Hosseini VR, Zheng H, Zou W. An efficient meshfree computational approach to the analyze of thermoelastic waves of functionally graded materials in a two-dimensional space. Alexandria Eng J. 2022;61(12):10495–510.
- View Article
- Google Scholar
18. Hosseini V Reza, Remazani M, Zou W, Banihashemii S. Stochastic model for multi-term time-fractional diffusion equations with noise. Therm sci. 2021;25(Spec. issue 2):287–93.
- View Article
- Google Scholar
19. Tong X, Ma D, Wang Z, Ming Z, Xie X. Model-free adaptive dynamic event-triggered robust control for unknown nonlinear systems using iterative neural dynamic programming. Inf Sci. 2024;655:119866.
- View Article
- Google Scholar
20. Ribeiro FDS, Leontidis G, Kollias S. Capsule routing via variational bayes. Proceedings of the AAAI Conference on Artificial Intelligence; 2020.
21. Jonkman J, Butterfield S, Musial W, Scott G. Definition of a 5-MW reference wind turbine for offshore system development. Golden, CO (United States): National Renewable Energy Lab. (NREL); 2009.

[ref1] 1. Brown AM, Bass AM, Garnett MH, Skiba UM, Macdonald JM, Pickard AE. Sources and controls of greenhouse gases and heavy metals in mine water: a continuing climate legacy. Sci Total Environ. 2024;906:167371. pmid:37758145
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Sharifi A, Allam Z, Bibri SE, Khavarian-Garmsir AR. Smart cities and sustainable development goals (SDGs): a systematic literature review of co-benefits and trade-offs. Cities. 2024;146:104659.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref3] 3. Keighobadi J, Mohammadian KhalafAnsar H, Naseradinmousavi P. Adaptive neural dynamic surface control for uniform energy exploitation of floating wind turbine. Appl Energy. 2022;316:119132.
View Article
Google Scholar

[9] View Article

[10] Google Scholar

[ref4] 4. Roohi M, Mirzajani S, Basse-O’Connor A. A no-chatter single-input finite-time PID sliding mode control technique for stabilization of a class of 4D chaotic fractional-order laser systems. Mathematics. 2023;11(21):4463.
View Article
Google Scholar

[12] View Article

[13] Google Scholar

[ref5] 5. Roohi M, Zhang C, Taheri M, Basse-O’Connor A. Synchronization of fractional-order delayed neural networks using dynamic-free adaptive sliding mode control. Fractal Fract. 2023;7(9):682.
View Article
Google Scholar

[15] View Article

[16] Google Scholar

[ref6] 6. Rasooli Berardehi Z, Zhang C, Taheri M, Roohi M, Khooban MH. A fuzzy control strategy to synchronize fractional‐order nonlinear systems including input saturation. Int J Intell Syst. 2023;2023(1).
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref7] 7. Zhu Q, Shang H, Lu X, Chen Y. Adaptive sliding mode tracking control of underwater vehicle-manipulator systems considering dynamic disturbance. Ocean Eng. 2024;291:116300.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref8] 8. Taimoor M, Lu X, Shabbir W, Sheng C. Neural network observer based on fuzzy auxiliary sliding-mode-control for nonlinear systems. Expert Syst Appl. 2024;237:121492.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref9] 9. Yu J, Wu M, Ji J, Yang W. Neural network-based region tracking control for a flexible-joint robot manipulator. J Comput Nonlinear Dynam. 2023;19(2).
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref10] 10. Zhou Y, Bhowmick P, Zhang L, Chen L, Nagamune R, Li Y. A model reference adaptive control framework for floating offshore wind turbines with collective and individual blade pitch strategy. Ocean Eng. 2024;291:116054.
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref11] 11. Mazare M. Adaptive optimal secure wind power generation control for variable speed wind turbine systems via reinforcement learning. Appl Energy. 2024;353:122034.
View Article
Google Scholar

[33] View Article

[34] Google Scholar

[ref12] 12. Zhou X, Ke Y, Zhu J, Cui W. Sustainable operation and maintenance of offshore wind farms based on the deep wind forecasting. Sustainability. 2023;16(1):333.
View Article
Google Scholar

[36] View Article

[37] Google Scholar

[ref13] 13. Haque A, Malik A. Machine learning in renewable energy systems for smart cities. Smart cities: Power electronics, renewable energy, and Internet of Things. CRC Press; 2024. pp. 287–314.

[ref14] 14. Abdollahzadeh M, Pourgholi M. Adaptive fuzzy sliding mode control of magnetic levitation system based on Interval Type-2 Fuzzy Neural Network Identification with an Extended Kalman–Bucy filter. Eng Appl Artif Intell. 2024;130:107645.
View Article
Google Scholar

[40] View Article

[41] Google Scholar

[ref15] 15. Jafari H, Malidareh BF, Hosseini VR. Collocation discrete least squares meshless method for solving nonlinear multi-term time fractional differential equations. Eng Anal Bound Element. 2024;158:107–20.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref16] 16. Hosseini VR, Rezazadeh A, Zheng H, Zou W. A nonlocal modeling for solving time fractional diffusion equation arising in fluid mechanics. Fractals. 2022;30(05).
View Article
Google Scholar

[46] View Article

[47] Google Scholar

[ref17] 17. Hosseini VR, Zheng H, Zou W. An efficient meshfree computational approach to the analyze of thermoelastic waves of functionally graded materials in a two-dimensional space. Alexandria Eng J. 2022;61(12):10495–510.
View Article
Google Scholar

[49] View Article

[50] Google Scholar

[ref18] 18. Hosseini V Reza, Remazani M, Zou W, Banihashemii S. Stochastic model for multi-term time-fractional diffusion equations with noise. Therm sci. 2021;25(Spec. issue 2):287–93.
View Article
Google Scholar

[52] View Article

[53] Google Scholar

[ref19] 19. Tong X, Ma D, Wang Z, Ming Z, Xie X. Model-free adaptive dynamic event-triggered robust control for unknown nonlinear systems using iterative neural dynamic programming. Inf Sci. 2024;655:119866.
View Article
Google Scholar

[55] View Article

[56] Google Scholar

[ref20] 20. Ribeiro FDS, Leontidis G, Kollias S. Capsule routing via variational bayes. Proceedings of the AAAI Conference on Artificial Intelligence; 2020.

[ref21] 21. Jonkman J, Butterfield S, Musial W, Scott G. Definition of a 5-MW reference wind turbine for offshore system development. Golden, CO (United States): National Renewable Energy Lab. (NREL); 2009.

Figures

Abstract

Introduction

Materials and methods

Modeling of the online adaptive intelligent control

DRL neural network

Stability analysis

Results and discussion

Conclusions

Supporting information

S1 File. Uploaded file in Zenodo repository.

References