Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

AI-assisted optimization design of seismic performance parameters for timber structures

  • Dongqi Wei,

    Roles Conceptualization, Investigation

    Affiliation School of Architectural Engineering and Art Design, Liuzhou City Vocational College, Liuzhou, Guangxi, China

  • Yuqiang Ding,

    Roles Writing – original draft

    Affiliation School of Architectural Engineering and Art Design, Liuzhou City Vocational College, Liuzhou, Guangxi, China

  • Feng Zhou,

    Roles Data curation, Writing – review & editing

    Affiliation School of Architectural Engineering and Art Design, Liuzhou City Vocational College, Liuzhou, Guangxi, China

  • Xuan Zhang

    Roles Project administration

    zhangxuan1193@outlook.com

    Affiliation School of Environmental and Life Sciences, Nanning Normal University, Nanning, Guangxi, China

Abstract

Timber multi-story buildings offer environmental benefits, lightweight construction, and seismic resilience, but Artificial Intelligence (AI) based integrated frameworks for optimizing seismic parameters, including inter-story drift and roof displacement, remain limited. The Gradient Boosted Random Forest Machine with Scalable Cheetah Optimizer (GBRF-SCO) is proposed for improving prediction accuracy while facilitating optimal design decisions. The dataset consists of 4,000 timber building samples obtained from a publicly available Kaggle repository (Timber Seismic Performance Dataset). Data pre-processing employs normalization and outlier detection using Robust Scaling and Isolation Forest, ensuring high-quality inputs. For exploratory analysis, t-Distributed Stochastic Neighbor Embedding (t-SNE) is applied to visualize high-dimensional feature relationships and identify structural parameter patterns relevant to seismic performance. The proposed framework uses GBRF to predict seismic response metrics, with the SCO tweaking hyperparameters to optimize model performance. It also enables the optimization of seismic performance characteristics, guiding engineers in selecting structural designs that minimize drift and enhance robustness. Multiple Linear Regression (MLR) was employed to examine the influence of key structural and seismic elements on roof displacement, providing insights into the overall seismic performance of wood buildings. Comparative evaluation shows superior performance over conventional regression and ensemble methods, demonstrating a higher accuracy of 0.949, which corresponds to the classification of roof displacement levels (low, medium, high) under seismic loading conditions and seismic intensities using Python 3.10. By providing a strong and clever method for designing sustainable and earthquake-resilient buildings, the suggested GBRF-SCO framework successfully improves the seismic performance optimization of timber structures.

1. Introduction

Wood constructions play a vital role in modern building due to their exclusive grouping of strength, flexibility, and sustainability, promoting both builders and landlords of large-scale infrastructures [1,2]. Over the past 50 years, loans in engineering techniques have enabled wood to transition from a traditional framing material to a primary structural component in commercial and residential buildings. Engineered timber products enhance strength, stability, and fire resistance compared to conventional wood-framed structures, while offering a higher strength-to-weight ratio that facilitates faster construction and reduces environmental impact relative to concrete or steel [3,4]. Technological developments ensure timber remains a durable, adaptable, and sustainable material choice in contemporary design.

Earthquake performance is a critical consideration for timber buildings, as seismic forces can induce inter-story drift, roof displacement, base shear, acceleration, and energy dissipation [5,6]. Evaluating these performance indicators allows engineers to determine whether a structure possesses sufficient strength, stability, and energy-dissipation capacity to resist seismic events [7]. Key metrics include lateral displacement, inter-story drift, base shear, ductility, damping capacity, and the natural period of vibration, which collectively describe a timber building’s ability to withstand seismic loads without catastrophic failure. Adherence to earthquake-resistant design principles ensures human safety and structural resilience [8].

Engineers improve seismic performance by modifying structural systems, material selection, and construction details, while verifying that buildings are safe, serviceable, and compliant with codes [9,10]. Specific seismic performance parameters, essential for assessing timber structures, rely on intrinsic wood properties such as ductility, low mass, and flexibility [11,12]. Specifically, lateral displacement, inter-story drift, base shear, damping capacity, and energy dissipation are key indicators of timber structure resilience under seismic loading [13]. Moreover, the form, layout, and detailing of timber connections, as well as the use of engineered wood products, significantly influence seismic performance [14]. Proper assessment ensures timber structures can safely absorb and transfer seismic energy, limiting damage and protecting occupants. Conforming to current seismic codes and performance-based design standards requires careful consideration of these factors throughout the design and construction process [15]. Fig 1 shows the detailed structural framework of a timber building.

thumbnail
Fig 1. Structural framework of a timber building illustrating key components influencing seismic performance.

[Timber-Frames: Anatomy and Joinery – Custom Home Building and Remodeling].

https://doi.org/10.1371/journal.pone.0341961.g001

It is crucial to monitor seismic performance, such as lateral drifts and roof displacement, to ensure that structures are safe under earthquake loads.

The research introduces the GBRF-SCO framework, an AI-driven methodology that enhances essential seismic metrics, including inter-story drift and roof displacement. Engineers can use the model’s hyperparameter optimization to make better predictions and design buildings that are more resistant to earthquakes and less likely to drift. In general, the GBRF-SCO framework helps with safer, stronger, and faster design of timber buildings.

1.1. Key contribution of the research

  • Initially, analytical modeling was carried out using a comprehensive Timber Seismic Performance Dataset that included different seismic and structural characteristics such as height, density, and acceleration.
  • After collecting the data, the dataset was processed through robust scaling, isolation forest, outlier removal, and normalization to enhance the model’s accuracy and reliability.
  • t-SNE analysis to visualize relationships between structural and seismic variables, identifying key trends for performance optimization.
  • Demonstration that the GBRF-SCO framework achieves superior predictive performance and stability, with improved accuracy, precision, and F1-score over conventional models.

2. Literature review

2.1. AI and machine learning in timber seismic performance

The research aimed to develop a multi-objective optimization framework [16] leveraging Deep Learning (DL) for the design of 20-story cross-laminated timber (CLT) Coupled Wall systems, improving high-dimensional design handling. An autoencoder compressed the design space dimensions, while neural networks mapped input variables to latent spaces and linked latent variables to output responses for structural optimization. This framework generated the best Pareto front, accounting for uncertainties in connection elements, and outperformed three deterministic models using nonlinear time history seismic analysis. However, it relied on a two-dimensional numerical model, limiting full three-dimensional representation, and its effectiveness depended on training data quality and diversity. Complementary research [17] applied machine learning (ML) algorithms to predict design suitability of box-shaped timber members, achieving accuracies from 91.7% to 98.6%, with Logistic Regression performing best. Both studies were limited by small datasets and constrained structural dimensions, potentially overlooking full structural variability.

2.2. Hybrid and CLT timber systems

The research [18] evaluated the mechanical performance of a hybrid steel-grout connection for CLT panels, using quasi-static cycle tests to assess secant stiffness and residual slip. ML models predicted these properties based on material and geometric characteristics, highlighting rod and grout diameters, though results are specific to the tested connection and large-scale CLT assemblies. Another study [19] employed DL with a deep fully convolutional neural network (d-FCNN) using encoder-decoder architecture to segment cracks in 501 images of Yingxian Pagoda timber components. The model, trained with a batch size of six over 100 epochs, achieved average accuracy but is limited by dataset size and generalization. Anomaly detection in civil structures [20] combined Transfer Learning (TL) with Extended Node Strength Network (ENSN) to identify Regions-of-Uninterest, validated via laboratory shaking table tests. Hybrid RC-timber systems [21] for high-rise seismic zones showed up to 38.7% seismic force and 30.6% base shear reduction, though limited to numerical analysis.

2.3. Multi-hazard and performance-based design approaches

The study [22] evaluated the effectiveness of hybrid walls and Impact-Resilient Double Concave Frictional Pendulum (IR-DCFP) bearings in enhancing seismic performance of Light-Frame Timber Buildings (LFTBs) under severe ground motions. Using incremental dynamic analyses, archetype Chilean LFTB models were assessed for collapse margin ratios and fragility curves. The combined hybrid walls and IR-DCFP solution minimized collapse probability at Maximum Considered Earthquake (MCE) levels and provided functional isolation despite low wall density and compact bearings. Multi-hazard design frameworks [23] were tested on steel-timber hybrid connections in 18- and 36-story case buildings using Direct Displacement-Based Design (DDBD), controlling seismic drift and wind-induced accelerations, though limited to analytical and numerical modeling. Another study [24] quantified cap beams’ effects on East Asian wood frames supported by stone bases, showing 42% reduction in energy dissipation, 81% increase in lateral load capacity, and 77% improvement in elastic stiffness, though applicability was limited to specific joint types and stress conditions. Table 1 provides a comparative review of notable timber-related seismic studies, emphasizing their aims, modeling methodologies, main discoveries, and current research constraints to identify knowledge gaps for future development.

thumbnail
Table 1. Overview of innovative research on seismic behavior of timber buildings.

https://doi.org/10.1371/journal.pone.0341961.t001

2.4. Knowledge gap

This section exposes limits in current research on the seismic performance of wood structures, as well as a lack of integrated AI-based optimization frameworks. Table 2 provides a comparative review of important research gaps discovered in prior research on the seismic performance of wood structures, as well as a description of how the current work contributes to resolving them.

thumbnail
Table 2. Analysis of identified research gaps and research contributions.

https://doi.org/10.1371/journal.pone.0341961.t002

This systematic review highlights the evolution of timber seismic research, identifies limitations, and positions the GBRF-SCO framework as a robust AI-based solution addressing these critical gaps.

2.5. Distinction between existing AI-based approaches and the proposed GBRF-SCO framework

Previous AI-based seismic researches on timber structures mainly rely on single learning models with manually tuned or grid-searched hyperparameters, which often suffer from high computational cost and suboptimal convergence in high-dimensional design spaces. In contrast, the proposed GBRF-SCO framework integrates a hybrid Gradient Boosted Random Forest model with the Scalable Cheetah Optimizer for adaptive, population-based hyperparameter tuning. By coupling prediction and optimization within a unified learning loop, the framework enhances nonlinear modeling capability, robustness, and design-oriented seismic performance optimization.

3. Methodology

An optimization framework with AI help has been developed to reliably anticipate and improve seismic performance indicators of wood structures, such as inter-story drift and roof displacement, enabling engineers to design safer and more robust timber buildings under earthquake conditions. Fig 2 depicts the GBRF-SCO framework’s overall workflow for improving the seismic performance of timber structures, showing the processes of data collecting, preprocessing, feature analysis, model training, and prediction.

thumbnail
Fig 2. The completed flowchart for enhancing seismic performance parameters of timber structures.

https://doi.org/10.1371/journal.pone.0341961.g002

3.1. Data acquisition

The Timber Seismic Performance Dataset contains detailed information on the structural and seismic characteristics of timber buildings. It considers building height, number of storys, wall thickness, material density, and damping qualities, as well as seismic inputs such as peak ground acceleration and spectrum acceleration. Table 3 represents the collection of sample data.

thumbnail
Table 3. Sample data from the timber seismic performance dataset.

https://doi.org/10.1371/journal.pone.0341961.t003

The Timber Seismic Performance Dataset referenced in this work is a secondary dataset publicly available through the Kaggle repository (https://www.kaggle.com/datasets/freshersstaff/timber-seismic-performance-dataset/data). This dataset has been used under the terms and conditions provided by Kaggle. The data are accessible without restriction to any researcher, and no identifiable human subjects or sensitive personal information are included in the dataset.

3.2. Data preparation approaches

Data preparation for the research included tasks such as normalizing the data, detecting outliers, and extracting features. By ensuring that all input features are uniformly scaled, normalization enhances the efficiency and stability of model training. Outlier detection eliminates odd or inconsistent data points, which enhances the integrity of the dataset on the whole and is even further verified by experts familiar with the domain. The t-SNE procedure is used to extract and visualize important structural patterns and relationships among variables, which gives a clearer presentation of seismic performance characteristics.

3.2.1. Normalization.

The Standard Scaler assumes that information is normally distributed within each component and scales them such that the distribution is centered around 0, with a standard deviation (SD) of 1. The element’s mean and SD are computed, then the component is scaled based on seismic performance parameters expressed in Equation (1).

(1)

Normalized data by subtracting the mean (μ) and dividing by the standard deviation (σ), ensuring all features are on a comparable scale for improved model performance.

3.2.2. Outlier detection.

The Isolation Forest algorithm is used to find anomalies in the seismic and structural dataset by looking for samples that are different from the rest. The contamination parameter is set to 0.05, which means that about 5% of the observations are likely to be outliers. Samples with anomaly scores below the decision threshold are considered anomalous and taken out of the dataset because they show physically inconsistent or severe seismic–structural pairings. This method cuts down on noise and bias, makes the data more reliable, and ensures that the samples kept are a true representation of how timber structures behave in genuine seismic conditions. This makes model training more stable and effective.

3.2.3. Feature analysis using t-distributed stochastic neighbor embedding (t-SNE).

This method enables feature extraction through dimensionality reduction, or the compression of data while preserving the connections between data points through the seismic performance criteria. Visualizations of correlations enable better decision-making by transforming convoluted data into its patterns and illuminating similarities, correlations, and clusters between points in an informative manner. By means of conditional probabilities, t-SNE augments standard geometrical distances to represent these similarities. The conditional probabilities , which are stated in Equation (2), express that similar data points and became individuals.

(2)

It calculates the conditional probability , showing how similar point i is to point j. Here, and are their feature vectors, and controls how distance affects similarity. The numerator measures how close the two points are, while the denominator ensures all probabilities around j sum to one. This helps t-SNE identify clusters of similar data points. In the original space, the probabilities were specified by Equation (3).

(3)

It defines the joint probability , which represents the combined similarity between points i and j. It is calculated as . Here, and are the conditional probabilities measuring how similar each point is to the other, and m is the overall data points. It ensures the similarity measure is symmetric, meaning the relationship between two points is treated equally in both directions. Where 2m represents the dataset’s size. The smooth indicator of an efficient number of neighbors represents the perplexity parameter that the t-SNE algorithm takes as an input, as shown in Equation (4).

(4)

It defines the perplexity of point as . represents the Shannon entropy of the probability distribution for point j. Perplexity indicates how broadly the probability distribution spreads across nearby points, effectively showing the close neighbors. A greater perplexity value means that more neighboring points are considered when measuring similarity. Here, is the bit value of the Shannon entropy , as shown in Equation (5).

(5)

It defines the Shannon entropy as . In this equation, represents the conditional probability that point i is similar to point j. The entropy measures the uncertainty or spread of these probabilities. A higher entropy value indicates that the similarities are distributed more evenly among many points, while a lower value means that only a few points are strongly similar to j. Equation (6) defines the probabilities at low-dimensional using this distribution.

(6)

and are the low-dimensional representations of data points j and i. The numerator, , measures how close the two points have a higher similarity. The denominator ensures that all probabilities in the low-dimensional integral sum to one. The t-SNE technique determines the lower-dimensional projection of the input data as , therefore reducing the divergence among and . After data preparation, the data is divided into two categories: testing (30%) and training (70%).

3.3. Gradient boosted random forest machine with scalable cheetah optimizer (GBRF-SCO) for accurate seismic predictions in timber structures

The GBRF-SCO framework employs an AI-based predictive ensemble combining gradient boosting and random forest to model nonlinear seismic–structural relationships. SCO adaptively optimizes GBRF hyperparameters using a cheetah-inspired search strategy, enhancing prediction accuracy, stability, convergence, and computational efficiency, thereby supporting the design of safer, more earthquake-resilient timber structures.

3.3.1. Gradient boosted random forest (GBRF) for accurate seismic response prediction.

The GBRF model merges aspects of gradient boosting and random forest methods. The model successfully captures the intricate, nonlinear connections between seismic and structural events. The linear combination of many weak learners increases overall prediction accuracy and generalization. This model is used for estimating inter-story drift and roof displacement in timber structures.

  • Gradient Boosting (GB) algorithm

The GB improves prediction accuracy by successively merging many weak learners to reduce residual errors, making it particularly efficient at capturing complicated nonlinear correlations between structural and seismic data. The GB architecture is represented in Fig 3.

The GB algorithm can approximate the underlying function given an input matrix w and a vector of molecular properties. This function maps the relationship between the molecular descriptor and the biological activity. The function is constructed in an additive manner. Equation (7) is used in gradient boosting to combine multiple weak learners into a strong predictive model by gradually minimizing the overall error.

(7)

Here, is the total number of iterations, represents the total error of the model, is the error from the iteration or tree, and is the rate of learning that determines how much each new model contributes to the last prediction. After the first iteration, minimize the following goal given a loss function, which assesses the quality of predictions pi in relation to real readouts. Equation (8) defines the optimization step for the iteration in the gradient boosting process.

(8)

represents the minimized error function at iteration n, and denotes the loss function that calculates the variations among the predicted values and the true outcomes Z. The term is the gradient of the loss function with respect to the previous prediction, indicating the direction of steepest increase in error. By taking the negative gradient, the model updates in the direction that reduces the error most effectively. The expression means that the algorithm searches for the function F that minimizes this difference. In simple terms, this equation guides the model to adjust its predictions step by step to reduce the overall error during training.

  • Random Forest (RF) classifier

The RF enhances model resilience and generalization by combining findings from several Decision Trees (DTs), decreasing overfitting, and ensuring consistent predictions of seismic reactions in various timber structure configurations. The recommended binary classification strategy makes use of the RF algorithm. Following the training phase, RF builds a huge number of decision trees and produces a class with an average prediction. RF hyperparameters were changed using grid search. Fig 4 denotes the architecture of the RF classifier.

This research uses 1000 random trees with a maximum depth of 10, utilizing a 0.5 confidence vote method and the Gini impurity criterion. The Gini impurity can be determined as follows in equation (9):

(9)

Here, represents the data point probability that belongs to class i, and D is the overall classes. The term measures how mixed or impure the data is for each class. When all data points belong to one class, H becomes zero, indicating perfect purity. Higher values of Hmean indicate greater uncertainty or diversity among the classes.

3.3.2. Traditional Cheetah Optimizer (TCO).

The TCO is a cheetah-inspired metaheuristic optimizing timber structure parameters to enhance seismic performance by balancing global exploration and local exploitation. It employs three strategies to avoid local optima. Fig 5 illustrates SCO phases: initial exploration, transition, attacking, and final convergence, systematically guiding the search toward optimal solutions. In the figure, represents the initial position of the cheetah in the search space, and , , , and are its updated positions during different hunting phases. and are random factors controlling movement, while and are adaptive coefficients that balance exploration and convergence. denotes the best or attacking position guiding the cheetah toward the optimal solution.

thumbnail
Fig 5. The optimization process of SCO (a) the exploration phase, where cheetahs search for potential solutions, (b) transition phase moving toward promising regions, (c) the attacking phase with rapid convergence to the best solutions, and (d) depicts the final convergence phase, where the algorithm stabilizes around the optimal result.

https://doi.org/10.1371/journal.pone.0341961.g005

Scalable Cheetah Optimizer (SCO) for Timber Structure Seismic Parameter Optimization: The Scalable Cheetah Optimizer (SCO) is a metaheuristic optimization algorithm designed for efficient hyperparameter tuning and seismic parameter optimization in timber structures. The SCO extends the traditional Cheetah Optimization (CO) framework by introducing adaptive control mechanisms, scalable step-size updates, and opposite learning concepts, ensuring high accuracy, rapid convergence, and improved stability in high-dimensional optimization problems.

Initialization of individuals: The optimization process begins with initializing the positions of cheetahs (candidate solutions) in a d-dimensional search space. The initial value of the variable for the j-th individual, , is defined as Equation (10)).

(10)

where and are the lower and upper bounds of the i-th variable, r denotes a random number, c is the total number of variables, and q is a uniform random number between 0 and 1. This ensures a uniform random distribution of initial solutions across the defined search space.

Searching technique: During the search phase, cheetahs evaluate their environment, including prey location, cover, and their own status. Each individual’s position is updated according to the same initialization principle (Equation 11), maintaining randomness in early exploration. The step size is dynamically adapted at each iteration based on the current iteration ratio , the distance between candidate solutions, and random perturbations expressed in Equation (11).

(11)

It describes the adaptive control parameter , which adjusts the step size for each variable during the optimization process. When , the value of is determined by the ratio of the current iteration. The overall number of iterations S, multiplied by the variable range . When , the value depends on the distance between the target position , current position , and other reference positions like and . The constants , , and control the weight of each term, while q adds a variation’s random factor.

Sitting-and-Waiting Model: The SCO also incorporates a sit-and-wait strategy, reflecting the cheetah’s behavior of remaining stationary to reduce unnecessary movements are expressed in Equation (12).

(12)

Here, the variable remains unchanged between iterations, allowing the algorithm to conserve computational resources while maintaining candidate diversity.

Attacking Strategy: During the attacking phase, cheetahs adjust positions dynamically based on prey, nearby individuals, and the leader’s position. The position update is expressed as Equation (13 and 14).

(13)(14)

where is the leader or reference position, is a random scaling factor, and represents the difference between the best solution and the current candidate. This allows efficient exploitation of promising areas in the search space.

SCO for Hyperparameter Optimization: The SCO framework ensures fast and reliable convergence in predicting seismic parameters by dynamically adjusting GBRF hyperparameters. Unlike the traditional TCO algorithm, which suffers from slow convergence, early stagnation, and inefficiency in high-dimensional problems, SCO is adaptive, scalable, and provides improved optimization performance. The opposite learning strategy further enhances exploration in Equation (15).

(15)

where is the opposite position of the j-th individual in dimension l, is a scaling factor, and k, v defines the exploration center.

Control Parameter Update Strategy: The CO algorithm updates control parameters adaptively to match nonlinear optimization requirements. The step size is updated as Equation (16 and 17).

(16)(17)

Here, is the target position, the best-known solution, and the reference position. Constants b2 and b3 control the contribution of each term, while q introduces randomness. This strategy balances global exploration with local fine-tuning.

Updating Individual Locations: To enhance individual updates and prevent redundant searches, the SCO employs the following update mechanism for binary and probabilistic optimization is expressed in Equation (18).

(18)

The sigmoid function bounds updated values between 0 and 1, suitable for probabilistic decisions. For discrete optimization is compute as Equation (19).

(19)

This converts continuous positions to binary states, enabling SCO to handle both continuous and discrete seismic optimization problems effectively. In simple terms, this decides whether each variable can take the value –1 or 1 in the next iteration. The SCO algorithm employs initialization, searching, and attacking strategies with adaptive control, enhancing convergence, stability, and scalability while optimizing GBRF hyperparameters, improving accuracy and efficiency in seismic parameter optimization for timber structures. Table 4 defines the key hyperparameters and their mathematical formulations used in the GBRF-SCO for seismic performance prediction.

thumbnail
Table 4. Hyperparameter for GBRF and SCO optimization framework.

https://doi.org/10.1371/journal.pone.0341961.t004

The selection of SCO hyperparameters depends on how well they converge, how fast they can be computed, and how many dimensions the seismic optimization problem has. The population size (r = 8–10) makes sure that there are enough different solutions without making the computation too expensive in feature spaces with a lot of dimensions. Smaller populations limit research, but bigger ones provide slight advances in accuracy. Convergence analysis, which is when the optimization error stabilizes, tells you the maximum number of iterations (S = 15–20). This setup strikes a good mix between exploration and exploitation, which makes GBRF hyperparameter tuning steady and dependable.

Pseudo code 1 represents the GBRF-SCO framework, which integrates gradient boosting for learning seismic behaviour with the SCO to adaptively tune model parameters, achieving accurate and efficient seismic performance prediction for timber structures.

Pseudo Code 1: GBRF–SCO Framework for Seismic Performance Prediction in Timber Structures

Input:

# Structural and seismic features

# Continuous seismic response (roof displacement / drift)

# SCO population size

# SCO iterations

# Hyperparameter bounds

Output:

# Optimized seismic response prediction

Step 1: Data Preprocessing

1. Remove outliers using Isolation Forest

2. Normalize features using Robust Scaling

3. Split data into training and testing sets

Step 2: Gradient Boosted Random Forest (GBRF) Modeling

1. Initialize GBRF hyperparameters:

2. Train GBRF model on

3. Predict seismic response

Step 3: Scalable Cheetah Optimizer (SCO)

1. Initialize the cheetah population:

2. Forto :

Evaluate fitness:

Identify the best cheetah

Update position:

Apply boundary constraints

3. Return optimal hyperparameters

Step 4: Final Prediction

1. Retrain GBRF using

2. Generate optimized seismic response:

Step 5: Seismic Damage Classification (Optional)

Return:

The GBRF-SCO framework partitions data, learns structural–seismic relationships by minimizing errors, and optimizes hyperparameters via cheetah-inspired SCO. Repeated learning identifies optimal parameters, while MLR assesses key factors, enabling the retrained GBRF to predict accurate, reliable seismic performance in timber structures.

4. Performance analysis and discussion

The objective is to provide an AI-assisted optimization framework for precisely predicting and improving the seismic performance characteristics of wood buildings, with an emphasis on inter-story drift and roof displacement. Table 5 presents the hardware and software specifications used for implementing and testing the GBRF-SCO framework.

thumbnail
Table 5. Experimental setup for GBRF-SCO model implementation.

https://doi.org/10.1371/journal.pone.0341961.t005

4.1. Multiple linear regression analysis for evaluating seismic performance relationships

The MLR approach analyzes the effects associated with different structural and seismic indicators, such as building height, wall thickness, and the acceleration of motion on the roof displacement of timber buildings. Table 6 presents the MLR results showing how key structural and seismic parameters influence roof displacement and GBRF-SCO’s overall seismic performance in timber structures.

4.2. Distribution of roof displacement by structural system type

Determine whether there are structural system-specific differences in the movement of roofs subjected to seismic forces. In order to determine which system can withstand an earthquake better in terms of stability and resilience. Better knowledge of structural behavior and more efficient design of wood structural systems lead to stronger buildings. Fig 6 shows the distribution of roof displacement among structural system types under seismic loading.

thumbnail
Fig 6. Roof displacement variation for timber structural system types.

https://doi.org/10.1371/journal.pone.0341961.g006

These findings suggest that the GBRF-SCO model may provide a good representation of the structural system-specific variation in roof displacement. By consistently outperforming the others in displacement, the hybrid system proves that the suggested framework improves the precision of seismic performance forecasts.

4.3. Comparative analysis of seismic ground and structural parameters

The research examines the consequences that various seismic intensity parameters have on the lateral load resistant capabilities of varying buildings, focusing on the influence these parameters have on the stability, strength, and seismic performance of residential structures, which would assist in optimizing structural performance for diverse ground motions, allowing more factual assessment of overall earthquake resistance, depicted in Fig 7 for each structural sample under seismic acceleration characteristics and lateral load resisting ratios.

The patterns that overlap indicate a high correlation between the intensity of ground motion and the load-resisting capability of the structure. There is a little positive correlation between building height and roof displacement, as shown in Fig 8(a) by the trend line. Fig 8(b) Roof displacement versus peak ground acceleration is shown, with materials color-coded according to density. Fig 8(c) shows the overall patterns of fluctuation in roof displacement as a function of data indices, including the rolling mean variation.

thumbnail
Fig 8. Findings of (a) Relationship between building height and roof displacement, (b) Variation of roof displacement with peak ground acceleration, (c) Trend of roof displacement across samples showing overall fluctuations patterns.

https://doi.org/10.1371/journal.pone.0341961.g008

The findings suggest that there is a connection between essential seismic parameters and roof displacement. The maximum ground acceleration varies from lightly felt to moderately felt to strongly felt; this is also true for roof displacement, which experiences a slight increase with greater height. The overall oscillation trends of the series of timber building samples reinforce some general consistent structural performance and stability of the GBRF-SCO.

4.4. Comparative analysis of roof displacement and structural feature relationships for different roof types

To investigate how roof geometry and critical structural characteristics affect roof displacement and overall seismic behavior, the study analyzes their influence on structural stability, deformation patterns, and the seismic performance of timber buildings. Fig 9(a) shows the variation of roof displacement with the number of storys for different roof types, indicating that displacement generally increases with height. Fig 9(b) illustrating relationships among structural features and roof displacement.

thumbnail
Fig 9. Graphical findings of (a) roof displacement with number of storys for different roof types, and (b) Parallel coordinate analysis of structural features influencing roof displacement across rooftypes.

https://doi.org/10.1371/journal.pone.0341961.g009

The results show that the geometry of the roof has a major impact on the earthquake reaction, especially for sawtooth and hip roof types, and that more storys result in more displacement of the roof. Roof displacement is also significantly affected by wall thickness and material density, according to the study, which shows how geometry and material parameters work together to affect seismic performance.

4.5. Pairwise correlation analysis of key structural and seismic parameters

The examination is to explore interrelationships among structural parameters such as height, storys, wall thickness, and material density, and their combined influence on roof displacement under seismic loading. Fig 10 showing relationships between building and material parameters with roof displacement, helping identify key factors influencing seismic performance.

thumbnail
Fig 10. Pairwise correlation analysis of key structural and seismic parameters.

https://doi.org/10.1371/journal.pone.0341961.g010

The findings indicate that roof displacement rises modestly with building height and number of storys, although wall thickness and material density have less association in GBRF-SCO. This suggests that geometric characteristics have a stronger effect on seismic reaction than material attributes in wood constructions.

4.6. Assessment of interdependencies among seismic response variables

Roof displacement and inter-story drift are two seismic and structural variables that the GBRF-SCO hopes to shed light on. These variables include building height, material properties, and ground acceleration. The correlation heatmap shown in Fig 11 illustrates the strength and direction of connections among structural, material, and seismic elements.

thumbnail
Fig 11. Correlation analysis of seismic and material parameters in timber structures.

https://doi.org/10.1371/journal.pone.0341961.g011

There is a considerable positive connection between building height (0.35–0.54) and the number of storys (0.66), and the results show that there is a strong association (0.67) between inter-story drift and roof displacement.

4.7. Assessment of inter-story drift and roof displacement consistency trends

To assess the patterns of structural deformation, compare the inter-story drift with the movement of the roof. This shows that the structure is generally stable and resilient, which is consistent with the GBRF-SCO’s evaluation of the seismic forces’ propagation through wood structures. See how inter-story drift compares to roof deformation in wood buildings in Fig 12.

thumbnail
Fig 12. Comparison of roof displacement and inter-story drift in timber structures.

https://doi.org/10.1371/journal.pone.0341961.g012

The result illustrates that the values of roof displacement are consistently greater than those of inter-story drift, suggesting that lateral deformation accumulates along the building’s height under seismic loading conditions.

4.8. Classification performance of roof displacement categories using a confusion matrix

The investigation verifies that the model accurately predicts the degree of roof displacement, which is necessary for correctly categorizing structural seismic reactions. Fig 13 displays the results of the roof displacement classes’ classification performance.

thumbnail
Fig 13. Confusion matrix representation of roof displacementclassificarion results.

https://doi.org/10.1371/journal.pone.0341961.g013

Based on quantile thresholds from the dataset’s empirical distribution, roof displacement values are divided into three ordinal classifications such as low, medium, and high. Values of displacement that are less than the 33rd percentile are considered low, values that are between the 33rd and 66th percentiles are considered medium, and values that are greater than the 66th percentile are considered high. This classification, based on quantiles, keeps the class proportions even, keeps the ordinal structure of seismic reaction severity, and doesn’t let people choose the threshold. The established criteria provide a uniform interpretation of the confusion matrix and elucidate the lack of significant misclassifications between low and high displacement categories.

A small number of incorrect classifications show that the model can distinguish between wood structures with varying degrees of seismic reactivity when using GBRF-SCO. During seismic parameter prediction, Fig 14 shows that the model learned steadily and converged strongly.

thumbnail
Fig 14. Performance analysis of GBRF-SCO model across training epochs.

https://doi.org/10.1371/journal.pone.0341961.g014

According to the results, the suggested GBRF-SCO model steadily increased testing and training accuracy across all epochs. To make the accuracy and loss trends easier to see, the legend was moved outside the graph. The GBRF-SCO model achieved a final training accuracy of 94.9% by displaying strong convergence and little validation variance.

4.9. Comparative analysis

This experiment evaluates the accuracy and efficiency of prediction models for seismic performance in timber structures, comparing the proposed Gradient Boosting Random Forest with Scalable Cheetah Optimizer (GBRF-SCO) to existing methods, including Enhanced Deep Line Segment Detection (LSD) [29], Support Vector Machine (SVM) [30], Adaptive Boosting (AdaBoost) [30], Extreme Gradient Boost (XGBoost) [30], Stacking Artificial Neural Network (ANN) [30], and Gradient Boosting ANN [30]. Performance is assessed using R², Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Accuracy, ROC-AUC, Precision, Recall, and F1-Score, ensuring a comprehensive evaluation of predictive fit, error magnitude, classification correctness, and positive prediction balance. Table 7 shows the numerical outcomes of error metrics.

thumbnail
Table 7. Evaluation of predictive modelling approaches with error metrics.

https://doi.org/10.1371/journal.pone.0341961.t007

The proposed GBRF-SCO model performed best with R² (0.988), MAE (0.54), and RMSE (0.20). The Enhanced Deep LSD model showed lower performance with R² (0.98), MAE (0.68), and RMSE (0.23). Overall, the proposed model gives more accurate and reliable results in Table 8. Fig 15 shows the Benchmark assessment of GBRF-SCO classification algorithms using quantitative performance metrics.

thumbnail
Table 8. Comparison of different classification model performances.

https://doi.org/10.1371/journal.pone.0341961.t008

thumbnail
Fig 15. Evaluation of classification model accuracy across multiple performance indicators.

https://doi.org/10.1371/journal.pone.0341961.g015

To reduce bias and have a fair comparison, all the ML models used hyperparameter settings taken directly from their original and peer reviewed studies. The only change made on the testing was the data set used for training. The data before and after counting, testing data, and tools to analyze the counting were the same. Such all-in-one testing method guarantees each model’s performance relies only on its skill and not on anything else apart from the model. The hyperparameter configurations for all the benchmark models used in the comparative analysis were used as they were in their original and peer-existing research to make the fair and unbiased comparison. Enhanced Deep LSD model makes use of best line confidence thresholds, crack length filter, and CNN based features extractor. The SVM makes use of polynomial kernel and a regularization parameter C = 45. AdaBoost makes use of a decision tree base estimator (max depth = 10), 180 estimators and learning rate of 1.0. The calculator uses a gbtree booster with β = 0.1, 450 estimators, Lambdas = 1.0 and Ï« = 0.01 with subsampling ratios of 0.7. The Stacking ant neural network integrated multiple ANN base learners to a meta learner of logistic regression (C = 20). The Gradient boosting NN made use of 25 boosting iterations with a learning rate of 0.2 and the weak learners was an ANN with two hidden layers with 128 neurons each. The proposed GBRF-SCO hyperparameters are fine-tuned by the SCO, which allows for an adaptive adjustment of the hyperparameters for better seismic performance estimation. All models in general were analysed under the same data preparation and experimental conditions so that every model could be objectively compared and their use could be re-produced.

The GBRF-SCO model outperformed all other methods, achieving the highest overall accuracy of 0.949, according to the performance comparison of several classification models. Along with the best ROC AUC Score (0.988), it had excellent performance across all other measures as well, including Precision (0.889), Recall (0.884), and F1-Score (0.897). The suggested model outperforms conventional models in terms of GBRF-SCO’s predictive power and class separation efficiency. Table 9 shows the results of the ablation studies that compared the efficiency of the individual and combined models.

thumbnail
Table 9. Ablation research results for model performance comparison.

https://doi.org/10.1371/journal.pone.0341961.t009

This ablation research showed that the combined GBRF-SCO model outperformed its individual components, GBRF and SCO. This integration led to higher accuracy, improved prediction stability, and better overall seismic performance optimization.

4.10. Discussion

The safety and resilience of wood buildings during earthquakes were enhanced through the development of the GBRF-SCO, capable of accurately forecasting and optimizing seismic performance parameters. Previous deep learning (DL) models for wood material identification showed promise but were dataset-specific and lacked generalizability [29]. Similarly, machine learning (ML) effectively assesses seismic damage in reinforced concrete (RC) structures, though its accuracy depends heavily on data quality and regional calibration [30]. Addressing these limitations, the study introduces the GBRF-SCO model, integrating AI-based prediction and optimization to improve seismic performance accuracy and design efficiency across diverse wood structure types.

4.10.1. Practical implications.

Engineers can make safer, more resilient buildings by designing wood structures with excellent seismic properties using the GBRF-SCO framework. It offers a data-driven decision-support tool that real construction projects can use to make buildings more seismically compliant and safer.

5. Conclusion

An AI-assisted GBRF-SCO framework was proposed for accurately predicting and optimizing key seismic performance parameters of timber structures. By integrating GBRF-SCO, the framework effectively captures complex nonlinear structural-seismic relationships while adaptively optimizing model hyperparameters. The proposed approach demonstrates clear performance advantages over conventional machine learning and ensemble models, achieving an R² of 0.988, MAE of 0.54, and RMSE of 0.20. Beyond predictive accuracy, the GBRF-SCO framework provides practical engineering value by supporting data-driven design decisions aimed at reducing inter-story drift and roof displacement, thereby enhancing the safety, resilience, and sustainability of timber buildings in seismic regions.

5.1. Limitations and future work

The GBRF-SCO framework has robust predictive and optimization capabilities; nevertheless, the study is limited by data diversity and availability, since the dataset predominantly depends on recorded and simulation-based data rather than comprehensive real-world experimental observations. The performance of the model depends on how good and complete the structural and seismic input parameters are. Also, the SCO-based optimization makes it more expensive to compute for high-dimensional feature spaces. Furthermore, extrapolation during intense seismic occurrences or atypical timber structural systems may be constrained by inadequate representative samples. Future efforts will focus on comprehensive experimental testing, field-based sensor data collection, multi-hazard scenario integration, and enhancements in computing efficiency to bolster resilience and practical application across various timber structures.

Supporting information

S1 File. We have uploaded the minimal dataset underlying the findings of this study as Supporting Information with the submission.

In addition, the dataset is publicly available without restriction from the Kaggle repository at the following link: https://www.kaggle.com/datasets/freshersstaff/timber-seismic-performance-dataset/data

https://doi.org/10.1371/journal.pone.0341961.s001

(XLSX)

References

  1. 1. Ilgın HE. Analysis of the main architectural and structural design considerations in tall timber buildings. Buildings. 2023;14(1):43.
  2. 2. Karaca F, Tleuken A, Pineda-Martos R, Ros Cardoso S, Orel D, Askar R, et al. Cultivating sustainable construction: stakeholder insights driving circular economy innovation for inclusive resource equity. Buildings. 2024;14(4):935.
  3. 3. Al-Najjar A, Dodoo A. Modular multi-storey construction with cross-laminated timber: life cycle environmental implications. Wood Mater Sci Eng. 2022;18(2):525–39.
  4. 4. Bergsagel D, Heisel F, Owen J, Rodencal M. Engineered wood products for circular construction: a multi-factor evaluation of lamination methods. npj Mater Sustain. 2025;3(1).
  5. 5. Yan L, Klingner R, Al-Qudsi A, Chen H, Dand JA. Current market landscape and industry voices in key timber construction markets. Buildings. 2025;15(18):3381.
  6. 6. De Araujo V. Timber construction as a multiple valuable sustainable alternative: main characteristics, challenge remarks and affirmative actions. Int J Constr Manag. 2021;23(8):1334–43.
  7. 7. Ilgın HE, Aslantamer ÖN. High-rise timber offices: main architectural and structural design parameters. Buildings. 2024;14(7):1951.
  8. 8. Vladimirova E, Gong M. Advancements and applications of wood-based sandwich panels in modern construction. Buildings. 2024;14(8):2359.
  9. 9. Shareef SS. Earthquake consideration in architectural design: guidelines for architects. Sustainability. 2023;15(18):13760.
  10. 10. Yang Z, Liu D, Zhou J, Zhang L, Liao W. Seismic vulnerability assessment of the cliff-attached buildings equipped with energy dissipation devices under obliquely incident seismic waves. Buildings. 2024;14(11):3488.
  11. 11. Zaker Esteghamati M. Leveraging machine learning techniques to support a holistic performance-based seismic design of civil structures. Woodhead Publishing; 2024. pp. 25–49.
  12. 12. Yang N, Qian X. Prediction of monitoring data and state assessment of heritage stone-timber structures based on spatio-temporal graph neural networks. Eng Struct. 2025;338:120619.
  13. 13. Liu Y, Xie J, Zhang W, Yu H. Research on hysteretic performance prediction method of novel connection dampers for external wall panels based on the mining of temporal combination characteristics. Structures. 2025;78:109285.
  14. 14. Yuanyao M, Dongbo L, Chunyan L, Yan W, Xiguang L, Bo W. Multi-objective optimization of traditional residential timber frames based on response surface methodology. J Comput Methods Sci Eng. 2024;24(6):3493–503.
  15. 15. Lyu H, Oshio H, Matsuoka M. Deep learning-based collapsed building mapping from post-earthquake aerial imagery. Remote Sens. 2025;17(17):3116.
  16. 16. Das S, Teweldebrhan BT, Tesfamariam S. High-dimensional multi-objective optimization of coupled cross-laminated timber walls building using deep learning. Eng Appl Artif Intell. 2024;136:109055.
  17. 17. Cosut M, Bekdas G, Nigdeli SM, Isikdag U. Predicting design suitability of box-shaped sustainable timber structural members using machine learning and hyperparameter optimization. Neural Comput Applic. 2025;37(29):24123–48.
  18. 18. Feujofack K BV, Loss C. Experimental and machine learning study on a novel high-performance hybrid steel-grout connector for cross-laminated timber panels under pre-yield cyclic loads. Eng Struct. 2024;303:117530.
  19. 19. Zhang L, Xie Q, Wang H, Han J, Wu Y. Deep‐learning‐based crack identification and quantification for wooden components in ancient chinese timber structures. Struct Control Health Monit. 2024;2024(1).
  20. 20. Wang S, Saida T, Nishio M. Optical flow‐based structural anomaly detection in seismic events from video data combined with computational cost reduction through deep learning. Struct Control Health Monit. 2025;2025(1).
  21. 21. Tsai M-T, Hsu C-C. Assessment of structural performance, materials efficiency, and environmental impact of multi-story hybrid timber structures in high seismic zone. Case Stud Const Mater. 2024;21:e03695.
  22. 22. Quizanga D, Almazán JL, Torres-Rodas P, Guindos P. Seismic performance of timber buildings retrofitted with hybrid walls and impact-resilient isolators. Soil Dyn Earthq Eng. 2025;198:109586.
  23. 23. Ciabattoni M, Petrini F, Pampanin S. Multi-hazard design of low-damage high-rise steel–timber buildings subjected to wind and earthquake loading. Eng Struct. 2024;303:117522.
  24. 24. Wu Y-J, Liu C, Wang L, Xie Q-F, Zhang L-P. Seismic performance of stone supported traditional timber frames with or without cap beams. J Build Eng. 2023;70:106395.
  25. 25. Wu Y-J, Zhang X-Y, Xie Q-F, Zhang L-P. Evaluation of seismic performance of traditional heavy timber frames with different types of column foot joints. J Build Eng. 2025;103:112123.
  26. 26. Solis F, Parra PF, Cendoya P, Gonzalez-Böhme LF, Quitral-Zapata F, Gallardo R. Seismic Behavior of a Timber Structure Based on a Soft-Kill BESO Optimization Algorithm. Buildings. 2025;15(6):980.
  27. 27. Auad G, Valdés B, Contreras V, Colombo J, Almazán J. Effects of the ductility capacity on the seismic performance of cross-laminated timber structures equipped with frictional isolators. Buildings. 2025;15(8):1208.
  28. 28. Wenzel A, Vera S, Guindos P. Energy and structural optimization of mid-rise light-frame timber buildings for different climates and seismic zones in Chile. Eur J Wood Prod. 2024;82(4):967–82.
  29. 29. Luo J, Guo Y, Liu Z, Hu Q, Hoque MA, Ahmed A. Enhancing deep line segment detection and performance evaluation for wood: a deep learning approach with experiment-based, domain-specific implementations. Forests. 2024;15(8):1393.
  30. 30. Luk SH. Machine learning-based methods for the seismic damage classification of RC buildings. Buildings. 2025;15(14):2395.