Figures
Abstract
The progressive wear of pantograph sliding strips on metro trains necessitates timely replacement to ensure safe and reliable operations. This study proposes an adaptive, data-driven framework for predicting the remaining useful life (RUL) of these components, leveraging operational data from Chongqing Metro Line 6. A Gamma-process model is employed to capture the wear behavior under real-world operating conditions, integrating historical records and new observations through Bayesian inference. Markov chain Monte Carlo (MCMC) sampling is then applied to solve the posterior distribution, with three parameter-estimation approaches compared and the model’s predictive accuracy evaluated across different life-cycle stages. The results demonstrate that incorporating prior knowledge significantly improves prediction accuracy. To showcase practical utility, the study devises a maintenance-scheduling strategy that integrates RUL forecasts with regular vehicle-maintenance intervals, thereby extending service life and reducing costs. Validated using real-world data, the proposed methodology offers a pragmatic tool for predictive maintenance in metro systems and can be adapted to similar engineering applications.
Citation: Liu J, Wu C (2025) Predicting the remaining useful life of metro pantograph sliding strips using gamma processes and its implications for maintenance scheduling. PLoS One 20(7): e0327769. https://doi.org/10.1371/journal.pone.0327769
Editor: Ke Feng, The University of British Columbia, AUSTRALIA
Received: March 13, 2025; Accepted: June 21, 2025; Published: July 11, 2025
Copyright: © 2025 Liu, Wu. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data and code are publicly available on GitHub (https://github.com/jery761210/sliding-strips.git);
Funding: This work was supported by the Science Technology Research Program of Chongqing Municipal Education Commission (Grant No. KJQN202203405, KJQN202303402, KJQN202203417). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
1 Introduction
Sliding strips are critical components in Metro pantograph systems, enabling the transfer of electrical power from overhead contact lines to trains. The widely employed Steemann single-arm pantograph (Fig 1) incorporates two interchangeable carbon sliding strips in its upper structure. These strips experience ongoing deterioration due to frictional wear, electrical arcing, and mechanical stress, necessitating replacement when their thickness diminishes to approximately 24.5 mm. Fig 2 illustrates the degradation process, contrasting the initial and worn profiles of two decommissioned strips, underscoring the substantial material erosion observed during operation.
The highlighted area shows the designated position for the sliding strip installation.
The dashed line represents the original profile, and the solid line indicates the current profile.This study analyzed thickness data from 1,207 sliding strips extracted from 20 trains operating on Chongqing Metro Line 6 between 2018 and 2021. The dataset comprehensively documented the thickness evolution of each strip from initial installation to eventual removal. The findings revealed that only 7% of replacements were due to critical issues like melting or cracking, while the remaining 93% were attributed to regular wear and tear. Consequently, the analysis focused on wear-related replacements, which constituted the vast majority of maintenance interventions.
The current maintenance protocols face substantial challenges. Technicians frequently conduct manual thickness assessments during inspections, relying on subjective judgment to determine the need for replacing strips approaching the 24.5 mm threshold. This approach often leads to premature replacements and suboptimal strip utilization. The dataset presented in Fig 3 reveals that out of 1,120 wear-related replacements, 368 strips (32.9%) continued functioning below the threshold, 30 strips (2.7%) were replaced precisely at the threshold, and 722 strips (64.4%) were replaced while still above it. This distribution highlights significant inefficiencies, with approximately 65% of strips replaced prematurely and about 33% operating beyond the recommended limit.
The red sector represents strips below the threshold, blue indicates strips at the threshold, and green represents strips above the threshold.The economic implications of the current pantograph sliding strip replacement practices are substantial. The average service life of these components is only 71 days, with each replacement costing approximately 5,000 RMB, resulting in significant maintenance expenditures. Field data indicate that 64.5% of the replaced strips still possessed adequate thickness for continued safe operation, suggesting potential cost savings through optimized replacement timing. Conversely, 33% of the strips exceeded the recommended threshold between 2018 and 2021 without reported safety incidents, though the safety implications of extended use were not assessed in this study.
The present study sought to develop predictive models for forecasting the degradation of pantograph sliding strips and estimating their RUL, defined as the time until a strip reaches a critical wear threshold. Accurate RUL estimation can facilitate maintenance planning that aligns with the actual wear progression, thereby reducing waste and enhancing economic efficiency.
Existing research on pantograph strip wear has primarily focused on predicting strip thickness or classifying its condition, rather than directly estimating the RUL. Kuźnar et al. [1] utilized artificial neural networks to forecast strip thickness with less than 12% error. Xu et al. [2] achieved 97.3% accuracy in wear trend fitting using support vector regression, while Trilla et al. [3] reduced thickness prediction uncertainty to 1.39 mm by employing a time delay neural network ensemble that integrates seasonal variables. Classification methodologies have also been explored, with Kuźnar et al. [4] and Kuźnar and Lorenc [5] demonstrating high accuracy in categorizing strip conditions.
The existing literature exhibits a dearth of studies aimed at predicting the RUL of pantograph sliding strips. To date, only two notable contributions have been identified in this domain. Guan et al. [6] proposed a Wiener process model tailored to address the challenge posed by unevenly spaced wear data in carbon contact strips of railway pantographs. Their methodology integrated the Wiener process with a weighted linear regression model for unevenly spaced intervals, effectively predicting RUL using authentic operational data from the Guangzhou Metro. Conversely, Li et al. [7] developed a reliability-centered framework for predicting RUL by investigating wear patterns under varying contact pressure fluctuations. Through experiments on sliding electrical contacts, they formulated both a longevity prediction model and a probability distribution model for the phase of stable wear.
Despite their potential, both methods exhibit notable limitations that hinder their practical application. The statistical assumptions underlying these models often fail to adequately capture the complexities of real-world scenarios, and they do not seamlessly integrate with current maintenance planning systems. This disconnect between theoretical frameworks and practical needs substantially diminishes their utility for informing maintenance decisions. Moreover, the accuracy of RUL predictions is highly sensitive to the precision of thickness measurements, posing a significant challenge. A mere 1 mm discrepancy in thickness prediction can result in a 33-day deviation in RUL estimation based on typical wear rates. Such considerable inaccuracies could lead to premature replacements or, even worse, unforeseen failures that jeopardize service reliability.
This study addresses the aforementioned challenges through two key contributions. Firstly, we introduce a data-driven prognostic methodology that employs a gamma stochastic process to characterize the progressive wear patterns of sliding strips and forecast their RUL. This approach effectively captures the inherent variability in wear and produces reliable RUL predictions through Monte Carlo sampling, offering a viable alternative to conventional Wiener process models [6]. Secondly, we propose a maintenance optimization strategy that synchronizes strip replacements with routine train maintenance schedules, optimizing strip utilization while upholding safety standards. We provide a quantitative evaluation of the economic advantages derived from this optimized scheduling strategy. These contributions collectively establish a comprehensive RUL-centered maintenance framework that advances methodological development and enhances practical utility in transit operations, significantly improving maintenance efficacy and decision-making precision.
The paper is structured as follows. Section 2 analyzes the dataset characteristics and justifies the selected degradation model. Section 3 Construct experimental data and preprocess all data.Section 4 introduces the gamma process for predicting RUL and validates the prediction accuracy. Section 5 demonstrates how RUL predictions can enhance replacement scheduling and quantifies the associated cost savings. Section 6 investigates the variation in prediction accuracy with and without prior data, providing practical implementation insights and an in-depth discussion. Finally, Section 7 concludes the study and proposes potential future research directions.
2 Stochastic degradation model selection and validation
2.1 Review of remaining useful life estimation methodologies
Accurately predicting the RUL of a sliding strip before it reaches a critical wear threshold is essential for efficient predictive maintenance. Current RUL estimation techniques fall into three primary categories: model-based approaches [8], data-driven methods [9], and hybrid methodologies [10]. Model-based strategies utilize analytical or physical models to predict component behavior and degradation, providing precise estimates without requiring extensive empirical data. Nonetheless, accurately modeling the intricate stochastic degradation and interactions among various factors in a purely physics-based model poses notable challenges.
The increased focus on data collection in operational processes has resulted in a rise in the popularity of data-driven techniques. These approaches utilize historical monitoring data along with statistical or machine learning algorithms to model degradation patterns, reducing the need for detailed physical understanding. While effective, data-driven methods typically necessitate extensive datasets, and their predictive accuracy may decrease when real-world operating conditions differ significantly from the historical training data. Hybrid methodologies, which combine aspects of both data-driven and physics-based approaches, seek to achieve a balance between accuracy and data quantity. Nevertheless, existing research has not definitively proven that hybrid methods consistently outperform either individual approach.
The effectiveness of pantograph sliding strips is impacted by various factors such as sliding speed, current collection intensity, relative height to the contact wire, and contact force [11]. The complex interaction among these parameters presents a substantial obstacle in formulating a precise theoretical degradation model. Nevertheless, the wealth of empirical data obtained from regular pantograph strip assessments establishes a solid basis for data-driven methodologies.
Estimation of RUL has been approached using diverse data-driven methodologies, such as regression models, similarity-based techniques, stochastic process models, and deep learning methods [12–21]. Researchers have introduced random coefficient regression frameworks to characterize degradation patterns and deduce lifetime distributions from data obtained through condition monitoring [12]. In cases where failure records are lacking, analysts may redefine the prognostic task or apply a Bernstein distribution to establish priors for random coefficients [13,14]. Similarity-based approaches involve identifying reference trajectories that closely resemble the unit under examination and calculating RUL as a weighted average of the most similar historical instances [15–18]. However, these methods often depend on manually defined similarity thresholds that may lack generalizability across different operational settings. On the other hand, stochastic process models, such as Wiener process models, have shown superior performance in capturing the inherent variability in degradation [19]. Various deep learning architectures, including autoencoders, deep belief networks, convolutional neural networks, and recurrent neural networks, have been utilized in equipment health monitoring for prognostic purposes [20]. Despite their robust nonlinear modeling capabilities, deep learning techniques necessitate substantial computational resources and extensive labeled datasets, which could impede their practical application. The existing literature does not provide a definitive conclusion on whether deep learning models consistently outperform traditional prognostic methods in predicting RUL, particularly in scenarios with limited failure data [21].
Researchers have utilized Wiener [22,23], Gamma [24,25], and Inverse Gaussian (IG) [26,27] processes in stochastic process models to characterize the deterioration of engineering components. The Gamma and IG processes, classified as Lévy processes with parameters, generate degradation paths with consistently positive increments, making them well-suited for modeling processes like wear or crack growth involving irreversible damage accumulation [24]. In contrast to the conventional stochastic model driven by Brownian motion, which can capture non-monotonic degradation and provide an approximate representation of monotonic degradation, offering considerable flexibility. For instance, a nonlinear Wiener process has been applied to predict bearing conditions, effectively capturing non-monotonic trends in vibration-based health indicators [28]. Hence, our research employs a stochastic process model based on a comprehensive review of existing literature. While all three processes can simulate wear progression, selecting the most suitable model hinges on the data characteristics, particularly the presence of monotonic patterns in the degradation trajectory and the optimal probability distribution for modeling wear increments. Although both Gamma and IG processes ensure monotonic paths, they diverge in their assumptions regarding increment distributions (Gamma distribution for Gamma processes and IG distribution for IG processes). The choice between these models should be guided by empirical validation to ascertain which increment distribution best aligns with the observed data.
2.2 Empirical analysis of wear monotonicity characteristics
The pantograph sliding strips inevitably experience thinning over time due to irreversible material loss from wear. Our wear analysis corroborates this trend, with data indicating either a steady decrease or constancy in strip thickness. As illustrated in Fig 4, the wear trajectories for twenty strips consistently demonstrate a decline in thickness, although minor deviations may arise from measurement errors. This consistent wear pattern represents a critical aspect that must be incorporated into any robust degradation model [29].
The figure shows twenty typical wear trajectories, demonstrating a consistent decrease in thickness.
2.3 Goodness-of-fit test for wear increments
The task at hand involves the selection of an appropriate probability distribution for these increments. Two potential distributions, namely Gamma and Inverse Gaussian (IG), were evaluated using Anderson–Darling (AD) goodness of fit tests [30]. Each strip’s positive wear increments were analyzed by fitting both a Gamma distribution and an IG distribution. Subsequently, the AD test was employed to compare the observed increment data with the cumulative distribution function (CDF) of each fitted distribution. The null hypothesis posited that the increments in the strip follow the specified distribution. If, at a significance level of 5%, the AD test indicated no statistically significant deviation between the empirical data and the fitted CDF, the distribution was considered a suitable model for the strip’s increments, demonstrating a successful outcome of the goodness of fit test for that distribution. Conversely, if the AD test detected a significant deviation at the 5% level, the distribution was rejected as an inadequate fit for that strip.
Table 1 displays the goodness-of-fit outcomes for wear increment data under two distributional assumptions. The Gamma distribution notably fits the wear increment data well across the entire fleet. Individual train pass rates for the Gamma distribution range from approximately 89% to 100%, with an overall 97.9% of the 1,120 increment series showing no significant deviation from the fitted Gamma model (Column 3 in Table 1). Specifically, only 18 series were deemed incongruent with the Gamma distribution by the Anderson-Darling (AD) test. Almost all trains adhered to the Gamma distribution, with 17 out of 20 trains having at least 95% of their strips passing the Gamma test; the one exception still achieved around 89%. This strong support highlights the excellent fit of wear increments with the Gamma distribution in nearly all cases.
While the Inverse Gaussian distribution achieved an impressive overall pass rate of 95.8%, its consistency across the entire fleet was somewhat lower. Individual train pass rates varied from 83% to 100% under this model, with 38 series showing significant deviations, approximately double the outliers compared to the Gamma distribution. It is notable that some trains, which had perfect pass rates under the Gamma distribution, encountered failures in one or two instances when tested using the Inverse Gaussian model. This highlights the superior ability of the Gamma distribution to capture the full range of observed wear increment patterns.
After comprehensive analysis, the Gamma process emerges as the optimal stochastic model for sliding-strip wear. Among the three processes evaluated (Wiener, Gamma, and IG), only the Gamma and IG processes exhibit consistent sign increments, leading to monotonically decreasing paths. Although the Wiener process is mathematically tractable, it may yield unrealistic thickness fluctuations when unrestricted. Moreover, the wear increment distribution associated with the Gamma distribution, boasting a 97.9% success rate, aligns both theoretically and empirically with models of sliding strip degradation.
3 Experimental dataset construction and preprocessing methodology
3.1 Analysis of factors influencing wear rate and representative sample selection
A test dataset was created using wear records to illustrate the methodology for predicting Remaining Useful Life (RUL) and planning maintenance. The dataset was structured based on train number, strip position, and manufacturer. Information from all 20 trains operating on Chongqing Metro Line 6 was gathered for this investigation, covering the complete fleet. As depicted in Fig 5, every six-car train in the fleet is fitted with two pantographs, each containing two sliding strips, totaling four strips per train (designated as positions 1–4). These strips were supplied by four distinct manufacturers (designated as 1–4).
The trailer cars (“TC”) are at both ends, and the four intermediate cars serve as motor cars. Two motor cars (“MP1” and “MP2”) are equipped with roof-mounted pantographs, while the other two (“M1” and “M2”) lack pantographs.To obtain a representative sample, we first identified variables influencing the rate of sliding-strip wear, defined as the average daily thickness reduction per strip. A one-way ANOVA analysis, followed by a Compact Letter Display post-hoc assessment, unveiled significant differences among the 20 trains [31,32]. Fig 6 classifies the fleet into three distinct groups based on wear rates, demonstrating statistical significance. These variances are likely linked to operational factors such as the duty cycle and average operating speed of each train.
The x-axis represents train numbers, and the y-axis shows the wear rate in millimeters per day. Outliers are marked with red dots. Compact Letter Display labels above each box indicate statistical groupings: identical letters denote nonsignificant differences, while distinct letters represent significant differences among trains.Fig 7 illustrates consistent wear rates among strips 1–4 within a single train, suggesting uniform rates of wear within the same train.
The x-axis denotes the position, and the y-axis represents the wear rate in millimeters per day. Outliers are marked with red dots. Statistical groupings are indicated by Compact Letter Display labels above each box, with matching letters suggesting no significant difference between positions.The study assessed the influence of manufacturers on wear rates, but faced challenges in making a valid comparison due to the exclusive use of specific manufacturers by certain trains (e.g., Trains 14–17 exclusively used Manufacturer 4). As a result, the manufacturer variable was omitted from the variance analysis, with train identity identified as the key factor driving variations in wear rates.
To preserve fleet diversity and facilitate analysis, we selected a single strip randomly from each train, yielding 20 representative samples. Table 2 presents information on the train, position, and manufacturer of each sample, while Fig 8 depicts the thickness-time trajectories corresponding to these samples, all demonstrating a uniform, gradual reduction in thickness.
The x-axis represents the number of days, and the y-axis shows thickness in millimeters. The color-coded lines depict the variability in wear progression rates observed within the fleet.
3.2 Degradation data transformation
In practical scenarios, raw thickness measurements commonly show a declining pattern over time, in contrast to the anticipated increasing degradation measure in a conventional Gamma process model [22]. To reconcile this disparity, we propose the incorporation of a linear transformation to the thickness data of individual strips. Let denote the maximum recorded thickness of strip i, and
represent the wear threshold. The transformed degradation indicator
for the j-th observation of strip i is formulated as.
Let represent the total number of strips in the sample, and
denote the number of observations for strip i. This conversion standardizes the data of each strip to a scale ranging from to 1: at installation
, and upon reaching the threshold
(or
if the threshold is surpassed).
Fig 9 depicts normalized wear accumulation paths, starting at zero and reaching one gradually; values exceeding one signify excessive wear. A consistent preprocessing methodology is employed for all thickness data to uphold data integrity and computational dependability.
The x-axis represents time elapsed in days, and the y-axis displays a dimensionless degradation indicator ranging from 0 to 1. Each curve corresponds to a specific test sample.
4 Gamma process modeling and bayesian estimation of remaining useful life
4.1 Gamma process model selection via hypothesis testing
This integrated methodology merges historical degradation patterns identified in Section 3 with present monitoring data within a Bayesian framework. This enables the ongoing adjustment of model parameters to uphold the dependability of predictive maintenance. The algorithm chooses historical trajectories by aligning static attributes (e.g., train identity and strip position) with those of each pending strip. As depicted in Fig 10, each pending strip aligns with three to seventeen corresponding historical replacements.
The horizontal axis lists the twenty pending samples detailed in Section 3, and the vertical axis shows the count of past strips with the same static attributes.Two primary Gamma-based degradation models commonly used in reliability analysis are the conventional model and the random-effect model [33]. The conventional model assumes that degradation follows a stationary process with consistent behavior across strips. In contrast, the random-effect model allows for variations in degradation rates among different strips. Let represent the measurement time of the j-th observation for the i-th sliding strip. Consistent with previous studies [34], we define the time increment as
, where time follows a power-law relationship controlled by a positive parameter γ. The corresponding decrease in the degradation indicator between consecutive observations is denoted as
, which has been empirically shown to display stochastic behavior.
The probability density function (PDF) governing the wear increment in a conventional Gamma degradation model is specified by Equation (2) [29]
The model utilizes shape and scale parameters, denoted as α and β, respectively, with the Gamma function represented by
. In contrast, the random-effect Gamma process model incorporates variability in the degradation rate among items by treating it as a random variable. Consequently, the integrated comprehensive PDF of undefined accounts for this random effect [35]
The PDF representing a Gamma distribution with shape parameter η and scale parameter δ is denoted as . This extended model accounts for the variability in wear rates at the individual level. Although more complex versions of the Gamma process have been suggested [36], their heightened intricacy could hinder parameter estimation. Therefore, we choose the aforementioned simpler formulation for this study.
To evaluate the optimal alignment of conventional and random-effect Gamma processes, a hypothesis test is performed under two critical assumptions: (1) the independence of historical degradation paths and (2) the uniform and independent distribution of increments within each path. The methodology for this test is outlined in reference [37].
Null Hypothesis (H₀): β is unknown but treated as non-random.
Alternative Hypothesis (H₁): β is treated as a random variable.
Under the constraints outlined in Equations (2) and (3), γ is assigned a value of 1 for H₀ and 2 for H₁ to simplify the model, reducing the number of independent parameters. This simplification improves computational stability and aids in predicting the RUL. The likelihood functions for each model are described in Equations (4) and (5), where represents the difference in the degradation indicator between consecutive measurements.
and the decision criterion relies on the maximum log-likelihood difference:
If the statistic equals or exceeds a critical value (the 1-θ quantile of a chi-square distribution with 1 degree of freedom [38]; for instance,
= 3.8415 at a.95 significance level), then H₀ is rejected in favor of H₁, and the random-effect model is selected. Otherwise, H₀ is not rejected, and the standard model is retained. Nonlinear optimizations required for maximum likelihood estimation are conducted using MATLAB’s IPOPT solver [39]. The outcomes of the model selection test are outlined in Table 3: solely strips 9, 12, and 19 exhibit a statistically significant value to justify the random-effect model, while the standard model adequately characterizes all remaining strips.
Table 3 presents the results of model selection for sliding strips employing a critical value of 3.8415 at a significance level of.95 in the chi-square distribution. The column “Maximum Log-Likelihood Difference ” shows the computed variances, and the column “Selected Model Index” indicates the favored model (1 = Conventional Gamma, 2 = Random-Effect Gamma).
Depending on the chosen model, we subsequently employ a Bayesian parameter estimation method (as outlined in Cases 1 and 2) to incorporate prior knowledge and new pending data.
4.2 Estimation of model parameters via bayesian inference
In Case 1, with Model Index set to 1, the shape parameter α and scale parameter β are assumed to be independent, each following a Gamma prior distribution. The posterior distribution of the parameters and the likelihood function for new observations are represented by equations (7) and (8), respectively.
Let and
represent the observed data vector for the pending strip, with asterisks denoting that this data is not included in the historical training set. By incorporating these observations into the likelihood function, we derive an updated expression
that is influenced by the new data. In Equation (8), the variable p denotes the percentage of the strip’s life cycle completed at the time of prediction. Higher values of p signify a larger amount of in-service data accessible for the strip.
In order to assess the probability, it is necessary to establish prior distributions for α and β. We employ an iterative method to derive informative priors from the historical dataset.
- Initialization: Initialize α and β as Gamma-distributed random variables; set iteration counter
.
- Data Extraction: Collect historical samples (degradation paths) with attributes matching the pending strip.
- Optimization was conducted utilizing the IPOPT solver to maximize the likelihood as defined in Equation (4) for the standard Gamma model by iteratively adjusting the parameters
and
. A convergence tolerance was
, with a maximum of 20 iterations permitted, incrementing
after each optimization iteration.
- Iteration Check: If
, return to Data Extraction with updated parameters; otherwise, proceed.
- Parameter Compilation: Aggregate the optimized values of
and
from each iteration into a sample dataset.
- Estimate Gamma Distributions: Fit the combined samples of
and
to Gamma distributions to obtain the final prior estimates of α and β for the forthcoming strip.
Case 2 pertains to Model Index = 2. When opting for the random-effect model, two additional parameters (η and δ) are integrated into both the posterior and likelihood formulations to characterize the distribution of the random effect β. Equations (9) and (10) delineate the posterior distribution and likelihood function for this specific case. The estimation procedure in Case 2 mirrors that of Case 1, albeit with the inclusion of an additional parameter, resulting in the estimation of three parameters (α, η, δ) instead of two.
Fig 11 and Fig 12 displays Gamma prior distributions obtained through Bayesian updating for the model parameters of individual pending strips. Each subplot corresponds to a particular pending strip and presents the estimated parameter distribution based on historical data.
Each subplot presents a histogram (blue bars) of parameter estimates, including shape (α) and scale (β), obtained from historical data, along with a fitted Gamma probability density function (red curve). These prior distributions serve as the basis for Bayesian updates as additional data is acquired.
Each subplot presents a histogram (blue bars) of parameter estimates, including shape (α) and scale (β), obtained from historical data, along with a fitted Gamma probability density function (red curve). These prior distributions serve as the basis for Bayesian updates as additional data is acquired.
4.3 Remaining useful life estimation and prediction error analysis
To dynamically solve Equations (7) and (9), we utilize Markov Chain Monte Carlo (MCMC) techniques through the DREAM toolbox [40,41], enabling effective sampling and convergence evaluation. The toolbox provides three point-estimation approaches for model parameters based on posterior samples [42]:
- Maximum A Posteriori (MAP): selects parameter values maximizing posterior probability;
- Maximum Likelihood Estimation with Posterior Constraints (MLE): optimizes the likelihood function under specified posterior constraints;
- Mean Estimate of Parameters (MEP): uses the mean of the posterior parameter sample.
These methodologies form the basis for assessing estimation techniques in predicting the RUL. After estimating parameters through these approaches, the RUL of the strip is calculated directly. In the conventional model, the RUL at the j-th observation time of the i-th strip is determined by Equation (11) [43,44].
Let represent the degradation indicator at time t. Assuming the scale parameter β follows a probability distribution with parameters η and δ, the mean value
can be utilized to determine the RUL [45]. Thus, the predicted RUL of the component can be calculated as follows:
If the degradation indicator of a test strip does not reach the specified threshold value of 1 prematurely, denoted as , Equations (11) and (12) should be adjusted by replacing the numerical value “1.” When considering a fixed parameter undefined in the two model scenarios, the final expressions for RUL are unified into Equation (13).
Fig 13 presents a comparative analysis of the RUL predictions generated by three estimation methods (MAP, MLE, and MEP) in relation to the actual remaining life of each test strip at different percentages of life cycle completion . This comparison enables a thorough evaluation of predictive precision as the strip’s lifespan advances.
Each subplot displays RUL in days on the vertical axis and the percentage of completed life cycle (p) on the horizontal axis. The lines labeled “True,” “MLE,” “MAP,” and “MEP” represent actual remaining life and RUL estimates derived from three distinct parameter estimation techniques.The three parameter estimation methods exhibit a general convergence toward the true remaining life as more data are collected over the course of the life cycle, as illustrated in Fig 12. However, during the early- to mid-life stages when data are limited, the prediction errors tend to be larger, with a few exceptions, such as test samples 5, 6, and 8, which deviate slightly from this overall trend. Furthermore, the MAP method demonstrates greater variability compared to both MLE and MEP, suggesting that it is less robust, particularly evident in the higher errors observed for samples 9 and 19 when using the MAP estimates.
To quantitatively assess prediction accuracy, we employ the mean absolute percentage error (MAPE) as the principal performance measure, as defined in Equation (14).
The variable denotes MAPE at a specific percentage of the life cycle, p, while
represents the number of test samples. Fig 14 illustrates the fluctuations in prediction errors (MAPE) for MAP, MLE, and MEP at various p values. As anticipated, the overall error decreases as more monitoring data becomes available, corresponding to higher p values.
The x-axis represents the life cycle completion percentage, and the y-axis denotes MAPE. The graph shows a decrease in error as the amount of monitoring data increases.These results underscore the substantial influence of the parameter estimation technique on predictive precision in our particular investigation. The MEP consistently yields the most accurate predictions at key stages of the life cycle (10%, 30%, and 90%), indicating its specific suitability for early-life, mid-life, and late-life phases. The MLE method provides the most precise and dependable estimates around the midpoint of the strip’s lifespan, typically at 50%. In contrast, the MAP method generally demonstrates higher prediction errors compared to other approaches, except near the 70% threshold, suggesting a lower level of robustness relative to MLE and MEP.
5 Maintenance scheduling based on remaining useful life predictions
5.1 Predictive maintenance criteria and cost-benefit calculation methodology
The adoption of predictive maintenance approaches has empowered metro operators to enhance the scheduling of sliding strip inspections. Traditionally, these inspections were integrated into standard vehicle maintenance procedures. Yet, with the advent of projected RUL data, it is now feasible to proactively plan additional maintenance tasks for the sliding strips beyond regular maintenance cycles. Maintenance choices are systematically modeled during each scheduled vehicle inspection to guarantee efficient resource distribution and enhance system dependability.
The decision to perform extra maintenance on a sliding strip between the j-th and (j + 1)-th scheduled maintenance events depends on a binary decision variable , indicating the need for additional maintenance. This decision is determined by Equation (15).
Predicting the remaining lifespan of the sliding strip before it reaches the critical wear threshold is essential for optimizing material efficiency and extending its service life. Conversely, once the strip has exceeded this threshold, further maintenance is deemed unnecessary. Notably, these scenarios hinge on a singular maintenance intervention.
The maintenance cost () is defined as the total expenditure on additional maintenance tasks, calculated using Equation (16).
In the equation provided, denotes team size,
represents average monthly salary per worker,
signifies average maintenance duration in hours, and
stands for monthly working hours per worker. This equation quantifies the labor costs associated with performing extra maintenance activities.
Utilizing RUL prediction can prolong the operational lifespan of a sliding strip by preemptively replacing it before it reaches its wear threshold. This practice enhances material efficiency with a quantifiable probability. The anticipated advantage, labeled as , is formally articulated in Equation (17).
Where represents the market price of the sliding strip,
denotes the average lifespan of the replaced sliding strips in the historical dataset, and
(representing the 90th percentile prediction error) signifies the probability of the predicted RUL deviating from the actual remaining life by an acceptable margin. The choice of the 90th percentile is justified by the prevalent practice of extending a strip’s usefulness primarily in its later stages of life, making this threshold a practical choice.
A reduction in strip thickness below a critical threshold prior to scheduled maintenance can result in operational instability. While RUL predictions can mitigate this risk, they are not infallible. Overreliance on forecasts for extending strip usage may result in surpassing the wear threshold. As noted in the Introduction, evaluations of benefits do not consider potential costs associated with operational disruptions, given the low likelihood of accidents due to delayed replacements. Consequently, beyond the defined threshold, the analysis focuses solely on identifying potential improvements without assigning monetary value to them.
The expected net benefit is determined by subtracting the maintenance cost from the aggregated expected benefits across all samples (denoted as).
5.2 Parameter specification and analysis of predictive maintenance scheduling results
Accurate assessment of these metrics necessitates precise definition of parameter values. The parameter is derived from our predictive model, reflecting the error estimations outlined in Section 3 (Fig 10). Results for MLE, MAP, and MEP suggest that stands at around 5.69%, 5.70%, and 5.13%, respectively. In contrast, historical data informs the parameter
, with an unspecified cost for a sliding strip estimated at approximately ¥5,000 based on industry surveys. The study draws upon data from Chongqing Metro Line 6 as a benchmark. Please consult Table 4 for a detailed compilation of parameter values.
The results of planned maintenance according to RUL forecasts utilizing MLE, MAP, and MEP methods are illustrated in Fig 15 and Fig 16. Scheduled maintenance events for vehicles are depicted by black circles, while projected maintenance actions from each parameter estimation technique are shown as colored triangles, with each color corresponding to the specific method as delineated in the legend.
Each line represents maintenance events marked by black circles, with adjacent numbers indicating the corresponding maintenance dates. Anticipated maintenance occurrences, predicted through various estimation techniques, are represented by colored triangles, with distinct colors indicating temporal intersections as per the legend’s specifications.
Each line represents maintenance events marked by black circles, with adjacent numbers indicating the corresponding maintenance dates. Anticipated maintenance occurrences, predicted through various estimation techniques, are represented by colored triangles, with distinct colors indicating temporal intersections as per the legend’s specifications.In instances where the final degradation indicator is below one, it is essential to conduct RUL prediction and determine maintenance actions during the last scheduled maintenance period. Specifically, Samples 1, 3, 5, 6, 7, 11, 15, 16, 17, and 20 are classified in this category. In such cases, besides the potential extension of the service life at the end (which necessitates an additional maintenance task), no extra maintenance is required in preceding intervals. Nevertheless, Samples 16 and 20 present unique situations. For example, Sample 16 reaches a degradation indicator of.77 on day 38, triggering scheduled maintenance on day 48 (MLE), 49 (MEP), or 50 (MAP). Conversely, Sample 20, with a degradation indicator of.64, indicates that maintenance should be executed on day 61 according to all three approaches.
Predicting the remaining useful life when the degradation indicator from the most recent maintenance event reaches or exceeds one is considered unnecessary. Utilizing forecasts from the maintenance event prior to the last one can effectively reduce the likelihood of surpassing the threshold. For example, in Sample 2, where the final degradation indicator exceeds one, all approaches suggest maintenance on day 92, preempting the scheduled maintenance on day 96 and preventing threshold exceedance. Comparable results are evident in Samples 4 and 13.
The MLE, MAP, and MEP methods generally provide consistent predictions for maintenance timing, with occasional deviations. For instance, in Sample 12 (final indicator value of.95), the estimated maintenance timing falls short of preventing threshold exceedance. In Sample 8, both the MLE and MEP methods recommend an additional maintenance step to avoid surpassing the threshold, unlike the MAP method. Conversely, in Sample 9, MAP and MEP suggest an extra maintenance step to prevent threshold exceedance, while MLE does not. Notably, Sample 18, with an actual indicator of.75 on day 42, necessitates maintenance on day 56. In contrast, Samples 14 and 19 exhibit no discrepancies, as all three methods propose similar maintenance schedules. Overall, the three methods exhibit a high level of agreement in anticipated maintenance timing, with only a few exceptions. Further analysis involved calculating costs, benefits, and relevant metrics using Equations (16) and (7), with results detailed in Table 5. Combined, the three approaches require 19 additional maintenance tasks, incurring a labor cost of ¥56.25 each, totaling approximately ¥1,068 in labor expenses. Discrepancies in projected maintenance timing and prediction accuracy result in varying expected benefits, with the MEP method showing the lowest projected net gain. Nevertheless, for decision-making purposes, using the lowest expected benefit as a conservative benchmark is recommended.
In scenarios where the wear threshold is exceeded, MLE exhibits the highest performance, while MAP displays the lowest performance, and MEP falls in an intermediate position. Therefore, when the objective is to optimize measurable net benefits or improve operational dependability upon surpassing the threshold, adopting an MEP-centric approach emerges as a balanced strategy for subsequent parameter estimation.
6 Discussion
6.1 Impact of prior information on predictive accuracy and implementation details
This study does not directly compare RUL prediction errors with previous research for two main reasons. Firstly, our analysis goes beyond solely reducing errors to optimizing maintenance and replacement schedules using RUL estimates. Secondly, the absence of accessible source code in most prior studies makes replication unattainable within the scope of our research.
The Bayesian updating process adheres to a specific protocol where the likelihood function is updated solely when historical data with corresponding attributes are revised, while new observations from the target directly modify the likelihood. This dual-input approach ensures that the posterior distribution incorporates both past patterns and current observations, with prior parameters significantly influencing predictive precision.
The importance of incorporating prior information is highlighted in the evaluation of prediction accuracy, as demonstrated in Fig 17. Omitting prior information consistently results in increased prediction errors across all stages of the life cycle. Without prior information, MAP and MLE methods produce identical results, as the posterior distribution reduces to the likelihood function alone. This effect is substantial, with prediction errors increasing by 42.07%, 39.31%, 37.27%, 24.85%, and 10.91% at life cycle completion thresholds of 10%, 30%, 50%, 70%, and 90%, respectively. These results unequivocally emphasize the critical importance of incorporating historical knowledge.
The x-axis represents the life cycle completion percentage, and the y-axis indicates MAPE. The graph shows a reduction in error as additional monitoring data is accumulated, excluding prior information.The framework is operationalized in MATLAB for data processing, employing IPOPT for maximum likelihood estimation and DREAM for Markov Chain Monte Carlo sampling. While alternative programming environments could theoretically support this approach, the adaptation of these specialized optimization and sampling techniques presents practical implementation difficulties. The current implementation requires manual intervention at certain points; nevertheless, all code and data are publicly available on GitHub (https://github.com/jery761210/sliding-strips.git) to foster collaboration and progress.
6.2 Emerging trends in predictive maintenance and cross-domain applicability
Recent technological advancements offer significant opportunities to enhance and broaden our approach to predicting RUL. Digital twin technology allows for the creation of real-time virtual replicas of physical assets, enabling detailed simulation and predictive analysis. In the context of pantograph systems, digital twins can incorporate the Gamma process model to simulate wear progression by adjusting parameters based on operational variables. Additionally, the integration of diagnostic frameworks that combine data from various sensors, such as vibration signals, acoustic emissions, temperature readings, and electrical data, is essential for comprehensive equipment health evaluations. By amalgamating data from multiple sensor types, the proposed framework can identify degradation patterns that might be missed by single-sensor methodologies.
The methodology under consideration is easily transferable to the examination of analogous transportation systems, like high-speed rail and other transit networks, as evidenced in a recent investigation on gearbox functionality [46]. These systems exhibit fundamental resemblances to pantograph strips, such as uniform wear patterns under different operational loads, standardized inspection procedures, and established safety protocols. Tailoring the model to these alternative systems mainly involves adjusting the data sampling methodology to accommodate distinct operational traits and validate the training data’s precision, along with refining pertinent parameters instead of necessitating substantial system restructuring.
7 Conclusions
This study presents a framework for predicting the remaining useful life of Metro pantograph sliding strips using a stochastic degradation model grounded in the Gamma process. Analysis of 1,207 sliding strips obtained from Chongqing Metro Line 6 yields four significant findings.
- Statistical analysis confirms the effectiveness of the Gamma stochastic process in representing the behavior of sliding strip wear, as 97.9% of strips show no significant deviation from the Gamma distribution. This validation underscores the critical importance of choosing the correct degradation model.
- The predictive accuracy improves steadily as additional condition monitoring data is incorporated. The Mean Estimate of Parameters (MEP) method demonstrates superior performance compared to the Maximum Likelihood Estimation (MLE) method at 10%, 30%, and 90% life cycle stages, with MLE exhibiting better performance around the 50% milestone. The Maximum A Posteriori (MAP) method generally results in higher errors, except in the proximity of the 70% life cycle stage.
- Incorporating prior distributions improves prediction accuracy by 10% to 42% across different life stages compared to methods that do not utilize historical data. This substantial improvement highlights the crucial role of leveraging previous knowledge in achieving reliable predictions of Remaining Useful Life.
- Implementing maintenance scheduling based on Remaining Useful Life (RUL) results in a net gain of approximately ¥3,048 per test case through optimized replacement timing. This financial validation highlights the practical importance of predictive maintenance approaches within the Metro operations framework.
The Gamma process framework harmonizes theoretical robustness with operational utility, enabling precise degradation characterization and enhancing maintenance planning. Through the integration of Bayesian inference with practical maintenance constraints, a methodology is formulated that is readily applicable in practical settings.
Future research should prioritize enhancing the framework’s capabilities and validating its effectiveness across various wear mechanisms and operational cycles, such as high-speed rail gearboxes and other transportation assets. The integration of continuous Internet of Things sensor data could facilitate real-time updates on Remaining Useful Life through efficient online Bayesian algorithms. Hybrid models that merge Gamma processes with deep learning could adeptly capture nonlinear degradation patterns while maintaining interpretability. Enhancing computational efficiency via advanced sampling methods or parallel processing would support real-time fleet-wide implementation. By optimizing maintenance activities for critical components, overall lifecycle expenses could be reduced. These advancements have the potential to elevate the prototype into a comprehensive predictive maintenance platform suitable for deployment in urban and high-speed rail networks.
Acknowledgments
The authors would like to acknowledge the Chongqing Rail Transit Group for their data supports.
References
- 1. Kuźnar M. Prediction of thickness of pantograph contact strips using artificial neural networks. Czasopismo Techniczne. 2019;(12):173–80.
- 2. Xu W, Peng J, Qiu C. Prediction model of subway pantograph slide plate wear trend based on LSSVR. Urban Rail Transit. 2020;29(1):77–81.
- 3.
Trilla A, Fernández V, Cabré X. Enhancing railway pantograph carbon strip prognostics with data blending through a time-delay neural network ensemble. In: PHM Society European Conference; 2020. https://doi.org/10.1109/ICNC.2020.9374617
- 4. Kuźnar M, Lorenc A, Kaczor G. Pantograph sliding strips failure-reliability assessment and damage reduction method based on decision tree model. Materials (Basel). 2021;14(19):5743. pmid:34640149
- 5. Kuźnar M, Lorenc A. A method of predicting wear and damage of pantograph sliding strips based on artificial neural networks. Materials (Basel). 2021;15(1):98. pmid:35009248
- 6. Guan Q, Wei X, Jia L, He Y, Zhang H. RUL prediction of railway PCCS based on wiener process model with unequal interval wear data. Applied Sciences. 2020;10(5):1616.
- 7. Li B, Sui Y, Wang Z, Zhong L. Wear characteristics analysis and residual life prediction of pantograph-catenary system slide plate. Journal of Liaoning Technical University (Natural Science). 2021;40(5):454–9.
- 8. Kang R, Gong W, Chen Y. Model-driven degradation modeling approaches: Investigation and review. Chinese Journal of Aeronautics. 2020;33(4):1137–53.
- 9. Si X-S, Wang W, Hu C-H, Zhou D-H. Remaining useful life estimation – A review on the statistical data driven approaches. European Journal of Operational Research. 2011;213(1):1–14.
- 10. Liao L, Köttig F. A hybrid framework combining data-driven and model-based methods for system remaining useful life prediction. Applied Soft Computing. 2016;44:191–9.
- 11. Bucca G, Collina A. A procedure for the wear prediction of collector strip and contact wire in pantograph–catenary system. Wear. 2009;266(1–2):46–59.
- 12. Elwany AH, Gebraeel NZ. Sensor-driven prognostic models for equipment replacement and spare parts inventory. IIE Transactions. 2008;40(7):629–39.
- 13. Gebraeel N, Elwany A, Jing Pan. Residual Life Predictions in the Absence of Prior Degradation Knowledge. IEEE Trans Rel. 2009;58(1):106–17.
- 14. Chen Q, Nicholson G, Ye J, Zhao Y, Roberts C. Estimating residual life distributions of complex operational systems using a Remaining Maintenance Free Operating Period (RMFOP)-based methodology. Sensors (Basel). 2020;20(19):5504. pmid:32992886
- 15.
Wang T, Jianbo Yu, Siegel D, Lee J. A similarity-based prognostics approach for Remaining Useful Life estimation of engineered systems. In: 2008 International Conference on Prognostics and Health Management, 2008:1–6. https://doi.org/10.1109/phm.2008.4711421
- 16. Li LL, Ma DJ, Li ZG. Residual useful life estimation by a data-driven similarity-based approach. Quality and Reliability Engineering International. 2017;33(8):1959–67.
- 17. Bektas O, Jones JA, Sankararaman S, Roychoudhury I, Goebel K. A neural network filtering approach for similarity-based remaining useful life estimation. Int J Adv Manuf Technol. 2018;101(1–4):87–103.
- 18. Yu W, Kim IY, Mechefske C. An improved similarity-based prognostic algorithm for RUL estimation using an RNN autoencoder scheme. Reliability Engineering & System Safety. 2020;199:106926.
- 19. Le Son K, Fouladirad M, Barros A, Levrat E, Iung B. Remaining useful life estimation based on stochastic deterioration models: A comparative study. Reliability Engineering & System Safety. 2013;112:165–75.
- 20. Gebraeel NZ, Lawley MA. A neural network degradation model for computing and updating residual life distributions. IEEE Trans Automat Sci Eng. 2008;5(1):154–63.
- 21. Wang Y, Zhao Y, Addepalli S. Remaining useful life prediction using deep learning approaches: a review. Procedia Manufacturing. 2020;49:81–8.
- 22. Zhang Z, Si X, Hu C, Lei Y. Degradation data analysis and remaining useful life estimation: A review on Wiener-process-based methods. European Journal of Operational Research. 2018;271(3):775–96.
- 23. Shen Y, Shen LJ, Xu W. A Wiener-based degradation model with logistic distributed measurement errors and remaining useful life estimation. Quality and Reliability Engineering International. 2018;34(6):1289–1302.
- 24. Le Son K, Fouladirad M, Barros A. Remaining useful lifetime estimation and noisy gamma deterioration process. Reliability Engineering & System Safety. 2016;149:76–87.
- 25. Xu A. Online Bayesian prediction of remaining useful life for gamma degradation process under conjugate priors. arXiv. 2022.
- 26. Pan D, Liu J-B, Cao J. Remaining useful life estimation using an inverse Gaussian degradation model. Neurocomputing. 2016;185:64–72.
- 27. Chen XD, Sun XL, Si XS, Li GD. Remaining useful life prediction based on an adaptive inverse Gaussian degradation process with measurement errors. IEEE Access. 2020;8:3498–3510.
- 28. Guo J, Wang Z, Li H, Yang Y, Huang C-G, Yazdi M, et al. A hybrid prognosis scheme for rolling bearings based on a novel health indicator and nonlinear Wiener process. Reliability Engineering & System Safety. 2024;245:110014.
- 29. Lawless J, Crowder M. Covariates and random effects in a gamma process model with application to degradation and failure. Lifetime Data Anal. 2004;10(3):213–27. pmid:15456104
- 30. Guida M, Penta F. A gamma process model for the analysis of fatigue crack growth data. Engineering Fracture Mechanics. 2015;142:21–49.
- 31. Nwobi F, Akanno F. Power comparison of ANOVA and Kruskal‒Wallis tests when error assumptions are violated. Adv Meth Stat. 2021;18(2).
- 32. Hanusz Z, Tarasińska J. Normalization of the Kolmogorov–Smirnov and Shapiro–Wilk tests of normality. Biometrical Letters. 2015;52(2):85–93.
- 33. Wang H, Xu T, Mi Q. Lifetime prediction based on Gamma processes from accelerated degradation data. Chinese Journal of Aeronautics. 2015;28(1):172–9.
- 34. van Noortwijk JM. A survey of the application of gamma processes in maintenance. Reliability Engineering & System Safety. 2009;94(1):2–21.
- 35. Hao H, Su C, Li C. LED Lighting System Reliability Modeling and Inference via Random Effects Gamma Process and Copula Function. International Journal of Photoenergy. 2015;2015:1–8.
- 36. Rodríguez-Picón LA, Rodríguez-Picón AP, Méndez-González LC, Rodríguez-Borbón MI, Alvarado-Iniesta A. Degradation modeling based on gamma process models with random effects. Communications in Statistics - Simulation and Computation. 2017;:0–0.
- 37. Liu G, Guan Q, Tang Y, Tzeng Y. Interval Modeling for Gamma Process Degradation Model. Symmetry. 2022;14(5):954.
- 38. Woody MS, Lewis JH, Greenberg MJ, Goldman YE, Ostap EM. MEMLET: An Easy-to-Use Tool for Data Fitting and Model Comparison Using Maximum-Likelihood Estimation. Biophys J. 2016;111(2):273–82. pmid:27463130
- 39. Lee L-F, Umberger BR. Generating optimal control simulations of musculoskeletal movement using OpenSim and MATLAB. PeerJ. 2016;4:e1638. pmid:26835184
- 40. Andrieu C, de Freitas N, Doucet A, Jordan MI. Machine Learning. 2003;50(1/2):5–43.
- 41. Vrugt JA. Markov chain Monte Carlo simulation using the DREAM software package: Theory, concepts, and MATLAB implementation. Environmental Modelling & Software. 2016;75:273–316.
- 42. Laloy E, Vrugt JA. High‐dimensional posterior exploration of hydrologic models using multiple‐try DREAM(ZS) and high‐performance computing. Water Resources Research. 2012;48(1).
- 43. Park C, Padgett WJ. Accelerated degradation models for failure based on geometric Brownian motion and gamma processes. Lifetime Data Anal. 2005;11(4):511–27. pmid:16328574
- 44. Wang H, Liao H, Ma X, Bao R. Remaining Useful Life Prediction and Optimal Maintenance Time Determination for a Single Unit Using Isotonic Regression and Gamma Process Model. Reliability Engineering & System Safety. 2021;210:107504.
- 45. van Noortwijk JM, van der Weide JAM, Kallen MJ, Pandey MD. Gamma processes and peaks-over-threshold distributions for time-dependent reliability. Reliability Engineering & System Safety. 2007;92(12):1651–8.
- 46. Wu H, Wei J, Wu P, Zhang F, Liu Y. Dynamic response analysis of high-speed train gearboxes excited by wheel out-of-round: experiment and simulation. Vehicle System Dynamics. 2024;:1–27.