Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Machine learning approaches for predicting Cracking Tolerance Index (CTIndex) of asphalt concrete containing reclaimed asphalt pavement

  • Lan Ngoc Nguyen,

    Roles Conceptualization, Data curation, Methodology, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation University of Transport and Communications, Hanoi, Vietnam

  • Thanh-Hai Le,

    Roles Conceptualization, Data curation, Investigation, Methodology, Supervision, Visualization, Writing – original draft, Writing – review & editing

    Affiliation University of Transport Technology, Hanoi, Vietnam

  • Linh Quy Nguyen,

    Roles Investigation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation University of Transport Technology, Hanoi, Vietnam

  • Van Quan Tran

    Roles Conceptualization, Investigation, Methodology, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    quantv@utt.edu.vn

    Affiliation University of Transport Technology, Hanoi, Vietnam

Abstract

One of the various sorts of damage to asphalt concrete is cracking. Repeated loads, the deterioration or aging of material combinations, or structural factors can contribute to the development of cracks. Asphalt concrete’s crack resistance is represented by the CT index. 107 CT Index data samples from the University of Transport Technology’s lab are measured. These data samples are used to establish a database from which a Machine Learning (ML) model for predicting the CT Index of asphalt concrete can be built. For creating the highest performing machine learning model, three well-known machine learning methods are introduced: Random Forest (RF), K-Nearest Neighbors (KNN), and Multivariable Adaptive Regression Spines (MARS). Monte Carlo simulation is used to verify the accuracy of the ML model, which includes the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and coefficient of determination (R2). The RF model is the most effective ML model, according to analysis and evaluation of performance indicators. By SHAPley Additive exPlanations based on RF model, the input Aggregate content passing 4.75 mm sieve (AP4.75) has a significant effect on the variation of CT Index value. In following, the descending order is Reclaimed Asphalt Pavement content (RAP) > Bitumen content (BC) > Flash point (FP) > Softening point > Rejuvenator content (RC) > Aggregate content passing 0.075mm sieve (AP0.075) > Penetration at 25°C (P). The results study contributes to selecting a suitable AI approach to quickly and accurately determine the CT Index of asphalt concrete mixtures.

1. Introduction

Asphalt concrete is one of the most common types of pavement surface materials used in the world. Asphalt concrete mixtures are materials with many advantages, such as being easy to construct with high productivity, open to traffic immediately after construction, good quality, uniform, and easy to repair. However, asphalt concrete mixtures are a typically temperature and humidity sensitive material, so in the operation process, under the effect of heavy loads, large vehicle traffic combined with environmental impacts such as high temperatures and humidity, asphalt pavement is easily degraded in quality leading to damage, rutting, permanent deformation, cracking. Cracking is one of many types of damage to asphalt concrete. Cracks develop due to repeated loading, deterioration or aging of material mixtures or structural conditions. All these factors affect the durability of the pavement. Cracks are not early treated, it will widen and grow. Rainwater will penetrate the crack, causing further cracking and possibly leading to structural damage.

An extensive literature review conducted as part of the National Cooperative Highway Research Program (NCHRP) 9–57 identified various crack tests [1]. Seven cracking tests were finally selected by the NCHRP 9–57 members and invited experts for further field validation, namely bending beam fatigue (BBF) test, overlay test (OT), disk-shaped compact tension (DCT) test, indirect tensile creep and strength test (IDT-CST) with full instrumentation, and three versions of semi-circular bend (SCB) tests [2]. The above tests have disadvantages such as: cutting samples, long testing time, high cost of equipment, etc.

In 2017, Zhou et al. [2] researched and developed a new test for cracking of asphalt concrete mixture called IDEAL-CT (indirect tensile asphalt cracking test) for determining the CT index representing the crack resistance of asphalt concrete. The IDEAL-CT test is sensitive to the composition of the asphalt concrete mixture and has been carried out at the State Departments of Transportation and in the contractors’ laboratories in the United States. Compared with the above seven concrete cracking test methods, the IDEAL-CT method can be easily implemented with the following advantages: no instrumentation, cutting, gluing, drilling, or notching of specimens) and practicality (minimum training needed for routine operation), and repeatability (coefficient of variance less than 25%) [3, 4]. It is also reported that the CT Index measured in the IDEAL-CT test correlated well with the field performance in terms of fatigue cracking from the FHWA accelerated loading facility full-scale testing.

Yan et al. [5] studied and compared the cracking tolerance index of asphalt concrete mixtures between 3 test methods SCB-IFIT, un-notched SCB-IFIT, and IDEAL-CT. The cracking tolerance index (CT index) results from IDEAL-CT correlate very well with the FI from SCB-IFIT. CT index results show less variation than the FI results (average COV of 5.3% versus 23.0%). Since the IDEAL-CT requires no specimen cutting or notching, it could be a promising alternative to evaluate the cracking resistance of asphalt mixtures.

Standard Test Method for Determination of Cracking Tolerance Index of Asphalt Mixture Using the Indirect Tensile Cracking Test at Intermediate Temperature ASTM D8225-19 has been promulgated by ASTM in 2019. Experimental tests can evaluate the material properties and physical and mechanical properties of the asphalt mixture. However, the time to carry out these tests is quite long and does not provide nearly complete information about the main and essential factors among the variables. Accidentally, in the process of making an error at a certain step, the experiment will have to be repeated, which causes a great loss in cost and time. Therefore, it is necessary to use new approaches to overcome such obstacles.

In the past few years, artificial intelligence or Machine Learning (ML) model has been one of the advanced techniques in the industrial 4.0 era that has been applied in many fields of technical science and natural science to solve real-life problems, initially showing outstanding effectiveness and benefits. These methods also predict many essential pavement parameters in the transport sector. Le et al. [6] developed an alternative numerical tool using an artificial neural network (ANN) to predict SMA mixtures’ Marshall Stability and Marshall Flow. Nguyen et al. [7] used an adaptive open neural inference system to predict the international roughness index IRI. Le et al. [8] developed three AI models, namely GAANFIS, PSOANFIS, and Support Vector Machine (SVM), to predict the Marshall Parameters of Stone Matrix Asphalt. The ability and effectiveness of artificial intelligence techniques or ML model in predicting asphalt concrete problems have also been evaluated and confirmed in many other studies [911]. Using a classification and regression tree (CART), Arifuzzaman et al. [12] examined the aging behavior of asphalts treated with styrene-butadiene-styrene (SBS) and styrene-butadiene (SB), and carbon nanotube (CNT). Additional explanatory connections between numerous factors that affect how oxidized asphalt behaves at different levels of the tree are revealed by the CART analysis. And it was shown to be more accurate than the outcomes of the regression model. The above studies show that it is feasible to apply artificial intelligence techniques to predict the CT index of asphalt concrete mixtures. Liu et al. [13] used ANN and SVM combining dimensionality reduction techniques as Principal Component Analysis (PCA) for predicting alligator cracking and longitudinal cracking of asphalt pavement. Five different ML algorithms including SVM, RF, ANN and Gradient Boosting (GB), and multiple linear regression were proposed to predict rutting depth of asphalt pavement [14]. In the investigation of Liu et al. [15], international roughness index (IRI) of asphalt pavement is also successfully predicted by machine learning models including SVR, ANN, GB, RF, Gaussian process regression and Extra-trees.

In this study, 107 data samples of CTIndex were used, each CTIndex data sample was determined as the average of three individual samples in the laboratory. The experimental procedure is described in the section 3. Based on these data samples, a database is created to build Machine Learning model to predict CT index of asphalt concrete. Three popular Machine Learning algorithms such ash Random Forest (RF), K-Nearest Neighbors (KNN), and Multivariable Adaptive Regression Spines (MARS) are introduced for building the best performance ML model. The performance of the ML models was assessed using four performance indexes such as the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and coefficient of determination (R2). The performance of ML model is validated by Monte Carlo simulation. Based on the best performance ML model, SHAPley Additive exPlanations (SHAP value) [16] is a visualization tool that is introduced to make machine learning model’s output more understandable. Therefore, in order to make clear the effect of each input variable including 8 factors such as (1) Aggregate content passing 4.75 mm sieve (AP4.75), (2) Aggregate content passing 0.075 mm sieve (AP0.075), (3) Bitumen content (BC), (4) Penetration at 25°C (P), (5) Flash point (FP), (6) Softening point (°C), (7) Reclaimed Asphalt Pavement content (RAP) and (8) Rejuvenator content (RC) on the CTIndex, the feature importance analysis will be performed by SHAPley Additive exPlanations in the last section. The study’s results contribute to selecting a suitable AI approach to quickly and accurately determine the CT index of asphalt concrete mixtures.

2. Significance research

CT index is a relatively new index introduced to determine the shear resistance of asphalt pavement. However, the experiment to determine CT index of asphalt pavement is relatively expensive and time consuming. Application of machine learning models has been carried out to study many properties of pavement. However, the study of CT index by machine learning model has not been proposed. Therefore, this study focuses on proposing to build a high performance machine learning model to evaluate CT index from 8 input boundaries including (1) Aggregate content passing 4.75 mm sieve (AP4.75), (2) Aggregate content passing 0.075 mm sieve (AP0.075), (3) Bitumen content (BC), (4) Penetration at 25°C (P), (5) Flash point (FP), (6) Softening point (°C), (7) Reclaimed Asphalt Pavement content (RAP) and (8) Rejuvenator content (RC), and evaluate and quantify the influence of these input variables on the CT index value.

3. Experimental procedure and experimental results

3.1. Material used

The asphalt mixtures used in the evaluation are shown in Table 1 with Dense Graded Asphalt (DGA) and gab-grade of Stone Matrix Asphalt (SMA). The two sources of aggregate used for the mixtures are designated type I and type II, respectively. The properties of these two aggregate sources are shown in Table 2.

Valdés et al. [17] shows the acceptable range of Reclaimed Asphalt Pavement (RAP) varying from 10% to 50% weight of asphalt pavement. Three types of rejuvenator consisting of sasobit, Zycotherm and Prephalt are used as additive binder for Dense Graded Asphalt (DGA) According to Anderson et al. [18], the sasobit content as 1.5% weight of binder should be used. Ayazi et al. [19] proposed the used Zycotherm to be equal to 0.15% weight of binder. Three contents of prephalt including 3.0, 5.0 and 7.0% weight of binder are recommended to use according to Shi et al. [20]. Therefore, the study also used cracking test data of Warm Mixtures Asphalt (WMA) and Hot Mixtures Asphalt (HMA) with RAP content of 0%, 20%, 30%, 40% and 50% weight of binder, respectively, to analyze accumulate. The WMA uses Sasobit and Zycotherm additives, HMA uses Prephalt rejuvenator additive at the contents of 3%, 5% and 7% respectively. The properties of the virgin and modified binders in unaged condition are shown in Table 3.

Flash point should have the significant effect on performance of asphalt pavement at high temperature, and Penetration at 25°C seems to have not any meaningful of binder performance. In this study, the results of CT index testing of asphalt mixes using 5 types of bitumen binder are described in Table 1. To distinguish 5 types of bitumen binder for the CT index evaluation test in unaged condition, four properties including Flash point, Penetration at 25°C, Softening point and Ductility at 25°C described in Table 3 are considered, properties of bitumen binder for Short-term aged (RTFO) and Long-term aged (PAV) are not considered in this investigation. Ductility at 25°C of the 5-bitumen binders has similarity value (> 100 cm). Evaluating the influence of bitumen binders’ properties on CT index by ML technique-SHAP value is also the aim of this study.

Therefore, 3 input variables including Flash point, Penetration at 25°C and Softening point are used to build a machine learning model to determine and evaluate the CT index.

The study used asphalt mixtures currently used in Vietnam, these mixtures were designed according to the Marshall method. When preparing the CTindex test sample, the gyratory compaction is used to ensure the air voids of the mixture according to the regulations of ASTM D8225 is equal to be 7±0.5%. There are 17 mixtures of DGA and SMA designed according to Marshall method, the specifications of the mixtures all meet the specifications according to MS-02 and AASHTO M325. Fig 1 shows the gradation curves of a mixture of SMA and DGA designed using aggregate type I and II.

thumbnail
Fig 1.

Mixture gradation aggregates: (a) Stone Mastic Asphalt (SMA) samples, (b) Dense Graded Asphalt (DGA) samples.

https://doi.org/10.1371/journal.pone.0287255.g001

3.2 Methodology of experiment

The IDEAL-CT was used to determine the intermediate-temperature cracking resistance of asphalt mixtures. The indirect tensile asphalt cracking test was recently developed as a practical cracking test that could be routinely used in asphalt mix design as well as for quality control/quality assurance (QC/QA) [2]. The test is performed on 150-mm diameter and 62-mm high gyratory specimens that are typically compacted to a target air void level of 7 ± 0.5 percent. Test specimens are conditioned for two hours at 25°C, and then tested using an indirect tension load frame. A minimum of three replicates are typically tested for a mixture. Testing can be performed using a stand-alone servo-hydraulic machine capable of sampling load and displacement data at a rapid rate (40 Hz), as shown in Fig 2(A). Specimens are loaded monotonically at a rate of 50 mm/min in load line displacement (LLD) until failure. A plot of load versus LLD is shown in Fig 2(B). The plot of load versus displacement is then analyzed to determine the CT index: (1) where:

Gf−fracture energy, J/m2

|m75| ‐ slope at 75% peak load, N/m

D ‐ specimen diameter, mm

t ‐ specimen thickness, mm

l75 ‐ displacement corresponding to 75% of the peak load at the post-peak stage, mm

thumbnail
Fig 2.

IDEAL-CT Test Device and Typical Result: (a) Test Device; (b) Recorded Load (P) versus Load-Line Displacement Curve according to Zhou et al [2].

https://doi.org/10.1371/journal.pone.0287255.g002

The specimen diameter D is the standard value 150 mm; therefore, this is a constant which can be neglected. Moreover, the Eq (1) helps to calculate experimentally CT index value in the experimental part (2.1 and 2.2 of this study). Therefore, the specimen diameter D is not considered in for building Machine Learning model in the next sections.

3.3. Experimental results and analysis

In this study, the database on cracking tolerance index CTIndex was collected from experimental studies in the laboratory of the University of Transport Technology. The database includes 107 records on the CTIndex cracking tolerance index of asphalt mixtures, SMA mixes, and Reclaimed asphalt pavement (RAP) using common asphalt, polymers, and additives, such as Sasobit, Zycotherm, and Prephalt.

The input data includes 8 variables: (1) Aggregate content passing 4.75 mm sieve (AP4.75), (2) Aggregate content passing 0.075 mm sieve (AP0.075), (3) Bitumen content (BC), (4) Penetration at 25°C (P), (5) Flash point (FP), (6) Softening point (°C), (7) Reclaimed Asphalt Pavement content (RAP) and (8) Rejuvenator content (RC) on the CTIndex. The output data is the cracking tolerance index CTIndex (CT).

The dataset used in the study was randomly divided into two sub-datasets using a uniform distribution, in which 70% of the data was used to build the training model and 30% of the data was used for verification. The statistical parameters such as minimum, median, maximum and standard deviations… are presented in detail in Table 4.

Fig 3 depicts the distribution of input parameters in this study. The plots for the distribution of the data for the parameters are shown through the histograms from Fig 3A3H. Most of the variation in the number of samples has a relatively large distribution, often concentrated in a relatively narrow range of values. Aggregate content passing 4.75 mm sieve (AP4.75) varied from 24.8% to 53.1%, with most samples in the 50% content range. Aggregate content passing 0.075 mm sieve (AP0.075) ranges in content from 6.4% to 9.6%. A fairly large number of samples are concentrated in the 7% range, others in the 8–9% range. Bitumen content (BC) has a large number of samples in the 5% range. Penetration at 25°C (P) has the number of samples located mainly in the range of 60–65 g/cm3. Flash point (FP) ranges from 247-362(mm), but most data points are in the 250–275 mm range. Softening point has distribution values concentrated mainly at 50°C. With Reclaimed Asphalt Pavement content (RAP) and rejuvenator content (RC) visible, most samples are concentrated in the 0% range. The distribution of the number of samples of the output variables in the experimental data (Fig 3I) shows that most of the samples have the CT index value of cracking tolerance index in the range of 0–200.

thumbnail
Fig 3.

Distribution of input variable (a) Aggregate content passing 4.75 mm sieve (AP4.75), (b) Aggregate content passing 0.075 mm sieve (AP0.075), (c) Bitumen content (BC), (d) Penetration at 25°C (P), (e) Flash point (FP), (f) Softening point, (g) Reclaimed Asphalt Pavement content (RAP), (h) Rejuvenator content (RC), and (i) CT Index.

https://doi.org/10.1371/journal.pone.0287255.g003

The correlation matrix of the input experimental data is depicted in Fig 4. As can be seen, most of the parameters have an influence on the output CT index. The highest observed correlation coefficient between input and output variables is -0.87. In addition, the input parameters also have a significant correlation with each other, expressed through the correlation coefficients between them, with the highest coefficient equal to -0.96. Therefore, the relationships between these variables need to be analyzed further.

thumbnail
Fig 4. Pearson correlation between inputs variables and CTindex.

https://doi.org/10.1371/journal.pone.0287255.g004

In actuality, the quantity, quality, and number of features (number of input variables) are the key determinants of a machine learning model’s accuracy. The quantity of data and number of characteristics might vary depending on each situation, such as a medical, pharmacological, or civil engineering challenge. For instance, there are a ton of Machine Learning research into the mechanical characteristics of concrete, such as compressive strength, elastic modulus, shear strength, and tensile strength, as well as the properties of soil, such as UCS, CBR, resilient modulus, and coefficient of permeability. Jeremiah et al. [21] identified 23 Machine Learning experiments for soil parameters, with data ranging from 49 to 283 samples and features from 4 to 14. Chaabene et al. [22] compiled the results of 47 Machine Learning experiments on the mechanical characteristics of concrete. In the 47 investigations, there are between 74 and 2817 samples of data, and there are between 4 and 13 characteristics. The computing time is significantly influenced by the number of characteristics. The accuracy and performance of a machine learning model with the same quantity of data and features heavily depend on the machine learning method. As a result, even when there are limited data and numerous features, choosing the right Machine Learning method can assist to increase the model’s capacity for prediction.

The authors of this paper use 3 machine learning algorithms, Random Forest (RF), K-Nearest Neighbors (KNN), and Multivariate Adaptive Regression Splines (MARS), with 107 data points and 8 features to investigate the choice of the best machine learning model. In addition, 8 characteristics are required to study how each input affects the CT index using Shapley Additive Explanation based on ML models. Monte Carlo simulation has been used to confirm the ML model’s dependability. The min-max value range of the variable indicated in Table 4 of the revised paper should be the only range in which the Machine Learning model of this study is used.

4. Machine learning methods

4.1 Random forest (RF)

Fig 5 presents briefly the process of Random Forest algorithm according to conception of Breiman [23]. The random forest (RF) algorithm is a supervised ML algorithm [23]. As the name, it is used to somehow create a forest and make it random. The number of trees in the forest is directly related to the results obtained by the algorithm: the more trees, the more accurate the output. But it should be highlighted that creating a forest is not equivalent to using information gain or creating an index to make a decision. It uses a tree-like graph to show possible results. If a training set with targets and features is input into a decision tree, it will show some regularities. These regularities can then be utilized to make predictions. The process of collecting these information nodes and forming rules is the process of using the information gain method and Gini coefficient calculation. To obtain an objective estimate of the test set error in RF, neither cross-validation nor a separate test set is required. Internally, it is computed using extrapolation from the categorization of the excluded person. Numerous studies have shown that the out-of-bag estimate is unbiased [24, 25]. The difference between the random forest algorithm and the decision tree algorithm is that in the random forest, the process of finding the root node and dividing the characteristic nodes will be carried out in a random manner.

thumbnail
Fig 5. Process of Random Forest algorithm according to conception of Breiman [23].

https://doi.org/10.1371/journal.pone.0287255.g005

4.2 K-Nearest Neighbors (KNN)

K-Nearest Neighbors (KNN) is a common classification and regression algorithm. Determine the kind of the object by looking at its neighbors. Based on the similarity in a covariate space between the population unit for which a prediction is intended and the sample units for which data are provided, the KNN technique and its modifications are intuitive, non-parametric approaches to either univariate or multivariate prediction [26, 27]. Usually, the distance between an object and its neighbors uses the Euclidean distance. In j-dimensional space Euclidean distance of points a(x1, x12, …, x1j) and b(x2, x22, …, x2j) and N objects closest to the object to be checked (2)

A large collection of training data or objects of the known kind is required for KNN. The premise is that if the nearest item is “a”, then the unknown thing must be “a”, but if the next j nearest objects all is “b”, then the unknown object must be “b”. Normally, no simple voting mechanism is used; each neighbor’s vote is weighted depending on its distance from the unknown object, so closer objects have a greater voice in recognizing it. The disadvantage of the k-neighbor algorithm is that it cannot give any basic structure information about the data, so cannot know what characteristics the average instance sample and the typical instance sample have.

4.3 Multivariate Adaptive Regression Splines (MARS)

Multiple Adaptive Regression Splines (MARS) is a form of regression analysis introduced by Friedman [28], which is a non-parametric regression technique that can be seen as an extension of a linear model that simulates the nonlinearity and interaction between variables, a general insertion points for the MARS model. It is a nonlinear, nonparametric regression method based on a segmentation strategy. This method does not need to assume a potential specific functional relationship between the input variable and the output variable but divides the data training set into independent sections, line segments, with different gradients, and each segment is called a basis function; the endpoint of each segment is nodes, a node marks the end of one region of data and the beginning of the next. The resulting basis functions will give the model greater flexibility, allowing bending and threshold values that deviate from linearity. MARS generates basis functions through a stepwise search and uses an adaptive regression algorithm to select node locations. The MARS algorithm is divided into two steps: forward selection and backward pruning: the forward selection process is to divide the input sample data, use the spline function instead of the divided cells to fit to get a new basis function, and then get a fitting model; the backward pruning process is to screen the generated basis functions, and eliminate the basis functions that contribute less to the model fitting, so as to avoid overfitting and generate the optimal model. MARS is a data modeling process. Due to make the model conform to the formula, the forward selection procedure should be performed on the training set of the data to make the model conform to the formula. The training error is reduced as much as possible by using the constant term and basis function pair to generate the model; for a model containing N basis functions, the next pair of basis functions added to the model is obtained by the least square method. When a new basis function is added to the model, its interaction with the existing basis functions in the model is also considered. If the number of basis functions reaches the maximum number predetermined by the model, an overfitting model will be generated. The numerator is the mean square error of the MARS model data training set, and the denominator is the penalty function. The more the model complexity, the greater the variance.

4.4 Performance evaluation of models

Model assessment is a crucial step in the development of a machine learning model that determines if the model is of high quality.

In order to use the proper assessment measures, model evaluation aids in the selection of models that are appropriate for a given situation. The coefficient of determination (R2), the root mean square error (RMSE), the mean absolute error (MAE), and the mean absolute percent error (MAPE) were the four statistical performance metrics employed in this investigation [25].

The following criteria establish these ratings: (3) (4) (5) (6)

Where Mj is the experimental value of CT index, Qj is the predicted value and N is the number of samples.

4.5 Methodology flowchart

The general outline of the predicted cracking tolerance index of asphalt concrete mixtures is detailed in Fig 6. This flowchart consists of three main steps, as follows: Step 1: The data used in this study were collected from the laboratory with the 107 samples presented in Section 2.3 earlier. These data include 8 inputs: Aggregate content passing 4.75 mm sieve (AP4.75), aggregate content passing 0.075 mm sieve (AP0.075), bitumen content (BC), penetration at 25°C(P), flash point (FP), softening point (SP), Reclaimed asphalt pavement content (RAP), and rejuvenator content (RC). The output is the cracking tolerance index CTindex (CT). This data is randomly split into two datasets, with 70% used for training the models and the remaining 30% for testing.

Step 2: The math facts are trained to perform the CT index prediction task, the model is built, and it is selected with an appropriate structure.

Step 3: Validate the model. Models are tested and validated using test datasets. The predictive ability of the models was evaluated through the criteria R2, RMSE, MAE, and MAPE. The model with optimal performance will be selected to predict the CT index and analyze the sensitivity to evaluate the influence of input parameters on the CT index.

5. Results and discussion

5.1 Performance evaluation of predictive machine learning algorithms

Fig 7 shows the accuracy of the training and testing model. Predictive performance is evaluated across four metrics, namely R2, RMSE, MAE, and MAPE. It can be observed that the RF and MARS models have R2 values on the training and testing datasets that are relatively similar (Fig 7A). Which, the RF model has a higher R2 value than the other two models with an average R2 value on the training set of 0.9927, and 0.9757 on the test data set. In the test data set, RF model can achieve R2 max value up to 0.9953. Similarly, looking at Fig 7B–7D, it is easy to see that the KNN model has the poorest performance with the RMSE, MAE, and MAPE indices always achieving much larger values than the two MARS and RF models. Indeed, the average performance of the predictive models is presented in Table 5. The mean value of the RMSE index for the KNN model on the training set is 25.2973 and RMSE = 34.7225 on the test set. This is up to two times larger than RF with RMSE = 11.2123 on the training set and RMSE = 17.4854 on the test set. Besides considering the mean value of the evaluation criteria, the standard deviation of the performance values is also considered to comprehensively compare the predictive performance of the models. The table of standard deviation values of the three models on the two training and test data sets is listed in Table 5. The RF model has a slightly smaller standard deviation score than the other two models with precision The standard deviations of the performance indicators R2, MAE, and MAPE are 0.0260, 2.8049, and 0.0294, respectively. This result shows that the RF model has superior accuracy compared to MARS and KNN. In addition, the relatively small deviation avoids causing large differences in the prediction performance of the model.

thumbnail
Fig 7.

Performance value of machine learning models after 3000 simulations (a)R2, (b) RMSE, (c) MAE and (d) MAPE values.

https://doi.org/10.1371/journal.pone.0287255.g007

thumbnail
Table 5. Comparison of three algorithms for compressive prediction using mean performance value and Std performance value.

https://doi.org/10.1371/journal.pone.0287255.t005

5.2 Prediction of typical machine learning algorithm

After the models’ rankings are validated by performance evaluation, the model that can run consistently and with exceptional performance will be chosen. Fig 8 compares the actual value and the predicted value of the RF model. Observe the difference between the experimental value and the predicted value from the RF model. The predicted values of RF on the training set are quite comparable to the training data as red round points. The prediction results obtained on the test set are surprisingly accurate with the predicted values very close to the true value (blue squares).

thumbnail
Fig 8. Comparison between true and predicted CT index of Random Forest model for training and testing set.

https://doi.org/10.1371/journal.pone.0287255.g008

It can be observed that the predicted values are very close to the fit line with the evaluation indexes on the experimental data set, which are R2 = 0.9953, RMSE = 8.7535, MAE = 7.2763, and MAPE = 0.0795, respectively. The error values between the experimental value and the predicted value for the training data set and the test data are compared in Fig 9. Based on the brown frequency distribution line, the error frequency can easily be seen in samples in the range. For the training database, the frequency of samples with errors in the range [-10;10] is about 5. Similar to the testing set, the frequency of samples with errors in the range [-10;10] is about 3. Finally, it can be concluded that the RF model is capable of predicting CT index of asphalt mixtures with good performance and reliable results.

thumbnail
Fig 9.

Error value between predicted and true CT index of asphalt concrete (a) training part, (b) testing part.

https://doi.org/10.1371/journal.pone.0287255.g009

In Fig 8, eight data samples including 6 data samples for training dataset and 2 data samples for testing dataset have CT index value higher than 200 are identified. These CT index value belong to Stone mastic asphalt (SMA) mixtures using Sasobit and without Sasobit additive. The SMA blend has a high binder content, while using cellulose fibers. The cellulose fibers in the SMA blend have the ability to cross and cross, forming a spatial network structure, this type of structure has the effect of transmitting force while preventing the slip between the particles, the overall connection of the mixture, and the resistance. resist and retard the formation and growth of cracks. Therefore, the SMA composite has better performance than the BTNC blend in terms of top-down crack resistance, bottom-up crack resistance and thermal crack resistance. This conclusion is consistent with the results of Wu et al. [29] and Habbouche et al. [30].

In this study, 3 machine learning algorithms including RF, KNN and MARS to build machine learning models for predicting CT index. These machine learning algorithms have hyperparameters which can be (i) tuned for each specific case or (ii) defaults values defined in the open-source library of language programming Python [25]. In this study, the default hyperparameters of three ML algorithms are used. Based on the database including 107 samples and 8 input variables, 3 algorithms are fitted on 70% of the database samples and 8 input variables to predict CT index.

Machine learning models can solve very complex problems, non-linear correlations such as CT index prediction with high accuracy, but the disadvantage of RF, KNN and MARS machine learning models cannot provide equations describing the correlation between outputs and variables as empirical [31]. Therefore, in order to increase the applicability of the RF machine learning model with high performance in predicting CT index and designing pavement components, an excel file derived from the RF model allows predicting CT index by excel formulas (including logic function) attached to this study (S1 File).

5.3 Feature importance analysis

The cracks, a common road surface disease, often do not attract people’s attention, and these are only regarded as normal asphalt aging phenomena. However, in recent years, the phenomenon of road surface subsidence caused by road cracks and traffic accidents has occurred from time to time. The cracks like scars are dormant on the road, and with the erosion of rainwater and the increase of vehicle load, they continue to spread vertically and horizontally and may cause collapse at any time [13]. On the other hand, the appearance of cracks will also increase road maintenance’s difficulty and the burden of later road maintenance. Many studies on the influence of composition and manufacturing methods on the mechanical behavior of asphalt mixtures [3234]. In the study by Sangsefidi et al. [35], two gradations (fine-graded and coarse-graded) were evaluated with two levels of binder content and air voids content to reflect, respectively, construction variable levels of binder content and density. Rutting, tensile cracking, and moisture susceptibility were the three characteristics of mixture performance that were assessed. It has been discovered that for estimating rutting performance, the volume content is more useful than the weight content and that the effective binder content should be considered rather than the overall binder content. Without further breakdown into effective binder content and absorbed binder content, the total binder content may adequately characterize the indirect tensile strength (ITS) values for tensile cracking. Gradation deviation had no discernible impact on cracking.

There are 8 data inputs including Aggregate content passing 4.75 mm sieve (AP4.75), Aggregate content passing 0.075 mm sieve (AP0.075), Bitumen content (BC), Penetration at 25°C (P), Flash point (FP), Softening point, Reclaimed Asphalt Pavement content (RAP), and Rejuvenator content (RC). The SHAP values of these data are calculated, and the results are summarized in Fig 10. These data are sorted from top to bottom according to their influence on the CT index output (cf. Fig 10B). With the SHAP values, the picture of feature importance is created clearer, and more abstract, with blue for low values, and gradually red for high values (cf. Fig 10A). Looking at the plot, it is easy to see that SG is the least important feature, and AP4.75 is the feature that has a great influence on the CT index. Based on color, with AP4.75 and RAP feature, with high values (red) SHAP values will be low, so it will push towards layer 0. By SHAPley Additive exPlanations based on RF model, the input Aggregate content passing 4.75 mm sieve (AP4.75) has a significant effect on the variation of CT Index value. In following, the descending order is Reclaimed Asphalt Pavement content (RAP) > Bitumen content (BC) > Flash point (FP) > Softening point > Rejuvenator content (RC) > Aggregate content passing 0.075mm sieve (AP0.075) > Penetration at 25°C (P).

thumbnail
Fig 10. SHAP value for feature importance analysis of CT index prediction.

https://doi.org/10.1371/journal.pone.0287255.g010

6. Conclusions

In this study, RF, KNN, and MARS models are developed to predict the cracking tolerance index of asphalt concrete mixtures. 107 experimental results were collected to build databases and develop models. In which, 70% of the data is randomly selected for the training phase and the remaining 30% is used for the testing phase of the machine learning models. Through analysis and evaluation of performance indicators, the RF model is the most stable and has predictive results close to the real value. On the test set, the RF model has R2, RMSE, MAE, and MAPE values of 0.9953, 8.7535, 7.2763, and 0.0795, respectively. Through sensitivity analysis, it can be seen that the crack resistance CT index varies greatly for different combinations of input variables. Of which, Aggregate content passing 4.75 mm sieve (AP4.75) has a significant effect on the variation of CT Index output. Next is Reclaimed Asphalt Pavement content (RAP) > Bitumen content (BC) > Flash point (FP) > Softening point > Rejuvenator content (RC) > Aggregate content passing 0.075 mm sieve (AP0.075) > Penetration at 25°C (P). In addition, the correlation between the input variables is quite complex, which cannot be reflected by a clear equation. The study has quantified the influence of the parameters on the output results, supporting the design of the optimal composition for asphalt concrete in the future. The number of data used in this study is relatively small, which affects the general application of machine learning models in predicting CT index. Therefore, the addition of data samples to the database is very necessary in further studies to build a highly universal machine learning model to improve the applicability of the machine learning model in designing mixture asphalt pavement including SMA using RAP.

References

  1. 1. Zhou F et al., “Experimental design for field validation of laboratory tests to assess cracking resistance of asphalt mixtures,” NCHRP Proj., no. 9–57, 2016.
  2. 2. Zhou F., Im S., Sun L., and Scullion T., “Development of an IDEAL cracking test for asphalt mix design and QC/QA,” Road Mater. Pavement Des., vol. 18, no. sup4, pp. 405–427, 2017.
  3. 3. Bennert T., Haas E., and Wass E., “Indirect tensile test (IDT) to determine asphalt mixture performance indicators during quality control testing in New Jersey,” Transp. Res. Rec., vol. 2672, no. 28, pp. 394–403, 2018.
  4. 4. West R. et al., “Phase VI (2015–2017) NCAT test track findings,” Auburn University. National Center for Asphalt Technology, 2019.
  5. 5. Yan C., Zhang Y., and Bahia H. U., “Comparison between SCB-IFIT, un-notched SCB-IFIT and IDEAL-CT for measuring cracking resistance of asphalt mixtures,” Constr. Build. Mater., vol. 252, p. 119060, Aug. 2020,
  6. 6. Le T.-H., Nguyen H.-L., Pham C.-T., Hoang H.-G. T., and Nguyen T.-A., “Development of Artificial Neural Network Model for Prediction of Marshall Parameters of Stone Mastic Asphalt,” in CIGOS 2021, Emerging Technologies and Applications for Green Infrastructure, Springer, 2022, pp. 1795–1803.
  7. 7. Nguyen H.-L. et al., “Adaptive network based fuzzy inference system with meta-heuristic optimizations for international roughness index prediction,” Appl. Sci., vol. 9, no. 21, p. 4715, 2019.
  8. 8. Le T.-H. et al., “Artificial Intelligence-Based Model for the Prediction of Dynamic Modulus of Stone Mastic Asphalt,” Appl. Sci., vol. 10, no. 15, Art. no. 15, Jan. 2020,
  9. 9. Ngo Q. T., Nguyen H.-L., and Le T.-H., “Investigation of Artificial Neural Network Models for Predicting the International Roughness Index of Rigid Pavements,” in CIGOS 2021, Emerging Technologies and Applications for Green Infrastructure, Ha-Minh C, Tang A. M., Bui T. Q., Vu X. H., and Huynh D. V. K., Eds., in Lecture Notes in Civil Engineering. Singapore: Springer, 2022, pp. 1851–1858. https://doi.org/10.1007/978-981-16-7160-9_187
  10. 10. Huang J., Shiva Kumar G., Ren J., Zhang J., and Sun Y., “Accurately predicting dynamic modulus of asphalt mixtures in low-temperature regions using hybrid artificial intelligence model,” Constr. Build. Mater., vol. 297, p. 123655, Aug. 2021,
  11. 11. Noura S., Al-Sabaeei A. M., Safaeldeen G. I., Muniandy R., and Carter A., “Evaluation of measured and predicted resilient modulus of rubberized Stone Mastic Asphalt (SMA) modified with truck tire rubber powder,” Case Stud. Constr. Mater., vol. 15, p. e00633, Dec. 2021,
  12. 12. Arifuzzaman M., Gazder U., Alam M. S., Sirin O., and Mamun A. A., “Modelling of Asphalt’s Adhesive Behaviour Using Classification and Regression Tree (CART) Analysis,” Comput. Intell. Neurosci., vol. 2019, p. e3183050, Aug. 2019, pmid:31511770
  13. 13. Liu J., Liu F., Gong H., Fanijo E. O., and Wang L., “Improving asphalt mix design by predicting alligator cracking and longitudinal cracking based on machine learning and dimensionality reduction techniques,” Constr. Build. Mater., vol. 354, p. 129162, Nov. 2022,
  14. 14. Liu J., Liu F., Zheng C., Zhou D., and Wang L., “Optimizing asphalt mix design through predicting the rut depth of asphalt pavement using machine learning,” Constr. Build. Mater., vol. 356, p. 129211, Nov. 2022,
  15. 15. Liu J., Liu F., Zheng C., Fanijo E. O., and Wang L., “Improving asphalt mix design considering international roughness index of asphalt pavement predicted using autoencoders and machine learning,” Constr. Build. Mater., vol. 360, p. 129439, Dec. 2022,
  16. 16. Lundberg S. M. and Lee S.-I., “A Unified Approach to Interpreting Model Predictions,” in Advances in Neural Information Processing Systems, Curran Associates, Inc., 2017. Accessed: Dec. 21, 2021. [Online]. Available: https://proceedings.neurips.cc/paper/2017/hash/8a20a8621978632d76c43dfd28b67767-Abstract.html
  17. 17. Valdés G., Pérez-Jiménez F., Miró R., Martínez A., and Botella R., “Experimental study of recycled asphalt mixtures with high percentages of reclaimed asphalt pavement (RAP),” Construction and Building Materials, vol. 25, no. 3, pp. 1289–1297, Mar. 2011,
  18. 18. Anderson K. et al., “Warm Mix Asphalt Final Report,” 2014.
  19. 19. Ayazi M. J., Moniri A., and Barghabany P., “Moisture susceptibility of warm mixed-reclaimed asphalt pavement containing Sasobit and Zycotherm additives,” Petroleum Science and Technology, vol. 35, no. 9, pp. 890–895, May 2017,
  20. 20. Shi N., Xiao H., Xiao J., and Liu Q., “Hot-in-place recycling engineering application of low penetration aged asphalt,” E3S Web Conf., vol. 375, p. 01051, 2023,
  21. 21. Jeremiah J. J., Abbey S. J., Booth C. A., and Kashyap A., “Results of Application of Artificial Neural Networks in Predicting Geo-Mechanical Properties of Stabilised Clays—A Review,” Geotechnics, vol. 1, no. 1, Art. no. 1, Sep. 2021,
  22. 22. Ben Chaabene W., Flah M., and Nehdi M. L., “Machine learning prediction of mechanical properties of concrete: Critical review,” Constr. Build. Mater., vol. 260, p. 119889, Nov. 2020,
  23. 23. Breiman L., “Random Forests,” Mach. Learn., vol. 45, no. 1, pp. 5–32, Oct. 2001, :1010933404324.
  24. 24. Touw W. G. et al., “Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle?,” Brief. Bioinform., vol. 14, no. 3, pp. 315–326, May 2013, pmid:22786785
  25. 25. Pedregosa F. et al., “Scikit-learn: Machine Learning in Python,” J. Mach. Learn. Res., vol. 12, no. null, pp. 2825–2830, 2011.
  26. 26. McRoberts R. E., Magnussen S., Tomppo E. O., and Chirici G., “Parametric, bootstrap, and jackknife variance estimators for the k-Nearest Neighbors technique with illustrations using forest inventory and satellite image data,” Remote Sens. Environ., vol. 115, no. 12, pp. 3165–3174, Dec. 2011,
  27. 27. Chirici G. et al., “A meta-analysis and review of the literature on the k-Nearest Neighbors technique for forestry applications that use remotely sensed data,” Remote Sens. Environ., vol. 176, pp. 282–294, Apr. 2016,
  28. 28. Friedman J. H., “Multivariate Adaptive Regression Splines,” Ann. Stat., vol. 19, no. 1, pp. 1–67, Mar. 1991,
  29. 29. Wu S., Wen H., Chaney S., Littleton K., and Muench S., “Evaluation of Long-Term Performance of Stone Matrix Asphalt in Washington State,” J. Perform. Constr. Facil., vol. 31, no. 1, p. 04016074, Feb. 2017,
  30. 30. Habbouche J., Boz I., and Diefenderfer B., Laboratory and Field Performance Evaluation of Pavement Sections with High Polymer-Modified Asphalt Overlays. 2021.
  31. 31. Feng D.-C., Wang W.-J., Mangalathu S., and Taciroglu E., “Interpretable XGBoost-SHAP Machine-Learning Model for Shear Strength Prediction of Squat RC Walls,” J. Struct. Eng., vol. 147, no. 11, p. 04021173, Nov. 2021,
  32. 32. Rahbar-Rastegar R., “Cracking in Asphalt Pavements: Impact Of Component Properties And Aging On Fatigue And Thermal Cracking,” Dr. Diss., Jan. 2017, [Online]. Available: https://scholars.unh.edu/dissertation/2284
  33. 33. Holleran G., Mccaffrey L., and Wilson D., “The effect of particle shape and angularity on SMA mix characteristics,” presented at the ARRB Conference, 23rd, 2008, Adelaide, South Australia, Australia, Jul. 2008. Accessed: Feb. 06, 2023. [Online]. Available: https://trid.trb.org/view/885958
  34. 34. Brence M., “The use of foundry sand in the composition of the asphalt mixture,” Sep. 2016. Accessed: Feb. 06, 2023. [Online]. Available: https://www.semanticscholar.org/paper/The-use-of-foundry-sand-in-the-composition-of-the-Brence/06ee9708bd89eff7fb93b361e0021296bce6f5d4
  35. 35. Sangsefidi E., Ziari H., and Sangsefidi M., “The effect of aggregate gradation limits consideration on performance properties and mixture design parameters of hot mix asphalt,” KSCE J. Civ. Eng., vol. 20, no. 1, pp. 385–392, Jan. 2016,