Figures
Abstract
The prediction of key indicators in the flotation process is crucial for optimizing operations, improving quality, and reducing consumption. However, indicator prediction itself suffers from complex nonlinear relationships, difficulties in model construction, and noise interference. To solve the above problems, this paper proposes a new model based on a multilayer tree structure belief rule base (MTS-BRB), termed MTS-BRB with attribute reliability (MTS-BRB-R). First, an initial prediction model is constructed using the MTS-BRB framework. Second, the attribute reliability is embedded into the model structure to enhance the robustness of its inference and prediction accuracy. Finally, the prediction of the tailings silica content in the iron ore flotation process is used as a case study to verify the effectiveness of the proposed model.
Citation: Dong H, Gong A, He W (2026) A new multilayer tree structure belief rule base-based prediction method for key indicators of flotation process. PLoS One 21(2): e0336336. https://doi.org/10.1371/journal.pone.0336336
Editor: Dariusz Siudak, Lodz University of Technology: Politechnika Lodzka, POLAND
Received: September 13, 2025; Accepted: October 23, 2025; Published: February 2, 2026
Copyright: © 2026 Dong et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The data underlying the results presented in the study are available from (https://www.kaggle.com/datasets/edumagalhaes/quality-prediction-in-a-mining-process).
Funding: This work was supported in part by Key Laboratory of Equipment Data Security and Guarantee Technology, Ministry of Education under Grant No. GDZB2024050100 in part by the Natural Science Foundation of Heilongjiang Province under Grant No. PL2024G009, in part by the Basic Research Support Program for Outstanding Young Teachers in Provincial Undergraduate Universities of Heilongjiang Province under Grant No. YQJH2024116.
Competing interests: The authors have declared that no competing interests exist.
1 Introduction
The flotation process is an indispensable key step in the modern mining industry [1,2]. Its efficiency directly affects the comprehensive utilization of minerals. Improving flotation efficiency is a popular issue in the mining industry [3]. Realizing real-time detection of key indices in the flotation process is very important for improving flotation efficiency [4].
Currently, prediction methods for key indicators rely heavily on traditional data-driven modeling approaches. Many studies have explored this topic in the context of data-driven modeling [5,6]. For example, Zhang et al. [7] proposed a neural network model based on an encoder-decoder structure for predicting the tailings grade in the zinc flotation process. Nakhaei et al. [8] developed a prediction model for copper concentrate grade via artificial neural networks with process variables such as froth thickness, make-up water volume, and feed rate as inputs and provided guidance for production parameter optimization. A. Jiahedsaravani et al. [9] modeled the relationship between froth characteristics and performance parameters through image analysis and neural networks. Yuanyuan Pu et al. [10] developed a new deep learning network, FlotationNet, for predicting grade and recovery in the flotation process. Guichun He et al. [11] used a two-step approach, first using the box-and-line diagram method and filtering algorithm to address outliers and then using various modeling methods to predict flotation recovery. Fardis Nakhaei et al. [12] developed a recurrent neural network (RNN) model to predict copper flotation performance. The RNN achieved high accuracy, with correlation coefficients of 0.92 for grade and 0.9 for recovery, outperforming other compared models. Yousef Salehi et al. [13] proposed a vision model that uses an autoencoder for image inpainting and a multirate ARX model with image features to predict slow-rate process variables. The model employs the EM algorithm and a Kalman filter for parameter estimation and state smoothing, demonstrating effective fast-rate prediction of interface level in a separation cell. Shahbazi et al. [14] used a random forest algorithm to identify key variables such as bubble surface area flux, Reynolds number, and particle size, and successfully constructed a model for predicting flotation rate constants and recoveries with high accuracy. Tian et al. [15] designed a model predictive control (MPC) scheme for the nonlinear coupled partial differential-orthogonal differential equation (PADE) model of mineral flotation columns by means of a Cayley-Tustin time-discretization scheme. Tustin time discretization to address constraints and perturbations and achieve optimal control within the guaranteed steady-state output and constraints. Chen et al. [16] simulated the flotation process of chalcopyrite in a three-dimensional mineralized tube by coupling computational fluid dynamics (CFD) simulation with flotation dynamics models, including collision, attachment, and detachment probability models and bubble slip velocity models. Dong et al. [17] proposed a new optimization control method based on condition identification and multi model switching. Through the integration of fuzzy control, expert system and LS-SVM algorithm, the intelligent collaborative optimization of flotation column foam layer depth and reagent dosage was realized, providing an effective solution for intelligent control of complex industrial processes.
Data-driven modeling approaches rely heavily on the quality and scale of monitoring data [18]. Moreover, their predictive performance is largely constrained by the completeness and accuracy of the training samples. Although these models can handle high-dimensional data and achieve high prediction accuracy in specific scenarios, their modeling process usually lacks interpretability and requires high data volume, which leads to insufficient model reliability.
To address these inherent shortcomings of purely data-driven models, researchers have explored hybrid modeling approaches that fuse expert knowledge and data. Among them, Belief Rule Base (BRB) was proposed by Yang et al. [19]. As a hybrid-driven model, it can utilize both expert knowledge and data. In addition, as a rule-driven model, one of the significant advantages of the BRB model is its good interpretability, so BRB models are usually highly reliable. Owing to these advantages, BRBs have been widely used in the field of forecasting. However, traditional BRB models face the rule combination explosion problem when dealing with multi-parameter problems. The hierarchical BRB (H-BRB) model is a hierarchical model that can effectively alleviate the rule explosion problem [20]. However, during its construction, it relies on manually designing the model structure, which leads to a lack of self-organization ability and makes its reasonableness difficult to ensure.
To address the limitations of H-BRBs, Yang et al. [21] proposed a self-organized BRB with a tree hierarchical structure (MTS-BRB) and verified the effectiveness of the model by predicting the project risk. Gong et al. [22] used the MTS-BRB to predict the remaining battery life and proposed an interpretable optimization algorithm. The MTS-BRB performs well in terms of structural self-organization. Nevertheless, the traditional method does not fully consider disturbing factors in the input data during modeling, which makes the model susceptible to noise and compromises the stability and accuracy of its predictions.
Therefore, to solve the above problems, this paper introduces the concept of attribute reliability into the MTS-BRB model. A new prediction model, MTS-BRB with attribute reliability (MTS-BRB-R), is proposed to handle environmental noise through calculated reliability measures. The MTS-BRB-R model can provide more accurate and reliable predictions while modeling complex systems.
The main contributions of this paper are as follows:
(1) A reliable MTS-BRB model is proposed. Owing to the limitations of the BRB model, it can easily lead to a rule combination explosion when there are too many feature attributes. MTS-BRB, as a model based on the H-BRB, can address the multi-attribute rule explosion problem effectively. On this basis, the attribute reliability is derived and added to the model by analyzing the parameters to obtain the MTS-BRB-R model.
(2) Proposed iron ore flotation model based on the MTS-BRB-R. The model defines its initial parameters by incorporating expert knowledge. These parameters are subsequently adjusted through algorithm optimization and sample training. Finally, better prediction results can be achieved. Compared with other prediction methods, this method is more accurate, reliable, and flexible.
This paper is organized as follows. Sect 2 reviews the basics of the BRB model and defines the problem. Sect 3 first describes the construction of the MTS-BRB-based prediction model in detail. It then mechanistically analyzes the process of constructing and training the prediction model via the MTS-BRB-R approach. In Sect 4, simulation experiments are designed via a specific case study. A comparative study with other methods is then conducted to validate the proposed model. Finally, Sect 5 concludes the study.
2 Preliminaries and problem description
In Sect 2.1, the basics of BRBs are reviewed to provide the basis for the new prediction model proposed in this paper. In Sect 2.2, the research problem is described.
2.1 BRB Basics
In this section, the main elements of the MTS-BRB model are reviewed to provide a basis for proposing the new prediction model in this paper.
The BRB expert system consists of a series of belief rules. Usually, the first belief rule in a traditional BRB can be written as:
where is the reference value of the ith antecedent attribute in the kth rule.
is a belief degree to which the consequent DN is believed to be true.
is the weight of the kth belief rule.
is the weight of the ith attribute. L is the number of rules. M is the number of attributes.
To illustrate conventional BRB in detail, this paper takes key flotation process indicators as the prediction problem: the tailings silica content (namely D) is a function of slurry pH value (namely X1) and slurry density (namely X2). When there are three assessment ratings e.g. , provided for X1 and X2, and three consequents, e.g.
, provided for D by domain expert, a conventional BRB can then be constructed by covering all possible combinations of each assessment rating for all antecedent attributes, as shown in Table 1.
From Table 1, it can be found that a belief rule such as R1 contains that when pH value is Low and slurry density is Low, then 60% sure that the tailings silica content is High and 30% is Medium. The weight of R7 is 1.0, indicating the importance of R7 over other rules. The weight of slurry pH and slurry density is 0.7 and 0.9, respectively, indicating the different importance of these two attributes. where denotes the nth antecedent reference attribute for the mth rule.
denotes the belief degree of the Kth rule for the Nth result.
A key limitation of the traditional BRB is that it involves generating an exponential number of combinations from all assessment ratings [23]. Specifically, when there are M antecedent attributes and each has N reference values, the rule base size becomes NM. This finding demonstrates that the size of the conventional BRB model is exponentially related to the number of antecedent attributes and their assessment ratings. Consequently, any increase in these numbers causes the rule base to grow exponentially, leading to the rule explosion problem.
2.2 Problem description
This section formulates the problem.
Problem 1: In the traditional BRB, the number of rules grows exponentially with the increasing number of features M. Assuming that there are three assessment levels, the relationship between the number of rules and the number of features is shown in the following equation: rule = 3M. Multi-parameter problems suffer not only from rule explosion but also from model construction. Taking the traditional hierarchical model as an example, ordinary hierarchical structures tend to delineate levels on the basis of empirical experience or multiple statistical data. This method lacks a set of complex and standard modeling processes. Therefore, a reasonable construction method is needed to construct hierarchical models.
Problem 2: In BRB, attributes are used as inputs to the model, and their reliability directly affects the reliability of the model. Using unreliable attributes may produce unreliable model outputs. Therefore, it is necessary to improve the existing model to increase its reliability and accuracy.
3 Proposed MTS-BRB-R modeling and training process
The H-BRB is a method designed to address the combinatorial rule explosion problem. It employs a hierarchical design that decomposes the complex decision-making process of a system into multiple levels, each dedicated to solving specific subproblems, thereby effectively alleviating the issue. This approach emphasizes incorporating all the attributes. It builds them into a large-scale, expert-knowledge-driven model in a top-down manner. This allows all attributes to function within a single BRB structure, as illustrated in Fig 1.
Although the H-BRB performs well in handling complexity and uncertainty, it lacks a standardized methodology for selecting attributes to use as inputs for its sub-models. This shortcoming leads to the following problems in the modeling process: first, the modeler must rely on subjective experience and domain knowledge to filter the attributes, which not only increases the complexity and time cost of modeling but also may lead to unstable model performance; second, owing to the lack of standardized selection guidelines, the consistency and repeatability of the H-BRB model across different domains and application scenarios are difficult to ensure.
To solve the above problems, Yang et al. [21] designed a multilayer tree structure to provide an effective guidance scheme for H-BRB modeling. The MTS-BRB model structure is shown in Fig 2.
The MTS-BRB consists of three types of nodes: root nodes, internal nodes, and leaf nodes. Root nodes: Nodes with no parent nodes become root nodes, and the root node consists of the set of all the resultant attributes. Internal nodes: Nodes with parent and child nodes become internal nodes, and each internal node can be interpreted as a child BRB containing the next level of leaf nodes or internal nodes. Leaf nodes: Nodes without children become leaf nodes, and each leaf node corresponds to a feature attribute.
However, since MTS-BRB uses attributes as inputs to the model, their reliability directly affects the reliability of the model. To address the impact of attribute unreliability on the model, this paper introduces attribute reliability within the BRB structure, which is composed of the first internal node. The location of its introduction is shown in Fig 2.
The modeling process of the MTS-BRB-R can be divided into three steps: the construction of the multilayer tree structure (MTS), the calculation of attribute reliability, and the construction of the BRB. The construction of the MTS is divided into three parts: determining the root node, determining the leaf nodes, and generating the internal nodes. The BRB model is first constructed by establishing a rule base with expert knowledge. Its inference process then involves four stages: (1) converting input data, (2) calculating rule activation weights, (3) integrating rules to update belief degrees, and (4) aggregating the results to produce a final output.
3.1 MTS-BRB-R modeling process
The process of MTS is described in detail in Part A, the calculation method of attribute reliability is described in Part B, and the specific process of generating the corresponding MTS-BRB-R system on the basis of the MTS tree structure is described in Part C.
Part A: MTS structure construction.
Suppose that the problem to be modeled is described as , where Xm denotes the mth antecedent attribute. D denotes the consequent attribute. The model is built upon M antecedent attributes, where the mth attribute is characterized by Jm discrete reference values
. These reference values, known as attribute references, define the possible states for each attribute. The formula for calculating the attribute correlation is as follows [21,24].
where denotes the correlation between the antecedent attributes Xm and Xn, which can be understood as the proportion of uncertainty reduction of Xm after Xn is known. Therefore, the smaller the AR value between the attributes is, the greater the independence of the two and the lower the correlation.
denotes the mutual information and
denotes the entropy of the two, which is obtained via the Eqs (3)–(4).
where denotes the probability that attribute Xm is equal to the reference value Am,i and attribute Xn is equal to the reference value An,j; similarly,
denotes the probability that attribute Xm is equal to the reference value Am,i. Their calculation process is described below.
Step 1: Determine the root node In the MTS, the root node is set as the consequent attribute root = D. The root node is the only node without a parent.
Step 2: Identify the first internal node The first internal node inner1 consists of the antecedent attribute . The first internal node contains all nodes except the root node.
Step 3: Generate leaf nodes with the remaining internal nodes
Step 3.1: specify the cluster size ε
The cluster size determines the number of elements of any internal node. The internal node elements include two kinds of elements: leaf nodes and internal nodes: inner = Ut.
Step 3.2: Calculate the AR between each attribute
Assume that each antecedent attribute Xm has an attribute reference value Am,j with a utility value of . When there are T inputs
where
, the following belief distribution can be obtained via a technique that is based on equivalent changes in utility:
where denotes the belief degree of the reference level Am,j on the antecedent attribute Xm.
can be obtained from the Eq (6).
On the basis of the Eqs (5)–(6), and
can be obtained via the Eqs (7)–(8).
Finally, the two are brought into Eqs (3) and (4) to calculate the AR values obtained. The table of AR values obtained is shown in Table 2.
As shown in Table 2, the AR value of any item with itself is 1. This result indicates that the knowledge of the item’s presence reduces the uncertainty about its own presence by 100%, which is a logically inevitable and perfect association.
Step 3.3: Hierarchical clustering of antecedent attributes based on correlation between attributes
In Step 2, the first internal node inner1 is , where M denotes the number of antecedent attributes in the problem. The ε pivot elements are generated according to the cluster size, and the ε pivot elements are chosen by Eq (9).
The Eq (9) shows that the pivot point element Pt has minimal correlation with the elements within .
Next, each pivot point element is used to generate a subset of attributes , both
.
Then, all the elements are assigned to the ε subsets via the following assignment process:
For each subset , its new pivot point
should be selected via the Eq (11). If
is exactly equal to
, then the clustering process is complete. When the number of elements in any subset is greater than the cluster size
, this subset is considered as U0 and re-execute Step 2. If at the end of clustering, there is a subset with the number of elements equal to 1, this subset is considered a leaf node.
The final constructed structure is shown in Fig 3.
As shown in Fig 3, each internal node in the structure contains ε elements. The first internal node encompasses all other internal nodes and all leaf nodes. The elements contained within an internal node can be one of two types: (1) other internal nodes, (2) leaf nodes. Leaf nodes are terminal elements that are defined by a set of attributes.
Part B: Calculating attribute reliability.
In engineering practice, the observation data collected by sensors may fluctuate due to interference factors. When the observation data fluctuate greatly, the current system is considered unreliable [25]. Therefore, unreliable observation data can be selected according to the tolerance range. If the observation data are out of the tolerance range, they do not represent the correct system information and are unreliable.
Suppose the input to the ith antecedent attribute Xi is . Ti denotes the number of input data for the antecedent attribute Xi. ψ is the expert-determined tolerance range.
and
represent the mean and standard deviation of the ith antecedent attribute observation, respectively.
The attribute reliability ri of the ith antecedent attribute is subsequently calculated via the Eq (12).
where ri denotes the reliability of the ith antecedent attribute. M represents the number of antecedent attributes. In particular, for the consequent attribute output by the sub-BRB, the default is fully reliable, which means that the consequent attribute reliability of the sub-BRB is 1.
Part C: BRB model structure construction.
After obtaining the MTS in part A, a BRB model is constructed for each internal node. It is worth noting that attribute reliability is incorporated only into the first internal node during the construction of the BRB models for internal nodes, as this node is responsible for training the overall model. The remaining internal nodes are constructed following the standard BRB modeling approach.
The initial parameters of the BRB model including rule weights, attribute weights, and belief degrees are determined by domain experts leveraging extensive operational experience and historical data from the flotation process. Specifically, rule weights are designed to reflect the relative importance of different operating conditions, attribute weights represent the influence of each process variable, and belief degrees characterize uncertain relationships between inputs and outputs. This expert-driven initialization not only incorporates domain knowledge but also provides a physically meaningful starting point for subsequent parameter optimization.
Step 1: Calculate attribute reliability
After constructing the MTS, the corresponding attribute reliability can be calculated using the method described in Part B.
Step 2: Calculate the match between the input data sample and the belief rule
For input xi, its match to the attribute reference is obtained via Eq (13).
where xi denotes the input data for the ith antecedent attribute. Ai,k and Ai,k + 1 denote the reference values of the ith antecedent attribute within the + 1] rule. Ki denotes the number of rules containing the ith antecedent attribute within the whole system.
Step 3: Aggregate attribute reliability and attribute weights
The formula for aggregating attribute reliability and attribute weights is as follows.
where Ci denotes the combination parameter that considers both attribute weights and attribute reliability. ri denotes the reliability of the ith attribute. denotes the weight of the ith attribute.
denotes the relative weight of the ith attribute. When the attribute is completely reliable ri = 1, then
.
Step 4: Calculate rule activation weights
According to the calculated combination parameter Ci, which considers both attribute weights and attribute reliability, the formula for calculating the rule activation weights is as follows:
Step 5: Generate belief degree
The belief degree in the results is generated by the parsing ER algorithm fused with the activation rule, which is calculated via Eqs (16) and (17).
where N represents the number of assessment levels and L represents the number of rules. represents the belief degree of the kth rule for the ith assessment result.
Step 6: Generate utility value
The final belief degree after aggregation can be expressed as:
Suppose that the utility of a single consequent Dn is represented as un, and the expected utility of S(x) can be expressed as:
3.2 MTS-BRB-R training process
Traditional BRB models rely on the relationship between the output results and the actual results to continuously optimize the parameters to achieve the effect of training the model. However, unlike the single-layer BRB model, since the H-BRB model takes the output of the sub-BRB model as the input of another BRB model, the sub-BRB model cannot optimize the model parameters directly on the basis of the actual results. Therefore, H-BRB adopts a training method in which the expert gives the parameters of the sub-BRB and trains only the BRB model that produces the final result, as shown in Fig 4. In this paper, the whole model is trained such that the expert gives the parameters of the BRB model of the internal node and trains the first internal node during training. For parameter optimization, we employ the Projected Covariance Matrix Adaptation Evolution Strategy (P-CMA-ES) algorithm to handle the constrained optimization problem in BRB training [26,27]. The parameter settings for the P-CMA-ES are provided in Table 3.
The training process is as follows:
Suppose that a set of input data is known , where
. T represents the number of input data. M represents the number of antecedent attributes. yt is the observation in numerical form. For the input data xt, the distributed output generated via the ER is as Eq (20).
where denotes the belief degree generated by the ER algorithm with respect to the evaluation result Dn. The average utility of the output can be obtained from the above equation as Eq (21).
where denotes the utility of the output portion of the rule.
is determined by the expert.
The training process of MTS-BRB-R can be summarized as the desire to have the output of the highest-level BRB system close to the possible approximation yt. Its optimization process can be expressed as Eq (22).
where denotes the parameter vector of inner1 in the MTS-BRB-R,
,
, and
. L, M, N represent the number of rules, the number of attributes, and the number of evaluation levels in inner1, respectively.
4 Case study
In this section, the prediction of the tailings silica content during iron ore flotation is used as an example to verify the validity of the model.
4.1 Experimental definitions
This paper uses publicly available datasets provided by the Kaggle platform. Kaggle is the world’s leading platform for data science and machine learning and is widely used in machine learning research and practice. The experimental data come from the Quality Prediction in a mining process dataset on the Kaggle platform (Kaggle, 2018). The dataset contains 22 parameters from March 2017–September 2017 of the iron ore concentration process in a flotation plant. The first column of the dataset indicates the time and date, the second and third columns represent the percentage of iron ore content and the percentage of silicon content before the iron ore slurry is fed into the flotation plant, and the ninth through twenty-second columns represent the process data during the flotation process. The specific selected dataset feature information is shown in Table 4.
Among them, the iron content of the ore feed (X1), the silicon content of the ore feed (X2), the starch content (X3), the pH value of the slurry (X4), the density of the slurry (X5), the air intake of the flotation column 03 (X6), the liquid surface height of flotation column 01 (X7), the liquid surface height of flotation column 02 (X8), and the iron content of the concentrate (X9) are taken as input characteristics. The concentrated silicon content (X10) is used as a prediction.
In this experiment, to ensure the objectivity and reliability of the model performance evaluation, a standard data division strategy is adopted, and the original dataset is divided at a ratio of 7:3.
Specifically, given the time-series nature of the flotation process data, we employed a chronological split to prevent data leakage: the first 70% of the data in temporal order are allocated to the training set, while the remaining 30% are used as an independent test set. This chronological partitioning ensures that the model is evaluated on future unseen data, preventing any temporal information leakage and providing a realistic assessment of the model’s predictive performance in practical applications.
Data standardization processing is a key step in eliminating the differences in feature scales and improving the stability of the algorithm. In this paper, there are features of different scales in the data, which, if directly input into the model, will lead to the distance-based algorithm being biased toward large-valued features owing to the difference in scales, reducing the prediction or clustering effect. Therefore, in this work, through Z-score standardization, each feature is converted to a distribution with a mean of 0 and a standard deviation of 1, which ensures that all the features participate in the calculation under the same scale, thus balancing their contributions to the model weights and enhancing the robustness and convergence efficiency of the algorithm.
To comprehensively assess the predictive performance and statistical reliability of the model, the following three evaluation indices are selected in this paper:
(1) Mean square error (MSE).
The MSE can effectively quantify the overall prediction deviation of the model by calculating the mean of the second-order moments between the predicted value and the true value, which is one of the core evaluation indices of the regression problem. Its calculation formula is as follows:
where n denotes the total number of samples. yi denotes the true value of the ith sample. denotes the predicted value of the ith sample.
(2) Coefficient of determination (R2).
R2 is a core metric used in statistics and machine learning to assess the goodness-of-fit of a regression model and is used to quantify the model’s ability to explain the variance of the target variable. Its calculation formula is as follows:
where n denotes the total number of samples. yi denotes the true value of the ith sample. denotes the predicted value of the ith sample.
denotes the mean of the true values.
(3) Core indicator decay rate (PDR).
PDR is used to quantify the degree of performance decay of the model under nonideal conditions such as noise interference, data distribution offset, or adversarial attack, reflecting the robustness of the model. It is an important evaluation index for evaluating the robustness of a model. Its definition is as follows:
where RMSEnoise and RMSEclean denote the root mean square errors with and without added noise, respectively.
4.2 Modeling
The model construction is divided into two parts: Part A is the construction of the multilayer tree structure. Part B is the construction of the BRB model.
Part A: Constructing a multi-level tree structure.
This part describes the construction of a Multi-Level Tree Structure.
Step 1: Determine the root node and the first internal node
First, the prediction consequent attribute is determined, and the silicon content X10 is concentrated as the root node: Root = X10. Next, set the attribute X10 reference value as SCn, where , which takes the values shown in Table 5. Finally, the first internal node U0 is set as the set consisting of all input features, denoted as
Step 2: Calculate the AR value between features
According to the step-by-step formula shown in Part A of Sect 3.1, the AR values between all the features in the internal node U0 are computed to obtain the AR value table shown in Table 6.
Step 3: Clustering on the basis of the distance between features
The maximum number of clusters ε is set to 3, and the first three main factors are selected on the basis of the computed AR value with the following selection Eqs (27)–(29):
X1, X2, and X9 are the three smallest features selected on the basis of the sum of the AR values, which serve as the initial three main elements, and the rest of the sets of elements are added to the set marked by the initial main elements according to the size of the AR values of the initial elements, for example:
According to the above equation, X4 should be added to the set {P3} where the pivot point P3 is located to form a new set . In addition, according to the above method, the remaining elements are added to the cluster where the main element is located, and the final formation of the result is shown in Fig 5.
Part B: Modeling the BRB.
This part describes the modeling of the Multi-Level Tree Structure.
Step 1: Calculate attribute reliability
According to Part B of Sect 3.1, only the attribute reliability of the first internal node U0 is calculated in this paper. The internal node U0 consists of attribute X1, internal node U1, and attribute X9. According to Eq (12), this study obtain that attribute X1 reliability r1 = 0.9809, attribute X9 reliability r9 = 0.8995, and internal node U1 reliability = 1.0, where the tolerance ψ given by experts is 1.88.
Step 2: Build the model
After completing the attribute clustering, a total of three sub-BRBs and one prediction BRB are generated by assigning three reference values to each feature attribute and five reference points
to the consequent attribute X10 and, in particular, three reference values
to the outputs of the sub-BRBs. Each BRB adopts a three-attribute structure, 27 rules are generated for each sub-BRB, and a total of 108 rules are generated.
In the MTS-BRB-R model for the prediction of the silica content in ore flotation concentrates, the belief rule for predicting the BRB can be described as Eq (31).
where and
are the reference values of the two attributes; in particular,
is the reference value of the sub-BRB result. SubBRB represents the result of the sub-BRB output. Both attributes X1 and attributes X9 and SubBRB have three reference values. Thus, 27 combinations of the three attributes form the 27 rules for predicting the BRB in the MTS-BRB-R model. The initial parameters of the predicted BRB are given by the expert, as shown in Table 7.
4.3 Experimental analysis
(A) Analysis of model structure.
The complexity of the H-BRB model can be reflected in two aspects: the number of rules and the rational basis for constructing the model. As the number of rules increases, the model complexity increases. To make a more intuitive comparison, it is assumed that there are M antecedent attributes with 3 evaluation levels for each attribute, which leads to the construction of 4 types of BRBs, including traditional BRBs and MTS-BRBs with 2/3/4 subnodes, and the following figure represents the number of rules for these BRBs, where the number of subnodes indicates the maximum number of subnodes for each internal node.
Fig 6 clearly shows that the MTS-BRB approach results in approximately linear growth in model size with increasing number of attributes for different subnode configurations (2, 3, or 4 subnodes). A quantitative comparison shows that the number of rules required by the MTS-BRB system is significantly lower than that of the traditional BRB method. Specifically, when the number of attributes reaches 10, the traditional BRB system needs to generate 310 = 59,049 rules, whereas only 81 (2-subnode), 117 (3-subnode), or 243 (4-subnode) rules are required when the MTS-BRB method is used, which are equivalent to 0.14%, 0.20%, and 0.41% of the number of rules of the traditional method, respectively. This comparison fully demonstrates the efficiency of the MTS-BRB method in controlling the number of rules, which effectively reduces the model complexity through the hierarchical tree structure. In addition, compared with the traditional H-BRB, which relies on manual organization construction, the MTS-BRB has greater rationality because the MTS-BRB organizes the model structure via a computational method that is based on entropy and mutual information.
(B) Model robustness analysis.
The robustness of the model is reflected mainly in the performance of the model in the presence of noise interference. Its evaluation index PDR is defined by Eq (25). where PDR represents the performance change caused by noise, and the larger the value is, the weaker the robustness of the model. In this work, three scenarios are designed to verify the model robustness: (1) adding Gaussian noise (2) adding impulse noise (3) combining Gaussian noise and impulse noise. Each scenario is designed with three interference strengths: weak, medium, and strong. The PDR changes of the global noise and characteristic interference are tested separately. In this study, all models, trained on clean data, were evaluated on noisy test sets to simulate online inference. The variation in the PDR is analyzed in the following two figures. The scenarios are designed as shown in Table 8.
In this paper, the effects of global noise and feature noise on model performance are evaluated separately. As shown in Figs 7 and 8, both global noise and feature noise lead to a systematic increase in PDR for all the models. Notably, MTS-BRB-R has superior stability in both noise scenarios, but its advantage is more prominent in feature noise.
According to the experimental data, the performance degradation rates (PDRs) of all the tested models significantly monotonically increased with increasing noise intensity (σ/p). This linear increasing relationship indicates a significant positive correlation between noise intensity and model performance degradation. Among the six compared models, the BPNN model exhibits the best noise immunity performance, with its PDR value remaining the lowest at all noise levels. Notably, among the BRB series of models, the MTS-BRB-R model exhibits superior robustness. The experimental data show that under the influence of moderate global noise (σ = 0.2), the PDR of the MTS-BRB-R (66.5%) is reduced by approximately 12% and 14.2% compared with those of the MTS-BRB and H-BRB, respectively. As shown in Figs 7 and 8, the PDR growth rate of MTS-BRB-R is significantly lower than that of H-BRB and MTS-BRB, confirming its better noise stability.
The superior performance of MTS-BRB-R may stem from the introduction of attribute reliability, which is designed to effectively suppress the propagation of noise in the inference process, thus maintaining greater stability. In contrast, the standard MTS-BRB and H-BRB models are more sensitive to noise disturbances and exhibit more substantial performance degradation, especially in high-noise environments.
(C) Analysis of model effects.
After the MTS-BRB-R model was built, a total of 2091 data points per hour in March from the mining flotation dataset were used as training data, and the test set was selected and normalized with data per hour in April. To illustrate the effectiveness of the MTS-BRB-R, in this paper, the MTS-BRB-R model is compared with other models.
The compared models are divided into two main categories: other BRB models and machine learning models. The BRB models include the BRB-PCA model constructed using data after PCA dimensionality reduction and the general hierarchical H-BRB and MTS-BRB models. Among them, PCA is a classical linear dimensionality reduction method based on orthogonal transformation, whose core idea is to preserve the maximum variance information in the data by projecting the high-dimensional data into a low-dimensional subspace spanned by principal components. The BRB-PCA model selects two principal components as inputs to construct the model. For H-BRBs, the model is constructed by randomly assigning three features to sub-BRBs. The MTS-BRB model constructs the model results in an ordered manner by calculating the AR values. Machine learning models include Bayesian regression, SVR, and BPNN. The performance analysis of each model is shown in Table 9.
In the diagonal scatter plot in Fig 9, ideally, all points should fall on the diagonal line (y = x), and deviations from the diagonal line indicate prediction errors. The points above the diagonal line indicate that the predictions are systematically high, and the points below the focus line indicate that the predictions are systematically low. As seen from the scatterplot, most models perform better in the region of small values (<2). Among them, the H-BRB, MTS-BRB, and MTS-BRB-R models show a trumpet trend distribution in the small-value region, which indicates that the error shows an increasing trend with increasing value. In the median region, the three models show a more regular distribution of outlier points, among which the degree of the outlier points of the MTS-BRB-R model is weaker than that of the remaining two models, and on the whole, the outlier points show a more regular distribution with the improvement of the model. With the improvement of the model, the outliers show a convergence trend; in the large value region, H-BRB shows a more obvious systematic underprediction, and with the improvement of the model, the underprediction problem shows an obvious improvement trend. A horizontal comparison of the MTS-BRB-R model reveals that the advantage of the MTS-BRB-R model is reflected mainly in the small-value region, which has a more favorable convergence trend, and in the large-value region, which overcomes the problem of systematic underprediction. Combining the horizontal and vertical comparisons, it can be proven that the MTS-BRB-R model has a superior modeling effect.
Combining the model robustness and model accuracy, the MTS-BRB-R model is more robust and accurate than the H-BRB model is. The Bayesian, SVR, and BPNN models have superior robustness, but their robustness is based on a lower initial accuracy. The MTS-BRB-R model is more robust than the other BRB models are; thus, it is reasonable and effective to solve the BRB multiattribute input problem with the MTS structure.
The superior performance of the proposed MTS-BRB-R model can be attributed to its unique structural and mechanistic advantages over both traditional BRB models and data-driven neural networks. Compared to traditional BRB, the multilayer tree structure effectively alleviates the rule explosion problem by decomposing the complex high-dimensional input space into manageable sub-modules, each focusing on a subset of attributes with high mutual correlation. This not only reduces model complexity but also enhances interpretability by providing a clear hierarchical reasoning path.
When compared to neural network models (e.g., BPNN, SVR), the MTS-BRB-R combines the strengths of both expert knowledge and data-driven learning. While neural networks often act as “black boxes" with limited interpretability, our model maintains transparent decision-making through its belief rule structure. The incorporation of attribute reliability further enhances robustness against noisy industrial data, which is a common challenge in flotation process monitoring. This hybrid modeling approach allows the MTS-BRB-R to achieve both high accuracy and reliability, making it particularly suitable for complex industrial processes where both performance and interpretability are crucial.
(D) Sensitivity analysis of tolerance parameter ψ.
The calculation of attribute reliability depends on the tolerance parameter ψ provided by domain experts. To verify the robustness of the model against variations in this parameter, we conducted a sensitivity analysis by testing ψ values ranging from 0.5 to 3.0. The performance metrics under different ψ values are summarized in Table 10.
The results demonstrate that the MTS-BRB-R model maintains stable performance across a wide range of ψ values (1.25 to 2.25), with RMSE variations within 0.2
5 Conclusion
Indicator prediction in the flotation process is a complex structured problem. Traditional BRB methods often lead to rule explosion when handling multiple parameters, as the number of rules grows via the Cartesian product of input references. Existing hierarchical models also largely rely on empirical segmentation without systematic construction frameworks. To overcome these issues, this paper proposes an MTS-BRB-R model based on a multilayer tree structure, which incorporates attribute reliability to enhance robustness and prediction accuracy.
The MTS-BRB-R model is particularly suited for complex systems like flotation with correlated inputs and local feature noise. It enables structured decomposition and reliability-aware inference. The experimental results demonstrate that the MTS-BRB-R model achieves superior performance compared to both traditional BRB and neural network approaches, owing to its effective integration of structural optimization through multilayer tree decomposition, enhanced robustness via attribute reliability mechanisms, and maintained interpretability through transparent belief rule reasoning.
However, limitations remain. Currently, only antecedent attribute reliability is considered, while sub-BRB outputs are assumed fully reliable, potentially affecting credibility assessment. The clustering of element sets still depends on subjective definitions, influencing structural optimization. Moreover, while competent under local noise, the model’s robustness against global systematic noise requires improvement. The mechanism through which attribute reliability affects overall model reliability also needs deeper investigation. Future work will focus on: (1) improving reliability computation for sub-BRBs; (2) developing adaptive clustering to automate element set partitioning; (3) enhancing robustness under global noise; (4) establishing a comprehensive reliability evaluation framework linking attribute-level and model-level performance.
References
- 1. Nagaraj DR, Farinato RS. Evolution of flotation chemistry and chemicals: a century of innovations and the lingering challenges. Minerals Engineering. 2016;96–97:2–14.
- 2. Bergh LG, Yianatos JB. The long way toward multivariate predictive control of flotation processes. Journal of Process Control. 2011;21(2):226–34.
- 3. Bartolacci G, Pelletier P Jr, Tessier J Jr, Duchesne C, Bossé P-A, Fournier J. Application of numerical image analysis to process diagnosis and physical parameter measurement in mineral processes—Part I: Flotation control based on froth textural characteristics. Minerals Engineering. 2006;19(6–8):734–47.
- 4. Maldonado M, Desbiens A, del Villar R. Potential use of model predictive control for optimizing the column flotation process. International Journal of Mineral Processing. 2009;93(1):26–33.
- 5. Dong H, Wang F, He D, Liu Y. Flotation equipment automation and intelligent froth feature extraction in flotation process: a review. Reviews in Chemical Engineering. 2025;41(3):225–39.
- 6. Gharai M, Venugopal R. Modeling of flotation process—an overview of different approaches. Mineral Processing and Extractive Metallurgy Review. 2016;37(2):120–33.
- 7. Zhang H, Tang Z, Xie Y, Luo J, Chen Q, Gui W. Grade prediction of zinc tailings using an encoder-decoder model in froth flotation. Minerals Engineering. 2021;172:107173.
- 8.
Nakhaei F, Sam A, Mosavi MR, Zeidabadi S. Prediction of copper grade at flotation column concentrate using Artificial Neural Network. In: IEEE 10th International Conference on Signal Processing Proceedings. 2010. p. 1421–4. https://doi.org/10.1109/icosp.2010.5656938
- 9. Jahedsaravani A, Marhaban MH, Massinaei M. Prediction of the metallurgical performances of a batch flotation system by image analysis and neural networks. Minerals Engineering. 2014;69:137–45.
- 10. Pu Y, Szmigiel A, Chen J, Apel DB. FlotationNet: a hierarchical deep learning network for froth flotation recovery prediction. Powder Technology. 2020;375:317–26.
- 11. He G, Liu M, Zhao H, Huang K. Ensemble prediction modeling of flotation recovery based on machine learning. International Journal of Mining Science and Technology. 2024;34(12):1727–40.
- 12. Nakhaei F, Irannajad M. Application and comparison of RNN, RBFNN and MNLR approaches on prediction of flotation column performance. International Journal of Mining Science and Technology. 2015;25(6):983–90.
- 13. Salehi Y, Chiplunkar R, Huang B. Robust-to-occlusion machine vision model for predicting quality variables with slow-rate measurements. Computers & Chemical Engineering. 2024;182:108581.
- 14. Shahbazi B, Chehreh Chelgani S, Matin SS. Prediction of froth flotation responses based on various conditioning parameters by Random Forest method. Colloids and Surfaces A: Physicochemical and Engineering Aspects. 2017;529:936–41.
- 15. Tian Y, Luan X, Liu F, Dubljevic S. Model predictive control of mineral column flotation process. Mathematics. 2018;6(6):100.
- 16. Chen W, Wang B, Yan X, Zhang H, Wang L. CFD investigation of chalcopyrite flotation prediction coupled with flotation kinetic model. Separation and Purification Technology. 2024;345:127203.
- 17. Dong Z, Wang R, Fan M, Fu X. Switching and optimizing control for coal flotation process based on a hybrid model. PLoS One. 2017;12(10):e0186553. pmid:29040305
- 18. Solomatine DP, Ostfeld A. Data-driven modelling: some past experiences and new approaches. Journal of Hydroinformatics. 2008;10(1):3–22.
- 19. Yang JB, Liu J, Wang J, Sii HS, Wang HW. Belief rule-base inference methodology using the evidential reasoning approach-RIMER. IEEE Transactions on systems, Man, and Cybernetics-part A: Systems and Humans. 2006;36(2):266–85.
- 20. Yang J-B, Wang Y-M, Xu D-L, Chin K-S, Chatton L. Belief rule-based methodology for mapping consumer preferences and setting product targets. Expert Systems with Applications. 2012;39(5):4749–59.
- 21. Yang L-H, Ye F-F, Liu J, Wang Y-M. Belief rule-base expert system with multilayer tree structure for complex problems modeling. Expert Systems with Applications. 2023;217:119567.
- 22. Gong A, He W, Ma N, Cao Y. Research on the application of multi-layer tree-structured belief rule base system in aviation engine performance evaluation. IEEE Access. 2024;12:139469–85.
- 23. Zhou Z, Feng Z, Hu C, Hu G, He W, Han X. Aeronautical relay health state assessment model based on belief rule base with attribute reliability. Knowledge-Based Systems. 2020;197:105869.
- 24.
MacKay DJ. Information theory, inference and learning algorithms. Cambridge University Press; 2003.
- 25. Feng Z, Zhou Z-J, Hu C, Chang L, Hu G, Zhao F. A new belief rule base model with attribute reliability. IEEE Trans Fuzzy Syst. 2019;27(5):903–16.
- 26. Zhang X, Gong A, He W, Cao Y, He H. A lithium battery health evaluation method based on considering disturbance belief rule base. Batteries. 2024;10(4):129.
- 27. Han P, Zhang Q, He W, Chen Y, Zhao B, Li Y, et al. A double inference engine belief rule base for oil pipeline leakage. Expert Systems with Applications. 2024;240:122587.