Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Comparison of the data mining and machine learning algorithms for predicting the final body weight for Romane sheep breed

  • Cem Tırınk ,

    Roles Conceptualization, Data curation, Formal analysis, Software, Visualization, Writing – original draft, Writing – review & editing

    cem.tirink@gmail.com

    Affiliation Department of Animal Science, Faculty of Agriculture, Igdir University, Iğdır, Türkiye

  • Hasan Önder,

    Roles Conceptualization, Data curation, Formal analysis, Software, Writing – original draft, Writing – review & editing

    Affiliation Department of Animal Science, Faculty of Agriculture, Ondokuz Mayis University, Samsun, Türkiye

  • Dominique Francois,

    Roles Resources, Writing – original draft, Writing – review & editing

    Affiliation GenPhySE, INRA, INPT, INP‐ENVT, Université de Toulouse, Castanet Tolosan, France

  • Didier Marcon,

    Roles Resources

    Affiliation INRAE, UE P3R, Osmoy, France

  • Uğur Şen,

    Roles Methodology, Writing – original draft, Writing – review & editing

    Affiliation Department of Agricultural Biotechnology, Faculty of Agriculture, Ondokuz Mayis University, Samsun, Türkiye

  • Kymbat Shaikenova,

    Roles Methodology, Writing – original draft, Writing – review & editing

    Affiliation Department of Technology and Processing of Livestock Production, Faculty of Veterinary and Animal Husbandry Technology, Saken Seifullin Kazakh Agrotechnical University, Astana, Kazakhstan

  • Karlygash Omarova,

    Roles Methodology, Writing – original draft, Writing – review & editing

    Affiliation Department of Technology and Processing of Livestock Production, Faculty of Veterinary and Animal Husbandry Technology, Saken Seifullin Kazakh Agrotechnical University, Astana, Kazakhstan

  • Thobela Louis Tyasi

    Roles Methodology, Writing – original draft

    Affiliation Department of Agricultural Economics and Animal Production, School of Agricultural and Environmental Sciences, University of Limpopo, Sovenga, Limpopo, South Africa

Abstract

The current study aimed to predict final body weight (weight of fourth months of age to select the future reproducers) by using birth weight, birth type, sex, suckling weight, age at suckling weight, weaning weight, age at weaning weight, and age of final body weight for the Romane sheep breed. For this purpose, classification and regression tree (CART), multivariate adaptive regression splines (MARS), and support vector machine regression (SVR) algorithms were used for training (80%) and testing (20%) sets. Different data mining and machine learning algorithms were used to predict final body weight of 393 Romane sheep (238 female and 155 male animals) were used with different artificial intelligence algorithms. The best prediction model was obtained by CART model, both training and testing set. Constructed CART models indicated that sex, suckling weight, weaning weight, age of weaning weight, and age of final weight could be used as an indirect selection measure to get a superior sheep flock on the final body weight of Romane sheep. If genetically established, the Romane sheep whose sex is female, age of final weight is over 142 days, and weaning weight is over 28 kg could be chosen for affording genetic improvement in final body weight. In conclusion, the usage of CART procedure may be worthy of reflection for identifying breed standards and choosing superior sheep for meat yield in France.

Introduction

Sheep is one of the first domesticated animals within the scope of archaeological and genetic studies [1, 2]. As a result of this domestication process, it has led to an improvement in sheep breeding over time. Multi-purpose sheep breeding is very noteworthy not only for the development of healthy civilizations but also for obtaining animal products such as milk, meat, and fleece, especially in the development processes of rural economies [36]. According to this importance for human being, researchers aimed sheep farms to reach a more profitable, innovative, and sustainable [7]. According to the FAO, 5.5 million heads of sheep are adopted to France’s several regions [8]. Principal sheep breeds include Charollais, Ile de France, Berrichon du Cher, Blanc du Massif Central, Roussin de La Hague, Préalpes du Sud, Lacaune and Romane in France [911]. Sheep breeds suitable for the regions have been adapted for sustainable sheep breeding, along with the optimization of care, food, and other environmental requirements [12].

Knowing the live weights of the animals in the herd in sheep breeding is essential in determining the breeding strategy to be applied in the herd and herd management. In this context, it will provide convenience in calculating the optimum amount of feed per sheep, determining the drug doses, marketing price more reliably, and optimum slaughter time of the animals [1315].

In literature, there are many studies in which multivariate statistical methods are used together with biometric measurements to create a characterization for breeds [1619]. Within the framework of the regression analysis assumptions, requirements such as linearity, normal distribution, constant variance, and explanatory variable independence should be sought in multivariate statistical approaches [2023]. However, various data mining and machine learning algorithms do not need these assumptions [24, 25]. In this context, many studies conducted within the scope of multivariate statistical methods have been carried out on different breeds and species, in which characteristics such as birth weight, live weight, weaning weight, and final weight are tried to be estimated [2628]. The main purpose of trying to predict such different traits is that these traits vary among animal materials. Expressing these differences will provide convenience in herd management issues such as selection practices and breed characterization.

Using information on sex, birth weight, suckling weight, age of suckling weight, weaning weight, age at weaning weight, and age at final weight, this study attempted to estimate the ideal final weight (weight at the fourth month of age) to choose the future reproducers (rams and ewes)). Three algorithms, including the CART, MARS, and SVR algorithms, were utilized and compared to achieve this goal.

Materials and methods

Data on the Romane sheep breed was provided by INRA in order to compare the algorithms. Romane sheep (Berrichon du Cher by Romanov crossbred) was developed to increase the productivity of the French sheep herd. INRA created the synthetic strain INRA 401 by crossing the Berrichon du Cher breed (good meat yield and quality but not very prolific, not very maternal, white fleece) with the Romanov breed (very prolific, maternal, but low butchering skills and coloured fleece). The line obtained was named Romane breed in 2006. It has an average litter size of 2 lambs per birth (for 3-year-old ewes), excellent fertility in the off-season, and good viability of the lambs at birth [29]. Housing conditions during the suckling period: ewes and lambs in the barn, ewes are fed with a total mix ration composed of wrapped forage bales and concentrates, and lambs are fed ad-libitum with concentrates. This study used data from 393 animals (155 male and 238 female) born in August, September and October 2021. Sex, birth weight, suckling weight, age at suckling weight (about 30 days), weaning weight, age at weaning weight, and age at final body weight were used to estimate the final weight that weight of fourth months of age to select the future reproducers (rams and ewes), still young and not mature adults.

Descriptive statistics were showed to response and explanatory variables according to each sex. The comparison of each variable was compared by using the independent samples t-test (p<0.05).

The Classification and Regression Tree (CART) algorithm was proposed by Breiman et al. [30]. With the CART algorithm, a binary split tree structure created by splitting a variable homogeneously includes the two sub-nodes. In the CART algorithm, the process begins from the root node, including the initial data set, and continues until many homogeneous sub-nodes are gotten, which will supply the minimum error variance.

Multivariate Adaptive Regression Splines (MARS) algorithm suggested to overcome the classification and regression type problems by Friedman [31]. The MARS algorithm is one of regression procedure that facilitates a more influential description of interaction, linear and nonlinear effects between explanatory and response variables. There is no need for the MARS algorithm assumptions as in linear regression [27, 3234].

The MARS algorithm generates base functions according to a step-by-step procedure, taking into account all possible interaction effects between candidate nodes and explanatory variables. The algorithm includes two different steps such as forward and backward pass steps, respectively [35]. The initial steps is the forward pass process which begins to determine the term of intercept in the initial pattern, and to improve the model, iteratively contains the initial patterns coupled with the least training error. The forward pass steps typically products an overfitted configuration that achieves extreme complexity [27, 31]. The model built from the forward pass process fits predominantly worthy. However, it can be difficult to overfitting in terms of generalization ability. The initial patterns that specify the smallest amount of the prediction model are abolished in the backward pass process, and this process is hand-me-down in the resolution to this problem [18, 35, 36]. The MARS algorithm is a significant instrument that can take linear and nonlinear relationships between dependent and independent variables [37, 38]. The equation for MARS procedure utilized to predict body weight from explanatory variables is below.

(1)

here, : the predicted value of BW, β0: the intercept of the model, βm: the basis functions’ coefficient, Km: the parameter that determines the degree of interaction, hkm(Xv(k,m)): the determined basis functions of the model.

Generalized cross-validation error (GCV) is eliminated by using variable selection (forward and backward) and thus the performance of the model is increased. The calculation of GCV is [27, 36, 39]: (2) here; n: the size of training set, yi: response variable’ observed value (BW), (): the estimation of the response variable (BW), M(λ): is called the penalty term for the complexity for the model that includes the λ terms.

At the initiation of the MARS procedure, the relationship between the explanatory variables, called multicollinearity, was tested and it was determined that there was no multicollinearity problem between the explanatory variables. To predict BW utilization in the training set. The cross-validation procedure helped to select the greatest MARS model among 72 MARS models (degree = 1:2 and nprune = 2:38). In addition, for training data set, ten-fold cross-validation technique was utilized for optimal MARS model.

A significant twig of the support vector machine (SVM), which is a machine learning algorithm, is the support vector regression (SVR) algorithm [40]. Here, struggling with classification is named support vector classification (SVC), while struggling with modeling and prediction is named SVR [33, 4143]. Although SVR is also a supervised learning method, the prediction performance obtained from SVR varies depending on the training and test set [33, 44].

The main goal in the linear SVR model is to define a function of f(x) which can have the maximum deviation (ε) from the training set. Training set points are built into the boundary between −ε to +ε [44]. However, in most studies, it cannot be shown within the scope of linear properties. Therefore, when using nonlinear SVR, the input data is matched to a higher dimensional Hilbert space (H) so that the regression line can be linear [40].

The hyperplane of the nonlinear SVR to be obtained is as follows; (3)

In this equation, weight vector is defined by w, non-linear kernel functions are defined by ϕ(x), indicates vector inner product is defined by 〈.,.〉 and the term of b is a bias term. There are many nonlinear kernel functions. In the current study, gaussian radial basis kernel function were used.

In comparison of the model performances, the following goodness of fit criteria were used [27, 28, 33, 36, 45]:

  1. 1. Root-mean-square error (RMSE):
(4)
  1. 2. Akaike information criterion (AIC):
(5)
  1. 3. Standard deviation ratio (SDratio):
(6)
  1. 4. Global relative approximation error (RAE):
(7)
  1. 6. Mean absolute percentage error (MAPE):
(8)
  1. 7. Pearson correlation coefficient (r):
(9)
  1. 8. Performance Index (PI):

(10) where, n is the size of training data set, k is called the number of parameters for the model, yi is the real value of the response variable (BW), is the predicted value for response variable (BW), sd is the standard deviation for the response variable (BW), sm is the standard deviation for optimum model’s errors [33].

RMSE, SDratio, CV, PI, RAE, MAPE and AIC, r and R2 criteria were used to compare the performance of the model. For this aim, it needs to determine best model for smallest RMSE, SDratio, CV, PI, RAE, MAPE and AIC values for train and test set and also, the highest r, R2 value for all algorithms [46].

All statistical evaluation was performed using R software [47]. To required knowledge of the data, descriptive statistics were utilized. The descriptive statistics for explanatory and response variables were predicted by using “psych” package in R environment [48]. The “caret” packages in the R software were used to perform analyze of the CART and MARS algorithms [49]. Also, support vector regression algorithm was performed by using “e1071” package in R software [50]. To display the performances of the constructed all models, the “ehaGoF” package was utilized [51].

Results

Descriptive statistics for all variables by sex (male and female) factor were expressed as mean ± standard error and are given in Table 1.

thumbnail
Table 1. Descriptive statistics for the whole characteristics.

https://doi.org/10.1371/journal.pone.0289348.t001

In Table 2, Pearson’s correlation coefficients for defining the association with response and explanatory variables except for age of suckling weight, age of weaning weight, and age of final weight. Final body weight had a small correlation coefficient with the birth weight, suckling weight, weaning weight with the coefficients of 0.29, 0.42, 0.52, respectively. Among the explanatory variables, relatively high correlation was determined only between weaning weight and suckling weight. The other correlation coefficients were low or moderate and can be interpreted as having no relationship.

thumbnail
Table 2. Correlation coefficients of the explanatory variables.

https://doi.org/10.1371/journal.pone.0289348.t002

Fig 1 points out the constructed CART regression tree diagram in estimation of final weight from explanatory variables such as birth weight (bw), suckling weight (sw), age at suckling weight (asw), weaning weight (ww), age at weaning weight (aww), and age at final weight (afw) and sex (1 = male, 2 = female) factor.

The final weight of the Romane sheep breed was divided into two groups based on sex. On the left side of the diagram, if the sex was female, the mean of the final body weight was determined as approximately 36 kg. The female Romane sheep were divided into two groups based on weaning weight, that is, weaning weight <23 kg and weaning weight ≥23 kg. If the weaning weight was ≥23 kg, the tree was divided into the age of final weight<106 days with the mean of 37 kg. In addition, if the age of the final weight was above 106 days, the final weight was divided into two groups as for weaning weight was over and under 27 kg. According to Fig 1, if the sex was female, weaning weight ≥23 kg, age at final weight≥106 days, and weaning weight≥27 kg, the predicted final weight was determined as 45 kg. On the right side of the diagram, if the sex was male, the mean of the final body weight was determined as approximately 42 kg. The male Romane sheep were divided into two groups for the age at final weight at 142 days. If the age of final weight is under 142 days, the final body weight was divided into for suckling weight with the coefficient of 18 kg. In addition, if the sex was male, the age of final weight was over 142 days, weaning weight over 28 kg, the final body weight was determined as 64 kg.

Table 3 gives the results of the CART algorithm depending on the cross-validation procedure. The constructed CART regression tree algorithm made 12 terminal knots (size of regression tree) with relative error (0.148) and the cross-validation error (0.312) which means that coefficient of determination (R2) and cross-validation R2 were close to each other with 0.852 (1–0.148) and 0.688 (1–0.312), respectively (Table 3). The relative error can estimate the R2, and cross-validation error can be defined as the mean of the cross-validated prediction errors and cross-validation R2 is calculated by 1–relative error. Cross-validation standard deviation can be defined as the standard deviation of the cross-validated prediction errors (Table 3).

thumbnail
Table 3. Results of CART algorithm based on the cross-validation technique.

https://doi.org/10.1371/journal.pone.0289348.t003

Table 4 shows that the final body weight can be explained with the eight basic functions in the MARS prediction model. The first term of the MARS prediction model is the intercept with value of 36.770. In the second term, the case where the sex is female for a negative coefficient of -4.369. The third term (Suckling weight-12.8) had a cutpoint of 12.8 kg with a coefficient of 0.814. The fourth term was the suckling weight age, with a cutpoint of 35 with a coefficient of 0.415. For the age of suckling weight is 35 days, the effect of term four on final body weight was unavailable. The fifth and sixth terms were for weaning weight, with cutpoint 17.4 and 24.4 with a coefficient of 1.0817 and -0.616, respectively (Table 4). When weaning weight was 17.4 and 24.4 kg or lighter, no contribution of the weaning weight to the final body weight was found. For the weaning weight lighter than 17.4 kg, term 5 was useless, and then only the term 6 (Weaning weight-24.4) contributed to the final body weight. The same procedure was for the term of 6. If the weaning weight was lighter than 24.4 kg, the term of 6 was useless. The seventh and the last term of the MARS prediction model were for the age of final weight with a cutpoint of 133 days.

To estimate the final body weight, the obtained best prediction MARS model allows breeders to make more precise decisions considering herd management, such as the necessary feed amount, medicinal drug doses for Romane sheep breed, and establishing the selling value of each sheep.

At the start of the SVR algorithm was instructed to the training set. Later performing the training processes, the SVR was assessed to predict the final body weight for Romane sheep breed. In the current study, the gaussian radial basis kernel function was used for estimating the final body weight of the Romane sheep breed. The consistency of the model is based on the selection parameters such as cost (C), gamma and epsilon. The current study determined cost, gamma, and epsilon values as 1, 0.083, and 0.1, respectively. These parameters were verified for several values, and analysis was affected for C and epsilon values. So, it would provide the greatest trustworthy model. To determine which variable is more effective, sensitivity analysis was carried out. The sensitivity analysis was calculated to estimate the relative importance values of the affected explanatory variables on final body weight. In Fig 2, the sensitivity analysis results were given.

thumbnail
Fig 2. Sensitivity analysis for support vector regression algorithm.

https://doi.org/10.1371/journal.pone.0289348.g002

According to Fig 2, the age of final weight was a more effective variable on final body weight for support vector regression algorithms. The second effective variable was weaning weight. Apart from these variables, sex variables (male and female), suckling weight, age of weaning weight, and birth weight were also important in determining the final body weight. The least effective variables in determining the final body weight were the age of suckling weight, birth type (2, 3, and 4), and number of co-suckled lambs.

For comparing all models, the goodness of fit criteria was used. The model performance results for CART, MARS, and SVR algorithms, based on the goodness-of-fit criteria, were provided in Table 5. Table 5 shows the finest prognostic model was realized for CART algorithm. The CART algorithm had the smallest RMSE, SDratio, CV, PI, RAE, MAPE, and AIC values for the train and testing set. Also, the highest R2 value was determined for the CART algorithm.

thumbnail
Table 5. Goodness-of-fit criteria for CART, MARS and SVR algorithms.

https://doi.org/10.1371/journal.pone.0289348.t005

In addition, The Pearson correlation coefficient was determined as 0.923 for the original data and 0.902 for the training and test sets for the predicted data, respectively. In addition, since the coefficient of variation (CV) values for both the train and the testing set in each model are below 30%, it is concluded that the results obtained from the applied models are reliable.

Discussion

Various statistical methods can be used to explain the relationship between body weight and various characteristics. The important point here is to use the correct statistical method. There are even some studies to estimate live weight in different breeds; no study has been found for the Romane sheep breed. In the present study, the final weight of male Romane sheep breed had a greater mean with comparison female Romane sheep breed (p<0.05).

Alonso et al. [43] utilized the SVR algorithm to predict the carcass weight in Asturiana de los Valles beef. To predict the carcass weight, the study was made by using 390 measurements from 144 animals. According to the study’s results, 150 days before slaughter time the ideal carcass weight prediction was obtained. They found MAPE values as 4.12 and 4.91 for train and test, respectively. The results of the MAPE values were smaller than our results. The achievement of their study may be caused by a sample size greater than ours.

Celik et al. [3] aimed to compare CART, CHAID, Exhaustive CHAID, MARS, MLP, and RBF for Mengali rams. The greatest prediction model was described as the CART algorithm in the scope of the model comparison criteria such as R2, RMSE and SDratio. The CART and MARS algorithms we used in our study were also used in this study. Celik et al. found that the CART algorithm was more reliable than the MARS algorithm [3]. This result supports our current study results.

Iqbal et al. [52] was aimed to examine performances of the random forests, regression tree, SVR algorithm, and gradient boosting machine algorithms. For this aim, Iqbal et al. [52] was used the data obtained from Beetal goats. To predict the body weight, they some biometric measures such as sex, and body length. As model comparison criteria such as Pearson’s correlation, R2, RMSE, MAPE, and MAE were used. The results of Iqbal et al. [52] showed that the gradient boosting machine was stated to be the greatest model for estimating the body weight of Beetal goats. However, the random forest regression algorithm was determined to be the second-best algorithm. Even if the SVR algorithm does not give the best results, even the data structure impacts on the prediction performances.

Marco et al. [53] tried to use AdaBoost ensemble learning method and RFR for different data sets. Several machine learning algorithms, for different data sets MLP, SVR, CART, kNN, and RFR were applied. According to the results obtained from most of the datasets applied in the study, they stated that it is a reliable and successful algorithm for RFR. However they indicated that the CART algorithm was also a useable algorithm, even if the CART algorithm wasn’t very sensitive to parameter tuning when RFR was so sensitive to parameter tuning, which results in stable prediction performance.

Tırınk [54] compared various artificial intelligence methods such as Multivariate Adaptive Regression Splines, Random Forest Regression, Bayesian Regularized Neural Network, and Support Vector Regression algorithms to estimate body weight from biometric measurements for the Thalli sheep breed. For this aim, 270 female Thalli sheep breeds were used. According to the results, the MARS algorithm was the best prediction model for Thalli sheep breeds inside the bayesian regularized neural network, random forest regression and support vector regression. These results support our results because the MARS model showed more consistent results than the SVR algorithm.

Tırınk et al. [33] compared CART, SVR, and RFR artificial intelligence methods using body measurements to estimate body weight at a different share of Polish Merino in the genotype of crossbreds (share of Suffolk and Polish Merino genotypes). To compare the estimation performances of the evaluated algorithms and determine the best model for estimating body weight, various body measurements and sex and birth type characteristics were assessed. According to test sets results, using random forest regression was recommended instead of using CART and SVR algorithms. The performance of the SVR (R2 = 0.714) algorithm was found to be more reliable than CART (R2 = 0.578) algorithm by Tırınk et al. [33]. These findings were not compatible with our results. In our study, the CART (R2 = 0.810) algorithm was better than the SVR (R2 = 0.761) algorithm. But it should be considered that there were such big differences in the point of determination coefficient. They used the data for 344 animals, while our sample size was 393. Although these sample sizes were similar, this cannot be the reason for the differences between the algorithms. The structure of the data may be the reason for these differences.

Kumar and Kumar-Singh [55] aimed to compare MLR, MARS, SVR, and RFR techniques in hydrological time-series modelling. Their results showed that according to the RMSE and R2 values, they reported that the SVR algorithm was superior and stated that it was applicable to predict the weekly pan evaporation values for the Ranichauri region. Their results were comparable only for MARS and SVR algorithms with our results. They suggested the use of SVR, but our results had a different recommendation to use MARS algorithms when compared with the study of Kumar and Kumar-Singh [55]. The coefficient of determination difference was only 0.01, which can be ignored that both MARS and SVR can be interpreted as similar, as in our results.

Komadja et al. [56] aimed a model development to predict peak particle velocity (PPV) in opencast mines using CART, MARS, and SVR algorithms. The models were developed using a record of 1001 real blast-induced ground vibrations, with ten corresponding blasting parameters from 34 opencast mines/quarries from India and Benin. The results showed that the MARS model outperformed other models in this study with lower error (RMSE = 0.227) and R2 of 0.951, followed by SVR (R2 = 0.87), CART (R2 = 0.74), and empirical predictors. Their results completely contrast to our findings that our study showed the CART algorithm was the best. Also, Komadja et al. [56] gave 50 results of previous studies results in a list. Interpretation of the list showed no consensus on the algorithm selection. The contrast in the results may be caused by data type and sample size.

Compared to the abovementioned research, artificial intelligence was used for many species and breeds. The extensive variation in previous studies was attributed to the physiologic phase of the animals, raising systems differences, and selection of statistical methods to apply. When our study was compared with other results, it was determined when the selected goodness-of-fit criteria are examined, it is understood that the models used in this study give similar results. However, proposing some statistical methods for BW estimation using biometric traits is very important for animal production, characterization and breeding purposes. The results obtained showed that much more work needs to be done on this subject.

Conclusion

Finally, in order to provide breeders and researchers with access to a superior population of Romane sheep for use in upcoming research, the CART method should be recommended. The findings of the current study, which uses the goodness of fit criteria to choose the best model for both CART and other methods, demonstrated that data mining and machine learning algorithms can be successfully used to estimate body weight based on other explanatory variables obtained. Even if there are some variations brought on by the breed factor, as was mentioned in the discussion section, more accurate models can be created by carrying out comparable research in addition to using alternative algorithms.

References

  1. 1. Zeder MA. Domestication and early agriculture in the Mediterranean Basin: origins diffusion, and impact. Proc. Natl. Acad. Sci. 2008; 105: 11597–11604. pmid:18697943
  2. 2. Şirin E. The effect on fertility of using different doses of PMSG in Anatolian merino sheep. BSJ Agric. 2022; 5(3): 278–280.
  3. 3. Celik S, Eyduran E, Karadas K, Tariq MM. Comparison of predictive performance of data mining algorithms in predicting body weight in Mengali rams of Pakistan. Braz J Anim Sci.2017; 46(11): 863–872.
  4. 4. Erkan Can M, Boğa M. The importance of sheep shelters in cold climate region. BSJ Agric. 2018; 1(1): 1–5.
  5. 5. Alarslan E, Aygün T. Determination of some milk yield and reproduction characteristics of Kıvırcık sheep in Yalova. BSJ Agric. 2019; 2(2): 86–92.
  6. 6. Sahin Y, Olfaz M. Investigating the breeding and lamb growing practices applied by sheep farmers in Tokat region. BSJ Agric. 2019; 2(3): 171–180.
  7. 7. Ateş A, Çam MA. Structural features of sheep farms in Ordu province. BSJ Agric. 2022; 5(3): 323–328.
  8. 8. FAO. Faostat, Live Animals Data. Available online: http://www.fao.org/faostat/en/#data/QL (accessed on 20 October 2021).
  9. 9. Huby M, Griffon L, Moureaux S, De Rochambeau H, Danchin-Burge C, Verrier E. Genetic variability of six French meat sheep breeds in relation to their genetic management. Genet. Sel. Evol. 2003; 35: 637–655. pmid:14604512
  10. 10. Vitezica ZG, Moreno CR, Bouix J, Barillet F, Perret G, Elsen M. A study on associations between PrP genotypes and meat traits in French sheep breeds. Anim Sci. 2005; 81(3): 325–330.
  11. 11. Vitezica ZG, Moreno CR, Bodin L, François D, Barillet F, Brunel JC, et al. No associations between PrP genotypes and reproduction traits in INRA 401 sheep1. J Anim Sci. 2006; 84: 1317–1322.
  12. 12. Sahin O, Aytekin I, Boztepe S, Keskin I, Karabacak A, Altay Y, et al. Relationships between FAMACHA© scores and parasite incidence in sheep and goats. Trop Anim Health Prod. 2021; 53: 331.
  13. 13. Sabbioni A, Beretti V, Superchi P, Ablondi M. Body weight estimation from body measures in Cornigliese sheep breed. Ital J Anim Sci. 2020; 19(1): 25–30.
  14. 14. Ağyar O, Özköse E, Ekinci MS, Akyol İ. Investigation of live weight measurements of morkaraman lambs according to various times in terms of different variables. BSJ Agri. 2020; 3(3): 193–199.
  15. 15. Sakar ÇM, Ünal İ, Okuroğlu A, Coşkum Mİ, Zülkadir U. Prediction of live weight from chest girth from birth to 12 months of age in Yerli Kara cattle. BSJ Agric. 2020; 3: 200–204.
  16. 16. Khan MA, Tariq MM, Eyduran E, Tatliyer A, Rafeeq M, Abbas F, et al. Estimating body weight from several body measurements in Harnai sheep without multicollinearity problem. J Anim Plant Sci. 2014; 24(1): 120–126.
  17. 17. Ali M, Eyduran E, Tariq MM, Tirink C, Abbas F, Bajwa MA, et al. Comparison of artificial neural network and decision tree algorithms used for predicting live weight at post weaning period from some biometrical characteristics in Harnai sheep. Pakistan J Zool. 2015; 47: 1579–1585.
  18. 18. Faraz A, Tirink C, Eyduran E, Waheed A, Tauqir NA, Nabeel MS, et al. Prediction of live body weight based on body measurements in Thalli sheep under tropical conditions of Pakistan using CART and MARS. Trop Anim Health Prod. 2021; 53(2): 1–12. pmid:33931799
  19. 19. Yavuz E, Şahin M. Investigation of Parametric, Non-Parametric and Semiparametric Methods in Regression Analysis. Sakarya Univ J Sci, 2022; 26(6): 1111–1116.
  20. 20. Mendes M, Akkartal E. Regression tree analysis for predicting slaughter weight in broilers. Ital J Anim Sci, 2009; 8: 615–624.
  21. 21. Sahin M, Yavuz E, Uckardes F. Multicollinearity Problem and Bias Estimates in Japanese Quail. Pakistan J Zool. 2018; 50(2): 757–761.
  22. 22. Tırınk C. Estimating of birth weight using placental characteristics in the presence of multicollinearity. BSJ Eng. Sci. 2020; 3: 138–145.
  23. 23. Yavuz E, Şahin M. Semiparametric regression models and applicability in agriculture. BSJ Agri. 2022; 5(2): 160–166.
  24. 24. Tyasi TL, Eyduran E, Celik S. Comparison of tree-based regression tree methods for predicting live body weight from morphological traits in Hy-line silver brown commercial layer and indigenous Potchefstroom Koekoek breeds raised in South Africa. Trop Anim Health Prod, 2021; 53: 7.
  25. 25. Kurnaz B, Önder H, Piwczynski D, Kolenda M, Sitkowska B. Determination of the best model to predict milk dry matter in high milk yielding dairy cattle. Acta Sci. Pol. Zootechnica, 2021; 20(3): 41–44.
  26. 26. Eyduran E, Zaborski D, Waheed A, Celik S, Karadas K, Grzesiak W. Comparison of the predictive capabilities of several data mining algorithms and multiple linear regression in the prediction of body weight by means of body measurements in the indigenous Beetal goat of Pakistan. Pakistan J Zool, 2017; 49(1): 257–265.
  27. 27. Eyduran E, Akin M, Eyduran SP. Application of Multivariate Adaptive Regression Splines through R Software; Nobel Academic Publishing; Ankara, Türkiye, 2019.
  28. 28. Olfaz M, Tirink C, Onder H. Use of CART and CHAID algorithms in Karayaka sheep breeding. J. Kafkas Univ. Vet. Fak. Derg. 2019; 25(1): 105–110.
  29. 29. Ricordeau G, Tchamitchian L, Brunel JC, Nguyen TC, François D. La race ovine INRA 401: un exemple de souche synthétique. INRA Prod. Anim. 1992; 1992: 255–262.
  30. 30. Breiman L, Friedman JH, Olshen RA, Stone CJ. Classification and regression trees; Chapman and Hall Wadsworth Inc.; New York, NY, USA, 1984.
  31. 31. Friedman J. Multivariate adaptive regression splines. Ann Stat, 1991; 19(1): 1–67.
  32. 32. Akin M, Eyduran SP, Eyduran E, Reed BM. Analysis of macro nutrient related growth responses using multivariate adaptive regression splines. Plant Cell, Tissue Organ Cult. 2020; 140: 661–670.
  33. 33. Tırınk C, Piwczyński D, Kolenda M, Önder H. Estimation of body weight based on biometric measurements by using random forest regression, support vector regression and CART algorithms. Animals, 2023; 13: 798. pmid:36899654
  34. 34. Coşkun G, Şahin Ö, Altay Y, Aytekin İ. Final fattening live weight prediction in Anatolian merinos lambs from some body characteristics at the initial of fattening by using some data mining algorithms. BSJ Agri. 2023; 6: 47–53.
  35. 35. Arthur CK, Temeng VA, Ziggah YY. Multivariate Adaptive Regression Splines (MARS) approach to blast-induced ground vibration prediction. Int. J. Min. Reclam, 2020; 34(3): 198–222.
  36. 36. Zaborski D, Ali M, Eyduran E, Grzesiak W, Tariq MM, Abbas F, et al. Prediction of selected reproductive traits of indigenous Harnai sheep under the farm management system via various data mining algorithms. Pakistan J Zool, 2019; 51: 421–431.
  37. 37. Çanga D, Yavuz E, Efe E. Prediction of Egg Weight Using MARS data mining Algorithm through R. KSU J. Agric. Nat. 2021; 24(1): 242–251.
  38. 38. Çanga D. Use of MARS Data Mining algorithm based on training and test sets in determining carcass weight of cattle in different breeds. J. Agri. Sci. 2022; 28(2): 259–268.
  39. 39. Çanga D, Boğa M. Detection of correct pregnancy status in lactating dairy cattle using MARS data mining algorithm. Turk J. Vet. Anim. Sci. 2022; 46: 809–819.
  40. 40. Nguyen QT, Fouchereau R, Frénod E, Gerard C, Sincholle V. Comparison of forecast models of production of dairy cows combining animal and diet parameters. Comput. Electron. Agric. 2020; 170: 105258.
  41. 41. Smola AJ, Schölkopf BA. Tutorial on support vector regression. Stat. Comput. 2004; 14: 199–222.
  42. 42. Kavaklioglu K. Modeling and prediction of Turkey’s electricity consumption using Support Vector Regression. Appl. Energy, 2011; 88: 368–375.
  43. 43. Alonso J, Castañón ÁR, Bahamonde A. Support Vector Regression to predict carcass weight in beef cattle in advance of the slaughter. Comput. Electron. Agric. 2013; 91: 116–120.
  44. 44. Patel AK, Chatterjee S, Gorai AK. Development of a machine vision system using the support vector machine regression (SVR) algorithm for the online prediction of iron ore grades. IAPR Mach Vis Appl, 2017; 12: 197–210.
  45. 45. Grzesiak W, Zaborski D. Examples of the use of data mining methods in animal breeding. In: Data mining applications in engineering and medicine (ed. A Karahoca); InTech, Rijeka: Croatia, 2012.
  46. 46. Tatliyer A. The Effects of Raising Type on Performances of Some Data Mining Algorithms in Lambs. KSU J. Agric. Nat. 2020; 23(3): 772–780.
  47. 47. R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2022, URL: https://www.R-project.org/.
  48. 48. Revelle W. psych: Procedures for Personality and Psychological Research, Northwestern University, Evanston, Illinois, USA, 2022, https://CRAN.R-project.org/package=psychVersion=2.2.9.
  49. 49. Kuhn M. caret: Classification and Regression Training. R package version 6.0–93, 2022, URL: https://CRAN.R-project.org/package=caret.
  50. 50. Meyer D, Dimitriadou E, Hornik K, Weingessel A, Leisch F. e1071: Misc Functions of the Department of Statistics, Probability Theory Group (Formerly: E1071), TU Wien. R package version 1.7–12, 2022, URL: https://CRAN.R-project.org/package=e1071.
  51. 51. Eyduran E. ehaGoF: Calculates Goodness of Fit Statistics. R package version 0.1.1, 2020, URL: https://CRAN.R-project.org/package=ehaGoF.
  52. 52. Iqbal F, Waheed A, Faraz A. Comparing the Predictive Ability of Machine Learning Methods in Predicting the Live Body Weight of Beetal Goats of Pakistan. Pakistan J Zool, 2022; 54(1): 231–238.
  53. 53. Marco R, Ahmad SSS, Ahmad S. Bayesian hyperparameter optimisation and Ensemble Learning for Machine Learning Models on software effort estimation. Int. J. Adv. Comput. Sci. Appl. 2022; 13(3): 419–429.
  54. 54. Tırınk C. Comparison of Bayesian Regularized Neural Network, Random Forest Regression, Support Vector Regression and Multivariate Adaptive Regression Splines Algorithms to Predict Body Weight from Biometrical Measurements in Thalli Sheep. J. Kafkas Univ. Vet. Fak. Derg. 2022; 28(3): 411–419.
  55. 55. Kumar P, Kumar-Singh AA. Comparison between MLR, MARS, SVR and RF Techniques: Hydrological Time-series Modeling. J. Human Earth Future 2022; 3(1): 90–98.
  56. 56. Komadja GC, Rana A, Glodji LA, Anye V, Jadaun G, Onwualu PA, et al. Assessing Ground Vibration Caused by Rock Blasting in Surface Mines Using Machine-Learning Approaches: A Comparison of CART, SVR and MARS. Sustainability 2022; 14: 11060.