Figures
Abstract
Sex assessment in cremation contexts represents a key step for reconstructing funerary rituals and demographic profiles. However, the high degree of fragmentation and thermal alteration of skeletal elements significantly hampers this operation. Most often, the morphological traits of pelvis and skull are not preserved while the application of metric criteria follows those developed from modern reference series. We used a prehistoric Italian sample (n = 155) with gender specific grave goods as a proxy for sex and developed binary logistic regression and Bayesian models based on 21 postcranial metric variables. We compared the results with the cut-off point method and validated them using an independent Austrian prehistoric sample (n = 45) sex estimation based on morphological sex traits. The Bayesian model achieved the highest accuracy (89%), outperforming cut-off points and regression models (68–88%, depending on the variable). In the Austrian sample, cut-off points produced the highest proportion of classified individuals but misclassified four times more cases than the other methods. Bayesian and regression models yielded higher rates of ambiguous classifications but maintained low error rates. Inconsistencies between the Bayesian sex prediction and morphological sex estimation can be explained by less reliable morphological traits in the morphological assessments. We also present CRISP, a new open-source software using the Bayesian approach for sex prediction. Multivariate statistical methods proved both more reliable and more adaptable for sex estimation in human cremated remains than univariate cut-off approaches. Validation demonstrated that metric methods substantially increase the number of individuals for which sex can be estimated and retain robustness when morphological indicators are ambiguous. The tool can further be used to identify deposits of cremated human remains composed of more than one individual. Minimal interpopulation differences between the Italian training and Austrian validation samples suggested CRISP is suitable for both Mediterranean and Central European contexts.
Citation: Waltenberger L, Beitel S, Bischeri M, Rebay-Salisbury K, Sperduti A, Cavazzuti C (2026) CRISP: Cremated remains inference of sex probabilities – A software for Bayesian sex estimation in human cremated remains. PLoS One 21(5): e0346813. https://doi.org/10.1371/journal.pone.0346813
Editor: Peter F. Biehl, University of California Santa Cruz, UNITED STATES OF AMERICA
Received: September 26, 2025; Accepted: March 24, 2026; Published: May 5, 2026
Copyright: © 2026 Waltenberger et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Yes - all data are fully available without restriction; All relevant data are within the paper and its Supporting Information files. The source code for CRISP software is published at Github: https://github.com/quadraBits/CRISP.
Funding: This research was funded in whole or in part by the Austrian Science Fund (FWF) project University of Vienna Open Access Department 10.55776/P33533E. The work of Claudio Cavazzuti was funded by the European Union-Next Generation EU, Mission 4 Component 1 CUPJ53D23000200001, MUR code: 2022ELZECR “MALTHUS”. For open access purposes, the author has applied a CC BY public copyright licence to any author accepted manuscript version arising from this submission.
Competing interests: The authors declare no conflict of interest.
Introduction
The identification of anthropological parameters, such as sex and age-at-death is a crucial component in bioanthropology and bioarchaeology. Sex estimation is integral to the reconstruction of osteobiographies and biological identities of individuals, as well as the understanding of funerary rituals, demographic structure, and the social organization of past communities. While more or less robust methods do exist for the analysis of unburnt skeletal remains [1], the analysis of cremated human remains is often difficult to perform and precarious due to extensive fragmentation, shrinkage, warping, and other heat-induced changes of bones [2–5]. These limitations often result in a greater number of undiagnosed individuals and higher error rates compared to unburnt skeletal remains [5].
Sex estimation in human cremated remains traditionally relies on either the assessment of sexually dimorphic morphological traits—primarily of the pelvis and cranium—or on metric approaches based on discriminant functions derived from single bone measurements [6]. Although morphological methods are generally preferred by anthropologists [7] as their validity has been extensively tested on unburnt skeletal samples, their application to cremated remains is severely constrained by fragmentation. Cremation frequently results in the partial or complete destruction of the most dimorphic skeletal elements, particularly the pelvis and skull [8–10]. Moreover, even when sex-diagnostic traits are observable, heat-induced deformation, delamination, and dimensional reduction substantially alter their morphology, causing cremated bones to appear more gracile than their unburned counterparts and increasing the risk of misclassification, particularly of male individuals.
As for the most metric methods, researchers have developed correction factors to handle these shrinkage effects, while applying cut-off points or discriminant functions developed from unburnt series. Metric approaches face similar limitations. Bone shrinkage induced by heat can reach up to 30% of the original dimensions, as demonstrated experimentally on ovine bones [11], and varies considerably according to skeletal element, bone composition, and burning conditions [12–14]. Significant shrinkage has been documented at temperatures exceeding 700°C, corresponding to complete calcination, whereas bones exposed to lower temperatures tend to retain dimensions closer to those of unburnt material [15]. This tends to increase the number of misclassified males. Earlier studies attempted to compensate for these effects by applying generalized correction factors (e.g., 10%) to measurements prior to using discriminant functions developed on unburnt samples [16]. However, such approaches oversimplify a process characterized by marked variability.
More recent research advocates chemosteometric approaches, which may provide greater accuracy than uniform correction factors, particularly in cases where shrinkage is highly irregular [15]. These developments underscore the need for metric sex estimation methods specifically calibrated on completely calcined human remains, in order to improve comparability and minimize bias introduced by heterogeneous burning conditions. Applying correction factors to adapt methods developed on unburnt skeletal material remains problematic, as sexual dimorphism already contributes substantial variability to prediction models, which is further amplified by element-specific and non-linear shrinkage during cremation. At present, the reliability of such corrected methods is difficult to assess and would require systematic testing on identified reference collections of cremated individuals.
Several studies have proposed sex estimation methods based on modern cremated remains [16–18]. While these approaches are valuable for forensic contexts, their applicability to archaeological cremation burials is limited as ancient populations are often biologically distinct (e.g., in terms of body mass) from the modern reference samples on which these methods are developed. Modern cremations are performed in highly controlled settings, using industrial furnaces with regulated oxygen supply and consistently high burning temperatures [19]. In contrast, prehistoric and ancient cremations were typically carried out on open-air wooden pyres, characterized by a heterogenous and fluctuating thermal conditions, variable pyre management, diverse funerary practices, and complex post-depositional processes [20–22].
For these reasons, sex estimation is often not possible for a substantial proportion of individuals recovered from archaeological cremation burials. Reported sex estimation rates vary widely across sites, depending on the study and the preservation of the remains, and range between 3% and 40% [7,23,24]. This highlights the need for further development of dedicated methods and analytical strategies specifically tailored for ancient cremations.
Recent studies have suggested potential methodological pathways to overcome, or at least reduce, these limitations. Among these, Cavazzuti, Bresadola [25] proposed a novel method developed on Bronze and Iron Age cremation contexts from Italy, employing gender-specific grave goods as proxies for sex. The study established cut-off points for 21 metric variables from various postcranial bones. The results demonstrated the persistence of high sexual dimorphism for the majority of the variables and moderate to good accuracy for 18 of them. This finding suggests that gender functions as a good proxy for sex, at least in this particular context. The method was subsequently validated through its application to archaeological cremated remains [18,26–28]. Particularly, the study of Bischeri, Alhaique [28] tested the correspondence between sex estimations, based on Cavazzuti et al.’ s metric method implemented by morphological observations, and typically gendered archaeological materials from the Etruscan sepultures of Tolle. In addition to grave goods (weapons for males; ornaments and spinning/weaving tools for females), gender was also indicated by the sexual attributes represented in the canopies (ossuaries shaped as a human face and body). Biological sex, estimated by osteologists, and archaeological gender, marked by grave goods and vessel features, matched in 95.6% of the cases.
The literature indicates that the majority of metric sex estimation methods in human cremated remains are predicated on statistical prediction techniques, including binary logistic regression, discriminant function analysis, and cut-off points (see [17,23,25]). These methods demonstrate accuracies ranging from 80 to 90%. Recently, the employment of advanced statistical methods, including machine learning algorithms, has been demonstrated to enhance accuracy to over 90% [18]. However, the application of these methods is challenging for researchers lacking familiarity with these intricate statistical techniques and programming. Consequently, they remain infrequently used in practice. A recent survey by Hlad, Löffelmann [7] revealed a lack of standardized practices in the osteological analysis of human cremated remains. The majority of practitioners rely on morphological assessments with minimal training in cremation-specific methods. Levels of confidence are notably low, particularly in the context of sex estimation, and the utilization of metric or computational approaches is limited. Given the lack of training opportunities in the analysis of human cremated remains and the significant experience and time required for professional analysis, there is an urgent need for simple and replicable methods of estimating sex of human cremated remains. Such methods should be applicable with adequate accuracy, even by researchers with limited experience. The objective of this study is to present and validate CRISP, a software for metric sex estimation based on Bayesian statistics. Such approaches have been utilized with success in the past, as evidenced by the DSP software for a metric sex estimation based on pelvic measurements [29]. The objective of this study is to present a method that is both highly accurate and user-friendly, thereby facilitating the analysis of human cremated remains also for researchers, who are not well-versed in these methods and computer coding.
Material
The development of the Bayesian method was based on the compilation and analysis of previously published osteometric data from cremation burials at several Italian necropolises, dating from the Final Bronze Age to the Iron Age. These sites include the Narde di Frattesina necropolis in Veneto (Final Bronze Age, 12th-9th century BCE; [30,31]); Chiavari, Liguria (Iron Age, 7th-6th century BCE, [32]) Castenaso, Madonna del Buon Consiglio, Emilia Romagna (Iron Age-Villanovian: 7th century BCE, [28]); Narce, Lazio (Iron Age-Faliscan, 8th-7th century BCE, [33]); and Pontecagnano, Campania (Iron Age: 8th-7th century BCE, [34]). These osteometric data were published in S1 in [25] and contain information on 124 individuals (50 males and 74 females). The osteological material and documentation are currently stored at the Bioarchaeology Service at the Museo delle Civiltà in Rome. Additionally, we included data from 31 individuals (16 males and 15 females) obtained from the Etruscan necropolis in Tolle, Tuscany (8th-5nd century BCE, [28]). The inclusion criteria were single burials of adult individuals (complete skeletal development), bone color ranging from grey to calcined white, absence of osteoarthritic lesions or other pathological lesions that could affect the measurements, and gender-specific urn shapes and/or grave goods. Cavazzuti, Bresadola [25] based their gender estimations on grave goods – such as weapons and razors for men, and spindle whorls, simple-arch or ‘leech’ fibulas, and faience or glass beads for women. In Tolle, the gender of individuals was inferred from the design of canopic urns featuring anthropomorphic characteristics. These urns depict an idealized representation of the deceased, often including indicators of rank, jewelry, clothing, and, in many cases, gender-specific iconography [28].Additionally, we tested the applicability of the diagnostic method to Central European populations using a sample from the Late Bronze Age Urnfield Culture in Austria. The dataset included 45 individuals with osteometric data from the following sites in Lower Austria: St. Pölten Fuhrmannsgasse (14th century BCE, [22]); Getzersdorf (13th-12th century BCE, [35,36]); Inzersdorf (12th-10th century BCE, [37–39]); and Franzhausen-Kokoron (10th-8th century BCE, [40,41]). These sites were subjected to osteological analysis as part of the “Unlocking the Secrets of Cremated Human Remains” project. All necessary permits were obtained for the described study, which complied with all relevant regulations. The Federal Monuments Authority of Austria and Natural History Museum of Vienna granted access to the cremated remains from Austria and approved this study from an ethical perspective. The use and access of published data by Cavazzuti, Bresadola [25] is granted by the Bioarchaeology Service at the Museo delle Civiltà in Rome (Italian Ministry of Culture). Access to the Additional information regarding the ethical, cultural, and scientific considerations specific to inclusivity in global research is included in the Supporting Information (S4 Checklist).
Methods
Fig 1 provides an overview of the measurement definitions of the variables utilized. The analysis was executed in R-statistics 4.4.1 by using the packages boot 1.3–30 [42], caret 6.0–94 [43], dplyr 1.1.4 [44], ggplot 2 3.5.1 [45], LogicReg 1.6.6 [46], rstatix 0.7.2 [47], tidyverse 2.0.0 [48], verification 1.42 [49], and visreg 2.7.0 [50]. Initially, t-tests were employed to identify potential group differences between the sexes, thereby identifying variables exhibiting significant sexual dimorphism. Furthermore, we calculated Chakraborty & Majumder D-values [51] to assess the nonoverlap of the distributions between both sexes per variable. Variables exhibiting significant sexual dimorphism were incorporated into a prediction model for sex.
A-A: Axis: anterior-posterior diameter, B-B: Axis: transverse diameter, C-C: Mandible: condyle width, D-D: Radius: maximum head diameter, E-E: Humerus: vertical head diameter, F-F: Humerus: capitulum maximum diameter, G-G: Humerus: trochlea minimum diameter, H-H: Humerus: trochlea maximum diameter, I-I: Lunate: maximum width, J-J: Lunate: maximum length, K-K: Femur: vertical head diameter, L-L: Patella: maximum width, M-M: Patella: maximum height, N-N: Patella: maximum thickness, O-O: Talus: maximum length, P-P: Talus: head-neck length, Q-Q: Talus: trochlea length, R-R: Talus: trochlea width, S-S: Navicular: maximum length, T-T: MT1: dorsoplantar width of the head, U-U: MT1: mediolateral width of the head.
To accomplish the objective, the emphasis was placed on three distinct approaches: Initially, binary logistic regression models were calculated as an extension of the cut-off points published in Cavazzuti, Bresadola [25]. In general, simple cutoff criteria facilitate the establishment of binary classification of sex based on a threshold value. However, these cutoffs do not encompass the complete distribution of measurements observed within each designated sex. Thus, individuals with values approximating the cut-off values are at elevated risk of misclassification. Conversely, binary logistic regression models incorporate these sex-specific distributions and estimate the probability that an individual belongs to a specific sex based on the observed measurement, offering a more nuanced and probabilistic approach to sex estimation.
A second approach to sex estimation involved the combination of multiple measured variables predicated on the premise that each variable reflected an aspect of sexual dimorphism. The integration of these variables has been demonstrated to improve classification accuracy. However, given the inherent fragmentation of archaeological cremation burials, multivariate models required a high degree of flexibility, as the complete documentation of all 21 target variables within a single burial was exceptionally rare. It is important to note that, generally, multiple binary logistic regression models and many other multivariate statistical models cannot manage missing data. In such instances, the imputation of missing values is imperative for the subsequent estimation of sex. Given that in our sample most of the individuals yielded one to three measurements out of the 21 method-defined variables, the final prediction may rely heavily on imputed values rather than actual observations. This has the potential to increase the risk of overfitting and reduce the model’s reliability.
Consequently, we developed a Bayesian model for sex prediction, offering both high accuracy and flexibility. To develop the method, we used a training set containing the recorded measurements from the Italian sample. The Bayesian model estimated the distributions of each variable based on this training dataset and calculated individual likelihoods for being male or female. As prior sex ratio we used equal priors for males and females (1:1) which reflects the sex ratio in adults. We also tested slightly skewed priors of 1.05:1 which corresponds to the natural sex ratio at birth. Comparison of the resulting posterior probabilities and classification outcomes showed only negligible differences in sex estimation accuracy between these prior settings, indicating that model predictions are primarily driven by the observed data rather than by prior assumptions. These measurements were then amalgamated into a posterior probability, thereby enabling the estimation of sex estimation even in circumstances where only a subset of the measurements was available. The Bayesian model employs distributions of the variables derived from a training set and assigns relative weights to the individual probabilities of each variable for sex estimation based on the probability distributions learned from the training set and subsequently integrating these probabilities to formulate a posterior prediction probability. A notable benefit of the Bayesian approach is its capacity to accommodate uncertainty and generate probabilistic classifications rather than binary outputs. This is of particular value in archaeological contexts, where data are often incomplete. In essence, the Bayesian model has the capacity to integrate measurements from one to 21 traits while concurrently providing a probabilistic sex classification. The predictive efficacy of the model is known to enhance as more measurements become available. Moreover, prior knowledge, such as expected sex ratios or population-specific distributions, can be integrated, enhancing the model’s relevance and interpretability and allowing for straightforward model updates in the future with increased data sets. The utilization of Bayesian models necessitates a foundation of statistical knowledge and proficiency in a programming language, such as R, Python, to execute the code. To address this need, we integrated the Bayesian model into an application. The model was updated in Python (Pycharm 2022.2.2, libraries: matplotlib 3.10.3, [52]; numpy 2.2.6, [53]; openpyxl 3.1.5, [54]; pandas 2.3.0, [55]; pathlib 1.0.1, [56]; PyQt6 6.9.1, [57]; scipy 1.16.0, [58]; and seaborn 0.13.2, [59]). In addition, an user interface was developed. The application can be downloaded and utilized offline. Beyond the capacity to predict individual cases, the application possesses the functionality to accommodate multiple case predictions derived from Excel spreadsheets.
The sex estimation models were evaluated using a leave-one-out cross-validation approach. The accuracy of the binary logistic regression models, the Bayesian models and the cut-off points (for which cut-off point accuracies were published in Cavazzuti, Bresadola [25]) were evaluated. In addition to reporting the raw classification accuracies, post-hoc filtering was also evaluated. In general, accuracy is calculated as the proportion of correctly classified individuals out of all cases using a default cut-off of 50% probability. However, in the discipline of osteology, it is customary to adopt a more cautious approach and to abstain from making sex estimation when the predicted probability approaches 50%, as this practice carries a high risk of misclassification and yields results that are nearly random. This approach has the potential to be misleading, if individuals with ambiguous sex estimates are included in the accuracy assessment [9,60]. There are no universally accepted thresholds for determining the probability required to confidently estimate sex in osteology. Consequently, an evaluation was conducted using five artificial post-hoc filtering thresholds (50%, 65%, 75%, 85%, 90%). Using post-hoc filtering as prediction thresholds minimizes prediction errors [61,62]. Individuals whose predicted probabilities did not exceed these thresholds were defined as “ambiguous” and excluded from further classification. For each threshold, classification accuracy was then recalculated based only on the remaining, non-ambiguous individuals. This approach enabled a more comprehensive evaluation of the models’ performance, with a particular focus on metrics such as balancing accuracy and confidence. Reliability curves were calculated for the Bayesian model and the ROC (Receiving Operating Characteristic) for the binary logistic regression models. Finally, we further validated the sex estimation models through the use of an independent test-set containing a sample from the Urnfield Culture in Austria. In contrast to the Italian sample, the availability of gender-specific grave goods was limited. The focus therefore shifted to a morphological sex estimation approach, which was based on the morphological expression of features at the cranium and pelvis as a proxy for sex [63]. The postcranial measurements obtained were then utilized for sex prediction purpose, employing sex estimation models that were developed based on the Italian sample.
Results
The distribution of the Italian and Austrian samples is presented in Table 1. The dataset, including measurements recorded for each individual is provided in S1 Data (.csv). Interobserver errors and technical error of measurements (TEM) of the variables have been published in Cavazzuti, Bresadola [25]. All used variables have low relative technical errors of measurements (<0.05). With the exception of the transverse diameter of the axis and the head-neck length of the talus, all variables showed significant differences between males and females and were included into the sex prediction models (see Table 2). Notably, the measurements of the mediolateral head width of the first metatarsal were found to be not normally distributed. In accordance with the assumptions underlying the Bayesian model, which requiresnormally distributed data, the measurements were log-transformed. The measurement of the humerus trochlea maximum diameter of case 149 (Tolle grave 116, male individual) was excluded from further analysis. This measurement was an outlier and the lowest of all cases. Given that other measurements of this individual were in the top range of the male sample, this value was suspected to be a measurement error.
Validation
The accuracies of the cut-off point method published by Cavazzuti, Bresadola [25] had been derived by dividing their dataset randomly in a training and a validation subset. The accuracy for the binary logistic regression models and the Bayesian model was estimated using a leave-one-out cross-validation (LOOCV). The accuracies obtained for each variable prior to post-hoc filtering (hereafter referred to as raw accuracies), of the cut-off points and the binary regression model, were similar. The cut-off point method yielded the highest raw accuracies for the variable radius head maximum diameter (88%). The maximum width of the patella (86%), and the width of the mandibular condyle (84%) also demonstrated high levels of accuracy. The anterior-posterior diameter of the axis, maximum height of the patella, and humerus trochlea minimum diameter exhibited the lowest raw accuracies (68%, 69%, and 72%, respectively). In the binary logistic regression models, the radius head maximum diameter, maximum height of the patella, and width of the mandibular condyle demonstrated good performance with respective accuracies of 88%, 86%, and 82%, as illustrated in Fig 2a-c. Conversely, the anterior-posterior diameter of the axis, trochlea width of the talus, and humerus trochlea maximum diameter exhibited the lowest raw accuracies with percentages of 68%, 70%, and 71%, respectively, as shown in Fig 2d-f. Detailed results of the accuracies per variable and method can be found in S1 File.
The presence of vertical lines at the bottom and top of the plots indicates the measurement values of individual cases, with gray areas denoting the 95% confidence intervals. a) Radius: maximum head diameter, b) Patella: maximum height, c) Mandible: condyle width, d) Axis: anterior-posterior diameter, e) Talus: trochlea width, f) Humerus: trochlea maximum diameter.
The cut-off-point method and the binary logistic regression method yielded comparable per-variable raw accuracies averaged across all univariable models (cut-off point: 75%, regression models: 77%). Multivariable accuracy estimates were not calculated for the binary logistic regression approach due to missing data constraints, whereas multivariable integration was performed using the Bayesian model. As expected, the Bayesian model, which employed combinations of variables for sex prediction, demonstrated superior performance compared to the sex prediction based on single variables (raw accuracy 86%). The sensitivity and specificity of the binary logistic regression models and the Bayesian model indicate a greater number of correctly classified females (higher specificity than sensitivity). Subsequent to the implementation of post-hoc filtering (75% threshold), a marginal enhancement in the accuracy of both models was observed (Table 3).
To this end, a series of post-hoc filtering thresholds were tested in order to exclude individuals with an ambiguous sex estimate from the model accuracy (thresholds: 65%, 75%, 85%, 90%). Overall higher thresholds did not result in a substantial increase in accuracy; however, they led to a significant number of cases being classified as ambiguous. For instance, in binary logistic regression models with a threshold of 75%, the exclusion of cases ranged from 9.5% to () and 57.5% depending on the variable (lowest rate of ambiguous estimated individuals: patella: maximum height; highest rate of ambiguous estimated individuals: dens axis: anterior-posterior diameter). This post-hoc filtering threshold of 75% resulted in an average increase in accuracy of 7% in the logistic regression models. In the Bayesian model, 19.6% of all cases were classified as ambiguous, with only a slight increase in accuracy of 3%. It has been demonstrated that alternative post-hoc filtering thresholds (65%, 85%, 90%) exhibit comparable classification performance, with accuracies fluctuating between 87 and 90%. Furthermore, calibration curves were calculated for different thresholds to assess the quality of the sex prediction in greater detail (Fig 3 and Table 4). Generally, the calibration lines indicated good calibration of the Bayesian model for sex prediction. The calibration line without post-hoc filtering indicated good calibration in the lower and middle regions of the plot. However, it exhibited an over proportional number of female estimates compared to the expected numbers. Post-hoc thresholds yielded comparable outcomes and exhibited better calibration, with enhanced prediction accuracy for male subjects.
The validation study exclusively included individuals with a predicted sex probability that exceeded the threshold. The red dotted line represents a technically perfect calibrated model. In practice, a model with a reliability curve below the red-dotted line in the area below the average probability of 0.5 and above the red-dotted line in the area above the average probability of 0.5 is ideal for distinguishing between females and males.
In conclusion, the accuracies after post-hoc filtering were found to be similar, with a negligible increase in accuracy in higher thresholds but a large increase of ambiguous classified cases from 9 to 36%. The 65% post-hoc filtering threshold had been determined to offer an optimal compromise, ensuring both high accuracy and a substantial number of cases with successful sex estimates. In other words, we suggest to interpret sex prediction probabilities obtained in CRISP lower than 65% as ambiguous and only assign a sex prediction to individuals with a prediction probability of 65% or more.
Application CRISP: Cremated Remains Inference of Sex Probabilities
The code for the Bayesian model and the application software CRISP can be obtained from GitHub (https://github.com/quadraBits/CRISP, published under software license GPL-3.0). The application’s functionality extends to an offline mode, accompanied by a user-friendly interface (Fig 4a). At present, two options are available: a single case input, and a multiple cases input. CRISP allows for direct inscription of measurements in millimeters for elements originating from a single case. CRISP is a flexible tool that allows the inclusion of up to 21 measurements per individual for sex prediction. When multiple cases are to be analyzed, an additional option allows users to record all measurements in an Excel spreadsheet and to automatically predict sex of all cases simultaneously. The spreadsheet template can be accessed within the application. For the single prediction, a “result page” is provided, which contains density plots for all variables (Fig 4b). A black dotted line is visible to highlight the location of the predicted single measurements within the distribution curves for both males and females. This allows a first assessment to identify suspected measurement errors and outliers. Furthermore, the probability of the subject being a male or female will be displayed. CRISP does not provide any interpretations of the results (e.g., the individual is “possible male/female”, “probably male/female”, or “male/female”) This discrimination into sex-specific groups are artificial, and definitions vary slightly between researchers, research groups, and countries.
a) Main page of the software for data recording.The available measurements for individual cases can be documented in the boxes located on the left side of this page. The estimation of multiple cases can be achieved through the upload of all relevant recordings into an Excel spreadsheet. Please use the provided template (button template). The file can be important into CRISP by selecting the Browse button. The calculation of the sex prediction is initiated by the act of pressing the designated button. b) Results page. The plots represent the data distribution (separated in females and males) used for the prediction model. The vertical dotted lines denote the individual measurements incorporated into the prediction. Right, you can find a box showing warnings, should they occur (see section “Warnings” for more details). Additionally, the results of the sex prediction are presented as probabilities, indicating whether the subject is male or female. The export of a report on the results for individual prediction is initiated by pressing the Export button. A detailed explanation can be found in the manual integrated into CRISP.
Recent research has underscored the challenges associated with the identification of multiple individuals interred within a single urn, despite the utilization of morphometric analysis (see [64]). Consequently, there is always some uncertainty the measurements obtained from different bone fragments may belong to the same or different individual. The combination of commingled elements of a male and female into a unified prediction may have an impact on the sex prediction. To avoid this, CRISP incorporates three distinct data control mechanisms. First, a warning will be issued if a measurement falls outside of three standard deviations of the variable distributions utilized for the Bayesian model. This will serve to underscore potential typographical errors and measurement inaccuracies. In instances where two to four measurements are employed for prediction, and the posterior prediction probabilities for at least two features deviate from each other by more than 33%, the warning “Inconsistency for few features!” is generated. This warning is accompanied by the discrepancy between the posterior prediction probabilities for the two features that deviate from each other. In the event that five or more measurements are utilized for sex prediction, and one or more observations yield contradictory results, the warning “Inconsistency for several features!” will be displayed. In this instance, the single posterior probabilities will be compared to the median posterior probability, and predictions with a deviation of more than 0.25 will be highlighted. In other words, when such warnings are triggered, they highlight substantial variation among the measurements, raising the possibility that the analyzed elements originate from more than one individual. We advise osteological practitioner to critically analyze the cremation burial a second time to conclude whether to keep all measurements for prediction or to reject single measurements due to a suspected second individual. It should be noted that bone morphology exhibits variation both between and within populations. Consequently, the aforementioned warning shall be regarded as mere indications, rather than definitive evidence, for the presence of multiple individuals. It is important to note that an alternative interpretation in certain cases is possible, if the sex prediction derived is affected by a deviating posterior prediction probabilities (e.g., a single posterior probability of 90% indicating male) and a second less discriminant measurement falling into the ambiguous range (e.g., a single posterior probability of 59% indicating male). In such a case the warning can be disregarded. In reality, those deviations usually have a negligible impact on the results of the sex prediction as the variables are weighted in the Bayesian model.
The results of single predicted cases can be exported as a PDF file containing all of the information that is presented on the results page. If the Excel spreadsheet containing data on multiple cases is used, the sex of each case will be predicted based on the information provided. The probabilities and warnings will be automatically documented in the provided Excel file for subsequent evaluation. Updated versions of CRISP will be made available on GitHub (https://github.com/quadraBits/CRISP). Detailed instructions to use CRISP can be found at the GitHub folder in the MANUAL.pdf. The manual is also directly accessible via the menu in CRISP software.
Test with an independent sample
In order to assess the adequacy of the sex prediction model in other populations, an independent sample from the Late Bronze Age of eastern Austria was utilized as a case study. The human cremated remains from this sample were predominantly highly fragmented, with a median fragment size between 5–10 mm and a bone weight of less than 200 g in adults. A comprehensive analysis of substantial case series was conducted in the Unlocking the Secrets of Cremated Human Remains project, encompassing over 700 cases, with a subset of 457 adult individuals. A meticulous examination of these cases revealed that a mere 43 cases yielded anatomical features conducive to morphological sex estimation, but also at least one anatomical structure allowing measurement and thus metric sex estimation. It is noteworthy that seven cases were estimated ambiguous by morphological sex estimation, necessitating further scrutiny to ascertain their precise classification. A comparison was made between the consistency of the results of the sex estimation based on morphological features and the three metric methods (complete results in S1 Table).
The findings (Table 5) indicated that the cut-off point method developed by Cavazzuti, Bresadola [25] exhibited the highest proportion of correctly classified individuals (69.4%), yet they also concomitantly demonstrated the greatest number of misclassified individuals (22.2%). The use of cut-off points always produced a classification, resulting in binary outcomes without associated probabilities. The classification of individuals of being “ambiguous” (three cases) was applied exclusively in instances where the evaluation of multiple features resulted in contrary results. The single binary logistic regression and Bayesian model exhibited 52.8% and 55.6% correctly classified individuals, along with notably higher rates for individuals categorized as “ambiguous”. In contrast, the rates for individuals showing inconsistencies between morphological and metric sex estimation were considerably lower than those observed using the cut-off point method. In summary, the Bayesian model and binary logistic regression models appeared to be more conservative with ambiguous cases, whereas the cut-off point method yielded higher classification rates but came with a four-fold increased risk of misclassifying individuals.
Table 6 provides information on cases that deviate from the established sex estimations based on morphological and metric variables. The findings indicated that most individual misclassifications by the cut-off point method involved measurements in proximity to the cut-off point or exhibiting variables of low accuracy. These individuals were also classified as “ambiguous” in both the logistic regression and Bayesian models. In addition, Inzersdorf grave 181 exhibited contradictory outcomes in the binary logistic regression. The radius head indicated a male individual (p = 98.9), while the humerus trochlea minimum diameter and the lunate width suggested a female individual (both: p > 99.99) suggesting the presence of a second individual or an admixture phenomenon. Furthermore, in cases where the Bayesian model or binary logistic regression models yielded inconsistent results for morphological sex estimation, the latter was typically performed on less reliable cranial features such as the supraorbital margin and ridge or nuchal crest. In such instances, we hypothesize that the reliability of morphological sex estimation is diminished, potentially leading to misclassification.
Discussion
Estimating the sex of individuals from cremated remains is a critical component of demographic studies of cemeteries, also contributing to the reconstruction of funerary behaviors and osteobiographies. Given the challenges posed by cremation, it is essential to employ all available methods, integrating commonly used morphological observations with metric approaches based on the measurement of skeletal elements. However, due to the effect of shrinkage and a high degree of fragmentation, highly diagnostic morphological features are rarely available, especially those from the fragile pelvic bones, and sex estimates are therefore often not feasible and, even when obtainable, not always reliable. Moreover, this analysis strongly relies on the observer’s experience and may implicitly entail inter-observer or inter-laboratory error. In contrast, metric sex assessments can be used on very fragmented individuals.
In archaeological contexts, the use of gender-associated grave goods as a proxy for biological sex has been adopted as a pragmatic and methodologically justifiable approach. Grave goods provide insights into an individual’s social role and presumed gender identity, which may, though not invariably, correspond to biological sex. However, there is a risk of conflating biological sex with socially constructed gender roles [65]. Furthermore, the correlation between specific grave goods and gender may vary across cultures, time periods, and even within the same society, thereby complicating the formulation of broad generalizations. This is particularly salient in cultures where gender is conceptualized as fluid, or where determinants such as social status and age appear to have exerted greater influence on grave goods assemblages than biological sex. Moreover, the specific funerary practices and the social organization underlying gender-specific grave goods are often not fully understood, which can result in biased interpretations and datasets. The selection of funerary rites, including grave goods, is typically influenced by a multifaceted array of factors, including gender, age, social status and, in some cases, the circumstances surrounding the death. This selection process is further influenced by the personal interests of the deceased individual [66,67]. Nonetheless, the validity of Cavazzuti et al.’s approach is further substantiated by numerous archaeological case studies from prehistoric Europe, which demonstrate the prevalence of gender-specific funerary practices across various regions [68,69].
These general associations between biological sex and gender expression in inhumed individuals have also been confirmed using methods beyond osteological analysis, such as DNA and amelogenin peptide analysis. The latter has recently emerged as a cost-effective, rapid, and minimally invasive approach, showing high of accuracy comparable to that of DNA analysis [65,70–72]. Together, these methods allow for a more nuanced evaluation of gender identity and social role. With a few notable exceptions (citations), numerous studies have documented a strong correspondence between chromosomal sex and presumed gender identity inferred from grave goods, particularly within well-structured burial traditions [65,68,73,74].In Austrian and Iberian prehistoric archaeological contexts, the biological sex of deceased children has been shown to consistently corresponded to a gender-specific burial placement [68,74]. Overall, these studies suggest that the use of gender as a proxy for sex in the development of sex estimation methods for human cremated remains is a valid strategy, provided that it is applied to societies characterized by high gender intensity and that sexing criteria are developed using robust statistical tools.
As outlined above, heat-induced bone shrinkage depends on several interacting factors, including the original morphology and mass of the skeletal elements as well as the temperature reached during cremation. These variables can introduce substantial variability in the degree of volumetric reduction observed both across cremations and within skeletal elements of the same individual [11,15]. To reduce the effect of random variation related to differential shrinkage, the present study focused exclusively on fully calcined fragments. As demonstrated in previous studies, sexual dimorphism in postcranial measurements is retained even after heat-induced bone shrinkage. The use of dry bone material for sex estimation has been shown to yield similar discriminatory power that is comparable to those obtained from cremated remains [18,25,75]. It is evident that postcranial skeletal dimensions demonstrate interpopulation variability, a phenomenon attributable to a combination of genetic, environmental, and lifestyle factors. It is important to note that methods developed on modern identified collections may not fully capture the morphological characteristics of prehistoric groups. This is due to secular changes, differential activity patterns, population-specific sexual dimorphism and different cremation techniques and post-depositional processes, which can influence bone size and shape [76–78]. Consequently, the application of metric sex estimation models, which have been trained on contemporary datasets, to archaeological samples necessitates caution, as systematic differences may affect prediction accuracy. Therefore, methods, such as CRISP needs to be developed based on archaeological material directly. The validation of the CRISP model with the Austrian sample suggests its applicability to prehistoric populations in Central and Southern Europe; however, further validation studies utilizing data from other populations are required to provide a more comprehensive evidence base.
The accuracy of metric sex estimation in human cremated remains has traditionally relied on cut-off points applied to single osteometric variables. While such univariate approaches offer simplicity and are frequently used in dry bone material (e.g., [79–81], they often suffer from reduced discriminatory power due to sex overlap. Research has demonstrated that classification based on solitary threshold per variable frequently yielded lower accuracies – typically ranging from 65 to 80% - and is particularly susceptible to misclassification in proximity to the cut-off point [82]. The employment of sophisticated machine learning algorithms, in conjunction with post-hoc filtering mechanisms, has led to a marked enhancement in the precision of the models in comparison to the utilization of conventional cut-off points. This enhancement can be attributed to the capacity of probabilities to facilitate a more nuanced interpretation of the outcomes. Multivariate statistical models and machine learning algorithms demonstrated superior performance by integrating multiple sexually dimorphic traits and weighting their contributions more appropriately [29,83,84]. Even if these variables exhibited minimal sexual dimorphism, resulting in suboptimal accuracy (e.g., the dens axis), their integration could potentially enhance accuracy significantly. The application of a Bayesian model resulted in a notably high accuracy of 87–90% after post-hoc filtering. The classification of individuals with posterior sex probabilities below a threshold, for example, 65%, as ambiguous was undertaken to reduce the risk of misclassification and ensure greater diagnostic reliability. Such probabilistic frameworks are particularly advantageous for fragmented cremated remains, where variable availability is inconsistent. These findings underscore the efficacy of flexible multivariate approaches over rigid univariate classification schemes in osteological sex estimation, particularly within the context of taphonomic constraints associated with cremations. A significant challenge is that multivariate methods and machine learning are intricate techniques that necessitate computer programming proficiency to process and interpret the code that implements sex estimation, which is frequently disseminated alongside the findings. This may discourage some researchers lacking familiarity with programming, and diminishes the acceptance and implementation of these methodologies in research projects and osteological analyses of cremation burials in archaeology. Consequently, we decided to develop a software application with a user-friendly accessible frontend.
The validation of metric sex estimation methods frequently relies on morphological sex assessments, which remain the standard approach in bioarchaeology and forensic anthropology [85]. In this study, we evaluated the performance of both cut-off-based and probabilistic metric models against morphological classifications in a sample of human cremated remains from Bronze Age Austrian necropolises. Our findings confirmed that the osteometric approach is a viable method for sex estimation and it can also be applied to highly fragmented individuals, where morphological traits are poorly preserved or difficult to assess. Cut-off point methods frequently yielded results that deviated from morphological assessments, particularly in cases where values clustered near the threshold. Conversely, multivariate approaches, including binary logistic regression and the Bayesian model, demonstrated a higher degree of concordance with morphological classifications. Notably, discrepancies between the Bayesian model and morphological assessments were predominantly observed in cases where posterior probabilities were low or when the morphological features employed for sex estimation were themselves less reliable. Features such as the supraorbital margin or nuchal crest, despite their frequent utilization, were recognized for their diminished interobserver agreement and augmented sexual overlap [9]. These discrepancies underscored a pivotal interplay between trait reliability and model confidence. When probabilistic models deviate from morphological classifications, they frequently do so with appropriately reduced certainty. This can serve as an internal indicator for ambiguous or borderline cases. Consequently, instead of perceiving these mismatches as errors, such discrepancies might be indicative of inherent biological ambiguity or limitations in the available osteological data, particularly in cases involving cremated and fragmentary remains [25]. For example, it may lead to the detection of multiple individuals deposited in a single cremation deposit.
A total of 457 adult individuals were identified in the Austrian sample. However, 260 (56.9%) had neither morphological nor metric traits available for sex estimation. A total of 130 individuals (28.4%) were sexed using morphological analysis. In 59 cases, the morphological criteria were not applicable. However, at least one measurement for the Bayesian model was obtained, resulting in an increase of 12.9% in successfully estimated sex. Consequently, the implementation of both approaches would facilitate the identification of the sex of approximately 40% of the individuals. This outcome is particularly noteworthy, considering the fragmentation of the skeletal material, which exhibited a median fragment size of less than 1 cm. In 20 cases, the incorporation of metric features enabled the validation of the morphological sex estimation through the implementation of the Bayesian model. In six cases, the morphology was ambiguous and metrics provided a more accurate sex estimation. The results of the metric and Bayesian sex estimation only deviated from each other in two cases. The morphological sex estimation was suspected to be less reliable due to features with a low degree of sexual dimorphism. Although the model was developed based on several Italian samples, which varied in terms of location and chronology, it demonstrated a high degree of accuracy. The findings from the Austrian sample indicate minimal interpopulation variations when compared from the Italian training sample. Further validation in different populations is necessary to ensure the findings can be generalized more widely.
We encourage our colleagues to use and critically validate our software further as well as to provide feedback, as we are planning to share updated versions in the future. It has been demonstrated that larger datasets, especially those containing data from various populations, may also lead to improved universal sex predictions over different time periods and geographical areas. We also encourage our colleagues to contact us if they have published metric data available that is ready to share. We are especially interested in receiving data that provides population-specific sex estimates.
Conclusion
In this study, we presented a Bayesian model for metric sex estimation in human cremated remains based on postcranial features derived from a previous work [25]. Tested on the same sample, the Bayesian model was found to be more flexible than binary logistic regression modeling and simple cut-off points in handling missing data. Furthermore, the accuracies of these methods were comparable to those of postcranial sex estimation in dry bone material. A validation of the model using an independent sample from Austria sexed by morphological observations suggested that, in cases of discrepancy between morphological and metric estimations, the reliability of the morphological features may be uncertain. The results of the Bayesian model were found to be more reliable. The Bayesian model was integrated into an open access software application to facilitate its use in osteological research.
Supporting information
S1 Data. Raw data used for the Bayesian model.
https://doi.org/10.1371/journal.pone.0346813.s001
(CSV)
S1 File. Supporting Information. Accuracies of the original cut-off point method and the binary logistic regression models.
https://doi.org/10.1371/journal.pone.0346813.s002
(PDF)
S1_Table. Comprehensive overview of individuals from prehistoric Austrian samples used for the comparative analysis of sex prediction methodologies by three distinct metric approaches.
Cut-off points (based on Cavazzuti, Bresadola [28]), single logistic regression models and Bayesian model) diagnosis are compared with the morphological sex determination, e.g., morphological features at the skull and pelvis. Single probabilities are provided for the regression model (in the event that multiple measurements were obtained, the probability for the most suitable model was utilized), as well as the Bayesian model. Furthermore, the table incorporates warnings derived from the Bayesian model to facilitate comprehension. The estimated of sex followed the following classification system; male/female (M/F, p ≥ 90), probable male/female (M?/F?, 80 ≤ p < 90), possible male/female (M??/F??, 65 ≤ p < 80), ambiguous (A, 50 ≤ p < 65).
https://doi.org/10.1371/journal.pone.0346813.s003
(PDF)
Acknowledgments
We would like to thank Michaela Fritzl, Hannah Skerjanz, and Domnika Verdianu for their support of the anthropological analysis in the framework of the Unlocking the Secrets of Cremated Human Remains project and Karin Wiltschke-Schrotta from the Natural History Museum of Vienna who granted us access to the osteological material of Getzersdorf.
References
- 1.
Rebay-Salisbury K. Sex, Gender and the Third Science Revolution. In: Matić U, Gaydarska B, Coltofean L, Díaz-Guardamino M, editors. Gender Trouble and Current Archaeological Debates, Themes in Contemporary Archaeology. International: Springer. 2024. p. 19–31.
- 2. de Boer HH, Roberts J, Delabarde T, Mundorff AZ, Blau S. Disaster victim identification operations with fragmented, burnt, or commingled remains: experience-based recommendations. Forensic Sci Res. 2020;5(3):191–201. pmid:33224550
- 3. McKinley JI. In the Heat of the Pyre. The Analysis of Burned Human Remains. Elsevier. 2015. p. 181–202.
- 4. Thompson T. The Assessment of Sex in Cremated Individuals: Some Cautionary Notes. Canadian Society of Forensic Science Journal. 2002;35(2):49–56.
- 5. Ochôa Rodrigues C, Ferreira MT, Matos V, Gonçalves D. “Sex change” in skeletal remains: Assessing how heat-induced changes interfere with sex estimation. Sci Justice. 2021;61(1):26–36. pmid:33357825
- 6. Gonçalves D, Pires AE. Cremation under fire: a review of bioarchaeological approaches from 1995 to 2015. Archaeol Anthropol Sci. 2016;9(8):1677–88.
- 7. Hlad M, Löffelmann T, Griffith JI, James HF, Vercauteren M, Snoeck C, et al. Practitioner preferences in the analysis of cremation deposits in archaeology and biological anthropology: An overview of current osteoarchaeological practices with a focus on sex estimation. PLoS One. 2024;19(12):e0310380. pmid:39621707
- 8. Chapman T, Lefevre P, Semal P, Moiseev F, Sholukha V, Louryan S, et al. Sex determination using the Probabilistic Sex Diagnosis (DSP: Diagnose Sexuelle Probabiliste) tool in a virtual environment. Forensic Sci Int. 2014;234:189.e1-8. pmid:24290894
- 9. Inskip S, Scheib CL, Wohns AW, Ge X, Kivisild T, Robb J. Evaluating macroscopic sex estimation methods using genetically sexed archaeological material: The medieval skeletal collection from St John’s Divinity School, Cambridge. Am J Phys Anthropol. 2019;168(2):340–51. pmid:30575013
- 10. Bruzek J. A method for visual determination of sex, using the human hip bone. Am J Phys Anthropol. 2002;117(2):157–68. pmid:11815949
- 11. Thompson TJU. Heat-induced dimensional changes in bone and their consequences for forensic anthropology. J Forensic Sci. 2005;50(5):1008–15. pmid:16225204
- 12. Thompson TJU. Recent advances in the study of burned bone and their implications for forensic anthropology. Forensic Science International. 2004;146:S203–5.
- 13.
McKinley JI. The analysis of cremated bone. In: Cox M, Mays S, editors. Human osteology: in archaeology and forensic science. London: Greenwich Medical Media Ltd. 2000. p. 403–21.
- 14.
Correia PM. Fire modification of bone: a review of the literature. In: Haglund WD, Sorg MH, editors. Forensic Taphonomy: The Postmortem Fate of Human Remains. Boca Raton: CRC Press. 1997. p. 275–93.
- 15. Gonçalves D, Vassalo AR, Makhoul C, Piga G, Mamede AP, Parker SF, et al. Chemosteometric regression models of heat exposed human bones to determine their pre-burnt metric dimensions. Am J Phys Anthropol. 2020;173(4):734–47. pmid:32722856
- 16. Rösing FW. Methoden und Aussagemöglichkeiten der anthropologischen Leichenbrandbearbeitung. Archäologie & Naturwissenschaft. 1977;1977(1):53–80.
- 17. Gonçalves D, Thompson TJU, Cunha E. Osteometric sex determination of burned human skeletal remains. J Forensic Leg Med. 2013;20(7):906–11. pmid:24112343
- 18. Hlad M, Veselka B, Steadman DW, Herregods B, Elskens M, Annaert R, et al. Revisiting metric sex estimation of burnt human remains via supervised learning using a reference collection of modern identified cremated individuals (Knoxville, USA). Am J Phys Anthropol. 2021;175(4):777–93. pmid:33720412
- 19. Dutch Furnace Works E. Cremation furnaces – cremation ovens. https://dfweurope.com/cremation-furnaces/. 2025. Accessed 2025 July 14.
- 20. Chatzikonstantinou I, Fülöp K, Chronaki S, Snoeck C, Kiriatzi E, Triantaphyllou S. Experimental insights into different funerary burning conditions on Crete island, Greece. Journal of Archaeological Science: Reports. 2025;62:105067.
- 21. Stamataki E, De Mulder G, Hermans R, Vercauteren M, Snoeck C. The seasonality experiment: Investigating how seasons affect the burning conditions of cremations. PLoS One. 2025;20(7):e0327478. pmid:40632747
- 22. Waltenberger L, Bosch MD, Fritzl M, Gahleitner A, Kurzmann C, Piniel M, et al. More than urns: A multi-method pipeline for analyzing cremation burials. PLoS One. 2023;18(8):e0289140. pmid:37647251
- 23. Waltenberger L, Heimel P, Skerjanz H, Tangl S, Verdianu D, Rebay-Salisbury K. Lateral angle: A landmark-based method for the sex estimation in human cremated remains and application to an Austrian prehistoric sample. Am J Biol Anthropol. 2024;184(1):e24874. pmid:37982358
- 24. Veselka B, Lemmers S. Deliberate selective deposition of Iron Age cremations from Oosterhout, the Netherlands: a ‘pars pro toto’ burial ritual. LUNULA Archaeologica Protohistorica. 2014;2014.
- 25. Cavazzuti C, Bresadola B, d’Innocenzo C, Interlando S, Sperduti A. Towards a new osteometric method for sexing ancient cremated human remains. Analysis of Late Bronze Age and Iron Age samples from Italy with gendered grave goods. PLoS One. 2019;14(1):e0209423. pmid:30699127
- 26.
Cavazzuti C. I resti umani della necropoli di Scalvinetto. In: Salzani L, editor. La necropoli dell’età del Bronzo di Scalvinetto di Legnago (Verona). Verona: Soprintendenza Belle Arti e Paesaggio per le Province di Verona, Rovigo e Vicenza. 2020. p. 372–420.
- 27. Esposito C, Gigante M, Lugli F, Miranda P, Cavazzuti C, Sperduti A, et al. Intense community dynamics in the pre-Roman frontier site of Fermo (ninth–fifth century BCE, Marche, central Italy) inferred from isotopic data Scientific Reports. 2023;13(1):3632.
- 28. Bischeri M, Alhaique F, Cavazzuti C, Sperduti A. Embodying identities. An integrated analysis of anthropomorphic urns, grave goods, and cremated remains from the Etruscan necropolis of Tolle. Journal of Mediterranean Archaeology.
- 29. Murail P, Brůžek J, Houët F, Cunha E. DSP: a tool for probabilistic sex diagnosis using worldwide variability in hip-bone measurements DSP: un outil de diagnose sexuelle probabiliste à partir des données métriques de l’os coxal. Bulletins et Mémoires de la Société d’anthropologie de Paris. 2005;17(3–4):167–76.
- 30.
Cardarelli A, Cavazzuti C, Quondam F, Salvadei L, Salzani L. Le necropoli delle Narde di Frattesina: proposta per una lettura delle evidenze demografiche, rituali e sociali a partire dai dati archeologici e antropologici. Firenze: Istituto Italiano di Preistoria e Protostoria, Soprintendenza Archeologia del Veneto, Università degli Studi di Padova. 2015.
- 31.
Salzani L, Colonna C. La fragilità dell’urna. I recenti scavi a Narde, necropoli di Frattesina (XII-IX sec. a.C.). Rovigo. 2010.
- 32.
Paltinieri S. La necropoli di Chiavari. Scavi Lamboglia (1959–1969). Chiavari: Instituto Internazionale di Studi Liguri. 2010.
- 33.
Tabolli J. Narce tra la prima età del Ferro e l’Orientalizzante antico: l’abitato I Tufi e La Petrina. Pisa, Roma: Fabrizio Serra Editore. 2012.
- 34.
D’Agostino B, Gastaldi P. Pontecagnano. II 7. La necropoli del Picentino. Tombe della Prima Età del Ferro dalla proprietà Colucci. Napoli: Collection du Centre Jean Bérard. 2016.
- 35. Maurer H. Ein urnenfelderzeitliches Brandgrab aus Getzersdorf, p. B. St. Pölten, NÖ. Archaeologia Austriaca. 1971;50:115–22.
- 36. Groiß F. Urnenfelderzeitliche Brandgräber aus Getzersdorf. Archaeologia Austriaca. 1976;59(60):99–126.
- 37.
Lochner M. Bestattungssitten auf Gräberfeldern der mitteldonauländischen Urnenfelderkultur. In: Lochner M, Ruppenstein F, editors. Brandbestattungen von der mittleren Donau bis zur Ägäis zwischen 1300 und 750 v Chr. Vienna: Verlag der Österreichischen Akademie der Wissenschaften. 2013. p. 11–31.
- 38. Fritzl M, Waltenberger L, Skerjanz H, Rebay-Salisbury K. The age of death and fire. dating burial customs by radiocarbon and correspondence analysis at the late bronze age cemetery of Inzersdorf, Austria. archaeologia. 2025;Band 109/2025:177–205.
- 39. Waltenberger L. Osteological report of the the buried individuals from the late bronze age cemetery of inzersdorf ob der traisen, austria. Archaeologia Austriaca. 2025;109.
- 40. Lochner M, Hellerschmid I. Sozialstrukturen im Gräberfeld Franzhausen-Kokoron, Niederösterreich. Eine Analyse anhand der Urnengrößen. archaeologia. 2012;Band 93/2009:23–32.
- 41.
Renhart S. Zur anthropologie des urnenfelderzeitlichen brandgräberfeldes von Franzhausen-Kokoron. In: Lochner M, Hellerschmid I, editors. Das urnenfelderzeitliche Gräberfeld von Franzhausen-Kokoron. Vienna: Österreichische Akademie der Wissenschaften. 2016.
- 42.
Canty A, Ripley BD. Boot: Bootstrap R (S-Plus) Functions. 1.3-30 ed. 2025.
- 43. Kuhn M. Building predictive models in R using the caret package. Journal of Statistical Software. 2008;28:1–26.
- 44.
Wickham H, Francois R, Henry L, Müller K. dplyr: A Grammar of Data Manipulation. 1.1.4 ed. 2025.
- 45.
Wickham H. ggplot2: Elegant Graphics for Data Analysis. New York: Springer-Verlag. 2016.
- 46.
Kooperberg C, Ruczinski I. LogicReg: Logic Regression. 1.6.6 ed. 2025.
- 47.
Kassambara A. rstatix: Pipe-friendly framework for basic statistical tests. 2025.
- 48. Wickham H, Averick M, Bryan J, Chang W, McGowan L, François R, et al. Welcome to the Tidyverse. JOSS. 2019;4(43):1686.
- 49.
Gilleland E, Pocernich M, Wahl S, Frenette R. Verification: Weather Forecast Verification Utilities. 1.44 ed. 2004.
- 50. Breheny P, Burchett W. Visualization of regression models using visreg. The R Journal. 2017;9(2):56–71.
- 51. Chakraborty R, Majumder PP. On Bennett’s measure of sex dimorphism. Am J Phys Anthropol. 1982;59(3):295–8. pmid:7158663
- 52. Hunter JD. Matplotlib: A 2D Graphics Environment. Computing in Science & Engineering. 2007;9(3):90–5.
- 53. Harris CR, Millman KJ, van der Walt SJ, Gommers R, Virtanen P, Cournapeau D, et al. Array programming with NumPy. Nature. 2020;585(7825):357–62. pmid:32939066
- 54.
Romano C. Openpyxl: A Python library to read/write Excel 2010 xlsx/xlsm/xltx/xltm files. 3.1.5 ed. 2024.
- 55. McKinney W. Data Structures for Statistical Computing in Python. In: Proceedings of the Python in Science Conference, 2010. 56–61.
- 56. Python Software Foundation. pathlib — Object-oriented filesystem paths. v1.0.1. 2025.
- 57. Buikstra JE, DeWitte SN, Agarwal SC, Baker BJ, Bartelink EJ, Berger E, et al. Twenty-first century bioarchaeology: Taking stock and moving forward. Am J Biol Anthropol. 2022;178 Suppl 74:54–114. pmid:36790761
- 58. Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods. 2020;17(3):261–72. pmid:32015543
- 59. Waskom M. seaborn: statistical data visualization. JOSS. 2021;6(60):3021.
- 60. McFadden C, Oxenham MF. Revisiting the Phenice technique sex classification results reported by MacLaughlin and Bruce (1990). Am J Phys Anthropol. 2016;159(1):182–3. pmid:26767498
- 61. Knecht S, Krüger G, Liebenberg L, Ardagna Y, Perrin M, Ouladsine M. Case-Specific Accuracy in Sex Estimation from Long Bones in Forensic Anthropology: An “Accuracy x-factors” Approach. Forensic Science International. 2026;:112820.
- 62. Pilmann Kotěrová A, Santos F, Bejdová Š, Rmoutilová R, Attia MH, Habiba A, et al. Prioritizing a high posterior probability threshold leading to low error rate over high classification accuracy: the validity of MorphoPASSE software for cranial morphological sex estimation in a contemporary population. Int J Legal Med. 2024;138(4):1759–68. pmid:38532206
- 63. Ferembach D, Schwidetzky I, Stloukal M. Empfehlungen für die Alters- und Geschlechtsdiagnose am Skelett. Homo. 1979;30:1–32.
- 64. Sabaux C, Snoeck C, Capuzzo G, Veselka B, Dalle S, Warmenbol E, et al. Novel multidisciplinary approach detects multiple individuals within the same Late Bronze–Early Iron Age cremation graves. Radiocarbon. 2024;66(4):761–73.
- 65. Rebay-Salisbury K, Berner M, Wiltschke-Schrotta K, Herrero Corral AM, Wolf M, Kanz F. More error than minority: Gendered burial practices align with peptide-based sex identification in early Bronze Age burials in Central Europe. Cambridge Archaeological Journal. 2025;:1–16.
- 66. Binford LR. Mortuary Practices: Their Study and Their Potential. Mem Soc Am Archaeol. 1971;25:6–29.
- 67.
Fahlander F, Oestigard T. The materiality of death: bodies, burials, beliefs. The materiality of death: bodies, burials, beliefs. Oxford: Archaeopress. 2008. p. 1–16.
- 68. Herrero-Corral AM, Garrido-Pena R, Rebay-Salisbury K, Flores Fernández R, Paulos-Bravo R, Wolf M, et al. Gendered burial practices in Copper Age non-adults of Iberia: Evidence from peptide-based sex identification at Humanejos (Madrid, Spain). Journal of Archaeological Science: Reports. 2025;66:105290.
- 69. Gaydarska B, Rebay-Salisbury K, Ramírez Valiente P, Fries JE, Hofmann D, Augereau A, et al. To Gender or not To Gender? Exploring Gender Variations through Time and Space. Eur j archaeol. 2023;26(3):271–98.
- 70. Rebay-Salisbury K, Janker L, Pany-Kucera D, Schuster D, Spannagl-Steiner M, Waltenberger L, et al. Child murder in the Early Bronze Age: proteomic sex identification of a cold case from Schleinbach, Austria. Archaeol Anthropol Sci. 2020;12(11):265. pmid:33123298
- 71. Stewart NA, Gerlach RF, Gowland RL, Gron KJ, Montgomery J. Sex determination of human remains from peptides in tooth enamel. Proc Natl Acad Sci U S A. 2017;114(52):13649–54. pmid:29229823
- 72. Lugli F, Figus C, Silvestrini S, Costa V, Bortolini E, Conti S, et al. Sex-related morbidity and mortality in non-adult individuals from the Early Medieval site of Valdaro (Italy): the contribution of dental enamel peptide analysis. J Archaeological Science: Reports. 2020;34:102625.
- 73. Mittnik A, Massy K, Knipper C, Wittenborn F, Friedrich R, Pfrengle S, et al. Kinship-based social inequality in Bronze Age Europe. Science. 2019;366(6466):731–4. pmid:31601705
- 74. Rebay-Salisbury K, Bortel P, Janker L, Bas M, Pany-Kucera D, Salisbury RB, et al. Gendered burial practices of early Bronze Age children align with peptide-based sex identification: A case study from Franzhausen I, Austria. Journal of Archaeological Science. 2022;139:105549.
- 75. Almeida BM, Silva FC, Santos AL. Biological sex estimation in experimentally burnt patellae: exploring sexual dimorphism through comparative analysis. Int J Legal Med. 2025;139(4):1971–82. pmid:40085240
- 76. McKinley JI. Complexities of the Ancient Mortuary Rite of Cremation: An Osteoarchaeological Conundrum. Isotopic Landscapes in Bioarchaeology. Springer Berlin Heidelberg. 2016. p. 17–41.
- 77. Ubelaker DH, DeGaglia CM. Population variation in skeletal sexual dimorphism. Forensic Sci Int. 2017;278:407.e1-407.e7. pmid:28698063
- 78. Gonçalves D. The reliability of osteometric techniques for the sex determination of burned human skeletal remains. Homo. 2011;62(5):351–8. pmid:21899836
- 79. Işcan MY, Miller-Shaivitz P. Discriminant function sexing of the tibia. J Forensic Sci. 1984;29(4):1087–93. pmid:6502108
- 80. Albanese J, Cardoso HFV, Saunders SR. Universal methodology for developing univariate sample-specific sex determination methods: an example using the epicondylar breadth of the humerus. Journal of Archaeological Science. 2005;32(1):143–52.
- 81. Alunni-Perret V, Staccini P, Quatrehomme G. Sex determination from the distal part of the femur in a French contemporary population. Forensic Sci Int. 2008;175(2–3):113–7. pmid:17624707
- 82.
Dirkmaat DC, Cabo LL. Forensic Anthropology: Embracing the New Paradigm. A Companion to Forensic Anthropology. 2012. p. 1–40.
- 83. Walker PL. Sexing skulls using discriminant function analysis of visually assessed traits. Am J Phys Anthropol. 2008;136(1):39–50. pmid:18324631
- 84. Navega D, Vicente R, Vieira DN, Ross AH, Cunha E. Sex estimation from the tarsal bones in a Portuguese sample: a machine learning approach. Int J Legal Med. 2015;129(3):651–9. pmid:25186617
- 85.
Buikstra JE, Ubelaker DH. Standards for Data Collection from Human Skeletal Remains. Davis HA, editor. Fayetteville: Arkansas Archaeological Survey. 1994.