Figures
Abstract
Finite element models built from quantitative computed tomography images rely on element-wise mapping of material properties starting from Hounsfield Units (HU), which can be converted into mineral densities upon calibration. While calibration is preferably carried out by scanning a phantom with known-density components, conducting phantom-based calibration may not always be possible. In such cases, a phantomless procedure, where the scanned subject’s tissues are used as a phantom, is an interesting alternative. The aim of this study was to compare a phantom-based and a phantomless calibration method on 41 postmenopausal women. The proposed phantomless calibration utilized air, adipose, and muscle tissues, with reference equivalent mineral density values of -797, -95, and 38 mg/cm3, extracted from a previously performed phantom-based calibration. A 9-slice volume of interest (VOI) centred between the femoral head and knee rotation centres was chosen. Reference HU values for air, adipose, and muscle tissues were extracted by identifying HU distribution peaks within the VOI, and patient-specific calibration was performed using linear regression. Comparison of FE models calibrated with the two methods showed average relative differences of 1.99% for Young’s modulus1.30% for tensile and 1.34% for compressive principal strains. Excellent correlations (R2 > 0.99) were identified for superficial maximum tensile and minimum compressive strains. Maximum normalised root mean square relative error (RMSRE) values settled at 4.02% for Young’s modulus, 2.99% for tensile, and 3.22% for compressive principal strains, respectively. The good agreement found between the two methods supports the adoption of the proposed methodology when phantomless calibration is needed.
Citation: Szyszko JA, Aldieri A, La Mattina AA, Viceconti M (2024) Phantomless calibration of CT scans for hip fracture risk prediction in silico: Comparison with phantom-based calibration. PLoS ONE 19(6): e0305474. https://doi.org/10.1371/journal.pone.0305474
Editor: Hesham M. H. Zakaly, Ural Federal University named after the first President of Russia B N Yeltsin Institute of Physics and Technology: Ural’skij federal’nyj universitet imeni pervogo Prezidenta Rossii B N El’cina Fiziko-tehnologiceskij institut, RUSSIAN FEDERATION
Received: January 11, 2024; Accepted: May 30, 2024; Published: June 14, 2024
Copyright: © 2024 Szyszko et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All files containing calibration lines and HU peaks obtained for the subjects in this study as well as the custom Matlab script are available from the AMSActa database (https://doi.org/10.6092/unibo/amsacta/7727).
Funding: This study was supported by the European Commission through the H2020 project “In Silico World: Lowering barriers to ubiquitous adoption of In Silico Trials” (topic SC1-DTH-06-2020, grant ID 101016503). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Osteoporosis is a systemic metabolic disease characterised by the disruption of bone microarchitecture and bone quality due to an imbalance in the bone remodelling physiological process. Osteoporosis mainly affects older adults, especially postmenopausal women [1–4], and it currently represents one of the most common bone pathologies. Its main clinical manifestation is represented by fragility fractures, most commonly at the hip or vertebrae [5]. Fragility fractures are very severe events, associated with increased morbidity and mortality, and represent a heavy burden for the healthcare system [4, 6, 7]. In addition to chronic pain, osteoporotic fractures also cause loss of mobility and loss of independence, leading to a decreased quality of life [5–7]. According to the International Osteoporosis Foundation, the pathology causes more than 9 million fractures worldwide per year [8], and in 2019, 32 million Europeans aged over 50 years were diagnosed with osteoporosis [2]. Due to the growing global life expectancy, the incidence of hip fractures is expected to increase up to 3.5 times by 2050, compared to 1990 [9, 10].
The dual-energy X-ray absorptiometry (DXA) imaging technique is currently considered the gold standard to diagnose osteoporosis [11, 12]. An average measurement of the bone density, called areal Bone Mineral Density (aBMD), is extracted from the DXA image and used to classify subjects as osteoporotic, osteopenic, or healthy. Nevertheless, the predictive accuracy of aBMD has been demonstrated to be limited [13–15], and there have been several instances where fractures occurred in patients not classified as high-risk by this parameter [16, 17]. In this context, patient-specific finite element (FE) models developed from quantitative computed tomography (QCT) data have demonstrated to represent a valid alternative, able to provide a more accurate fracture prediction [18–20]. In fact, FE models can more accurately account for personalised mechanical determinants of fracture, such as patient-specific bone 3D geometry and density distribution [13, 21–23]. In addition, QCT-based FE models can account for different loads and forces [24–26]. These models rely on a heterogeneous mapping of the material properties based on the Hounsfield Units (HU) of the QCT images, where each element of the FE model is assigned a mineral density value dependent on the image local HU [27]. The element-wise density is later employed to compute the corresponding Young’s modulus values from density-elasticity laws available in the literature [28, 29]. The extraction of densities from the CT HU values requires a calibration to be performed, i.e., the assessment of a mathematical relation between HU and mineral density. The preferred approach adopted to calibrate CT scans is a phantom-based calibration procedure, which involves either placing a calibration phantom in line with the patient during the scan or scanning a phantom offline using the same scanning parameters employed to image the patient [30, 31]. The phantom is composed of multiple components with different known concentrations of calcium hydroxyapatite (HA) or dipotassium phosphate (K2HPO4). As a result, a linear relationship can be established between HU and mineral density. However, this approach presents some limitations in the clinical practice. On one hand, the inline scanning is associated with increased costs [32, 33] and may lead to artifacts due to the patient’s position and anatomy, potentially causing issues such as beam hardening or partial volume averaging [32–34]. On the other hand, the offline scanning, which requires scanning the phantom separately, could represent a logistical and financial burden for the hospitals [35]. Moreover, many clinical studies carried out to assess the accuracy of these FE models often make use of opportunistic CT scans, i.e., CT scans that were originally acquired for other clinical purposes [13, 33]. In these cases, phantom scans are commonly not available.
Aiming to overcome this issue, so-called phantomless calibration approaches have been proposed, which allow the calibration of CT scans without a phantom. Usually, these approaches employ internal patient tissues in place of the components of the phantom. Different tissue combinations can be found in the literature for this purpose, such as air and adipose tissue [36]; muscle and adipose tissues [34, 37]; air, adipose, and muscle tissues [38, 39]; air, aortic blood, and muscle tissues [40]; air, aortic blood, cortical bone, and muscle [33]. Fixed reference density values are hypothesized a priori for the selected tissues, and the calibration is performed by extracting the corresponding HU from the scan. In the literature, different approaches identifying the reference values have been presented. Michalski et al. [33] proposed a phantomless calibration that used air, adipose tissue, aortic blood, skeletal muscle and cortical bone as references estimating their density values from their mass attenuation coefficients according to [41]. Winsor et al. [40], on the other hand, adopted air, muscle and aortic blood as reference tissues, assigning nominal density values provided by the National Institute of Standards and Technology [42] to them. While those studies explored different tissues combinations, including aortic blood, a common limitation of these methods was represented by the manual and therefore user-dependent selection of volumes of interest where the specific tissues were contained. In addition, given that the average HU values for aortic blood fall within the range of values for skeletal muscle [40], the choice of both these reference tissues poses reproducibility and robustness challenges in phantomless calibration. Other studies [36, 38, 39], instead, determined reference density values for the chosen reference tissues by taking advantage of an inline phantom-based calibration. More in detail, in [38, 39] air, adipose tissue and skeletal muscle were employed as references to calibrate femur CT scans of metastatic patients acquired with 120 kVp tube voltage, 220 mA tube current, and 3 mm slice thickness. In [36], instead, phantom-based determined reference values of air and adipose tissues were employed to carry out phantomless calibration used for the prediction of femoral strength. In those works, a phantomless calibration procedure was proposed and density values for the selected reference tissues were also provided. Herein, CT images of post-menopausal women cohort were available along with the corresponding offline phantom-based calibrations. In our efforts to replicate prior phantomless calibration methodologies on our postmenopausal women cohort though, including those outlined by Eggermont et al. [38], we came across significant differences in Young’s modulus values derived from the phantom-based calibration available compared to those derived from the phantomless calibration performed adopting the reference density values originally derived there for metastatic patients [36]. Recognizing the need for calibration parameters better suited to our cohort, and analogous phantomless calibration procedure is here proposed, where reference human tissues to be employed in the phantomless calibration are selected and their reference density values identified through phantom-based calibrations across multiple subjects and CT acquisition parameters. The aim of this study was to develop an automatic phantomless calibration methodology to be integrated into an existing FE pipeline for hip fracture risk prediction presented in [18, 43]. In this context, the comparison between the phantom-based and phantomless calibrated FE models is also presented. Our objective was in fact to ensure that our developed method could be robust and easily reproducible by other researchers in the field.
Materials and methods
QCT data
For the purpose of this study the QCT scans of 34 postmenopausal female subjects (age 67 ± 8 years, range: 56–84 years; height 160 ± 7 cm, range: 143–180 cm, weight 66 ± 13 kg, range: 39 ± 100 kg) were selected from the Hip-Fracture Validation Collection, freely available at [44]. The collection comprises the CT scans of the femurs of 101 women (age 69 ± 9 years, range: 56–86 years; height 160 ± 7 cm, range: 134–180 cm, weight 65 ± 12 kg, range: 39–100 kg) resident in Emilia Romagna region, collected at Rizzoli Orthopaedic Institute (IOR) from 1999 to 2016 for surgical planning of hip arthroplasty (with approval from Istituto Ortopedico Rizzoli Ethics Committee Area Vasta Emilia Centro CE AVEC 731/2020/Oss/IOR). No consent form for the processing of personal data used for clinical activity for research purposes was applicable, due to article 110 bis paragraph 4 of the Nuovo Codice Privacy D.lgs. 196/2003, updated to D.lgs. 101/2018 regarding the Italian Scientific Institute for Research, Hospitalization and Healthcare (IRCCS). These scans were obtained using GE HiSpeed CT/I (GE Healthcare, Milwaukee, WI) scanner with tube current variations in the 170–200 mA range and slice spacing variations in the 1–2 mm range. The remaining scan parameters were kept constant: tube voltage of 120 kVp, slice thickness of 3 mm, and "bone" reconstruction kernel. In addition, the QCT scans of further 7 female subjects (age 73 ± 6 years, range: 66–82 years; height: 163 ± 4 cm, range: 160–169 cm; weight: 63 ± 13 kg, range: 50–89 kg), collected at Rizzoli Orthopaedic Institute (IOR) in 2023, were also included in the present study. Ethical approval for these subjects was obtained from the Istituto Ortopedico Rizzoli Ethics Committee Area Vasta Emilia Centro CE AVEC 453/2023/Sper/IOR, and all subjects provided written informed consent. These scans were acquired using a GE Discovery CT scanner (GE Healthcare, Milwaukee, WI), with tube current variations in the 100–140 mA range. Other scan parameters remained constant: tube voltage of 120 kVp, slice thickness of 1.25 mm, slice spacing of 1.25 mm, and "bone" reconstruction kernel. Detailed CT scanning settings for all the subjects included in the study are provided in S1 File. The included 41 subjects were selected because scans of a QRM-ESP (QRM GmbH, Germany) calibration phantom were available acquired with matching CT acquisition parameters, as explained in more detail in the following. In order that the phantomless calibration pipeline proposed could be validated, the 41 subjects included were divided into two groups: Group 1, which consisted of 24 subjects representing the largest subgroup with uniform CT parameters, and Group 2, which comprised 17 subjects scanned with tube currents ranging from 100 to 190 mA and with two different CT scanners. Table 1 displays the scan parameter details.
Phantom-based calibration
A QRM-ESP (QRM GmbH, Germany) phantom was employed to carry out the phantom-based calibration. The phantom includes five inserts with different quantities of calcium hydroxyapatite measuring 50, 100, 200, 400, and 800 mg/cm3, that cover the range of spongious and cortical bone densities [45]. The phantom was scanned offline using GE HiSpeed CT/I and GE Discovery CT (GE Healthcare, Milwaukee, WI) scanners. A total of 6 CT scans of the phantom were acquired with CT acquisition parameters matching those adopted for the femurs’ CT scans: the tube current was varied in the 100–200 mA range and slice spacing adjusted between 1.25–4.5mm, with slice thickness of either 1.25mm or 3mm, while maintaining a constant “bone” kernel and tube voltage of 120 kVp. Detailed information about the ESP phantom scans is provided in Table 2. For each scan, the phantom inserts were manually segmented, and the corresponding average HU values were computed. Hence, linear regressions between the average HU values and the known density of the phantom inserts were performed to identify the coefficients of the calibration line for each scan. S1 File provides the phantom-based calibration information for each subject included in this study.
Phantomless calibration
The phantomless calibration procedure was based on the methodology reported in [38], where air, adipose, and muscle tissues were employed to carry out calibration. The main steps of the phantomless calibration procedure will be described in detail in the following. First, a reference point (RP) was identified as the average between the femur head and the knee centres (Fig 1A). The CT slice closest to the RP was then selected, and a 9-slice-wide volume of interest (VOI) was defined from that slice in the caudal direction. The selected VOI included air, adipose, and muscle tissues (Fig 1B). In order to eliminate any artifacts, the VOI was restricted by removing the portion extending in the posterior direction, starting from a point (OP) located 5 cm posteriorly to RP (Fig 1C). Since only one limb CT scan was used, no restriction in the medial direction was applied. The volume of interest resulting from this process will be referred to as VOIcut in the following.
(a) RP definition based on the Femur Head Centre and Knee Centre. (b) VOI definition based on RP and OP by removing the portion extending 5cm posteriorly from the RP. (c) VOIcut selection. The visualization of the CT scan was obtained from the data collected during the study, which is available at [42].
The voxels HU contained in the VOIcut were used to build a histogram, which was later fitted with a kernel distribution using a normal smoothing function based on a 5 HU bandwidth (Matlab, release R2022b, The Mathworks Inc) (Fig 2). The peaks corresponding to air, adipose tissue, and muscle HU reference values could thus be identified. Further details about the kernel distribution fitting procedure are provided in the S1 File.
The "pdf value" on the y-axis represents the probability density function value.
For each of the 24 subjects in Group 1 the equivalent mineral density values for air, adipose, and muscle tissues was computed by means of the available phantom-based calibration line. Subsequently, their equivalent mineral density values were averaged, resulting in -797, -95, and 38 mg/cm3 for air, adipose, and muscle, respectively. These reference values were then adopted in the phantomless calibration process performed on Group 2 subjects, where calibration was performed employing the above-mentioned reference density values previously obtained from Group 1 instead of employing the phantom scans. Aiming to assess whether different CT acquisition parameters could affect the reference density values identified for air, adipose and muscle tissues and obtained for Group 1 employing the phantom-based calibration, those reference density values were also extracted through phantom-based calibration on Group 2, in order that a comparison with the values obtained for Group 1 could be carried out. The overall overview of the procedure implemented is shown in Fig 3.
Upper panel: the calibration phantom was employed to calibrate the CT scans for Group 1 subjects so that reference density values for air, adipose, and muscle tissues could be computed. These reference density values were later employed to calibrate through a phantomless calibration procedure the CT images of Group 2 subjects. Phantom scans were also available to calibrate Group 2 subjects, so that the outcomes of the phantomless and phantom-based calibration methodologies could be compared. Lower panel: phantom-based calibrations of Group 2 subjects were also employed to compute reference values for air, adipose, and muscle tissues to assess potential effects of different CT acquisition parameters on the reference density values. In the figure, ρHA stands for the density of the phantom, f stands for the distribution of the HU in the VOIcut (pdf value), HUphantom and HUsubject stand for HU values extracted from the CT scan of the phantom and subject, respectively.
FE models
FE models of the femur were built for all 41 subjects included in this study according to the methodology presented in [43] and here briefly summarized. First, the CT scans were segmented using a semi-automatic methodology [46], and the femur geometry was extracted. Subsequently, a 2-mm-sized 10-nodes tetrahedral mesh was built using Ansys ICEM CFD (release 2019R3, Ansys Inc.). Bonemat (V3.2, Istituto Ortopedico Rizzoli, Bologna, Italy) was used to assign heterogeneous linear elastic material properties to the models, and specifically to map the HU-based Young’s modulus values to the mesh elements [28, 29]. For each subject tested, the material properties were mapped using both the phantom-based and the phantomless calibrations on the same mesh. Boundary conditions replicating a sideways fall loading scenario were applied to the models. Specifically, a 1000 N load was applied at the femur head center, and a rigid frictionless contact plane perpendicular to the load direction was created at the greater trochanter. A revolute joint was placed at the knee rotation centre (calculated as the mean location between most medial and most lateral epicondyle points), with the rotation axis along the antero-posterior direction, perpendicular to the load axis. A total of 28 different FE simulations were performed by varying the femur impact pose from 0° to +30° in the medio-lateral plane and -30° to +30° in the antero-posterior plane [26, 43]. Simulations were solved with Ansys Mechanical APDL (Ansys Inc., PA, USA). The principal strain values represented the biomechanical variables of interest. In addition, for each impact pose, the load to failure, i.e. the intensity of the force required to fracture the femur was computed based on principal strains. The region of interest (ROI) chosen for determining load to failure included surface nodes located at the proximal femur (representing 25% of its biomechanical length in the longitudinal direction) [41]. By comparing the so obtained loads to failure with forces acting on the femur upon falling computed with a stochastic analytical model [18, 25, 43] the absolute risk of hip fracture at time 0 (ARF0), where time 0 refers to the time of the QCT scan, was derived [41].
Comparison metrics
Young’s modulus values of each element of the mesh together with the superficial tensile (ε1) and the compressive (ε3) principal strains obtained on the ROI resulting from the two calibration procedures were compared, and the root mean square relative error (RMSRE) computed using the average values of phantom-based and phantomless calibrations as normalizing values:
(1)
where:
- ■ n is the number of data points.
- ■ xi represents the value obtained from phantom-based calibration.
- ■
represents the value obtained from phantomless calibration.
- ■
represents the average between phantom-based and phantomless calibration.
Point-to-point comparisons between the phantom-based and phantomless calibrated FE models were also performed for each subject considering principal strains and Young’s moduli. In addition, the differences in Minimum Side-Fall Strength (MSF), i.e., the lowest load to failure across the 28 femoral impact poses, and ARF0 resulting from the phantom-based and phantomless calibrated femur FE models were calculated for each subject.
Results
Table 3 presents the mean and standard deviation of the reference density values obtained from the phantom-based calibration for air, adipose, and muscle tissue, for both the Group 1 and for all 41 subjects averaged together. Small variations can be observed between the obtained reference density values.
The RMSRE computed on Young’s modulus values for Group 2 subjects settled to an average value of 2.86 ± 0.76%, with a maximum of 4.02%. The mean RMSRE computed on superficial principal strains for the same subjects turned out to be, on average, equal to 1.55 ± 0.74%, with a maximum of 2.99% for tensile and 1.61 ± 0.80%, with the maximum being 3.22% for compressive principal strains. Fig 4 shows the violin plots relative to the element-by-element relative differences in the Young’s modulus resulting from the two calibration methodologies for Group 2. There, outliers were excluded using the Interquartile Range (IQR) method, which calculates the range between the first quartile (Q1) and the third quartile (Q3) of the data distribution and removes data points falling beyond a threshold defined as 1.5 times the IQR. The outliers were identified as high errors associated with low Young’s modulus values, primarily attributed to the spongy bone region of the femur and by the femur head typically associated with the low cortical area. Violin plots containing outliers can be found in the S1 File. Element-wise relative differences in Young’s modulus averaged on each subject belonging to Group 2 ranged from 0.95% to 2.93%, with an average value of 1.99%.
Violin plots showing the distributions of element-wise relative differences in Young’s modulus values between phantom-based and phantomless calibration for Group 2 subjects. The solid black line represents the mean value, while the blue solid line represents the median. Outliers, identified using the inter-quartile range method, have been excluded.
Fig 5 shows the spatial distribution of the relative differences in Young’s modulus along with the Young’s modulus values obtained for both methods for one of the analysed subjects (S07). Spatial distributions of relative differences in Young’s modulus are shown for additional subjects in the S1 File.
Frontal plane is the mean frontal section from the anterior view of the femur.
The point-to-point relative differences in superficial principal strains averaged on each subject belonging to Group 2 ranged from 0.29% to 2.53%, with an average value of 1.30% for tensile principal strains and from 0.29% to 2.69%, with an average value of 1.34% for compressive principal strains. A more detailed comparison carried out on tensile and compressive principal strains, including all the simulated femur impact poses, can be found in the S1 File. Fig 6 shows the distribution of the point-to-point relative differences between the phantom-based and phantomless calibration for the superficial tensile (Fig 6A) and compressive (Fig 6B) principal strains, respectively. Outliers identified using the inter-quartile range method have been excluded. Violin plots containing outliers can be found in the S1 File.
Violin plots showing the distributions of the point-to-point relative differences computed on superficial tensile (a) and compressive (b) principal strain values between phantom-based and phantomless calibrations, considering all 28 simulated femur impact poses for each the 17 subjects in Group 2. The solid black line represents the mean, the blue solid line represents the median. Outliers, identified using the inter-quartile range, have been excluded.
The correlation between phantom-based and phantomless calibration-derived Young’s modulus and principal strains was excellent (R2 > 0.99). In Fig 7 the scatter plot comparing phantom-based and phantomless calibrations-derived highest superficial tensile and lowest superficial compressive strains is depicted. Specifically, the highest superficial tensile (a) and the lowest superficial compressive (b) principal strains for each simulated impact pose (28 angles) and Group 2 subjects are shown.
The highest superficial tensile (a) and the lowest superficial compressive (b) principal strains for each of the 28 simulated impact poses for each of 17 subjects contained in Group 2 (R2 > 0.99).
The relative differences computed for MSF ranged from 0.08% to 2.41%, with an average value of 0.98%. S1 File reports the MSF values along with the associated failure load angles for the phantom-based and phantomless calibrated FE models. Similarly, the absolute differences computed for ARF0, which can be found in S1 File for Group 2, ranged from 0.01% to 1.87% with an average of 0.59%.
Discussion
When building a patient-specific QCT-based FE model of the femur, heterogenous material properties are assigned to the mesh elements based on the local density information contained in the CT scan. In fact, a number of mathematical laws have been proposed and validated in the literature, which relate density and Young’s modulus values [28, 29, 47]. Nonetheless, in order for any of those relationships to be used, density needs to be estimated from the HU values. Calibration, establishing the HU-density relation, is in fact a crucial step in QCT-based FE modeling. Typically, calibration involves scanning a phantom with known density components, so that a mathematical relationship can be established between the average HU and the known density of the components themselves. However, the availability of a phantom to be scanned as well as the availability of phantom scanned with CT acquisition parameters matching exactly the CT scan of a subject are not straightforward and may be challenging. Therefore, a phantomless calibration procedure, i.e., an approach able to relate HU and density without the need for a phantom, but uniquely based on the subject’s CT scan, would be pivotal to extending the usability of these patient-specific models. In this context, the primary objective of this study was to develop and evaluate an automatic phantomless CT calibration approach according to the methodology proposed in [38]. The developed procedure only requires input data in the form of a CT scan and predefined anatomical landmarks such as head femur center and knee center. The output of the procedure is the calibration line, which is generated automatically. With this aim, in this work the outcomes of subject-specific QCT-based FE models built starting from phantom-based and phantomless calibration procedures have been compared to assess the reliability of the phantomless calibration approach.
Our study involved a cohort of 41 postmenopausal women whose CT scans were obtained with different scanners, tube current values (100–200 mA), slice thickness (1.25–3 mm) and spacing between slicing (1–2 mm). Density values for the air, adipose and muscle tissues selected as references to be adopted for future calibrations were defined based on the phantom-based procedure for 24 subjects whose scans had been acquired with the same CT parameters, and which formed so called Group 1. The phantomless calibration method was then validated on 17 subjects referred to as Group 2, for whom phantom-based calibration was also available. Nevertheless, aiming to assess whether the identified reference density values were robust enough, those were computed through phantom-based calibration also considering Group 2 subjects. Although some variations could be observed between air, adipose and muscle reference density values extracted considering Group 1 only or Group 1 and 2 together, these differences were minimal and did not significantly impact the calibration process and the reliability of the obtained results. The density values obtained for Group 1 were -797, -95, and 38 mg/cm3 for air, adipose, and muscle tissues, respectively. When including all subjects, these values shifted slightly to -801, -96, and 37 mg/cm3. In the studies presented by Eggermont et al. [38] and Ataei et al. [39], different reference values for the same tissues were reported: -840, -90, 30 and -838, -86, and 35 mg/cm3 for air, adipose, and muscle tissues, respectively. These differences in the reference density values may be attributed to several factors. Firstly, our study exclusively focused on postmenopausal women, whereas their cohorts included both elderly men and women, including premenopausal women, diagnosed with bone metastases. As literature suggest, the density of adipose, and muscle tissues may vary, depending on gender, age or health conditions [48–50], and therefore adjustments to the here proposed reference density values may be necessary for accurate results. Moreover, while in our study reference density values identification was based on an offline phantom-based calibration, in [38, 39] an inline calibration method was adopted by scanning the patient atop of the calibration phantom. Previous studies have shown that scanning the patient atop of the phantom might cause artifacts due to the patient’s position and anatomy, thus resulting in issues such as beam hardening and scatter, partial volume averaging, or a decrease in HU values in the phantom due to the gap between the patient and the phantom [33, 34, 36, 37]. Offline calibration, on the other hand, eliminates these variables, providing a calibration process free from the influence of patient-specific factors during the phantom scanning procedure. On the other hand, the scanning of the phantom separately from the patient may not fully account for beam hardening effects related to patient size and anatomy [51]. It is known that higher amounts of soft tissue tend to absorb lower energy X-ray photons, leading to beam hardening and variability in HU measurements [33].
Considering the Young’s modulus differences here obtained by comparing the phantom-based and the phantomless calibrated FE models for Group 2, limited point-to-point relative differences were observed. Notably, the most significant local point-to-point differences were associated with low-density values located in the spongy bone tissue (Fig 5). These variations did not substantially impact the FE predicted femur strength, which was determined based on superficial principal strains. Furthermore, the maximum RMSRE calculated on Young’s modulus was less than 5%, indicating small variations between both calibration methods. In a comparable study [39], the relative differences between the Young’s modulus obtained from the mean bone density values for the cortical and trabecular bone for two calibration methods resulted in 5.47% and 4.6%, respectively. The point-to-point relative differences in superficial tensile and compressive principal strains exhibited low errors, with the highest mean differences of 2.53% and 2.69%, respectively. Importantly, in our study, the correlation between the principal strains resulting from the two calibration methods was notably high, with R2 > 0.99, thus affirming a strong agreement in the strains prediction. Additionally, the mean relative differences in the MSF resulted in 0.97% with the highest being 2.41%. These findings agree with similar studies [38, 39], where relative differences between the mean failure load turned out to be 0.16% and 6.96%, respectively. There, a high correlation emerged between failure loads of phantom-based and phantomless calibrated FE models, with R2 = 0.94 and R2 = 0.83 for [38, 39], respectively. Similarly, our study showed an excellent correlation between failure loads, with R2 > 0.99. Winsor et al. [40] reported absolute differences of 6.9% and 3.9% for femur strength and femoral bone density, respectively, while comparing the inline phantom-based and phantomless method. Nevertheless, the differences computed on femur strength and femoral bone density increased significantly when comparing FE models calibrated with offline phantom-based and phantomless calibration methodologies (22% and 17% relative differences for femur strength and femoral bone density, respectively). In our simulation outcomes the absolute differences in ARF0 for all subjects remain below 2%. Crucially, these differences would not significantly impact the classification of the subjects as at high or low fracture risk, as the ARF0 threshold for classification previously obtained for the full cohort was set at 39.98%. As a whole, despite the different FE pipelines and reference tissue combinations adopted, the outcomes obtained in our study closely agreed with those reported in studies where CT phantomless calibration was also carried out.
There are some limitations to this study, which are reported in the following. First of all, the assumption of consistent muscle and adipose tissues density among all subjects was made, which is a strong hypothesis required when a phantomless calibration procedure is developed. While we acknowledge the potential impact of sarcopenia on muscle density for instance, we tentatively assumed similar muscle states across our study participants, given the narrow age range of postmenopausal women included in our sample. However, we recognize that different pathologies may introduce variability in muscle density, and future studies might provide valuable insights into this aspect of our research. Notably, despite these potential discrepancies, the average relative differences observed on Young’s modulus and principal strains for all subjects in Group 2 remained below 4%. This suggests that while individual variations in effective density might be present, they may not substantially impact the overall outcomes. Moreover, while our cohort is small and focused, it was intentionally selected to address research dedicated to fracture risk prediction in postmenopausal osteoporotic subjects. Nonetheless, adjustments to the proposed reference density values may be necessary for accurate results in different populations. From a general perspective, our study also provides the framework to recalculate such reference density values for different populations if needed.
In addition, when applying the here presented methodology to very skinny subjects with minimal adipose tissue, issues might arise in detecting the adipose tissue peak due to its reduced presence in CT images. Although our methodology employs a histogram analysis with a narrow bin width (1 HU) to identify subtle tissue peaks, accuracy may be compromised in instances of extremely low BMI. On the other hand, one of the subjects considered in this study actually had a BMI value of 14.31, which is classified as underweight. In that case tissue peaks could be successfully identified. However, further investigations about the impact of subject anatomy and composition on the calibration process would be of interest, especially across different populations. Moreover, although the subjects analysed here were scanned with two different CT scanners and with varying CT acquisition parameters (e.g. tube current, slice spacing, and slice thickness), it is important to point out that tube voltage never varied. Previous studies have shown that alterations in tube voltage have the most significant impact on HU [39, 52]. Slightly higher errors were observed for subjects S11-S17 compared to other subjects in Group 2. While these errors indicate minimal impact on our overall results, further studies including more subjects and CT scanners would be beneficial to assess inter-scanner and cohort variability and its impact on the phantomless calibration. Therefore, a more comprehensive analysis involving diverse tube voltage settings and additional CT scanners would be essential to enhancing the robustness and consistency of the here achieved results. Another aspect to be considered is also the time elapsed between the phantom and the subjects scans, which in our case varied from a few months to up to a few years: this variability in timing may have impacted the stability of HU values among scans, and potentially have affected calibration accuracy.
In conclusion, the here presented study has shown promising results regarding the adoption of phantomless calibration methodologies and provides reference density values for postmenopausal women to the scientific community which could be employed to carry out phantomless calibration when needed. These findings would not only offer potential benefits in enhancing the current clinical practice but would also allow the scientific community to take advantage of a higher number of CT scans, including opportunistic CT scans, to reliably build FE models. Such methodology could therefore support the adoption of digital twin technologies in clinics, fostering, inter alia, the creation of validation collections to increase the credibility evidence of CT-based digital twins.
Supporting information
S1 File. Detailed information about CT scans, phantom-based calibration, kernel distribution fitting and results.
https://doi.org/10.1371/journal.pone.0305474.s001
(PDF)
Acknowledgments
We acknowledge the CINECA award under the ISCRA initiative, for the availability of high performance computing resources and support.
References
- 1. Tu KN, Lie JD, Wan CKV, Cameron M, Austel AG, Nguyen JK, et al. Osteoporosis: A Review of Treatment Options. Pharm Ther. 2018 Feb;43(2):92–104. pmid:29386866
- 2. Pouresmaeili F, Kamalidehghan B, Kamarehei M, Goh YM. A comprehensive overview on osteoporosis and its risk factors. Ther Clin Risk Manag. 2018 Nov;14:2029–49. pmid:30464484
- 3. Jeremiah MP, Unwin BK, Greenawald MH, Casiano VE. Diagnosis and Management of Osteoporosis. Am Fam Physician. 2015 Aug;92(4):261–8. pmid:26280231
- 4. Kanis JA, Cooper C, Rizzoli R, Reginster JY. European guidance for the diagnosis and management of osteoporosis in postmenopausal women. Osteoporos Int. 2019;30(1):3–44. Erratum in: Osteoporos Int. 2020 Jan;31(1):209. Erratum in: Osteoporos Int. 2020 Apr;31(4):801. pmid:30324412
- 5. Cauley JA, Chalhoub D, Kassem AM, Fuleihan GEH. Geographic and ethnic disparities in osteoporotic fractures. Nat Rev Endocrinol. 2014 Jun;10(6):338–51. pmid:24751883
- 6. Rizkallah M, Bachour F, Khoury ME, Sebaaly A, Finianos B, Hage RE, et al. Comparison of morbidity and mortality of hip and vertebral fragility fractures: Which one has the highest burden? Osteoporos Sarcopenia. 2020 Sep;6(3):146–50. pmid:33102809
- 7. Borgström F, Karlsson L, Ortsäter G, Norton N, Halbout P, Cooper C, et al. Fragility fractures in Europe: burden, management and opportunities. Arch Osteoporos. 2020 Apr;15(1):59. pmid:32306163
- 8. Johnell O, Kanis JA. An estimate of the worldwide prevalence and disability associated with osteoporotic fractures. Osteoporos Int. 2006 Dec;17(12):1726–33. pmid:16983459
- 9. Khandelwal S, Lane NE. Osteoporosis: Review of Etiology, Mechanisms, and Approach to Management in the Aging Population. Endocrinol Metab Clin North Am. 2023 Jun;52(2):259–75. pmid:36948779
- 10. Cooper C, Campion G, Melton LJ. Hip fractures in the elderly: A world-wide projection. Osteoporos Int. 1992 Nov;2(6):285–9. pmid:1421796
- 11. Dimai HP. Use of dual-energy X-ray absorptiometry (DXA) for diagnosis and fracture risk assessment; WHO-criteria, T- and Z-score, and reference databases. Bone. 2017 Nov;104:39–43. pmid:28041872
- 12. Cosman F, de Beur SJ, LeBoff MS, Lewiecki EM, Tanner B, Randall S, et al. Clinician’s Guide to Prevention and Treatment of Osteoporosis. Osteoporos Int. 2014 Oct;25(10):2359–81. Erratum in: Osteoporos Int. 2015 Jul;26(7):2045–7.
- 13. Keaveny TM, Clarke BL, Cosman F, Orwoll ES, Siris ES, Khosla S, et al. Biomechanical Computed Tomography analysis (BCT) for clinical assessment of osteoporosis. Osteoporos Int. 2020 Jun;31(6):1025–48.
- 14. Schuit SCE, van der Klift M, Weel AEAM, de Laet CEDH, Burger H, Seeman E, et al. Fracture incidence and association with bone mineral density in elderly men and women: the Rotterdam Study. Bone. 2004 Jan;34(1):195–202. Erratum in: Bone. 2006 Apr;38(4):603. pmid:14751578
- 15. Wainwright SA, Marshall LM, Ensrud KE, Cauley JA, Black DM, Hillier TA, et al. Hip Fracture in Women without Osteoporosis. J Clin Endocrinol Metab. 2005 May;90(5):2787–93. pmid:15728213
- 16. Siris ES, Chen YT, Abbott TA, Barrett-Connor E, Miller PD, Wehren LE, et al. Bone Mineral Density Thresholds for Pharmacological Intervention to Prevent Fractures. Arch Intern Med. 2004 May;164(10):1108–12. pmid:15159268
- 17. Unnanuntana A, Gladnick BP, Donnelly E, Lane JM. The Assessment of Fracture Risk. J Bone Joint Surg Am. 2010 Mar;92(3):743–53.
- 18. Bhattacharya P, Altai Z, Qasim M, Viceconti M. A multiscale model to predict current absolute risk of femoral fracture in a postmenopausal population. Biomech Model Mechanobiol. 2019 Apr;18(2):301–18. pmid:30276488
- 19. Qasim M, Farinella G, Zhang J, Li X, Yang L, Eastell R, et al. Patient-specific finite element estimated femur strength as a predictor of the risk of hip fracture: the effect of methodological determinants. Osteoporos Int. 2016 Sep;27(9):2815–22. pmid:27108118
- 20. Aldieri A, Bhattacharya P, Paggiosi M, Eastell R, Audenino AL, Bignardi C, et al. Improving the Hip Fracture Risk Prediction with a Statistical Shape-and-Intensity Model of the Proximal Femur. Ann Biomed Eng. 2022 Feb;50(2):211–21. pmid:35044572
- 21. Viceconti M, Qasim M, Bhattacharya P, Li X. Are CT-Based Finite Element Model Predictions of Femoral Bone Strength Clinically Useful? Curr Osteoporos Rep. 2018 Jun;16(3):216–23. Erratum in: Curr Osteoporos Rep. 2019 Dec;17(6):580. pmid:29656377
- 22. Aldieri A, Terzini M, Osella G, Priola AM, Angeli A, Veltri A, et al. Osteoporotic Hip Fracture Prediction: Is T-Score-Based Criterion Enough? A Hip Structural Analysis-Based Model. J Biomech Eng. 2018 Nov;140(11):111004. pmid:30029233
- 23. Garavelli C, Curreli C, Palanca M, Aldieri A, Cristofolini L, Viceconti M. Experimental validation of a subject-specific finite element model of lumbar spine segment using digital image correlation. PLoS One. 2022 Sept;17(9):e0272529. pmid:36084092
- 24. Keyak JH, Rossi SA, Jones KA, Skinner HB. Prediction of femoral fracture load using automated finite element modeling. J Biomech. 1997 May;31(2):125–33.
- 25. Aldieri A, Terzini M, Audenino AL, Bignardi C, Paggiosi M, Eastell R, et al. Personalised 3D Assessment of Trochanteric Soft Tissues Improves HIP Fracture Classification Accuracy. Ann Biomed Eng. 2022 Mar;50(3):303–13. pmid:35103867
- 26. Altai Z, Qasim M, Li X, Viceconti M. The effect of boundary and loading conditions on patient classification using finite element predicted risk of fracture. Clin Biomech. 2019 Aug;68:137–43. pmid:31202100
- 27. Knowles NK, Reeves JM, Ferreira LM. Quantitative Computed Tomography (QCT) derived Bone Mineral Density (BMD) in finite element studies: a review of the literature. J Exp Orthop. 2016 Dec;3(1):36. pmid:27943224
- 28. Morgan EF, Bayraktar HH, Keaveny TM. Trabecular bone modulus–density relationships depend on anatomic site. J Biomech. 2003 Jul;36(7):897–904. pmid:12757797
- 29. Schileo E, Dall’Ara E, Taddei F, Malandrino A, Schotkamp T, Baleani M, et al. An accurate estimation of bone density improves the accuracy of subject-specific finite element models. J Biomech. 2008 Aug;41(11):2483–91. pmid:18606417
- 30. Skornitzke S, Vats N, Kopytova T, Tong EWY, Hofbauer T, Weber TF, et al. Asynchronous calibration of quantitative computed tomography bone mineral density assessment for opportunistic osteoporosis screening: phantom-based validation and parameter influence evaluation. Sci Rep. 2022 Dec;12(1):20729. pmid:36456574
- 31. Brown JK, Timm W, Bodeen G, Chason A, Perry M, Vernacchia F, et al. Asynchronously Calibrated Quantitative Bone Densitometry. J Clin Densitom. 2017 Apr;20(2):216–25. pmid:26781430
- 32. Eggermont F, Derikx LC, Free J, van Leeuwen R, van der Linden YM, Verdonschot N, et al. Effect of different CT scanners and settings on femoral failure loads calculated by finite element models. J Orthop Res. 2018 Aug;36(8):2288–95. pmid:29508905
- 33. Michalski AS, Besler BA, Michalak GJ, Boyd SK. CT-based internal density calibration for opportunistic skeletal assessment using abdominal CT scans. Med Eng Phys. 2020 Apr;78:55–63. pmid:32059948
- 34. Boden SD, Goodenough DJ, Stockham CD, Jacobs E, Dina T, Allman RM. Precise measurement of vertebral bone density using computed tomography without the use of an external reference phantom. J Digit Imaging. 1989 Feb;2(1):31–8. pmid:2488150
- 35. Prado M, Khosla S, Chaput C, Giambini H. Opportunistic application of phantom-less calibration methods for fracture risk prediction using QCT/FEA. Eur Radiol. 2021 Dec;31(12):9428–35. pmid:34047849
- 36. Lee DC, Hoffmann PF, Kopperdahl DL, Keaveny TM. Phantomless calibration of CT scans for measurement of BMD and bone strength—Inter-operator reanalysis precision. Bone. 2017 Oct;103:325–33. pmid:28778598
- 37. Mueller DK, Kutscherenko A, Bartel H, Vlassenbroek A, Ourednicek P, Erckenbrecht J. Phantom-less QCT BMD system as screening tool for osteoporosis without additional radiation. Eur J Radiol. 2011 Sep;79(3):375–81. pmid:20223609
- 38. Eggermont F, Verdonschot N, van der Linden Y, Tanck E. Calibration with or without phantom for fracture risk prediction in cancer patients with femoral bone metastases using CT-based finite element models. PLoS One. 2019 Jul;14(7):e0220564. pmid:31361790
- 39. Ataei A, Eikhout J, Leeuwen RGH van, Tanck E, Eggermont F. The effect of variations in CT scan protocol on femoral finite element failure load assessment using phantomless calibration. PLoS One. 2022 Mar;17(3):e0265524. pmid:35303026
- 40. Winsor C, Li X, Qasim M, Henak CR, Pickhardt PJ, Ploeg H, et al. Evaluation of patient tissue selection methods for deriving equivalent density calibration for femoral bone quantitative CT analyses. Bone. 2021 Feb;143:115759. pmid:33212317
- 41. White DR, Wilson IJ, Griffith RV. Report 46. J Int Comm Radiat Units Meas. 2016;os24.
- 42.
X-Ray Mass Attenuation Coefficients. NIST [Internet]. 2009 Sep [cited 2024 May 16]; Available from: https://www.nist.gov/pml/x-ray-mass-attenuation-coefficients
- 43. Aldieri A, Curreli C, Szyszko JA, La Mattina AA, Viceconti M. Credibility assessment of computational models according to ASME V&V40: Application to the Bologna Biomechanical Computed Tomography solution. Comput Methods Programs Biomed. 2023 Oct;240:107727.
- 44.
Aldieri A, La Mattina AA, Szyszko JA, Baruffaldi F, Viceconti M. HFValid collection: Hip-Fracture validation collection [Internet]. University of Bologna; 2023 Jun [cited 2024 Jan 3]. Available from: https://amsacta.unibo.it/id/eprint/7277/
- 45. Kalender WA, Felsenberg D, Genant HK, Fischer M, Dequeker J, Reeve J. The European Spine Phantom—a tool for standardization and quality control in spinal bone mineral measurements by DXA and QCT. Eur J Radiol. 1995 Jul;20(2):83–92. pmid:7588873
- 46. Aldieri A, Biondi R, La Mattina AA, Szyszko JA, Polizzi S, Dall’Olio D, et al. Development and validation of a semi-automated and unsupervised method for femur segmentation from CT. Sci Rep. 2024 Mar;14(1):7403. pmid:38548805
- 47. Keyak JH, Lee IY, Skinner HB. Correlations between orthogonal mechanical properties and density of trabecular bone: Use of different densitometric measures. J Biomed Mater Res. 1994 Nov;28(11):1329–36. pmid:7829563
- 48. Aubrey J, Esfandiari N, Baracos VE, Buteau FA, Frenette J, Putman CT, et al. Measurement of skeletal muscle radiation attenuation and basis of its biological variation. Acta Physiol Oxf Engl. 2014 Mar;210(3):489–97. pmid:24393306
- 49. Feliciano EMC, Winkels RM, Meyerhardt JA, Prado CM, Afman LA, Caan BJ. Abdominal adipose tissue radiodensity is associated with survival after colorectal cancer. Am J Clin Nutr. 2021 Sep;114(6):1917–24. pmid:34510172
- 50. Monirujjaman M, Martin L, Stretch C, Mazurak VC. Adipose Tissue Radiodensity in Chronic Diseases: A Literature Review of the Applied Methodologies. Immunometabolism 202 Oct;3(4).
- 51. Brunnquell CL, Winsor C, Aaltonen HL, Telfer S. Sources of error in bone mineral density estimates from quantitative CT. Eur J Radiol. 2021 Nov 1;144:110001. pmid:34700093
- 52. Irsal M, Nurbaiti , Mukhtar AN, Jauhari A, Winarno G. Variation kVp and mAs on CT scan image quality using standard phantom. AIP Conf Proc. 2020 Jul;2296.