Figures
Abstract
The peculiarities of older individuals related to osteoporosis and hyperostosis may lead to a higher rate of misdiagnosis of rib fractures on computed tomography (CT) images in older than in middle-aged/young people when using radiologist-only reading. However, none of these studies on rib fracture computer-aided diagnostic (CAD) systems grouped patients by age or evaluated the value of using CAD in older patients. To address these gaps, we divided 1,012 blunt chest trauma emergency patients who underwent chest CT into middle-aged/young and older groups with a cutoff age of 60 years. CT images were read by six radiologists from three institutions (each with 7 years of experience in thoracic CT diagnosis) using two reading methods, radiologist-only and radiologist-CAD reading, to explore the value of a deep learning (DL)-based CAD system for detecting rib fractures in emergency older patients. The final findings of the independent panel consisting of two senior radiologists with or without a thoracic surgeon were set as the reference standard. The sensitivity was calculated by dividing the number of true positives by the overall number of fractures, as confirmed by an expert panel. The false positives per patient (FPPP) was calculated by dividing the number of false positives by the overall number of patients. Sensitivity and FPPP were used to evaluate the diagnostic efficiency. Sensitivity, FPPP, and reading time were compared between the two groups, as well as reading methods. The results showed the following: (1) Sensitivity for detecting fresh fractures using radiologist-only reading was lower in the older than in the middle-aged/young group (86.7% [95% confidence interval (CI): 86.3%, 91.7%] vs. 91.5% [95% CI: 89.9%, 92.9%], p < 0.05). With the assistance of the CAD system, the sensitivity increased in the older group to the same level as that in the middle-aged/young group using radiologist-only reading (92.5% [95% CI: 90.4%, 94.2%] vs. 91.5% [95% CI: 89.9%, 92.9%], p > 0.05). (2) The FPPP of fresh fractures with radiologist-only reading was higher in the older than in the middle-aged/young group (0.47 vs. 0.37, p < 0.05). With the assistance of the CAD system, the FPPP in the older group decreased to the same level as that in the middle-aged/young group when using radiologist-only or radiologist-CAD reading (0.37 vs. 0.37/0.39, p > 0.05). (3) The reading time of fresh fractures when using radiologist-only reading was longer in the older than in the middle-aged/young group (6.1 vs 5.4 min, p < 0.05). With the assistance of the CAD system, the reading time in the older group was reduced by approximately 36% (p < 0.05). We conclude that the efficiency of intermediate-level radiologists in diagnosing fresh rib fractures by radiologist-only reading in older emergency patients was lower than that in middle-aged/young patients. When a DL-based CAD system assists radiologists, the diagnostic efficiency of identifying fresh fractures in older patients improves to the same level as independent radiologist-only reading in middle-aged/young patients while reducing the reading time.
Citation: Xiong S, Wu W, Liu S, Cheng J, Wan B (2026) Computer-aided diagnosis system for thoracic computed tomography of rib fractures in older emergency patients: A preliminary study. PLoS One 21(6): e0351988. https://doi.org/10.1371/journal.pone.0351988
Editor: Christopher Njeh, Indiana University School of Medicine, UNITED STATES OF AMERICA
Received: November 24, 2023; Accepted: June 3, 2026; Published: June 17, 2026
Copyright: © 2026 Xiong et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: This work was supported by the Hubei Provincial Department of Education of China (Grant Number B2022032). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Rib fractures commonly occur in patients with blunt chest trauma [1]. While minor fractures have minimal health consequences, rib fractures can be life-threatening as the number of fractures increases and complications worsen [2]. Hence, accurate diagnosis of patients with rib fractures is essential. Multirow spiral thin-slice computed tomography (CT) with a slice thickness of < 2 mm is now the most sensitive method for detecting rib fractures [3]. Nevertheless, rib-by-rib and side-by-side comparisons of many thin-slice images obtained during the diagnostic process are time consuming and cumbersome for radiologists [3,4]. Moreover, the number of radiologists does not keep pace with the required examinations [5].
Furthermore, CT-based rib fracture diagnosis in older emergency patients has certain limitations. The prevalence of osteoporosis is higher in older than middle-aged/young people [6–8]. The decrease in bone density due to osteoporosis [9–12] may cause thinning of the rib cortex and reduction of bone trabeculae in CT images [9, 13–16], which may interfere with the radiologist’s observation of a lucent fracture line in the images. In contrast, the proportion of hyperostosis [17,18] is higher in older people than in middle-aged/young people. This finding primarily reflects degenerative spondylotic changes, where loss of disc height and altered biomechanics at the costovertebral junctions stimulate reactive subperiosteal bone formation to compensate for joint instability [19]. Decades of accumulated mechanical stress on these articulations further promotes aberrant ossification, leading to cortical thickening and irregularity [20]. Such rib cortical thickening and irregular morphology on CT images may interfere with radiologists’ diagnosis of fractures. These peculiarities may lead to a higher rate of misdiagnosis (including false negatives and positives) of rib fractures in older than in middle-aged/young people when using radiologist-only reading. Additionally, the inaccurate diagnosis of rib fractures may lead to legal disputes [21,22]. These issues may overwhelm radiologists who diagnose rib fractures using manual reading alone in older patients.
In recent years, several studies on computer-aided diagnostic (CAD) systems for rib fracture diagnosis based on deep learning (DL) have been published [23–31], as these systems can improve rib fracture diagnostic efficiency [32,33]. For example, some studies have reported [23] the use of a DL algorithm for detecting rib fractures, which achieved a sensitivity of 87.4%, with 0.16 false positives per examination in approximately 500 patients, suggesting that the potential clinical value of the DL algorithm may be of value in assisting radiologists in screening fractures and reducing missed diagnoses. Jin et al. [24] developed a DL model for rib fracture detection. They obtained a sensitivity of 92.9% and false positives per scan of 5.27 in a test cohort that included 900 patients, suggesting that the model would help radiologists achieve a higher sensitivity than that achieved by manual reading alone. Tan et al. [31] found that the sensitivity of rib fracture diagnosis by attending radiologists assisted by DL-based CAD increased from 94.56% to 95.67% compared with radiologist reading alone.
However, none of these studies on rib fracture CAD systems grouped patients by age or evaluated the value of using CAD in older patients. In addition to the peculiarities of older individuals related to osteoporosis and hyperostosis, some radiologists, especially intermediate-level radiologists who have recently assumed responsibility for reviewing reports in our institution, report that they have less confidence in CT-based rib fracture diagnosis in older individuals than in middle-aged/young people. Therefore, we hypothesized that the diagnostic efficiency of radiologists with a certain level of experience in performing independent reading of CT images for rib fractures in older individuals would be lower than that in middle-aged/young people. This study used sensitivity at the lesion level and false positives per patient (FPPP) to evaluate diagnostic efficiency. We first assessed the difference in diagnostic efficiency between two different age groups with a cutoff age of 60 years read independently by radiologists with 7 years of experience; second, we applied a DL-based CAD system to diagnosis in two different age groups to assess its application value in emergency older patients, if a difference in diagnostic efficiency was found between the two different age groups.
Materials and methods
Ethics statement
All procedures involving human participants were performed per the ethical standards of the Institutional Research Committee and the 1964 Helsinki Declaration. The authors/radiologists did not have access to information that could identify the individual participants during or after data collection. The Institutional Review Board of Jingzhou Hospital Affiliated to Yangtze University approved the study, and the need for informed consent was waived owing to its retrospective nature.
Dataset and classification criteria
Clinical and imaging data from 1,047 blunt chest trauma were selected using a picture archiving and communication system. These patients underwent emergency chest CT scans at Jingzhou Hospital Affiliated to Yangtze University, between January 2022 and December 2022. The inclusion criteria were as follows: age ≥ 20 years, chest trauma that occurred within 3 weeks, and a follow-up CT scan performed between 28 and 42 days (median time, 31 days) after chest trauma. The exclusion criteria were as follows: images disturbed by internal fixation artifacts (n = 20) or severe respiratory or motion artifacts (n = 15). Finally, 1,012 patients were included in the analysis (Table 1). The diagnoses of rib fractures in different age groups using different reading methods are summarized in Fig 1.
R1, R2, R3, R4, R5, and R6 represent radiologists 1, 2, 3, 4, 5, and 6, respectively.
In this study, we classified fresh fractures as either complete or incomplete. A fresh fracture was defined as a fracture occurring within 3 weeks of trauma, characterized by sharp margins and no periosteal reaction or callus formation [34]. Complete fracture was defined as corresponding inner and outer cortex disruption. An incomplete fracture was described as a disruption in the inner or outer cortex, such as a buckle rib fracture, characterized by a kink-like inner cortical disruption and a smooth outer cortex [35,36].
Computed tomography and image processing
Images were acquired using a GE Optima CT660 scanner (GE Healthcare, Milwaukee, WI, USA) and a Philips Incisive CT scanner (Philips Healthcare, Cleveland, OH, USA). The standardized scanning parameters were as follows: tube voltage, 120 kV; tube current, automated dose modulation (1. ASIR, GE Optima CT660, and 2. iDose4, Philips Incisive CT); matrix size, 512 × 512; pitch, 0.516 (GE Optima CT660) or 0.85 (Philips). Bone reconstruction was performed on all images; the reconstruction slice thickness was 1.25 mm. The scanning range was from the root of the neck to the lower edge of the 12th rib.
CAD system
Our CAD system (Care.Ai Release 2023.1, Deep Wise Healthcare, Beijing, China) used the DL algorithm (S1 File) and included three main models: rib counting, fracture detection, and fracture type classification (Fig 2). No scans from the current dataset for the 8:1:1 training, validation, and testing of the DL algorithm were included. The diagnostic efficiency of the internal test dataset was 92.8% sensitivity for fresh rib fractures at the lesion level with a FPPP of 1.25. Furthermore, the model had a sensitivity of 85.1% in distinguishing between incomplete and complete fractures.
Model 1: ResNet-34 was used to distinguish a nonfresh fracture from a fresh fracture. Model 2: ResNet-34 was used to determine an incomplete fracture from a complete fracture among fresh fractures.
Image interpretation
Six radiologists, each with 7 years of experience in thoracic CT diagnosis from three institutions, were blinded to the patient’s age, sex, and diagnostic efficiency of the CAD system in the internal test dataset. They used various viewing techniques—including axial view, multiplanar reconstruction, curved planar reconstruction, and volume rendering—to diagnose and categorize fresh rib fractures. They recorded the fresh rib fracture locations, types (complete/incomplete fractures), and reading time for each reading in a structured spreadsheet. Three rounds of reading were conducted (Fig 1). In the first round, to assess consistency in rib fracture diagnosis among the six radiologists, each radiologist independently read the CT images of all 1,012 patients. A 1-month washout interval was implemented between the rounds to minimize recall bias. Using R software (version 3.5.2), the 1,012 patients were then stratified by age (668 middle-aged/young vs. 344 older patients) and randomly allocated into six balanced subgroups (168–169 patients per group), with each maintaining the representative age ratio (approximately 1.94:1). In the second round, each of these six subgroups were assigned to each of the six radiologists for independent reading (defined as radiologist-only reading). In the third round, the radiologists were assigned subgroups identical to those in the second round and read the CT images jointly with the DL-based CAD system (defined as radiologist-CAD reading). In this round, each radiologist reviewed the CT images with each data point (location and fracture type) marked by the CAD system (Fig 3). Subsequently, they determine whether to accept or reject each data point. Following three rounds of reading, the reason for each misdiagnosis using the radiologist-only reading was determined by the same intermediate-level radiologist who performed the reading and by consultation with two senior radiologists.
(A) In the third round, each fresh fracture location was automatically identified and marked with a rectangular box by the CAD system. The fracture type was provided to the radiologist for reference. (B) The fracture location was partially magnified.
Reference standard for rib fractures
The initial and follow-up CT images were independently read by two senior radiologists, each with 15 years of experience in thoracic CT diagnosis, who represented the reference standard, including the locations and types of fresh rib fractures. The expert panel asked a thoracic surgeon to participate if the findings were ambiguous. The findings of the expert panel’s final discussions were used as reference standards [37].
Statistical analysis
An independent data expert not involved in the readings counted the results in three rounds according to the reference standards. A fracture corresponding to the reference standard’s location and type was considered a true positive. A fracture that did not align with the reference standard in either location or type was considered a false positive. A fracture confirmed by the reference standard but regarded as normal was a false negative. Owing to the infinite number of normal positions in one rib, assessing the number of true negatives was impossible. Sensitivity was calculated by dividing the number of true positives by the overall number of fractures, as confirmed by an expert panel. In addition, the FPPP was calculated by dividing the number of false positives by the overall number of patients [26]. The primary endpoint was the sensitivity of radiologist-only reading for fresh fractures in the two different age groups and radiologist-CAD reading in the older group.
The R software version 3.5.2 (R Foundation for Statistical Computing, Vienna, Austria) was used to process the data. The χ2 test was used to compare the sex distribution, sensitivity, and FPPP between two age groups. McNemar’s test was used to compare the sensitivity and FPPP of the two reading methods. Kendall’s coefficient of concordance was used to assess the consistency of diagnostic findings among the six radiologists. An independent-sample t-test was used to compare reading time between the two age groups. A paired-sample t-test was used to compare the reading times of the two reading methods. Statistical significance was defined as a p-value of 0.05.
Results
Consistency in diagnosis of fresh fractures among six radiologists
Sensitivities for fresh fracture diagnosis among the six radiologists ranged from 87.9% to 88.2%. FPPPs for fresh fracture diagnosis among the six radiologists ranged from 0.42 to 0.44 (Table 1 in S2 File). Kendall’s W analysis revealed significant inter-observer consistency, with high agreement for sensitivities (W = 0.834, p < 0.001) and FPPPs (W = 0.855, p < 0.001).
Number of true fractures in two different age groups
Each patient was represented by a single scan in this study. Overall, 1012 patients had 2,085 fresh fractures diagnosed per the reference standard (Table 2). Each of 163 patients had no fresh fracture. Each of 278 patients had one fresh fracture. Each of 571 patients had two or more fresh fractures.
Comparison of sensitivity using two reading methods in two different age groups
The sensitivity of radiologist-only reading method for fresh fractures in the older group was significantly lower than that of the middle-aged/young group (86.7% vs. 91.5%, p = 0.001). With CAD assistance, the sensitivity in the older group increased to 92.5% (p = 0.001), which did not differ significantly from the radiologist-only sensitivity in the middle-aged/young group (92.5% vs. 91.5%, p = 0.417; Table 3). The results of true positive detection in the two different age groups are shown in Tables 2 and 3 of S2 File.
Comparison of sensitivity for diagnosing each fracture type using two reading methods in two different age groups
The sensitivities of radiologist-only method for complete and incomplete fractures in the older group were significantly lower than those in the middle-aged/young group (89.3% vs. 92.7%, p = 0.024 for complete fractures; 80.8% vs. 87.8%, p = 0.023 for incomplete fractures). With CAD assistance, the sensitivities in the older group increased to 93.5% (p = 0.003) and 90.6% (p = 0.001), respectively, which did not differ significantly from the sensitivities of radiologist-only method in the middle-aged/young group (93.5% vs. 92.7%, p = 0.558; 90.6% vs. 87.8%, p = 0.304, respectively; Table 4). The increase in sensitivity was greater for incomplete fractures than for complete fractures (9.8% vs. 4.2%, p = 0.004; Fig 4 and Table 4).
(A) A 63-year-old male with a car accident injury showed only a cortical fold of the bone in the right sixth anterior rib. (B) The incomplete fracture (rectangular box marked) was missed in the radiologist-only reading, whereas it was accurately diagnosed in the radiologist-CAD reading. (C) Four weeks later, a follow-up computed tomography review showed the formation of a callus at the location, confirmed as an incomplete fracture by the panel.
Comparison of FPPP using two reading methods in two different age groups
The FPPP of radiologist-only reading method for fresh fractures in the older group was significantly higher than that of the middle-aged/young group (0.47 vs. 0.37, p = 0.003). With CAD assistance, the FPPP in the older group decreased to 0.37 (p = 0.001), which did not differ significantly from the radiologist-only FPPP in the middle-aged/young group (0.37 vs. 0.37, p = 0.948; Table 5). The results of false positive detection in the two different age groups are shown in Tables 4 and 5 of S2 File. The reasons for misdiagnosis, including false negatives and false positives, with radiologist-only reading in the two different age groups are shown in Fig 5.
Group 1 and Group 2 represent the middle-aged/young group and the older group, respectively.
Comparison of the reading time using two reading methods in two different age groups
The reading time of radiologist-only method for fresh fractures in the older group was significantly longer than that in the middle-aged/young group (6.1 min vs. 5.4 min, p < 0.001). With CAD assistance, the reading time in the older group decreased significantly compared with that of radiologist-only reading (3.9 min vs. 6.1 min, p < 0.001), corresponding to a reduction of approximately 36% (Table 6 and Fig 6).
Group 1 and Group 2 represent the middle-aged/young group and the older group, respectively. * p < 0.05.
Discussion
Recently, DL-based CAD systems have been used to diagnose rib fractures. Meng et al. [25] detected and classified rib fractures in 300 CT images and found that radiologists assisted by a DL model obtained a higher F1-score (0.960) and recall rate (0.978) than those who read independently. Kaiume et al. [26] stated that their DL-based CAD system for CT diagnosis yielded a sensitivity of 64.5% for identifying rib fractures, which was higher than reading the two interns alone. However, CT-based rib fracture diagnosis in older emergency patients has certain limitations. The CT imaging manifestations in older patients, which may include thinning of the rib cortex, reduction of bone trabeculae, rib cortical thickening or irregular morphology, due to the higher proportion of osteoporosis [6–8] and hyperostosis [17,18] in this age group, may hamper radiologists’ diagnosis of rib fractures and lead to a higher rate of misdiagnosis. In this study, using the radiologist-only reading, misdiagnosis occurred in both middle-aged/young and older groups. For instance, some false positives due to anatomical variations (e.g., costochondral junctions) could be mistaken for fractures. Additionally, minor, nondisplaced fractures in areas of high complexity, such as the posterior ribs, could be missed as false negatives. Therefore, we used a DL-based CAD system for the diagnosis of rib fractures. We found that the fresh rib fracture diagnostic efficiency in the older group using the radiologist-only reading was lower than that in the middle-aged/young group and that with CAD assistance, the diagnostic efficiency could be optimized to the same level as that achieved in the middle-aged/young group when using the radiologist-only reading, with reduced reading time.
Our study indicated that radiologist-only reading for fresh fractures yielded lower sensitivity in the older group than in the middle-aged/young group. This may be due to the imaging manifestation characteristics of CT in the older group due to osteoporosis and hyperostosis, which may have interfered with the radiologists’ diagnosis of rib fractures. Furthermore, in recent years, some reports have indicated that the diagnostic efficiency of CT for rib fractures can be improved when CAD systems assist radiologists. For example, Jin et al. [24] developed a DL-based CAD system for detecting rib fractures in a test cohort of 120 patients. They achieved a higher detection sensitivity (93.4% and 94.4% for two radiologists, respectively) when using a collaborative human-computer reading approach than when using a computer or human reading only. Zhang et al. [27] indicated that using a DL model to assist two radiologists in diagnosing rib fractures in 198 chest CT images achieved higher sensitivity (88.9% and 88.7% for the two radiologists, respectively) than when the radiologists read independently. However, these studies have yet to consider the potential impact of age on fracture diagnostic efficiency or explore the value of CAD in diagnosing rib fractures in different age groups. We utilized the CAD system in two different age groups. We found that the sensitivity of diagnosis of fresh fractures when using the radiologist-CAD reading in the older group could be improved to the same level as that of the radiologist-only reading in the middle-aged/young groups. This suggests that using the DL-based rib fracture CAD system to assist radiologists in reading CT images of older patients may help reduce missed diagnoses of fresh fractures.
For each fracture type, this study indicated that using radiologist-only reading for incomplete and complete fractures in the older group exhibited lower sensitivity than in the middle-aged/young group. This suggests that the reduction in fresh rib fracture diagnostic sensitivity when using radiologist-only reading in the older group may be specific to missed diagnoses of incomplete or complete fractures. Furthermore, Zhou et al. [28] indicated that rib fractures could be classified into three categories (fresh, healing, and old). They reported that DL model II had a higher sensitivity for diagnosing fresh and healing fractures (89.6% and 97.6%, respectively) than five radiologists with 7–9 years of CT diagnostic experience. Inspired by their report, we used the CAD system in two age groups. We classified fresh fractures in emergency patients with blunt chest trauma that occurred within 3 weeks into incomplete and complete fractures. Our results indicate that with the assistance of the CAD system, the sensitivity of both incomplete and complete fractures in the older group increased to the same level as the radiologist-only or radiologist-CAD reading in the middle-aged/young group, in which the increase in incomplete fractures was 9.8%, which was higher than the 4.2% increase in complete fractures. Using the DL-based rib fracture CAD system to assist radiologists in reading CT images in an older patient group could help reduce missed diagnoses of incomplete and complete fractures. It may be beneficial in decreasing missed diagnoses of incomplete fractures.
Regarding FPPP, this study indicated that using radiologist-only reading to detect fresh fractures in the older group exhibited a higher FPPP than the middle-aged/young group. Zhang et al. [27] indicated that using a DL model to assist two radiologists in diagnosing rib fractures in 198 chest CT images achieved a lower false positives per scan of 0.13 for one of the radiologists than when the radiologists read independently. Castro–Zunti et al. [29] found that a DL model exhibited fewer false positive diagnoses than radiologists alone in detecting old fractures and normal ribs. We utilized the CAD system in two different age groups, and the results indicated that, with the assistance of the CAD system, the FPPP of fresh fractures in the older group decreased to the same level as that achieved with radiologist-only or radiologist-CAD reading in the middle-aged/young group. This suggests that the CAD system can help radiologists reduce the false positives of fresh fractures in older patients.
Regarding reading time, Liu et al. [38] reported that junior radiologists could reduce their reading time by 28% with the assistance of DL-based rib fracture artificial intelligence software. Sun et al. [39] reported that radiologists with less experience could reduce their reading time by 47%. In contrast, senior radiologists could reduce their reading time by 9% when assisted by a DL-based rib fracture CAD system. The results of our study are generally consistent with these reports, showing that radiologists’ reading time for images of an older patient group was longer than that of the middle-aged/young group. Moreover, we showed that CAD could assist radiologists in reducing the reading time in the older group by approximately 36%.
The CAD system has important implications when utilized in emergency settings, where radiologists are often overloaded with trauma cases. The CAD system can serve as a second reader, reducing missed diagnoses and false positives, and improving the speed of patient care, especially for older high-risk patients. Saved time with faster diagnosis of rib fractures may lead to quicker intervention, reducing complications such as pneumonia or respiratory failure, and potentially improving the prognosis of emergency patients.
However, to ensure a balanced discussion, it is important to acknowledge that CAD may mislead radiologists in certain scenarios. As shown in Tables 2–5 of S2 File, CAD may mislead radiologists, resulting in both false positives (misclassifying nonfracture structures) and false negatives (overlooking fractures). These errors may stem from model “hallucinations” or training data biases, highlighting the need for radiologists to maintain critical thinking and utilize CAD as an auxiliary tool rather than as definitive diagnostic basis.
Currently, the development of the CAD system faces several challenges, including ethical concerns, complex rib anatomy, the subtle nature of some fractures, and the need for high-quality imaging data. Variability in CT scan protocols and patient positioning can affect the accuracy of fracture detection. The CAD system should be validated on a diverse dataset to ensure its generalizability to different patient populations. Despite the challenges, we hypothesize that advanced CAD systems can accurately diagnose not only rib fractures but also fractures in other parts of the thorax (e.g., thoracic vertebral fractures) and incorporate clinical data such as patient age, history of osteoporosis, and other comorbidities to provide a comprehensive risk assessment. Also, further development of the CAD system can be done to recognize fracture healing stages and assess the potential for complications (e.g., rib displacement affecting lung function).
This study has some limitations. First, the six intermediate-level radiologists focused solely on diagnosing rib fractures, which is inconsistent with clinical practice. Second, we only included six radiologists with the same experience level, each possessing 7 years of experience, as readers to diagnose rib fractures from CT images. Further increase of radiologist number and inclusion of those with different experience levels would contribute to richer and more objective results. Third, it was difficult to quantify the contribution of the cumulative learning experience from repeated readings.
Conclusions
In summary, intermediate-level radiologists’ efficiency in diagnosing fresh rib fractures by radiologist-only reading was lower in older emergency patients than in middle-aged/young patients. Moreover, the potential value of using a DL-based CAD system to assist radiologists in diagnosing rib fractures in older emergency patients may be higher than in middle-aged/young patients.
Supporting information
S2 File. Sensitivity and false positives per patient (FPPP) of six radiologists for fresh fracture diagnosis, and true positive and false positive detection results in the middle-aged/young and older groups.
https://doi.org/10.1371/journal.pone.0351988.s002
(DOCX)
Acknowledgments
We would like to thank Editage (www.editage.cn) for English language editing. We also gratefully acknowledge the six intermediate-level radiologists including Hai Hu and Xiaohua Zhou from Jingzhou Hospital Affiliated to Yangtze University, Xue Sun and Bo Chen from the Second Affiliated Hospital and Yuying Children’s Hospital of Wenzhou Medical University, and Jia Ren and Li Zhu from Affiliated Renhe Hospital of China Three Gorges University for reading the CT images.
References
- 1. Peek J, Ochen Y, Saillant N, Groenwold RHH, Leenen LPH, Uribe-Leitz T, et al. Traumatic rib fractures: A marker of severe injury. A nationwide study using the National Trauma Data Bank. Trauma Surg Acute Care Open. 2020;5(1):e000441. pmid:32550267
- 2. Talbot BS, Gange CP, Chaturvedi A, Klionsky N, Hobbs SK, Chaturvedi A. Traumatic rib injury: Patterns, imaging pitfalls, complications, and treatment. Radiographics. 2017;37(2):628–51.
- 3. Ringl H, Lazar M, Töpker M, Woitek R, Prosch H, Asenbaum U, et al. The ribs unfolded - a CT visualization algorithm for fast detection of rib fractures: Effect on sensitivity and specificity in trauma patients. Eur Radiol. 2015;25(7):1865–74. pmid:25680714
- 4. Urbaneja A, De Verbizier J, Formery A-S, Tobon-Gomez C, Nace L, Blum A, et al. Automatic rib cage unfolding with CT cylindrical projection reformat in polytraumatized patients for rib fracture detection and characterization: Feasibility and clinical application. Eur J Radiol. 2019;110:121–7. pmid:30599847
- 5.
Radiologists RCo. Clinical radiology UK workforce census 2015 report. London: The Royal College of Radiologists. 2016.
- 6. Schreiber JJ, Anderson PA, Rosas HG, Buchholz AL, Au AG. Hounsfield units for assessing bone mineral density and strength: A tool for osteoporosis management. J Bone Joint Surg Am. 2011;93(11):1057–63. pmid:21655899
- 7. Sebro R, De la Garza-Ramos C. Machine learning for opportunistic screening for osteoporosis from CT scans of the wrist and forearm. Diagnostics (Basel). 2022;12(3):691. pmid:35328244
- 8. Lee S, Chung CK, Oh SH, Park SB. Correlation between bone mineral density measured by dual-energy X-ray absorptiometry and hounsfield units measured by diagnostic CT in lumbar spine. J Korean Neurosurg Soc. 2013;54(5):384–9. pmid:24379944
- 9. Cheng X, Yuan H, Cheng J, Weng X, Xu H, Gao J, et al. Chinese expert consensus on the diagnosis of osteoporosis by imaging and bone mineral density. Quant Imaging Med Surg. 2020;10(10):2066–77. pmid:33014734
- 10. Prins JTH, Van Lieshout EMM, Reijnders MRL, Verhofstad MHJ, Wijffels MME. Rib fractures after blunt thoracic trauma in patients with normal versus diminished bone mineral density: A retrospective cohort study. Osteoporos Int. 2020;31(2):225–31. pmid:31828365
- 11. Jang M, Kim M, Bae SJ, Lee SH, Koh J-M, Kim N. Opportunistic osteoporosis screening using chest radiographs with deep learning: Development and external validation with a cohort dataset. J Bone Miner Res. 2022;37(2):369–77. pmid:34812546
- 12. Link TM. Osteoporosis imaging: State of the art and advanced imaging. Radiology. 2012;263(1):3–17. pmid:22438439
- 13. Gausden EB, Nwachukwu BU, Schreiber JJ, Lorich DG, Lane JM. Opportunistic use of CT imaging for osteoporosis screening and bone density assessment: A qualitative systematic review. J Bone Joint Surg Am. 2017;99(18):1580–90. pmid:28926388
- 14. Holcombe SA, Hwang E, Derstine BA, Wang SC. Measuring rib cortical bone thickness and cross section from CT. Med Image Anal. 2018;49:27–34. pmid:30031288
- 15. Martel D, Monga A, Chang G. Osteoporosis Imaging. Radiol Clin North Am. 2022;60(4):537–45.
- 16. Smith AD. Screening of bone density at CT: An overlooked opportunity. Radiology. 2019;291(2):368–9. pmid:30917293
- 17. Wang C, Tian F, Zhou Y, He W, Cai Z. The incidence of cervical spondylosis decreases with aging in the elderly, and increases with aging in the young and adult population: A hospital-based clinical analysis. Clin Interv Aging. 2016;11:47–53. pmid:26834465
- 18.
Yuan J, Zhou X, Cheng Q, Wang X, Feng T, Zhan H. Analysis of photoacoustic signals of hyperosteogeny and osteoporosis. Photons Plus Ultrasound: Imaging and Sensing 2019, 2019. 189. https://doi.org/10.1117/12.2506671
- 19. Resnick D. Degenerative diseases of the vertebral column. Radiology. 1985;156(1):3–14. pmid:3923556
- 20. Hunter DJ, Bierma-Zeinstra S. Osteoarthritis. Lancet. 2019;393(10182):1745–59.
- 21. Pinto A, Berritto D, Russo A, Riccitiello F, Caruso M, Belfiore MP, et al. Traumatic fractures in adults: Missed diagnosis on plain radiographs in the Emergency Department. Acta Biomed. 2018;89(1-S):111–23. pmid:29350641
- 22. Kolopp M, Douis N, Urbaneja A, Baumann C, Gondim Teixeira PA, Blum A, et al. Automatic rib unfolding in postmortem computed tomography: Diagnostic evaluation of the OpenRib software compared with the autopsy in the detection of rib fractures. Int J Legal Med. 2020;134(1):339–46. pmid:31734725
- 23. Weikert T, Noordtzij LA, Bremerich J, Stieltjes B, Parmar V, Cyriac J, et al. Assessment of a deep learning algorithm for the detection of rib fractures on whole-body trauma computed tomography. Korean J Radiol. 2020;21(7):891–9. pmid:32524789
- 24. Jin L, Yang J, Kuang K, Ni B, Gao Y, Sun Y, et al. Deep-learning-assisted detection and segmentation of rib fractures from CT scans: Development and validation of FracNet. EBioMedicine. 2020;62:103106. pmid:33186809
- 25. Meng XH, Wu DJ, Wang Z, Ma XL, Dong XM, Liu AE, et al. A fully automated rib fracture detection system on chest CT images and its impact on radiologist performance. Skeletal Radiol. 2021;50(9):1821–8. pmid:33599801
- 26. Kaiume M, Suzuki S, Yasaka K, Sugawara H, Shen Y, Katada Y, et al. Rib fracture detection in computed tomography images using deep convolutional neural networks. Medicine (Baltimore). 2021;100(20):e26024. pmid:34011107
- 27. Zhang B, Jia C, Wu R, Lv B, Li B, Li F, et al. Improving rib fracture detection accuracy and reading efficiency with deep learning-based detection software: A clinical evaluation. Br J Radiol. 2021;94(1118):20200870. pmid:33332979
- 28. Zhou Q-Q, Hu Z-C, Tang W, Xia Z-Y, Wang J, Zhang R, et al. Precise anatomical localization and classification of rib fractures on CT using a convolutional neural network. Clin Imaging. 2022;81:24–32. pmid:34598000
- 29. Castro-Zunti R, Chae KJ, Choi Y, Jin GY, Ko S-B. Assessing the speed-accuracy trade-offs of popular convolutional neural networks for single-crop rib fracture classification. Comput Med Imaging Graph. 2021;91:101937. pmid:34087611
- 30. Yang C, Wang J, Xu J, Huang C, Liu F, Sun W, et al. Development and assessment of deep learning system for the location and classification of rib fractures via computed tomography. Eur J Radiol. 2022;154:110434. pmid:35797792
- 31. Tan H, Xu H, Yu N, Yu Y, Duan H, Fan Q, et al. The value of deep learning-based computer aided diagnostic system in improving diagnostic performance of rib fractures in acute blunt trauma. BMC Med Imaging. 2023;23(1):55. pmid:37055752
- 32. Topol EJ. High-performance medicine: The convergence of human and artificial intelligence. Nat Med. 2019;25(1):44–56. pmid:30617339
- 33. Guermazi A, Tannoury C, Kompel AJ, Murakami AM, Ducarouge A, Gillibert A. Improving radiographic fracture recognition performance and efficiency using artificial intelligence. Radiology. 2022;302(3):627–36.
- 34. Zura R, Xu ZJ, Della Rocca GJ, Mehta S, Steen RG. When Is a fracture not “Fresh”? Aligning reimbursement with patient outcome after treatment with low-intensity pulsed ultrasound. J Orthop Trauma. 2017;31(5):248–51. pmid:28134628
- 35. Yang K-M, Lynch M, O’Donnell C. “Buckle” rib fracture: An artifact following cardio-pulmonary resuscitation detected on postmortem CT. Leg Med (Tokyo). 2011;13(5):233–9. pmid:21795096
- 36. Schulze C, Hoppe H, Schweitzer W, Schwendener N, Grabherr S, Jackowski C. Rib fractures at postmortem computed tomography (PMCT) validated against the autopsy. Forensic Sci Int. 2013;233(1–3):90–8. pmid:24314506
- 37. Zhou QQ, Wang J, Tang W, Hu ZC, Xia ZY, Li XS, et al. Automatic detection and classification of rib fractures on thoracic CT using convolutional neural network: accuracy and feasibility. Korean J Radiol. 2020;21(7):869–79. pmid:32524787
- 38. Liu X, Wu D, Xie H, Xu Y, Liu L, Tao X, et al. Clinical evaluation of AI software for rib fracture detection and its impact on junior radiologist performance. Acta Radiol. 2022;63(11):1535–45. pmid:34617809
- 39. Hongbiao S, Shaochun X, Xiang W, YuRun T, Yang L, Mingzi Z, et al. Comparison and verification of two deep learning models for the detection of chest CT rib fractures. Acta Radiol. 2023;64(2):542–51. pmid:35300519