Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The impact of deep learning reconstruction in low dose computed tomography on the evaluation of interstitial lung disease

  • Chu hyun Kim,

    Roles Conceptualization, Data curation, Formal analysis, Project administration, Writing – original draft, Writing – review & editing

    Affiliations Center for Health Promotion, Samsung Medical Center, Seoul, Republic of Korea, Department of Radiology and AI Research Center, Samsung Medical Center, Sungkyunkwan University, Seoul, Korea

  • Myung Jin Chung ,

    Roles Conceptualization, Project administration, Resources, Software

    mj1.chung@samsung.com

    Affiliations Department of Radiology and AI Research Center, Samsung Medical Center, Sungkyunkwan University, Seoul, Korea, Department of Data Convergence and Future Medicine, Sungkyunkwan University School of Medicine, Seoul, Korea

  • Yoon Ki Cha,

    Roles Supervision, Validation

    Affiliation Department of Radiology and AI Research Center, Samsung Medical Center, Sungkyunkwan University, Seoul, Korea

  • Seok Oh,

    Roles Data curation, Formal analysis

    Affiliation Gil Medical Center, Department of Biomedical Engineering, Gachon University College of Medicine, Incheon, Korea

  • Kwang gi Kim,

    Roles Conceptualization, Project administration, Resources, Supervision, Validation

    Affiliation Gil Medical Center, Department of Biomedical Engineering, Gachon University College of Medicine, Incheon, Korea

  • Hongseok Yoo

    Roles Conceptualization, Supervision, Validation

    Affiliation Division of Pulmonary and Critical Care Medicine, Samsung Medical Center, School of Medicine, Sungkyunkwan University, Seoul, South Korea

Abstract

To evaluate the effect of the deep learning model reconstruction (DLM) method in terms of image quality and diagnostic agreement in low-dose computed tomography (LDCT) for interstitial lung disease (ILD), 193 patients who underwent LDCT for suspected ILD were retrospectively reviewed. Datasets were reconstructed using filtered back projection (FBP), adaptive statistical iterative reconstruction Veo (ASiR-V), and DLM. For image quality analysis, the signal, noise, signal-to-noise ratio (SNR), blind/referenceless image spatial quality evaluator (BRISQUE), and visual scoring were evaluated. Also, CT patterns of usual interstitial pneumonia (UIP) were classified according to the 2022 idiopathic pulmonary fibrosis (IPF) diagnostic criteria. The differences between CT images subjected to FBP, ASiR-V 30%, and DLM were evaluated. The image noise and BRISQUE scores of DLM images was lower and SNR was higher than that of the ASiR-V and FBP images (ASiR-V vs. DLM, p < 0.001 and FBP vs. DLR-M, p < 0.001, respectively). The agreement of the diagnostic categorization of IPF between the three reconstruction methods was almost perfect (κ  =  0.992, CI 0.990–0.994). Image quality was improved with DLM compared to ASiR-V and FBP.

Introduction

A precise diagnosis of interstitial lung disease (ILD) is crucial for selecting appropriate treatment candidates [1, 2]. High spatial resolution computed tomography (CT) with thin sections is ideal for evaluating ILD [3, 4]; however, due to concerns about the malignancy risk associated with cumulative exposure to radiation, low-dose chest CT (LDCT) is the preferred modality for conducting follow-up examinations and determining disease progression. However, reducing the radiation dose compromises image quality, which may affect the detection of subtle parenchymal changes [5, 6] and thus, further improvements are needed.

To overcome the shortcomings of LDCT, image reconstruction methods after image acquisition can be used to reduce noise, thereby improving image quality and potentially enhancing the diagnostic value [7]. Iterative reconstruction (IR) algorithms are the most widely used CT noise-reduction methods as an alternative to the conventional reconstruction mode based on filtered back projection (FBP) [810]. However, IR algorithms are typically nonlinear and task specific, which may modify the spatial resolution and image noise texture [11].

Deep learning technology has been reported to exhibit excellent performance in various fields of medical imaging [10, 1215]. Deep learning model reconstruction (DLM) has strength in that it does not require simplification of parameters and can handle millions of parameters [16]. A few studies have evaluated image quality and noise in DLM [10, 1315, 17]; however, to the best of our knowledge, none has specifically investigated its diagnostic impact on ILD.

This study evaluated the effect of the DLM method in terms of quantitative and qualitative image quality and diagnostic impact of DLM in terms of classifying CT pattern in ILD.

Materials and methods

Patients

The institutional review board (IRB) of Samsung Medical Center approved this retrospective study (IRB, file number 2021-06-092), and the requirement of informed consent was waived for the use of patient medical data. In total, 369 consecutive patients were included between August 2021 and September 2021. A CT scan was performed because of clinical or radiological suspicion of ILD on chest radiography. All patients underwent routine CT evaluation for ILD, which included helical LDCT and standard-dose non-helical high-resolution CT (HRCT). A total of 170 patients without imaging findings suggestive of ILD (for example, non-dependent ground-glass opacity or reticular abnormalities, non-emphysematous cysts, honeycombing, and traction bronchiectasis) [18] were excluded. Six patients were excluded due to suboptimal image quality. Finally, 193 patients were included in this study.

CT acquisition and image reconstruction

All CT images were obtained using a multidetector CT scanner (Revolution Frontier, GE Healthcare) under the LDCT protocol without the use of contrast. The protocols consisted of a fixed tube current of 20 mAs per slice (40 mA with a half-second rotation and 0.984 pitch). Slice thickness of 1.25 mm and high-spatial-frequency algorithm were applied. The chest CT protocol used the helical mode with the following parameter: 1.25 mm × 64 detector configuration. The other parameters were as follows: peak tube voltage of 120 kVp, 40-mm table feed per gantry rotation, pitch of 0.984:1, and z-axis tube current modulation. Protocol of LDCT was appropriate according to the guideline of the Korean Society of Thoracic Radiology [19].

Supine inspiratory HRCT scans of all patients were also obtained without intravenous contrast using the same CT scanner for the reference standard. The protocol consisted of sections reconstructed with a high-spatial-frequency algorithm at 1- or 2-cm intervals under automatic exposure control (142–275 mA with dose modulation) with a slice thickness of 1.25 mm, from apex to base.

Three different reconstructions of the LDCT images were obtained: conventional FBP, adaptive statistical iterative reconstruction-Veo at a level of 30% (ASiR-V), and DLM. Images acquired were reconstructed using a FBP algorithm with a sharp convolution kernel and IR algorithms available with vendor’s CT scanner for analysis (ASIR; GE Healthcare). For deep learning model reconstruction, FBP images were used and denoised with a dedicated software package (ClariCT.AI, ClariPi). In this study, ASiR-V with blending ratios of 30% was adopted as a suitable compromise between denoising power and sharpness preservation [9]. And for DLM, since ClariCT.AI provided two tuning parameters; noise blending between 0.0 and 1.0, and edge blending between 0.0 and 1.0, after a pilot visual evaluation, we selected a noise blending value of 0.0 and the edge blending value was set to 0.0. The criteria was two readers’ judgement regarding the best balance between image noise and sharpness. All scan data were directly displayed on the picture archiving and communication systems (PACS) (Centricity 3.0, GE Healthcare) workstation monitors. Images were viewed on monitors in lung (width, 1500 HU; level, -700 HU) window settings. To assess radiation exposure in LDCT, we reviewed the CT dose index (CTDIvol) and dose-length product (DLP) recorded as digital imaging and communications in medicine data. The estimated effective dose was calculated as the DLP multiplied by a k-factor of 0.014 mSv·mGy–1·cm–1 [20].

Deep learning reconstruction model

The deep learning model reconstruction (DLM; ClariCT.AI, ClariPi) [13] was developed as a denoising solution using a U-Net-based convolutional neural network. All methods were developed by ClariPi and ClariCT.AI is a commercial product of ClariPi. Contribution of authors was to determine its adequate denoising strength of LDCT for use in ILD assessment. Details are summarized in S1 and S2 Figs in S1 File.

Image quality analysis

The performance of the image reconstruction methods was evaluated both quantitatively and qualitatively for each case, reconstructed using three different methods (FBP, ASiR-V, and DLM). Quantitative analysis was performed using signal, noise, signal-to-noise ratio (SNR), and blind/referenceless image spatial quality evaluator (BRISQUE) score. For qualitative analysis, a thoracic radiologist visually scored the images.

Signal, noise, and signal-to noise ratio.

Standardized 20-mm-diameter circular regions of interest were used to record signal and noise, which represented the mean pixel intensity value and standard deviation of pixels for the lung parenchyma, and background air for LDCT scans in FBP, ASiR-V, and DLM image sets [21]. Lung measurements were obtained from the lower lobes towards the periphery to avoid parenchymal lesions. Background air was obtained from the air external and anterior to the patient at the sternomanubrial junction [22]. The signal-to-noise ratio (SNR) was calculated for all three image sets. SNR was calculated as follows [23]:

BRISQUE.

BRISQUE is a no-reference image quality assessment model that uses natural scene statistics in the spatial domain [24]. This model is composed of three steps: 1) extraction of natural scene statistics, 2) calculation of feature vectors, and 3) prediction of the image quality score. We utilized a pre-trained prediction model provided by Mittal et al. [24] to predict the image quality score. The minimum and maximum image scores are 0 and 100, respectively, with a lower image score indicating a better image quality. The potential of BRISQUE as an indicator of medical image quality has been reported previously [2426].

Visual scoring.

Two thoracic radiologists (reader 1 had 6 years of experience; reader 2 had 15 years of experience) independently assessed CT images and performed a qualitative image analysis on a chest CT scan. The radiologists were blinded to the patients’ data and the image reconstruction techniques and examined the images in a random order using PACS. CT scans were graded on axial images with datasets displayed on standard windows, and windowing was allowed, as in routine reporting conditions. The readers randomly assessed the subjective image contrast, noise, and image quality using a five-point visual scoring system, and score of two readers were subsequently averaged [27] (Table 1).

thumbnail
Table 1. Visual scoring system used to evaluate image quality.

https://doi.org/10.1371/journal.pone.0291745.t001

Evaluating diagnostic agreement on CT pattern diagnosis of ILD

Two thoracic radiologists (reader 1 had 6 years of experience; reader 2 had 15 years of experience) who were blinded to clinical data and image reconstruction techniques independently assessed CT images. Also, images were randomly displayed using PACS to reduce bias. Radiologists determined the radiologic features of usual interstitial pneumonia (UIP), which is a hallmark of idiopathic pulmonary fibrosis (IPF). A classification of ‘UIP’, ‘Probable UIP’, ‘Indeterminate for UIP’, and ‘Alternative diagnosis’ was assigned for each case using the 2022 American thoracic society and Fleischner Society guidelines [4, 28, 29]. The UIP pattern was defined as subpleural, basal predominance of reticular abnormalities, honeycombing with, or without traction bronchiectasis; the absence of findings was suggestive of alternative diagnosis, including extensive ground-glass opacity, micronodules, discrete cysts, mosaic attenuation, or segmental/lobar consolidation [28]. After diagnosis was made using the three reconstruction methods, cases showing discrepant diagnoses were selected and compared with HRCT findings of the patient. And two radiologists made consensus diagnosis which was considered a reference standard.

Statistical analysis

Image quality of the three reconstruction methods (FBP, ASiR-V, and DLM) were compared using one-way analysis of variance, and post-hoc pairwise comparisons were adjusted for multiple comparisons using the Bonferroni correction. Average Cohen’s kappa statistics were used to evaluate the agreement between the two readers and the diagnostic agreement on the three reconstruction methods. A kappa statistic of 0.81–1.00 indicates an excellent agreement; 0.61–0.80, substantial agreement; 0.41–0.60, moderate agreement; 0.21–0.40, fair agreement; and 0.00–0.20, poor agreement [30]. Statistical significance was set at p < 0.05. All statistical calculations were performed using SAS (version 9.4; SAS Institute, Cary, NC, USA) and R (version 3.3.1; R Foundation for Statistical Computing http://www.R-project.org/ [31]) software.

Results

Basic characteristics of the participants and radiation dose

Of the 193 patients included in the current study, 141 were men and 52 were women, with a mean age of 68.95 ± 9.39 years (range 36–88 years). A total of 93 patients (48.2%) were diagnosed with IPF based on the diagnostic criteria of the American Thoracic Society and European Respiratory Society [29], 55 patients (28.5%) were diagnosed with connective tissue disease related ILD, and 19 patients (9.8%) were diagnosed with interstitial lung abnormality. Organizing pneumonia was diagnosed in six patients (3.1%), followed by smoking related ILD in six (3.1%), nonspecific interstitial pneumonia in four (2.1%), and pleuroparenchymal fibroelastosis in four patients (2.1%). Remaining six patients (3.1%) had other diagnoses (for example, chronic hypersensitivity pneumonitis, post inflammatory fibrosis, sarcoidosis, hemosiderosis). Lung biopsy was performed on 38 patients, either wedge resection (34 patients) or transbronchial lung biopsy (4 patients). Of these, 17 were pathologically confirmed to have UIP on surgical lung biopsy.

As for the radiation dosage in LDCT, the mean CTDIvol, DLP, and effective dose were 1.96 ± 0.03 mGy, 70.32 ± 5.82 mGy*cm, and 0.98 ± 0.08 mSv, respectively.

Comparison of imaging reconstruction methods

Signal, noise, and SNR results.

A comparison of the image signal, noise, and SNR measurements is shown in Fig 1A–1F. The image signal of the lung parenchyma did not significantly differ across FBP, ASiR-V, and DLM; however, the mean background air signal of ASiR-V images was lower than that of FBP and DLM images (FBP vs. ASiR-V, ASiR-V vs. DLM, all p < 0.001). The other parameters, including noise and SNR, differed significantly (Fig 2). Image noise was significantly higher in FBP and significantly lower in DLM, both in the lung parenchyma and background air (FBP vs. ASiR-V, ASiR-V vs. DLM, FBP vs. DLM, all p < 0.001). The SNR in the lung parenchyma and background air significantly differed across the three different reconstructions, and it was higher in the DLM and lower in the FBP (FBP vs. ASiR-V, ASiR-V vs. DLM, FBP vs. DLM, all p < 0.001).

thumbnail
Fig 1.

Signal (A, B), noise (C, D), signal-to-noise ratio (E, F), and blind/referenceless image spatial quality evaluator score (G) results of three different imaging reconstruction methods (filtered back projection [FBP], adaptive statistical iterative reconstruction-Veo at a level of 30% [ASiR-V], and deep learning image reconstruction [DLM]) at the lung parenchyma and background air. Paired t test results with *** p < 0.01; ns, not significant.

https://doi.org/10.1371/journal.pone.0291745.g001

thumbnail
Fig 2. Comparison of low-dose chest computed tomography scan in axial lung window images of the lung of a 50-year-old man.

Reconstruction performed using filtered back projection (FBP) (A), adaptive statistical iterative reconstruction-Veo at a level of 30% (ASiR-V) (B), and deep learning image reconstruction (DLM) (C). The signal of the lung parenchyma did not vary significantly across the different reconstructions. However, the image noise of the DLM images was lower than that of the ASiR-V and FBP images (ASiR-V vs. DLM, p <0.001 and FBP vs. DLR-M, p < 0.001, respectively). The signal-to-noise ratio was significantly higher in DLM in both the lung parenchyma and background air than in other reconstruction methods. (FBP vs. ASiR-V, ASiR-V vs. DLM, and FBP vs. DLM, all p < 0.001).

https://doi.org/10.1371/journal.pone.0291745.g002

BRISQUE score results.

The BRISQUE scores of the three reconstructed images are depicted in Fig 1G. The BRISQUE scores of both ASiR-V and DLM were lower than those of the FBP images (all p < 0.001). The difference between ASiR-V and DLM was statistically significant (p < 0.001), and DLM showed a lower BRISQUE score. The difference between the ASiR-V and FBP images was approximately 1 point, whereas the difference between the DLM and FBP images was approximately 10 points.

Visual scoring by two thoracic radiologists.

The results of the qualitative image analysis are presented in Table 2. In the evaluation of the image contrast, DLM had highest mean points (4.47 ± 0.52), showing above average to excellent contrast, followed by ASiR-V (3.85 ± 0.56) and FBP (3.62 ± 0.48). The scores significantly differed among the three methods (FBP vs. ASiR-V, ASiR-V vs. DLM, FBP vs. DLM, all p < 0.001). In terms of image noise, DLM had highest mean points (3.86 ± 0.45), showing average to below average noise, followed by ASiR-V (3.12 ± 0.78), FBP (2.43 ± 0.78), and scores significantly differed between three methods (FBP vs. ASiR-V, ASiR-V vs. DLM, FBP vs. DLM, all p < 0.001). For overall image quality, the DLM images yielded the highest score, which was slightly inferior to the best image quality. The scores between ASiR-V and DLM, ASiR-V and FBP, and DLM and FBP significantly differed (FBP vs. ASiR-V, ASiR-V vs. DLM, and FBP vs. DLM; all p < 0.001).

Diagnostic categorization of UIP based on CT patterns

On the CT pattern diagnosis of UIP, there was substantial agreement between the two readers (κ  =  0.617). According to the reference standard that consisted of a consensus panel of two radiologists, the UIP was the most common diagnosis (56.5% in FBP [109 of 193], 56.0% in ASiR-V [108 of 193], and 56.0% in DLM [108 of 193]), followed by probable UIP (19.2% in FBP [37 of 193], 17.6% in ASiR-V [34 of 193], and 18.1% in DLM [35 of 193]), alternative diagnosis (16.6% in all FBP, ASiR-V, and DLM [32 of 193]), and indeterminate for UIP pattern (7.8% in FBP [15 of 193], 9.8% in ASiR-V [34 of 193], and 9.3% in DLM [18 of 193]) (Table 3).

thumbnail
Table 3. Diagnostic categorization of usual interstitial pneumonia based on computed tomography patterns.

https://doi.org/10.1371/journal.pone.0291745.t003

The agreement in diagnostic categorization between the three reconstruction methods was almost perfect (κ  =  0.992, CI 0.990–0.994). In the ASiR-V and DLM images, probable UIP was less diagnosed compared to that of FBP (19.2% in FBP [37 of 193], 17.6% in AsiR-V [34 of 193], and 17.6% in DLM [34 of 193]), and indeterminate for UIP was frequently diagnosed (7.8% in FBP [15 of 194], 9.8% in AsiR-V [19 of 193], and 9.8% in DLM [19 of 193]) compared to that of the FBP images (Figs 3 and 4). One case was diagnosed as UIP in the FBP image but categorized as probable UIP in both the ASiR-V and DLM images (Fig 5). There were no discrepant cases in the category of ‘alternative diagnosis’ between the three reconstruction methods. Cases with discrepant diagnoses among the three reconstruction methods are presented in Table 4.

thumbnail
Fig 3. A discordant case between filtered back projection (FBP), adaptive statistical iterative reconstruction-Veo at a level of 30% (ASiR-V), and deep learning image reconstruction (DLM) in probable usual interstitial pneumonia (UIP) and indeterminate UIP pattern.

CT images of a 66-year-old man showing ground-glass opacity and reticulation in the subpleural areas with a basal predominance (A-D). In the displayed FBP (A) and AsiR-V images (B), readers made the diagnosis of a probable UIP pattern because tubular structures in the periphery of the right lower lobe (arrow) were interpreted as peripheral traction bronchiolectasis. However, in the DLM image (C), the presence of traction bronchiolectasis is not definite. In addition, reticulation is less prominent, and ground-glass opacity is the dominant finding. Therefore, it was categorized as indeterminate for the UIP. In high-resolution CT, (D) probable UIP was suggested because traction bronchiolectasis can be observed (arrow). The patient underwent surgical biopsy, which revealed UIP; therefore, IPF was diagnosed.

https://doi.org/10.1371/journal.pone.0291745.g003

thumbnail
Fig 4. A discordant case between filtered back projection (FBP), adaptive statistical iterative reconstruction-Veo at a level of 30% (AsiR-V), and deep learning image reconstruction (DLM) in probable usual interstitial pneumonia (UIP) and indeterminate UIP pattern.

CT images of a 61-year-old man showing ground-glass opacity in the basal subpleural areas of the left lower lobe (A-D). In the FBP image (A), reader 1 diagnosed probable UIP pattern, since the observable tubular structures in the periphery (arrows) were interpreted as peripheral traction bronchiolectasis and reticulation was thought to be present. However, reader 2 made the diagnosis of indeterminate for UIP. In the ASiR-V (B) and DLM images (C), two readers diagnosed indeterminate for UIP, since ground-glass opacity with subtle reticulation is dominant without traction bronchiolectasis. In high-resolution CT (D), minimal traction bronchiolectasis can be seen (arrows), and probable UIP was suggested. The final diagnosis of this patient was established using a multidisciplinary approach.

https://doi.org/10.1371/journal.pone.0291745.g004

thumbnail
Fig 5. A discordant case between filtered back projection (FBP), adaptive statistical iterative reconstruction-Veo at a level of 30% (ASiR-V), and deep learning image reconstruction (DLM) in usual interstitial pneumonia (UIP) and probable UIP pattern.

CT images of a 74-year-old man showing subpleural predominant reticular abnormality with traction bronchiectasis (A-D). Clustered, thick-walled cystic spaces are observed in the basal portion of the left lower lobe (arrow), which was interpreted as honeycombing. Therefore, the UIP pattern was diagnosed by two readers using the FBP image (A). However, in the ASiR-V and DLM images, the case was classified as probable UIP by readers because the findings suggesting honeycombing were unclear. On high-resolution CT (D), probable UIP was suggested. The final diagnosis of this patient was established using a multidisciplinary approach.

https://doi.org/10.1371/journal.pone.0291745.g005

thumbnail
Table 4. Cases with discrepant diagnosis between three reconstruction methods.

https://doi.org/10.1371/journal.pone.0291745.t004

Discussion

This study demonstrated that DLM showed favorable results in terms of image noise, SNR, BRISQUE scores, and visual scoring compared to that of the ASiR-V and FBP images. Furthermore, DLM maintained diagnostic agreement in CT pattern diagnosis of UIP.

The FBP and IR algorithms are widely used in CT image reconstruction. However, both methods have several drawbacks. FBP can severely degrade image quality by increasing image noise, while IR can modify the spatial resolution and noise texture in different regions of CT images [11, 32]. IR often requires long reconstruction times [11, 27]. With recent advances in artificial intelligence technology, DLM has been introduced to overcome the limitations of FBP and IR approaches. The DLM incorporates convolutional neural networks into the image reconstruction process, which can be used to generate high-quality images from low-dose projection data in a short reconstruction time in a clinical environment [12]. Compared to IR, DLM is a cost effective technology that takes up less space and time, that does not require complex high-level calculations [33].

In this study, we applied the BRISQUE score to assess the image quality of the DLM images with LDCT. Unlike SNR, BRISQUE is a no-reference image quality assessment model based on the principle that natural images possess certain regular statistical properties that are measurably modified by distortions [24]. In our study, DLM images yielded significantly better image quality in the BRISQUE score in concordance with other image parameters, including noise, SNR, and visual scoring.

Recently, several studies have reported that DLM yields a favorable noise texture with superior image quality on low-dose chest CT [15, 17, 22, 23]. However their clinical impact was questionable, because most of studies were performed with normal groups who had clear lung parenchyma without pathology. Less is known whether DLM really works in actual clinical settings.

ILD is a rare condition characterized by extensive inflammation and fibrosis mainly involving the lungs and IPF, the most common type of ILD, is associated with poor prognosis [2, 28]. Since interstitial lung disease (ILD) is characterized by subtle parenchymal changes (e.g. honeycombing, traction bronchiectasis, ground-glass opacities and reticulation) it may be influenced by image quality largely than other lung etiologies [29]. Diagnosing ILD is challenging for chest radiology specialists and reported inter-/intra- reader variability on ILD is relatively high [4, 9]. Subtle changes in image quality may alter the radiologist’s decision. In addition, since ILD is mostly progressive disease, the patients with ILD need continuous monitoring of the disease and need cancer surveillance by performing routine follow-up CTs. LDCT is widely used to monitor disease progression in ILD patients [5, 6]. Maintaining image quality while using LDCT, thereby minimizing radiation accumulation is paramount for these patients. Therefore, our study aimed to evaluate the image quality by applying DLM to patients with ILD and evaluate whether it maintains an adequate level of diagnostic power compared to other classic imaging reconstruction methods. As a result, in our study, DLM in LDCT maintained diagnostic agreement in terms of CT pattern diagnosis of UIP compared to that of IR and FBP. To the best of our knowledge, this study specifically investigated the diagnostic impact of DLM on ILD, which has never been tried in previous literature.

Although diagnostic agreement between three reconstruction methods was almost perfect, there were six cases (3.1%, 6 of 193) of discrepant diagnoses between the three reconstruction methods (Table 4). Four patients were diagnosed with indeterminate UIP in DLM, while probable UIP was diagnosed on HRCT. Of these, three patients were finally diagnosed with IPF based on multidisciplinary diagnosis. While DLM showed better image quality than that of other reconstruction images, the denoising process may have produced some degradation in the small peripheral airway dilatations or subtle reticulation, which may have caused underestimation of lung fibrosis. Similar results were also observed in IR, with four cases being diagnosed as indeterminate for UIP but probable for UIP on HRCT. Although IR has shown favorable results in the detection and assessment of ILD [8, 9] it is known to have limitations in that the noise texture often differs from that of traditional FBP images and can negatively affect subjective acceptance and diagnostic confidence, altering noise texture [21, 27]. In our study, there were cases in which both IR and DLM were suggestive of such results (Figs 2 and 3). Another possible reason for the discrepant diagnoses is that the reported interobserver agreement in diagnosing UIP pattern is not high ranging 0.40–0.69 [3, 6, 34]. The diagnostic agreement between the two radiologists in this study was substantial; however, variability may have influenced the diagnosis.

It is noteworthy that the CT pattern diagnostic agreement of ‘UIP pattern’ between the three different reconstructions was almost perfect, except in one case (Fig 4). Diagnosing the UIP pattern by CT is important, since the radiologic diagnosis is sufficient to secure a diagnosis without lung biopsy in an appropriate clinical contest [35, 36]. One discrepant case was found between FBP and other two reconstruction method, and it is thought to have result from substantial noise of the FBP image. Actually, for this reason, the FBP image is seldom used in real clinical practice and considered as raw images when reconstructing CT images. In addition, there were no discrepant cases of alternative diagnoses that may require further diagnostic interventions other than IPF. Most discrepant cases showed differences between ‘probable UIP’ and ‘indeterminate UIP’. CT alone usually is not sufficient to make a definitive diagnosis in these two categories, and additional biopsies or multidisciplinary approaches are needed in most cases. In summary, our study show that diagnostic agreement is maintained in the large framework of DLM in diagnosing UIP pattern by CT, but has not actually shown results that may replace or exceed preexisting methods.

The current study has some limitations. First, our investigation was retrospective. Moreover, the study was conducted at a single institution, which may have caused selection bias. Second, complete blinding of the image reconstruction method was not possible because of the unique visual appearance of the FBP, ASiR-V, and DLM images. Despite the radiologists being blinded to the reconstruction methods and images being randomly displayed in PACS system, it is anticipated that the radiologists were familiar with FBP and ASiR-V images but not with DLM images. Preliminary sessions to let readers get familiar with image appearance before formal analysis could have been helpful to reduce bias. Third, the method we used in our study is original version BRSIQUE which was used for natural images not CT images. A study using self-developed modified BRISQUE for MRI image evaluation has been published [26]. However, it was difficult to apply in our study because the modified BRISQUE they used were not open to public and our study used different image modality. In our future research, we are planning to develop and apply modified BRISQUE based on pre-trained CT images to increase accuracy.

Conclusions

In conclusion, the image noise, SNR, BRISQUE, and visual scoring of chest LDCT scan images improved with DLM compared to that with ASiR-V and FBP. DLM may be feasible in clinical practice for evaluating ILD, since diagnostic agreement is maintained in CT pattern diagnosis of UIP compared to that of ASiR-V and FBP.

Acknowledgments

Samsung medical center is run by the Samsung life public interest foundation, a social welfare corporation, and is a non-profit medical center.

References

  1. 1. Petnak T, Lertjitbanjong P, Thongprayoon C, Moua T. Impact of Antifibrotic Therapy on Mortality and Acute Exacerbation in Idiopathic Pulmonary Fibrosis: A Systematic Review and Meta-Analysis. Chest 2021;160:1751–1763 pmid:34217681
  2. 2. Canestaro WJ, Forrester SH, Raghu G, Ho L, Devine BE. Drug Treatment of Idiopathic Pulmonary Fibrosis: Systematic Review and Network Meta-Analysis. Chest 2016;149:756–766 pmid:26836914
  3. 3. Mayo JR. CT evaluation of diffuse infiltrative lung disease: dose considerations and optimal technique. J Thorac Imaging 2009;24:252–259 pmid:19935222
  4. 4. Lynch DA, Sverzellati N, Travis WD, Brown KK, Colby TV, Galvin JR, et al. Diagnostic criteria for idiopathic pulmonary fibrosis: a Fleischner Society White Paper. Lancet Respir Med 2018;6:138–153 pmid:29154106
  5. 5. Zwirewich CV, Mayo JR, Muller NL. Low-dose high-resolution CT of lung parenchyma. Radiology 1991;180:413–417 pmid:2068303
  6. 6. Christe A, Charimo-Torrente J, Roychoudhury K, Vock P, Roos JE. Accuracy of low-dose computed tomography (CT) for detecting and characterizing the most common CT-patterns of pulmonary disease. Eur J Radiol 2013;82:e142–150 pmid:23122673
  7. 7. Padole A, Ali Khawaja RD, Kalra MK, Singh S. CT radiation dose and iterative reconstruction techniques. AJR Am J Roentgenol 2015;204:W384–392 pmid:25794087
  8. 8. Pontana F, Billard AS, Duhamel A, Schmidt B, Faivre JB, Hachulla E, et al. Effect of Iterative Reconstruction on the Detection of Systemic Sclerosis-related Interstitial Lung Disease: Clinical Experience in 55 Patients. Radiology 2016;279:297–305 pmid:26583761
  9. 9. Lim HJ, Chung MJ, Shin KE, Hwang HS, Lee KS. The Impact of Iterative Reconstruction in Low-Dose Computed Tomography on the Evaluation of Diffuse Interstitial Lung Disease. Korean J Radiol 2016;17:950–960 pmid:27833411
  10. 10. Park S, Yoon JH, Joo I, Yu MH, Kim JH, Park J, et al. Image quality in liver CT: low-dose deep learning vs standard-dose model-based iterative reconstructions. Eur Radiol 2022;32:2865–2874 pmid:34821967
  11. 11. Mohammadinejad P, Mileto A, Yu L, Leng S, Guimaraes LS, Missert AD, et al. CT Noise-Reduction Methods for Lower-Dose Scanning: Strengths and Weaknesses of Iterative Reconstruction Algorithms and New Techniques. Radiographics 2021;41:1493–1508 pmid:34469209
  12. 12. Hosny A, Parmar C, Quackenbush J, Schwartz LH, Aerts H. Artificial intelligence in radiology. Nat Rev Cancer 2018;18:500–510 pmid:29777175
  13. 13. Choi H, Chang W, Kim JH, Ahn C, Lee H, Kim HY, et al. Dose reduction potential of vendor-agnostic deep learning model in comparison with deep learning-based image reconstruction algorithm on CT: a phantom study. Eur Radiol 2022;32:1247–1255 pmid:34390372
  14. 14. Kolb M, Storz C, Kim JH, Weiss J, Afat S, Nikolaou K, et al. Effect of a novel denoising technique on image quality and diagnostic accuracy in low-dose CT in patients with suspected appendicitis. Eur J Radiol 2019;116:198–204 pmid:31153565
  15. 15. Nam JG, Ahn C, Choi H, Hong W, Park J, Kim JH, et al. Image quality of ultralow-dose chest CT using deep learning techniques: potential superiority of vendor-agnostic post-processing over vendor-specific techniques. Eur Radiol 2021;31:5139–5147 pmid:33415436
  16. 16. Kambadakone A. Artificial Intelligence and CT Image Reconstruction: Potential of a New Era in Radiation Dose Reduction. J Am Coll Radiol 2020;17:649–651 pmid:32004484
  17. 17. Kim JH, Yoon HJ, Lee E, Kim I, Cha YK, Bak SH. Validation of Deep-Learning Image Reconstruction for Low-Dose Chest Computed Tomography Scan: Emphasis on Image Quality and Noise. Korean J Radiol 2021;22:131–138 pmid:32729277
  18. 18. Putman RK, Gudmundsson G, Axelsson GT, Hida T, Honda O, Araki T, et al. Imaging Patterns Are Associated with Interstitial Lung Abnormality Progression and Mortality. Am J Respir Crit Care Med 2019;200:175–183 pmid:30673508
  19. 19. KSTR. KSTR Chest CT Practice Guidelines and Technical Standards 2007. https://kstr.radiology.or.kr/reference/guide.php. Accessed 23th, May, 2023
  20. 20. Trattner S, Halliburton S, Thompson CM, Xu Y, Chelliah A, Jambawalikar SR, et al. Cardiac-Specific Conversion Factors to Estimate Radiation Effective Dose From Dose-Length Product in Computed Tomography. JACC Cardiovasc Imaging 2018;11:64–74 pmid:28823748
  21. 21. Lin S, Lin M, Lau KK. Image quality comparison between model-based iterative reconstruction and adaptive statistical iterative reconstruction chest computed tomography in cystic fibrosis patients. J Med Imaging Radiat Oncol 2019;63:602–609 pmid:31090256
  22. 22. Winklehner A, Karlo C, Puippe G, Schmidt B, Flohr T, Goetti R, et al. Raw data-based iterative reconstruction in body CTA: evaluation of radiation dose saving potential. Eur Radiol 2011;21:2521–2526 pmid:21822785
  23. 23. Kuo Y, Lin YY, Lee RC, Lin CJ, Chiou YY, Guo WY. Comparison of image quality from filtered back projection, statistical iterative reconstruction, and model-based iterative reconstruction algorithms in abdominal computed tomography. Medicine (Baltimore) 2016;95:e4456 pmid:27495078
  24. 24. Mittal A, Moorthy AK, Bovik AC. No-reference image quality assessment in the spatial domain. IEEE Trans Image Process 2012;21:4695–4708 pmid:22910118
  25. 25. Zhang Z, Dai G, Liang X, Yu S, Li L, Xie Y. Can signal-to-noise ratio perform as a baseline indicator for medical image quality assessment. IEEE Access 2018;6:11534–11543
  26. 26. Chow LS, Rajagopal H. Modified-BRISQUE as no reference image quality assessment for structural MR images. Magn Reson Imaging 2017;43:74–87 pmid:28716679
  27. 27. Goodenberger MH, Wagner-Bartak NA, Gupta S, Liu X, Yap RQ, Sun J, et al. Computed Tomography Image Quality Evaluation of a New Iterative Reconstruction Algorithm in the Abdomen (Adaptive Statistical Iterative Reconstruction-V) a Comparison With Model-Based Iterative Reconstruction, Adaptive Statistical Iterative Reconstruction, and Filtered Back Projection Reconstructions. J Comput Assist Tomogr 2018;42:184–190 pmid:28806318
  28. 28. Raghu G, Remy-Jardin M, Myers JL, Richeldi L, Ryerson CJ, Lederer DJ, et al. Diagnosis of Idiopathic Pulmonary Fibrosis. An Official ATS/ERS/JRS/ALAT Clinical Practice Guideline. Am J Respir Crit Care Med 2018;198:e44–e68 pmid:30168753
  29. 29. Raghu G, Remy-Jardin M, Richeldi L, Thomson CC, Inoue Y, Johkoh T, et al. Idiopathic Pulmonary Fibrosis (an Update) and Progressive Pulmonary Fibrosis in Adults: An Official ATS/ERS/JRS/ALAT Clinical Practice Guideline. Am J Respir Crit Care Med 2022;205:e18–e47
  30. 30. Svanholm H, Starklint H, Gundersen HJ, Fabricius J, Barlebo H, Olsen S. Reproducibility of histomorphologic diagnoses with special reference to the kappa statistic. APMIS 1989;97:689–698 pmid:2669853
  31. 31. Ripley BD. The R project in statistical computing. MSOR Connections 2001;1:23–25
  32. 32. Nagayama Y, Sakabe D, Goto M, Emoto T, Oda S, Nakaura T, et al. Deep Learning-based Reconstruction for Lower-Dose Pediatric CT: Technical Principles, Image Characteristics, and Clinical Implementations. Radiographics 2021;41:1936–1953 pmid:34597178
  33. 33. Koetzier LR, Mastrodicasa D, Szczykutowicz TP, van der Werf NR, Wang AS, Sandfort V, et al. Deep Learning Image Reconstruction for CT: Technical Principles and Clinical Prospects. Radiology 2023;306:e221257 pmid:36719287
  34. 34. Choe J, Kwon BS, Do KH, Hwang HS, Song JW, Chae EJ. Diagnostic and prognostic implications of 2018 guideline for the diagnosis of idiopathic pulmonary fibrosis in clinical practice. Sci Rep 2021;11:16481 pmid:34389774
  35. 35. Hunninghake GW, Lynch DA, Galvin JR, Gross BH, Muller N, Schwartz DA, et al. Radiologic findings are strongly associated with a pathologic diagnosis of usual interstitial pneumonia. Chest 2003;124:1215–1223 pmid:14555549
  36. 36. Hunninghake GW, Zimmerman MB, Schwartz DA, King TE Jr, Lynch J, Hegele R, et al. Utility of a lung biopsy for the diagnosis of idiopathic pulmonary fibrosis. Am J Respir Crit Care Med 2001;164:193–196 pmid:11463586