Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Quantitative ultrasound, elastography, and machine learning for assessment of steatosis, inflammation, and fibrosis in chronic liver disease

  • François Destrempes,

    Roles Formal analysis, Investigation, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Laboratory of Biorheology and Medical Ultrasonics, University of Montreal Hospital Research Center (CRCHUM), Montréal, Québec, Canada

  • Marc Gesnik,

    Roles Data curation, Formal analysis, Investigation, Methodology, Writing – review & editing

    Affiliation Laboratory of Biorheology and Medical Ultrasonics, University of Montreal Hospital Research Center (CRCHUM), Montréal, Québec, Canada

  • Boris Chayer,

    Roles Data curation, Formal analysis, Investigation, Methodology, Resources, Software, Writing – review & editing

    Affiliation Laboratory of Biorheology and Medical Ultrasonics, University of Montreal Hospital Research Center (CRCHUM), Montréal, Québec, Canada

  • Marie-Hélène Roy-Cardinal,

    Roles Formal analysis, Investigation, Methodology, Writing – review & editing

    Affiliation Laboratory of Biorheology and Medical Ultrasonics, University of Montreal Hospital Research Center (CRCHUM), Montréal, Québec, Canada

  • Damien Olivié,

    Roles Methodology, Project administration, Resources, Supervision, Writing – review & editing

    Affiliations Department of Radiology, Radiation oncology and Nuclear Medicine, Université de Montréal, Montréal, Québec, Canada, Department of Radiology, Centre hospitalier de l’Université de Montréal (CHUM), Montréal, Québec, Canada

  • Jeanne-Marie Giard,

    Roles Investigation, Methodology, Project administration, Supervision, Validation, Visualization, Writing – review & editing

    Affiliation Department of Medicine, Division of Hepatology and Liver Transplantation, Université de Montréal, Montréal, Québec, Canada

  • Giada Sebastiani,

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Writing – review & editing

    Affiliation Department of Medicine, Division of Gastroenterology and Hepatology, McGill University Health Centre (MUHC), Montréal, Québec, Canada

  • Bich N. Nguyen,

    Roles Investigation, Methodology, Resources, Supervision, Validation, Writing – review & editing

    Affiliations Department of Pathology, Centre hospitalier de l’Université de Montréal (CHUM), Montréal, Québec, Canada, Department of Pathology and Cellular Biology, Université de Montréal, Montréal, Québec, Canada

  • Guy Cloutier ,

    Contributed equally to this work with: Guy Cloutier, An Tang

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    guy.cloutier@umontreal.ca (GC); an.tang@umontreal.ca (AT)

    Affiliations Laboratory of Biorheology and Medical Ultrasonics, University of Montreal Hospital Research Center (CRCHUM), Montréal, Québec, Canada, Department of Radiology, Radiation oncology and Nuclear Medicine, Université de Montréal, Montréal, Québec, Canada, Institute of Biomedical Engineering, University of Montreal, Montréal, Québec, Canada

  • An Tang

    Contributed equally to this work with: Guy Cloutier, An Tang

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    guy.cloutier@umontreal.ca (GC); an.tang@umontreal.ca (AT)

    Affiliations Department of Radiology, Radiation oncology and Nuclear Medicine, Université de Montréal, Montréal, Québec, Canada, Department of Radiology, Centre hospitalier de l’Université de Montréal (CHUM), Montréal, Québec, Canada, Laboratory of Medical Image Analysis, Centre de recherche du Centre hospitalier de l’Université de Montréal (CRCHUM), Montréal, Québec, Canada

Abstract

Objective

To develop a quantitative ultrasound (QUS)- and elastography-based model to improve classification of steatosis grade, inflammation grade, and fibrosis stage in patients with chronic liver disease in comparison with shear wave elastography alone, using histopathology as the reference standard.

Methods

This ancillary study to a prospective institutional review-board approved study included 82 patients with non-alcoholic fatty liver disease, chronic hepatitis B or C virus, or autoimmune hepatitis. Elastography measurements, homodyned K-distribution parametric maps, and total attenuation coefficient slope were recorded. Random forests classification and bootstrapping were used to identify combinations of parameters that provided the highest diagnostic accuracy. Receiver operating characteristic (ROC) curves were computed.

Results

For classification of steatosis grade S0 vs. S1-3, S0-1 vs. S2-3, S0-2 vs. S3, area under the receiver operating characteristic curve (AUC) were respectively 0.60, 0.63, and 0.62 with elasticity alone, and 0.90, 0.81, and 0.78 with the best tested model combining QUS and elastography features. For classification of inflammation grade A0 vs. A1-3, A0-1 vs. A2-3, A0-2 vs. A3, AUCs were respectively 0.56, 0.62, and 0.64 with elasticity alone, and 0.75, 0.68, and 0.69 with the best model. For classification of liver fibrosis stage F0 vs. F1-4, F0-1 vs. F2-4, F0-2 vs. F3-4, F0-3 vs. F4, AUCs were respectively 0.66, 0.77, 0.72, and 0.74 with elasticity alone, and 0.72, 0.77, 0.77, and 0.75 with the best model.

Conclusion

Random forest models incorporating QUS and shear wave elastography increased the classification accuracy of liver steatosis, inflammation, and fibrosis when compared to shear wave elastography alone.

Introduction

Chronic liver disease (CLD) is one of the top ten leading causes of death in the United States [1]. Nonalcoholic fatty liver disease (NAFLD) is the most common cause of CLD, affecting up to one third of the adult Western population [2]. It has a substantial burden due to the incidence and prevalence, and its impact on longevity and quality of life [3]. NAFLD is characterized by vacuoles of fat and may lead to nonalcoholic steatohepatitis (NASH), which is characterized by inflammation. All causes of CLD may evolve to liver fibrosis, a scarring process, which may progress to cirrhosis and liver failure. Although liver biopsy is the established reference standard for classification of steatosis grade, inflammation grade, and fibrosis stage, it has several limitations including cost, sampling error, and procedure-related morbidity and mortality [4].

The noninvasiveness, wide availability, innovative technical developments, and cost-effectiveness of ultrasound constitute key advantages for management of patients with CLD. Medical ultrasound has traditionally been used to diagnose diseases based on B-mode (structures) and Doppler-mode (flow) images. More recently, elastography techniques, which assess stiffness, tissue strain or viscoelasticity, provide the highest diagnostic performance for staging of liver fibrosis [57]. Over the past 5 decades, quantitative ultrasound (QUS) imaging techniques have been developed to analyze constructive and destructive interferences between echoes to characterize tissue microstructure below the typical resolution of ultrasound scanners, based on spectral analysis of radiofrequency signals or statistical properties of the echo envelope [8]. In earlier clinical studies, QUS controlled attenuation [911], which is related to ultrasound energy loss in tissues, and backscatter coefficient [12] provided promising results for grading liver steatosis. More recently, several studies have proposed traditional machine learning [1316] or deep learning [1719] techniques combined with shear wave elastography to improve the staging of liver fibrosis. The homodyned-K distribution has also been proposed to model the ultrasound echo envelope in the context of liver steatosis and steatohepatitis [12, 2022], since its parameters are related to cell density, tissue echogenicity, and tissue microstructure.

Considering the high disease prevalence, there is a need for noninvasive imaging and classification of CLD. The purpose of this study was to develop a combined QUS and elastography-based model to improve classification of steatosis grade, inflammation grade, and fibrosis stage in comparison with elastography alone, using histopathology as the reference standard. We privileged the random forests machine learning strategy to identify combination of imaging features providing the highest classification accuracy for a given training dataset [21].

Materials and methods

Study design and subjects

This is a retrospective, cross-sectional ancillary study to an imaging trial (ClinicalTrials.gov Identifier No. NCT02044523). This study was approved by the institutional review board of the two participating institutions, Centre hospitalier de l’Université de Montréal and McGill University Health Centre. All patients gave written informed consent.

The prior cross-sectional imaging trial compared diagnostic accuracy of ultrasound-based and magnetic resonance-based elastography techniques using liver biopsy as the reference standard [6]. For this ancillary study, QUS and elastography measurements were used to develop models for classification of steatosis grade, inflammation grade, and fibrosis stage.

Between October 2014 and September 2018, consecutive adult subjects were enrolled if a liver biopsy was scheduled as part of their clinical standard of care. Ninety-one subjects recruited at hepatology clinics of the two participating institutions underwent QUS. Subjects were excluded from this ancillary study if: (a) images were not acquired with the probe assigned to the study protocol (n = 2); (b) the underlying pathology did not meet eligibility criteria due to ethanol consumption (n = 2), drug-induced hepatitis (n = 1), sarcoidosis (n = 1), cholestasis (n = 1), or no biopsy was performed (n = 2). Characteristics of 82 subjects included in this study are described in Table 1. Among the 82 included subjects, the underlying pathology was CLD attributable to NAFLD (n = 6), NASH (n = 38), hepatitis B virus (n = 2), hepatitis C virus (n = 13), autoimmune hepatitis (n = 16), or mixed causes (n = 7). Consecutive eligible participants were included.

For each of steatosis or inflammation grade and fibrosis stage, classification tasks obtained from splitting the dataset into two classes based on the grade (4 values) or stage (5 values), viewed as ordinal variables, were assessed. Thus, we considered 3 steatosis and inflammation classification tasks, and 4 fibrosis classification tasks.

Ultrasound imaging

Ultrasound images were acquired with a clinical scanner (Acuson S2000 or S3000, Siemens Healthineers) using a convex probe (4C1). Patients were required to fast at least 4 hours prior to the examination. They were scanned in dorsal decubitus with their right arm in maximal abduction. Required standard intercostal B-mode images included: (1) the right liver lobe and right kidney, (2) right liver lobe at the level of the right portal vein, and (3) hepatic veins. A 3-second cine-loop of radiofrequency (RF) signals of the right liver lobe in a plane without major vessels was recorded for subsequent QUS post-processing.

Ultrasound shear wave elastography

Point shear wave elastography (pSWE) was performed with the same convex probe according to guidelines [23]. The median shear wave velocity (expressed in m/s) of 10 valid measurements (obtained in 20 repetitions or less) was used as a surrogate biomarker of liver stiffness. Reliability of measurements was based on the success rate and an interquartile range to the median (IQR/M) < 30% as per European Federation of Societies for Ultrasound in Medicine and Biology (EFSUMB) guidelines and Recommendations on the Clinical Use of Liver Ultrasound Elastography [23].

Preliminary post-processing

Echo envelope of RF data was computed based on Hilbert’s transform, after having compensated for time gain compensation settings. The first 30 images of uncompressed and unfiltered echo envelopes of RF signals were analyzed. A region of interest (ROI) was manually delineated in the first image of the cineloop and then propagated automatically to subsequent images. A pre-classification of pixels (pixels’ labeling) within the ROI of all frames was estimated [24], according to the statistical distribution of the echo envelope.

Quantitative ultrasound analysis

A sliding window of 78 x 12 pixels, corresponding to about 3 mm along both axial and lateral axes, was swept across the ROI by steps of 4 x 1 pixels. For each window, only pixels with same label as the central pixel were used for homodyned K-distribution (HKD) parameters’ estimation [24], so that the hypothesis of a single distribution was met locally (as opposed to a mixture of a few distributions), thus allowing for application of the XU estimation method of the HKD parameters [25]. Notice that the HK distribution was proposed as a general model for echo envelope distribution, with the advantage of a physical interpretation of its parameters [24]. These QUS maps comprised the normalized mean intensity μn (mean intensity divided by the maximal intensity within ROI) [26]; the reciprocal 1/α of the scatterer clustering parameter α, which is an indicator of density and homogeneity in fluctuations of acoustical impedance [1]; the coherent-to-diffuse signal ratio k [27] and the diffuse-to-total signal power ratio 1/(k +1) [1], both of which are QUS biomarkers of structure within scatterers’ spatial organization [24]. Notice that k represents the coherent-to-diffuse signal power ratio, not to be confused with parameter k above. For each parametric map, mean value and inter-quartile range (IQR) within the ROI were computed, and median values over the 30 frames yielded two QUS features per map [21]. Thus, 8 HKD features were produced for each acquisition: (mean and IQR features) x (4 parametric maps). Means and maxima were winsorized to avoid spurious outliers [28].

The total acoustical attenuation due to layers of fat that could be present in patients with obesity might be a confounder for HKD parameters. Therefore, the total attenuation coefficient slope (ACS) was estimated, based on spectral Gaussian fit method [8], and its median value over 30 frames was output as an QUS feature, in addition to HKD parameters. Combination of total attenuation with statistical features was shown to improve classification performance [29], and this is an alternative to estimating HKD parameters directly on the echo envelope compensated for total attenuation.

Lastly, the local ACS was estimated as a further QUS biomarker. Estimation was based on a hybrid state-of-the-art method [30]. For both total and local attenuation estimation methods, a calibrated reference phantom (model 117GU-101, CIRS, Norfolk, VA) was used to compensate for acquisition settings [8]. In total, 11 features were extracted from each acquisition of radiofrequency data.

B-mode images with overlaid parametric maps of QUS features in 4 representative patients are shown in Fig 1. Point shear wave elastography images in the same patients are shown in Fig 2. Custom programs for QUS computations were implemented in C++ language and Matlab R2018a software (MathWorks).

thumbnail
Fig 1. Examples of histopathology-proven steatosis grades.

From left to right with histopathology-proven steatosis grades: 1) 62-year-old man with hepatitis B virus with steatosis grade 0; 2) 62-year-old man with NASH and hepatitis B virus with steatosis grade S1; 3) 38-year-old man with NASH with steatosis grade S2; and 4) 69-year-old woman with NASH with steatosis grade S3. Displayed images: (A) Representative B-mode acquired in the right liver lobe and corresponding QUS parametric maps (zoomed in on ROIs) illustrating (B) reciprocal of scatterer clustering parameter (log scale), (C) coherent-to-diffuse signal ratio, and (D) diffuse-to-total signal power ratio. Yellow indicates higher values, whilst dark blue values indicate lower values. In the table are reported the inflammation grade and the fibrosis stage, as well as the corresponding point shear wave elasticity (pSWE), and local and total attenuation coefficient slopes (ACS).

https://doi.org/10.1371/journal.pone.0262291.g001

thumbnail
Fig 2. Examples of point shear wave elastography (pSWE) measurements.

pSWE measurements in 4 different patients: (A) steatosis grade S0, (B) steatosis grade S1, (C) steatosis grade S2, and (D) steatosis grade S3. Green rectangles indicate regions of interest where measurements are performed.

https://doi.org/10.1371/journal.pone.0262291.g002

Histopathology analysis.

Liver biopsies were performed by percutaneous approach with 16-G or 18-G core needles. Specimens were stained with hematoxylin and eosin and centrally scored by a hepatopathologist. Steatosis was graded according to the proportion of hepatocytes with macrovesicles of fat according to the NASH Clinical Research Network (CRN) scoring system, where S0: < 5%, S1: 5–33%, S2: 33–66%, and S3 > 66%. Inflammation was graded according to the level of lobular inflammation with the NASH CRN system for patients with NAFLD or NASH, and according to the severity of inflammation with the METAVIR scoring system for patients with autoimmune or chronic hepatitis, where A0: no foci, A1: < 2 foci per 200x field, A2: 2–4 foci per 200x field, and A3: > 4 foci per 200x field. Fibrosis was staged according to the distribution and severity of fibrosis and level of architectural modeling with the METAVIR scoring system for patients with autoimmune hepatitis or hepatitis B or C infection, and according to the NASH CRN for patients with NAFLD or NASH. Fibrosis stages F1A, F1B, and F1C in the NASH CRN system were pooled into stage F1.

Machine learning model and features selection

Classification tasks were performed with random forests [31]. This statistical learning approach is suitable in the case of relatively small data sets, as random forests avoid over-fitting (i.e., that would be based on too many trees—see Theorem 1.2 and the remark thereafter in [31]), and have very few hyper-parameters to be tuned; i.e., input features and maximum number of terminal nodes (MNTNs) in each tree. For classification purposes, we used 1000 trees and let MNTN vary from 2 to 20 by steps of 2. Selection of features was performed with G-mean as a recommended evaluation index in the case of imbalanced data set [32]. Random forests used for feature selection comprised 3000 trees. The 10 combinations of 4 features or less among all 11 features that obtained highest G-mean were selected, together with any other combination with same G-mean values in case of ties. Notice that random forests do not perform pruning on trees [33] and that Gini index is used at each node to determine the best split [34]. The number of variables randomly selected at each node is the default one; i.e., the square root of the total number of variables (one or two candidates per node in the context of this study since we used one to four features for classification).

Features selection was performed by testing all possible combinations of 1 to 4 features (there are 561 such combinations) as follows. Each combination of features was considered as the input of its own random forest consisting of 3000 trees. A random forest was trained for each of these combinations of features on the whole dataset. As a measure of potential quality of a given combination of features that was quick to compute (unlike an AUC obtained from several bootstraps), the G-mean was computed based on the sensitivity and specificity obtained on the whole dataset. The 10 combinations of 4 features or less among all 11 features that obtained highest G-mean were selected, together with any other combination with same G-mean values in case of ties. Further assessment of the 10 (or more) selected combinations of features was then performed based on AUCs estimated with the 0.638+ bootstrap method. See S1 Table for AUCs obtained with 0.638+ bootstrap method for each of the 11 features taken individually. For inflammation grade 0 vs. ≥ 1, the AUC of total attenuation was within the AUC’s 95% CI corresponding to the best tested combination of features. For inflammation grade A0-1 vs. A2- = 3, the AUC of parameter k was within the corresponding AUC’s 95% CI. For fibrosis stage F0 vs. F1-4, F0-1 vs. F2-4, or F0-3 vs. 4, the AUCs of parameter pSWE were within the corresponding 95% CIs corresponding to the best combinations of features. For all other classification tasks, none of the best performing individual features were within the corresponding AUC’s 95% CI of the best tested combination of features. S2 Table presents ANOVA tests and post-hoc multiple comparisons when applicable for each of these features. AUCs obtained with 0.638+ bootstrap method in the case where all features were taken as input of random forests are presented in S3 Table. The resulting AUCs all fell off below the AUC’s 95% CI of the corresponding best tested combinations of features.

ROC curves were obtained using stratified sampling, akin to varying the loss function used at the learning phase according to the sensitivity aimed at [35]. This may explain the non-monotonicity appearing in some of the reported ROC curves, as the dataset was imbalanced for some of the dichotomous classification tasks. We did not apply post-processing algorithms on the reported ROC curves.

Evaluation of classification performance

Classification performance was evaluated with the 0.632+ bootstrap method [36]. This method, which is recommended when sample size is too small for allowing training-validation-test set split, combines the leave-one-out bootstrap error and the training error with adaptive weights, thus avoiding over-estimation of the generalization error by the former and under-estimation by the latter. Receiver operating characteristic (ROC) curves were estimated by considering stratified resampling of the dataset with proportion of one class varying from 0 to 1 by steps of 1/40. One thousand bootstraps were generated for each stratification proportion, and for each bootstrap, classification errors were evaluated only on data not belonging to the bootstrap. The area under receiver operating characteristic curves (AUC-ROC) were then computed with the trapezoidal method for each previously selected combination of features and each value of MNTN. Combination of features and corresponding MNTN with highest AUC-ROC was selected as best model among tested ones for a given classification task. The jackknife method [37] was used to generate a sample of ROC-AUCs, from which a 95% confidence interval (CI) could be estimated based on percentiles. For each classification task, the point with maximal Youden index on the ROC curve of its best model was computed, and corresponding sensitivity and specificity were reported. The flow chart of the full post-processing pipeline is presented in Fig 3.

thumbnail
Fig 3. Flow chart of the post-processing pipeline.

(A) Clinical examination, recruitment with signed informed consent, research data acquisitions, biopsy and histology. (B) Calculation of the echo envelope of radiofrequency (RF) cineloops, manual delineation of a region-of-interest (ROI) contours in one frame, and propagation of contours along all frames of the cineloop. (C) Calculation of quantitative ultrasound (QUS) features. (D) Machine learning with random forests based on 11 features and gold standards: selection of 10 combinations of 4 features or less with highest G-mean (features selection), calculation of AUC-ROC on these combinations (0.632+ bootstrap), and estimation of the 95% CI for the features combination with highest AUC-ROC.

https://doi.org/10.1371/journal.pone.0262291.g003

Blinding

Sonographers performing ultrasound examinations were blinded to histopathological results. The hepatopathologist was blinded to ultrasound results.

Statistical analysis

To evaluate relationships between single features and liver steatosis, inflammation, and fibrosis, features were further analyzed with univariate ordinal logistic regressions. Steatosis grade, inflammation grade, and fibrosis stage were considered in turn as response, and any of the features appearing within the best combinations for the corresponding classification task was considered as single predictor. Coefficients of these regressions were reported, together with P-values, after Holm-Bonferroni correction.

To test for association between histopathological components (steatosis, inflammation, and fibrosis), viewed as categorical variables, contingency tables for pairs of these variables were computed, and Pearson’s chi-square tests were performed based on corresponding tables. P-values were reported, after Holm-Bonferroni correction.

Statistical analyses were performed with software R (version 3.2.5, 2016; R Foundation for Statistical Computing). For proportional odds logistic regressions (also called ordinal logistic regressions), we have used the function “polr” of the package “MASS” [38, 39] usable with the software R. The raw p-values were obtained with the function “pnorm” based on the statistics t output by the function“polr”. The Holm-Bonferroni correction on p-values was then computed according to the algorithm introduced in [40]. Machine learning and regression analyses used the package “randomForest” (version 4.6–12, 2015) [41] and “MASS” (version 7.3–45, 2015) [42], respectively.

Results

Study database

Table 1 indicates the sample size of each group determined by histopathology: 29, 22, 15, and 16 patients for steatosis grades S0, S1, S2 and S3, respectively; 8, 39, 27, and 8 patients for inflammation grades A0, A1, A2 and A3, respectively; and 12, 13, 18, 13, and 26 patients for fibrosis stages F0, F1, F2, F3, and F4, respectively. Examples of representative histopathology images are presented in Fig 4. Dataset is provided in S4 Table.

thumbnail
Fig 4. Examples of histopathology slides.

Hematoxylin and eosin-stained images (10x magnification) in 4 different patients corresponding to those shown in Fig 1 above: (A) steatosis grade S0, (B) steatosis grade S1, (C) steatosis grade S2, and (D) steatosis grade S3. Representative vacuoles of macrovesicular steatosis are indicated by arrows.

https://doi.org/10.1371/journal.pone.0262291.g004

Machine learning model and features selection

In Table 2, combination of QUS features and/or elasticity providing the highest AUC-ROC is reported for each classification task. AUCs obtained with elasticity alone were improved by combining QUS and elastography features, for each classification task, except for fibrosis stage ≤ 1 vs. ≥ 2. Improvements were most substantial for steatosis grade (25%-50% in AUC) and for classification of inflammation 0 vs. ≥ 1 (34% in AUC). For the six other dichotomous tasks, improvement was less than 10% in AUC, despite the fact that for five of them, QUS features were combined with shear-wave elasticity.

thumbnail
Table 2. Accuracy of shear wave elasticity alone and in combination with quantitative ultrasound (QUS) features for classification of steatosis, inflammation, and fibrosis.

https://doi.org/10.1371/journal.pone.0262291.t002

Classification tasks performance

Table 2 also provides AUC-ROC and its 95% confidence interval, for each classification task obtained with shear wave elasticity only, or with best combination of QUS and elastography features. For classification of steatosis grades S0 vs. S1-3, S0-1 vs. S2-3, S0-2 vs. S3, AUCs were 0.60, 0.63, 0.62, respectively, with elasticity alone, and were 0.90, 0.81, and 0.78, respectively, with the best tested combination of features. For classification of inflammation grades A0 vs. A1-3, A0-1 vs. A2-3, A0-2 vs. A3, AUCs were respectively 0.56, 0.62, and 0.64 with elasticity alone, and raised up to 0.75, 0.68, and 0.69, respectively, with the best selected combination of features. For classification of liver fibrosis stages F0 vs. F1-4, F0-1 vs. F2-4, F0-2 vs. F3-4, F0-3 vs. F4, elasticity alone yielded AUCs of 0.66, 0.77, 0.72, and 0.74, respectively, whereas best tested combinations of features gave AUCs of 0.72, 0.77, 0.77, and 0.75.

From Table 2, one observes that local ACS is the only common parameter in the best combinations of parameters for the three dichotomous classifications tasks of steatosis grade. Moreover, the three dichotomous classification tasks of the inflammation grade have no common parameters. For the four dichotomous classification tasks of the fibrosis stage, point shear wave elasticity is the only common parameter. However, except for the task F0 vs. F1-4, for which the best combination of parameters was pSWE, μn mean and 1/(k + 1) mean, there were three common parameters: pSWE, μn IQR and 1/(k + 1) IQR.

The importance values, viewed as mean decrease in accuracy, of each of the parameters within the combinations of parameters appearing in Table 2 are reported in S5 Table, based on the R package “randomForest”.

ROC curves for combination of elastography and QUS features with highest ROC-AUC for classification of steatosis, inflammation, and fibrosis are shown in Fig 5.

thumbnail
Fig 5. Receiver operating characteristic (ROC) curves.

ROC curves obtained with elastography (dashed lines) and the combinations of QUS and elasticity features (solid lines) with highest AUC-ROC for the classification of (A) steatosis grade S0 vs. S1-3, S0-1 vs. S2-3, S0-2 vs. S3, (B) inflammation grade A0 vs. A1-3, A0-1 vs. A2-3, A0-2 vs. A3, and (C) fibrosis stage F0 vs. F1-4, F0-1 vs. F2-4, F0-2 vs. F3-4, F0-3 vs. F4. Sensitivity and specificity at optimal Youden index are displayed for each ROC curve.

https://doi.org/10.1371/journal.pone.0262291.g005

For sake of comparison, we present in S5 Table in Supplemental Materials the ROC-ACUs obtained with the support vector machine (SVM) model on the same combinations of parameters that are appearing in Table 2. For this purpose, we used the R package “e1071” (version 1.7–9, 2021) [43]. The kernel for projection of data in higher dimensional space was the radial basis function (RBF) of degree 3 with parameter “γ” equal to the reciprocal of the number of parameters (default values in the package). The ROC curves were produced with class weights of the form (p/N(0), (1-p)/N(1)), where N(0) and N(1) represent the number of elements in the two groups of a given dichotomous classification task, letting p vary from 0 to 1. As the cost C related to the soft margin of SVM has an impact on accuracy, powers of 2 (with exponent varying from 0 to 10) were tested with the 0.632+ bootstrap method, and the best obtained values are reported in S6 Table. In cases where the best AUC was obtained with C = 210, we tested further powers of 2 until decrease in ACU. Finer tuning may have been desirable for some classification tasks, notably for inflammation grade, but we did not pursue this task. As the number of hyper-parameters to be tuned for SVM is larger than for RF, further tuning of the SVM model might require a grid search on a validation set in the case of a larger sample size.

Ordinal relation between steatosis, inflammation or fibrosis and QUS/pSWE features.

Ordinal logistic regressions revealed significant ordinal relations (adopting a confidence level of 0.05) between steatosis grade and mean value (within the segmented ROI) of the normalized mean intensity μn (p = 0.0039), mean value and IQR of the coherent-to-diffuse signal ratio k (p = 0.0029 and 0.029, respectively), and the local ACS (p < 0.001). There was also a significant relation between the fibrosis stage and shear wave elasticity (p = 0.0011). See Table 3 for standard deviations of regression coefficients.

thumbnail
Table 3. Coefficients (with 95% confidence intervals in parentheses) of ordinal logistic regressions with the steatosis grade, or the inflammation grade, or the fibrosis stage as responses, and with features appearing in Table 1, as single predictors.

https://doi.org/10.1371/journal.pone.0262291.t003

Categorical association between steatosis, inflammation, and fibrosis

There was an association between inflammation grade and fibrosis stage (p = 0.010) but not between steatosis and inflammation grades (p = 0.57), nor between steatosis grade and fibrosis stage (p = 0.97).

Discussion

This study revealed that a machine learning approach combining QUS features with elasticity provides a higher accuracy than elasticity alone for the classification of steatosis, inflammation, and fibrosis in patients with CLD. Ultrasound-based elastography techniques provide high accuracy for staging liver fibrosis [44, 45]. However, performance is limited in the case of steatosis and inflammation, which may coexist with fibrosis and confound liver stiffness [21, 46]. It has been reported that liver steatosis decreases liver stiffness in mixed steatohepatitis condition [6, 46, 47], and that inflammation, through a combination of hepatocellular ballooning, aggregation of macrophages, and edema, increases liver stiffness [6, 4851].

Steatosis grades were best classified by a combination of QUS parameters (including coherent-to-diffuse signal ratio, local attenuation coefficient slope, mean intensity normalized by its maximal value, and shear wave elasticity). Multi-parametric approaches provided good to excellent classification accuracy: 0.90 for steatosis grades S0 vs. 1–3, 0.81 for grades S0-1 vs. S2-3, and 0.78 for grades S0-2 vs. S3. Based on coefficients of proportional odds logistic regressions, the local ACS tended to increase with steatosis grade, as expected to occur with an increase in fat content, in agreement with Goshal et al. [12]. Related to the increase in local ACS, a trend of decrease in normalized intensity (μn) was observed. On the other hand, the backscatter coefficient, which is related to the intensity itself (μ), has been reported to be positively correlated with steatosis grade [52]. Moreover, as noted in Zhou et al. [22], an increase in fat content (i.e., fat-infiltrated hepatocytes with nuclei pushed to the cell periphery) is expected to yield an increase in the coherent component of the analytic ultrasound signal. The observed trend was indeed an increase in the average coherent-to-diffuse signal ratio with steatosis grade. There was also a corresponding trend for the IQR of this parameter. Features that did not perform well as stand-alone features could nonetheless improve classification performance when combined with other features, although this makes physical interpretation of their contribution more difficult to analyze. Our proposed method compares favorably with recent results reported by Moret et al. for two state-of-the-art techniques: hepatorenal index ratio with AUCs of 0.90 for S0 vs. S1-3, 0.78 for S0-1 vs. S2-3, and 0.73 for S0-2 vs. S3 and controlled attenuation parameter (CAP) with AUCs of 0.93 for S0 vs. S1-3, 0.76 for S0-1 vs. S2-3, and 0.70 for S0-2 vs. S3 [53].

Inflammation grades were best classified by QUS and elastography features (including total attenuation coefficient slope, coherent-to-diffuse signal ratio, point shear wave elasticity, and reciprocal of the scatterer clustering parameter). Multi-parametric approaches provided AUCs ranging from 0.68 to 0.75, significantly higher than elastography alone, which provided AUCs ranging from 0.56 to 0.64. There was no significant ordinal relation between inflammation grade and any features retained in the 3 binary inflammation classification tasks. However, pSWE was somewhat related to inflammation grade with a P value of 0.083. This trend of increase in stiffness with inflammation is consistent with reported results in an animal study [47].

Fibrosis staging with pSWE alone provided rather low performance for classification of fibrosis stages F0 vs. F1-4 and moderate performance ranging from 0.72 to 0.77 for fibrosis stages F0-1 vs. F2-4, F0-2 vs. F3-4, and F0-3 vs. 4. We believe that the coexistence of confounding factors in a heterogeneous disease population may explain the lower AUCs of pSWE in detecting advanced fibrosis, whereas many prior studies on elastography focused on populations with only one disease. Nonetheless, the combination with QUS parameters (coherent-to-diffuse signal ratio, diffuse-to-total signal power ratio, mean intensity normalized by its maximal value, and total attenuation) could further improve the classification accuracy, leading to AUCs in the range of 0.72 to 0.77 for all dichotomization schemes. Among features that were retained for the 4 binary fibrosis classification tasks, pSWE presented an ordinal relation with fibrosis stage. The observed trend of increase in stiffness with fibrosis stage is consistent with the multivariate analysis reported in Kazemirad et al. [47]. Recent works by Brattain et al, Durot et al., Kagadis et al., showed that fibrosis classification accuracy could be further improved, with AUCs in the range of 0.93 to 0.99, by using machine learning techniques such as automated image quality assessment and ROI selection [54], support vector machines [16] and deep learning techniques applied on shear wave elastography data [19].

These results confirm that using machine learning to select the best combination of QUS and elastography parameters may significantly improve classification of steatosis, inflammation, and fibrosis in patients with CLD in comparison with elastography alone. Such a comprehensive approach is critical to reduce or alleviate the need for liver biopsy because these histological components coexist and may have confounding effects on liver stiffness. Of note, our study found associations between inflammation grade and fibrosis stage. Therefore, a multi-parametric approach was required to account for the stiffness-increasing effect of inflammation and fibrosis.

Notice that for some of the classification tasks, distinct combinations of features yielded the same AUC, up to two decimals: steatosis grade S0 vs. S1-3, with QUS features combined with pSWE; steatosis grade S0-2 vs. S3, also with QUS features only; fibrosis stage F0-1 vs. ≥ F2-4, fibrosis stage F0-2 vs. F3-4 and fibrosis stage F0-3 vs. F4, also with QUS features combined with pSWE. In particular, in all cases where the best combination of features comprised pSWE, none of the tested combinations of QUS features only were as good.

Limitations

The sample size was limited to 82 patients with various causes of CLD. However, this pilot study constitutes a proof of concept indicating that incorporating QUS and elastography increases the classification study, a line of work that should be investigated in larger studies with a single etiology such as NAFLD. We did not assess iron overload, another histological feature encountered in some causes of CLD. While there is currently no ultrasound-based technique for assessment of iron overload, this remains a relatively uncommon clinical problem encountered in select populations with hemochromatosis and transfusional hemosiderosis that can be detected by blood tests and quantitated by magnetic resonance imaging [55]. Another limitation was the lack of assessment of biliary disease. Similarly, biliary disease can be suspected on the basis of cholestatic enzymes and assessed by magnetic resonance cholangiopancreatography when relevant. Ultrasound images were acquired with two clinical scanner models which could have introduced variability. Finally, our dataset was imbalanced with few positive cases for steatosis grades S0-2 vs. S3 (66 vs. 16) and inflammation grades A0-2 vs. A3 (74 vs. 8), and with few negative cases for inflammation grades A0 vs. A1-3 (8 vs. 74) and fibrosis stages F0 vs. F1-4 (12 vs. 70). However, we have used the G-mean as metric for features selection, which is recommended in the case of imbalanced datasets. Moreover, in that context, AUC under the Precision-Recall curve (PRC) has been suggested as alternative metric to AUC-ROC [56]. We have verified that AUC-PRCs were higher for combinations of features than with pSWE stand-alone, albeit further assessment in future studies would require larger sample size. Further assessment would be required on a larger data set in a target population of patients with NAFLD or NASH, in particular to assess more precisely the impact of each of the proposed QUS features in the considered dichotomous tasks. Future work could include clinical variables, such as body mass index, to take into account variability in patient’s health conditions, at the classifier’s training step. Furthermore, other machine learning models than random forests could be assessed on a larger database.

Conclusions

In conclusion, this ancillary study to a prospective imaging trial revealed that a machine learning model based on random forests could select combinations of QUS features and shear wave elastography stiffness that improved the classification accuracy of three key histological features of CLD (steatosis, inflammation, and fibrosis) over that of elastography alone. Future research should validate this approach in human cohorts with NAFLD, a highly prevalent cause of CLD.

Supporting information

S1 Table. Accuracy of pSWE and QUS features.

Accuracy of each of the eleven features alone and for the best combination of features for classification of steatosis, inflammation, and fibrosis. The best combination depends on the classification task; see Table 2.

https://doi.org/10.1371/journal.pone.0262291.s001

(DOCX)

S2 Table. ANOVA tests and post-hoc multiple comparisons when applicable for each of these features.

Anova (AOV) test if Shapiro-Wilk (S-W) test succeeded (p-value > 1) or else Kruskal-Wallis (K-S) tests with post-hoc multiple comparison tests with Bonferroni-Holm adjustment of p-values, based on t-test in the former case, or Wilcoxon rank sum test in the latter case.

https://doi.org/10.1371/journal.pone.0262291.s002

(DOCX)

S3 Table. Accuracy of all eleven features obtained with 0.638+ bootstrap method.

All features taken as input to random forests and of the best combination of features for classification of steatosis, inflammation, and fibrosis. The best combination of features depends on the classification task; see Table 2.

https://doi.org/10.1371/journal.pone.0262291.s003

(DOCX)

S4 Table. Dataset.

Dataset contains patient identification from 1 to 82 (ID), steatosis grade (Steatosis), inflammation grade (Inflammation), fibrosis stage (Fibrosis), point shear wave elasticity (pSWE), μn = mean intensity normalized by its maximal value (munMean), 1/α = reciprocal of the scatterer clustering parameter (ialphaMean), k = coherent-to-diffuse signal ratio (kMean), 1/(k + 1) = diffuse-to-total signal power ratio (ikappaMean), mean intensity normalized by its maximal value inter-quartile range (munIQR), reciprocal of the scatterer clustering parameter inter-quartile range (ialphaIQR), coherent-to-diffuse signal ratio inter-quartile range (kIQR), diffuse-to-total signal power ratio inter-quartile range (ikappaIQR),total attenuation coefficient slope (TotalACS), local attenuation coefficient slope (LocalACS).

https://doi.org/10.1371/journal.pone.0262291.s004

(XLSX)

S5 Table. Importance of each parameter within the multi-parameter combinations appearing in Table 2 for the various dichotomous classification tasks.

The reported importance is represented by the mean decrease in accuracy, where a higher value corresponds to a higher importance.

https://doi.org/10.1371/journal.pone.0262291.s005

(DOCX)

S6 Table. Accuracy of shear wave elasticity alone and in combination with quantitative ultrasound (QUS) features for classification of steatosis, inflammation, and fibrosis, based on support vector machine (SVM) models.

The same combinations of parameters as in Table 2 were tested with SVM. The parameter γ of radial basis functions (RBF) was set to the reciprocal of the number of parameters (default value with R package “e1071”); the degree of RBFs was the default value 3; the cost C was varied according to powers of 2 for each combination of parameters and the best resulting AUC-ROC is reported.

https://doi.org/10.1371/journal.pone.0262291.s006

(DOCX)

Acknowledgments

We thank Mrs. Assia Belblidia, Mrs. Catherine Huet, and Mr. Walid El Abyad for their assistance in patient enrollment. ACUSON S2000 and S3000 ultrasound systems were lent by Siemens Healthineers.

References

  1. 1. Kim WR, Brown RS Jr., Terrault NA, El-Serag H. Burden of liver disease in the United States: summary of a workshop. Hepatology (Baltimore, Md. 2002;36(1):227–42. Epub 2002/06/27. doi: S0270913902000265 [pii] pmid:12085369.
  2. 2. Younossi ZM, Koenig AB, Abdelatif D, Fazel Y, Henry L, Wymer M. Global epidemiology of nonalcoholic fatty liver disease-Meta-analytic assessment of prevalence, incidence, and outcomes. Hepatology (Baltimore, Md. 2016;64(1):73–84. Epub 2015/12/29. pmid:26707365.
  3. 3. Lima PH, Fan B, Berube J, Cerny M, Olivie D, Giard JM, et al. Cost-Utility Analysis of Imaging for Surveillance and Diagnosis of Hepatocellular Carcinoma. AJR Am J Roentgenol. 2019:1–9. Epub 2019/04/18. pmid:30995098.
  4. 4. Ratziu V, Charlotte F, Heurtier A, Gombert S, Giral P, Bruckert E, et al. Sampling variability of liver biopsy in nonalcoholic fatty liver disease. Gastroenterology. 2005;128(7):1898–906. pmid:15940625.
  5. 5. Petitclerc L, Sebastiani G, Gilbert G, Cloutier G, Tang A. Liver fibrosis: Review of current imaging and MRI quantification techniques. J Magn Reson Imaging. 2017;45(5):1276–95. pmid:27981751.
  6. 6. Lefebvre T, Wartelle-Bladou C, Wong P, Sebastiani G, Giard JM, Castel H, et al. Prospective comparison of transient, point shear wave, and magnetic resonance elastography for staging liver fibrosis. Eur Radiol. 2019;29(12):6477–88. Epub 2019/07/07. pmid:31278577.
  7. 7. Gatos I, Drazinos P, Yarmenitis S, Theotokas I, Zoumpoulis PS. Comparison of Sound Touch Elastography, Shear Wave Elastography and Vibration-Controlled Transient Elastography in Chronic Liver Disease Assessment using Liver Biopsy as the "Reference Standard". Ultrasound Med Biol. 2020;46(4):959–71. Epub 2020/01/28. pmid:31983484.
  8. 8. Mamou J, Oelze ML. Quantitative Ultrasound in Soft Tissues: Springer; 2013 Oct. 9 2013.
  9. 9. Myers RP, Pollett A, Kirsch R, Pomier-Layrargues G, Beaton M, Levstik M, et al. Controlled Attenuation Parameter (CAP): a noninvasive method for the detection of hepatic steatosis based on transient elastography. Liver Int. 2012;32(6):902–10. pmid:22435761.
  10. 10. Sasso M, Miette V, Sandrin L, Beaugrand M. The controlled attenuation parameter (CAP): a novel tool for the non-invasive evaluation of steatosis using Fibroscan. Clinics and research in hepatology and gastroenterology. 2012;36(1):13–20. pmid:21920839.
  11. 11. Yilmaz Y, Yesil A, Gerin F, Ergelen R, Akin H, Celikel CA, et al. Detection of hepatic steatosis using the controlled attenuation parameter: a comparative study with liver biopsy. Scand J Gastroenterol. 2014;49(5):611–6. pmid:24611771.
  12. 12. Ghoshal G, Lavarello RJ, Kemmerer JP, Miller RJ, Oelze ML. Ex vivo study of quantitative ultrasound parameters in fatty rabbit livers. Ultrasound Med Biol. 2012;38(12):2238–48. pmid:23062376; PubMed Central PMCID: PMC3640816.
  13. 13. Stoean R, Stoean C, Lupsor M, Stefanescu H, Badea R. Evolutionary-driven support vector machines for determining the degree of liver fibrosis in chronic hepatitis C. Artif Intell Med. 2011;51(1):53–65. Epub 2010/08/03. pmid:20675106.
  14. 14. Gatos I, Tsantis S, Spiliopoulos S, Karnabatidis D, Theotokas I, Zoumpoulis P, et al. A new computer aided diagnosis system for evaluation of chronic liver disease with ultrasound shear wave elastography imaging. Med Phys. 2016;43(3):1428–36. Epub 2016/03/05. pmid:26936727.
  15. 15. Gatos I, Tsantis S, Spiliopoulos S, Karnabatidis D, Theotokas I, Zoumpoulis P, et al. A Machine-Learning Algorithm Toward Color Analysis for Chronic Liver Disease Classification, Employing Ultrasound Shear Wave Elastography. Ultrasound Med Biol. 2017;43(9):1797–810. Epub 2017/06/22. pmid:28634041.
  16. 16. Durot I, Akhbardeh A, Sagreiya H, Loening AM, Rubin DL. A New Multimodel Machine Learning Framework to Improve Hepatic Fibrosis Grading Using Ultrasound Elastography Systems from Different Vendors. Ultrasound Med Biol. 2020;46(1):26–33. Epub 2019/10/16. pmid:31611074; PubMed Central PMCID: PMC6879839.
  17. 17. Wang K, Lu X, Zhou H, Gao Y, Zheng J, Tong M, et al. Deep learning Radiomics of shear wave elastography significantly improved diagnostic performance for assessing liver fibrosis in chronic hepatitis B: a prospective multicentre study. Gut. 2019;68(4):729–41. Epub 2018/05/08. pmid:29730602; PubMed Central PMCID: PMC6580779.
  18. 18. Gatos I, Tsantis S, Spiliopoulos S, Karnabatidis D, Theotokas I, Zoumpoulis P, et al. Temporal stability assessment in shear wave elasticity images validated by deep learning neural network for chronic liver disease fibrosis stage assessment. Med Phys. 2019;46(5):2298–309. Epub 2019/04/01. pmid:30929260.
  19. 19. Kagadis GC, Drazinos P, Gatos I, Tsantis S, Papadimitroulas P, Spiliopoulos S, et al. Deep learning networks on chronic liver disease assessment with fine-tuning of shear wave elastography image sequences. Phys Med Biol. 2020;65(21):215027. Epub 2020/10/02. pmid:32998480.
  20. 20. Fang J, Zhou Z, Chang NF, Wan YL, Tsui PH. Ultrasound parametric imaging of hepatic steatosis using the homodyned-K distribution: An animal study. Ultrasonics. 2018;87:91–102. Epub 2018/02/25. pmid:29476945.
  21. 21. Tang A, Destrempes F, Kazemirad S, Garcia-Duitama J, Nguyen BN, Cloutier G. Quantitative ultrasound and machine learning for assessment of steatohepatitis in a rat model. Eur Radiol. 2019;29(5):2175–84. Epub 2018/12/19. pmid:30560362.
  22. 22. Zhou Z, Fang J, Cristea A, Lin YH, Tsai YW, Wan YL, et al. Value of homodyned K distribution in ultrasound parametric imaging of hepatic steatosis: An animal study. Ultrasonics. 2019;101:106001. Epub 2019/09/11. pmid:31505328.
  23. 23. Dietrich CF, Bamber J, Berzigotti A, Bota S, Cantisani V, Castera L, et al. EFSUMB Guidelines and Recommendations on the Clinical Use of Liver Ultrasound Elastography, Update 2017 (Long Version). Ultraschall Med. 2017;38(4):e16–e47. Epub 2017/04/14. pmid:28407655.
  24. 24. Destrempes F, Emilie F, Yu F, Cloutier G. Unifying concepts of statistical and spectralquantitative ultrasound techniques. IEEE Trans Med Imaging. 2016;35:488–500. Epub 2015/09/29. pmid:26415165.
  25. 25. Destrempes F, Poree J, Cloutier G. Estimation Method of the Homodyned K-Distribution Based on the Mean Intensity and Two Log-Moments. SIAM journal on imaging sciences. 2013;6(3):1499–530. Epub 2014/05/06. pmid:24795788; PubMed Central PMCID: PMC4008482.
  26. 26. Trop I, Destrempes F, El Khoury M, Robidoux A, Gaboury L, Allard L, et al. The Added Value of Statistical Modeling of Backscatter Properties in the Management of Breast Lesions at US. Radiology. 2015;275(3):666–74. pmid:25496215.
  27. 27. Dutt V, Greenleaf JF. Ultrasound echo envelope analysis using a homodyned K distribution signal model. Ultrason Imaging. 1994;16(4):265–87. Epub 1994/10/01. pmid:7785128
  28. 28. Wilcox RR. Introduction to Robust Estimation and Hypothesis Testing. Oxford, UK: Academic Press; 2011.
  29. 29. Sheet D, Karamalis A, Eslami A, Noel P, Chatterjee J, Ray AK, et al. Joint learning of ultrasonic backscattering statistical physics and signal confidence primal for characterizing atherosclerotic plaques using intravascular ultrasound. Med Image Anal. 2014;18(1):103–17. Epub 2013/11/05. pmid:24184434.
  30. 30. Bigelow TA, Labyed Y. Attenuation compensation and estimation. In: J. M, Oelze ML, editors. Quantitative ultrasound in soft tissues. Springer Dordrecht Heildelberg New York London 2013. p. 75–6.
  31. 31. Breiman L. Random forests. Machine learning. 2001;45(1):5–32.
  32. 32. He H, Garcia EA. Learning from imbalanced data. IEEE Transactions on knowledge and data engineering. 2009;21(9):1263–84.
  33. 33. Breiman L. Manual on setting up, using, and understanding random forests v3. 1. Statistics Department University of California Berkeley, CA, USA. 2002;1:58.
  34. 34. Cutler A, Cutler DR, Stevens JR. Random forests. Springer, editor. Boston, MA2012. 157–75 p.
  35. 35. Woods K, Bowyer KW. Generating ROC curves for artificial neural networks. IEEE Trans Med Imaging. 1997;16(3):329–37. Epub 1997/06/01. pmid:9184895.
  36. 36. Hastie T, Tibshirani R, Friedman J. The elements of statistical learning: data mining, inference and prediction. Springer-Velag New York Berlin Heldelberg. 2001;27(2):83–5.
  37. 37. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44(3):837–45. pmid:3203132
  38. 38. Venables WN, Ripley BD. Modern applied statistics with S-PLUS: Springer Science & Business Media 2013.
  39. 39. https://cran.r-project.org/web/packages/MASS/MASS.pdf [last accessed 14/12/2020].
  40. 40. Holm S. A simple sequentially rejective multiple test procedure. Scandinavian journal of statistics. 1979:65–70.
  41. 41. Breiman L, Cutler A. Package ‘randomForest”. Version 4.6–12. October 7, 2015. Available from: https://www.stat.berkeley.edu/~breiman/RandomForests/.
  42. 42. Venables W, Ripley B. Modern applied statistics (Fourth S., editor) New York. ISBN 0-387-95457-0.: Springer; 2002.
  43. 43. Meyer D, Dimitriadou E, Hornik K, Weingessel A, Leisch F, Chang CC, et al. Package ‘e1071’. Version 1.7–9. September 16, 2021.
  44. 44. Tang A, Cloutier G, Szeverenyi NM, Sirlin CB. Ultrasound Elastography and MR Elastography for Assessing Liver Fibrosis: Part 1, Principles and Techniques. AJR Am J Roentgenol. 2015;205(1):22–32. pmid:25905647.
  45. 45. Tang A, Cloutier G, Szeverenyi NM, Sirlin CB. Ultrasound Elastography and MR Elastography for Assessing Liver Fibrosis: Part 2, Diagnostic Performance, Confounders, and Future Directions. AJR Am J Roentgenol. 2015;205(1):33–40. pmid:25905762.
  46. 46. Yoneda M, Suzuki K, Kato S, Fujita K, Nozaki Y, Hosono K, et al. Nonalcoholic fatty liver disease: US-based acoustic radiation force impulse elastography. Radiology. 2010;256(2):640–7. Epub 2010/06/10. doi: radiol.10091662 [pii] pmid:20529989.
  47. 47. Kazemirad S, Zhang E, Nguyen BN, Bodson-Clermont P, Destrempes F, Trudel D, et al. Detection of Steatohepatitis in a Rat Model by Using Spectroscopic Shear-Wave US Elastography. Radiology. 2017;282(3):726–33. Epub 2016/08/12. pmid:27513850.
  48. 48. Iredale JP. Models of liver fibrosis: exploring the dynamic nature of inflammation and repair in a solid organ. Journal of Clinical Investigation. 2007;117(3):539–48. PubMed PMID: PMC1804370. pmid:17332881
  49. 49. Zeng X, Xu C, He D, Zhang H, Xia J, Shi D, et al. Influence of Hepatic Inflammation on FibroScan Findings in Diagnosing Fibrosis in Patients with Chronic Hepatitis B. Ultrasound in medicine & biology. 2015;41(6):1538–44. Epub 2015/03/01. pmid:25724309.
  50. 50. Dong D-R, Hao M-N, Li C, Peng ZE, Liu XIA, Wang G-P, et al. Acoustic radiation force impulse elastography, FibroScan(®), Forns’ index and their combination in the assessment of liver fibrosis in patients with chronic hepatitis B, and the impact of inflammatory activity and steatosis on these diagnostic methods. Molecular Medicine Reports. 2015;11(6):4174–82. PubMed PMID: PMC4394958. pmid:25651500
  51. 51. Chen J, Talwalkar JA, Yin M, Glaser KJ, Sanderson SO, Ehman RL. Early detection of nonalcoholic steatohepatitis in patients with nonalcoholic fatty liver disease by using MR elastography. Radiology. 2011;259(3):749–56. PubMed PMID: PMC3099044. pmid:21460032
  52. 52. Han A, Zhang YN, Boehringer AS, Montes V, Andre MP, Erdman JW Jr., et al. Assessment of Hepatic Steatosis in Nonalcoholic Fatty Liver Disease by Using Quantitative US. Radiology. 2020;295(1):106–13. Epub 2020/02/06. pmid:32013792; PubMed Central PMCID: PMC7104700.
  53. 53. Moret A, Boursier J, Houssel Debry P, Riou J, Crouan A, Dubois M, et al. Evaluation of the Hepatorenal B-Mode Ratio and the "Controlled Attenuation Parameter" for the Detection and Grading of Steatosis. Ultraschall Med. 2020. Epub 2020/09/30. pmid:32992377.
  54. 54. Brattain LJ, Ozturk A, Telfer BA, Dhyani M, Grajo JR, Samir AE. Image Processing Pipeline for Liver Fibrosis Classification Using Ultrasound Shear Wave Elastography. Ultrasound Med Biol. 2020;46(10):2667–76. Epub 2020/07/06. pmid:32622685; PubMed Central PMCID: PMC7483774.
  55. 55. Labranche R, Gilbert G, Cerny M, Vu K-N, Soulières D, Olivié D, et al. Liver Iron Quantification with MR Imaging: A Primer for Radiologists. RadioGraphics. 2018;38(2):392–412. Epub 2018/03/13. pmid:29528818.
  56. 56. Davis J, Goadrich M, editors. The relationship between Precision-Recall and ROC curves. Proceedings of the 23rd international conference on Machine learning; 2006: ACM.