Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Non-invasive assessment of NAFLD as systemic disease—A machine learning perspective

  • Ali Canbay ,

    Contributed equally to this work with: Ali Canbay, Julia Kälsch

    Roles Conceptualization, Project administration, Supervision, Writing – review & editing

    Affiliation Department of Gastroenterology, Hepatology, and Infectious Diseases, Otto-von-Guericke University Magdeburg, Magdeburg, Germany

  • Julia Kälsch ,

    Contributed equally to this work with: Ali Canbay, Julia Kälsch

    Roles Data curation, Formal analysis, Funding acquisition, Writing – review & editing

    Affiliations Department of Gastroenterology and Hepatology, University Hospital, University Duisburg-Essen, Essen, Germany, Institute for Pathology, University Hospital, University Duisburg-Essen, Essen, Germany

  • Ursula Neumann,

    Roles Formal analysis, Software

    Affiliation Department of Mathematics and Computer Science, University of Marburg, Marburg, Germany

  • Monika Rau,

    Roles Data curation, Writing – review & editing

    Affiliation Division of Hepatology, Department of Internal Medicine II, University Hospital Würzburg, Würzburg, Germany

  • Simon Hohenester,

    Roles Data curation, Funding acquisition, Investigation, Resources, Writing – review & editing

    Affiliation Department of Medicine II, University Hospital, LMU Munich, Munich, Germany

  • Hideo A. Baba,

    Roles Data curation, Investigation, Writing – review & editing

    Affiliation Institute for Pathology, University Hospital, University Duisburg-Essen, Essen, Germany

  • Christian Rust,

    Roles Data curation, Funding acquisition, Writing – review & editing

    Affiliation Center for Nutritional Medicine and Prevention, Department of Medicine I, Hospital Barmherzige Brüder, Munich, Germany

  • Andreas Geier,

    Roles Data curation, Investigation, Validation, Writing – review & editing

    Affiliation Division of Hepatology, Department of Internal Medicine II, University Hospital Würzburg, Würzburg, Germany

  • Dominik Heider,

    Roles Conceptualization, Formal analysis, Software, Supervision, Writing – review & editing

    Affiliation Department of Mathematics and Computer Science, University of Marburg, Marburg, Germany

  • Jan-Peter Sowa

    Roles Formal analysis, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliations Department of Gastroenterology, Hepatology, and Infectious Diseases, Otto-von-Guericke University Magdeburg, Magdeburg, Germany, Department of Gastroenterology and Hepatology, University Hospital, University Duisburg-Essen, Essen, Germany

Non-invasive assessment of NAFLD as systemic disease—A machine learning perspective

  • Ali Canbay, 
  • Julia Kälsch, 
  • Ursula Neumann, 
  • Monika Rau, 
  • Simon Hohenester, 
  • Hideo A. Baba, 
  • Christian Rust, 
  • Andreas Geier, 
  • Dominik Heider, 
  • Jan-Peter Sowa


Background & aims

Current non-invasive scores for the assessment of severity of non-alcoholic fatty liver disease (NAFLD) and identification of patients with non-alcoholic steatohepatitis (NASH) have insufficient performance to be included in clinical routine. In the current study, we developed a novel machine learning approach to overcome the caveats of existing approaches.


Non-invasive parameters were selected by an ensemble feature selection (EFS) from a retrospectively collected training cohort of 164 obese individuals (age: 43.5±10.3y; BMI: 54.1±10.1kg/m2) to develop a model able to predict the histological assessed NAFLD activity score (NAS). The model was evaluated in an independent validation cohort (122 patients, age: 45.2±11.75y, BMI: 50.8±8.61kg/m2).


EFS identified age, γGT, HbA1c, adiponectin, and M30 as being highly associated with NAFLD. The model reached a Spearman correlation coefficient with the NAS of 0.46 in the training cohort and was able to differentiate between NAFL (NAS≤4) and NASH (NAS>4) with an AUC of 0.73. In the independent validation cohort, an AUC of 0.7 was achieved for this separation. We further analyzed the potential of the new model for disease monitoring in an obese cohort of 38 patients under lifestyle intervention for one year. While all patients lost weight under intervention, increasing scores were observed in 15 patients. Increasing scores were associated with significantly lower absolute weight loss, lower reduction of waist circumference and basal metabolic rate.


A newly developed model ( can predict presence or absence of NASH with reasonable performance. The new score could be used to detect NASH and monitor disease progression or therapy response to weight loss interventions.


Non-alcoholic fatty liver disease (NAFLD) is a growing public health problem worldwide [1]. NAFLD represents the liver manifestation of the metabolic syndrome and the underlying origin seems to be excess lipid accumulation. Development and mechanisms are complex and not completely clear [24]. By now, NAFLD is a major cause of liver-related morbidity and mortality [1], without apparent serological signs of injury [5]. NAFLD represents a serious health concern and will amount to significant burden on health care systems [1,6]. One problem among many regarding NAFLD is stratification of patients by risk of progression, as substantial morbidity and mortality arise from cardiovascular and other causes [7,3,8]. Another problem is that serum parameters used to assess severity of other chronic liver diseases remain well within normal ranges for most NAFLD patients [9,10].

The diagnostic gold standard for NAFLD assessment remains liver biopsy, a potentially painful and harmful procedure for the patient. In a liver biopsy 5% lipid content can be detected and, more importantly, inflammation, hepatocellular ballooning, and fibrotic alterations can be assessed [11]. These are required to diagnose non-alcoholic steatohepatitis (NASH). While accumulation of lipids in the liver can be assessed by imaging techniques [12] non-invasive tests cannot accurately detect inflammation and hepatocellular ballooning. Due to the invasive nature and the required effort to take a liver biopsy, it is not suited to monitor disease progression or regression. Thus, development of non-invasive scoring systems to predict severity of NAFLD, either based on (advanced) fibrosis or NASH, is a vital field of clinical research [1320]. Despite several available scores and biomarkers, no ideal set of markers has been agreed upon. Many scores fail validation in independent cohorts, require a relatively large number of markers, too complicated to handle in clinical routine, or are nontransparent, making validation in independent cohorts impossible [19,21,22]. Objective biomarker panels for assessment and monitoring of NAFLD or NASH are still urgently needed to improve clinical routine and research [23].

In the current study, we initially aimed at comparing currently available scores [1316,20] for the non-invasive detection of NASH in morbidly obese patients. Histological separation of NAFL and NASH was based on the non-alcoholic steatohepatitis activity score (NAS) [24], as this is still the more widely applied and known system. As most tested scores were unable to predict NASH or fibrosis with reasonable accuracy, we aimed for a new non-invasive score for separation of NAFL from NASH. An ensemble feature selection approach (EFS) [25] identified biomarkers associated with NAS from routine markers, adiponectin and the apoptosis marker M30 [26,27]. A logistic regression model based on these biomarkers was built and validated in an independent study cohort.

Patients and methods

Ethics statement

The study protocol conformed to the ethical guidelines of 1975 Declaration of Helsinki and was approved by the Institutional Review Board (IRB; Ethik-Kommission der Medizinischen Fakultät der Universität Duisburg-Essen; Germany; 15-6356-BO). Due to the retrospective nature of the training study the IRB waived the need for written informed consent. All procedures adhered to the Declaration of Helsinki and the requirements of the IRB.

The protocol of the prospectively recruited validation study cohort was approved by the local Ethics Committee and the Review Board of the University Hospital Würzburg (Ethik-Kommission der Medizinischen Fakultät der Universität Würzburg; AZ96/12). Written informed consent was obtained from all participants.

All individuals participating in the (non-invasive) lifestyle intervention study were prospectively collected and gave written informed consent to the study protocol.

All authors had access to the study data and reviewed and approved the final manuscript.

Study design and sample acquisition

The training cohort (University Hospital Essen) consisted of 164 obese patients with histologically confirmed NAFLD (NAS 1–8; Table 1), who underwent bariatric surgery. Patients were eligible for the retrospective study, when liver histology was available and a NAS of at least 1 was diagnosed by a pathologist. Patients received dietary and exercise counselling for 6 months prior surgery, no calorie restriction was imposed. A blood sample was collected for assessment of serum derived factors on the day of surgery (prior surgery) and liver tissue was sampled during bariatric surgery. Detailed demographic and clinical information of this cohort is given in Table 1. All data shown were recorded on the day of surgery.

Table 1. Basic clinical and demographic data of the training and evaluation cohorts.

The validation cohort was recruited prospectively at the University Hospital Würzburg (Division of Hepatology), including 122 patients. The majority of these patients underwent bariatric surgery (n = 105). Detailed demographic and clinical information of this cohort is given in Table 1. Patients were eligible when all parameters used in the newly generated score were complete and histological assessment of the NAS was available.

Pathologists assessed the NAS prior development of the new score and were thus blinded to the results of the new score. While during training the NAS as reference was required to be unblinded, calculation of the new score in the validation cohort was performed blinded to the NAS.

To assess a possible use of the new score for monitoring of treatment, data from a cohort recruited for lifestyle intervention at the University of Munich (Department of Medicine II) were applied [28]. The whole study cohort consisted of 152 patients. For the present analysis 38 patients with complete datasets to generate the new score at start and end of the study were selected.

Dataset and statistics

The training dataset included the socio-demographic parameters sex, age, height, weight, and BMI, as well as the serum parameters ALT, AST, AST/ALT ratio, GGT, albumin, triacylglycerols, total cholesterol, fasting blood sugar, HbA1c, thrombocyte count, total (M65) and caspase-cleaved serum CK-18 (M30), and adiponectin. In liver tissue steatosis, ballooning, lobular inflammation, and fibrosis were assessed by two independent pathological reviewers (JK, HAB). From these available parameters, the presence of NAFL or NASH was determined and the NAS, the APRI score ( [20], the BARD score ( [16], the NAFLD fibrosis score ( [14], the Palekar Score [13], and the Gholam score [15] were calculated. Calculation of the Sumida Score [18], SteatoTest, and ActiTest / NASHTest [19] could not be performed based on the available data.

The validation dataset included age, HbA1c, GGT, M30, adiponectin, sex, BMI, and NAS. Statistical data analyses were performed with R ( and Prism Version 5 (Graphpad Inc., La Jolla, CA, USA). All data are presented as mean ± standard error of the mean (SEM) unless specified otherwise. Correlation analysis was performed using Spearman’s rank correlation coefficient.

Importance analysis and predictive modeling

Importance analysis of the biomarkers were carried out with EFS with default settings [25] using the web-interface ( EFS aggregates eight different feature selection methods and provides a quantitative ranking of the features [29]. EFS has been shown to compensate biases of single feature selection methods, overcoming instability and unreliability of biomarker discovery approaches.

The biomarkers with the highest ranks identified by EFS, were used for subsequent model development by logistic regression. The stats package of R with standard settings was used. For evaluation of the model performance a 10-fold cross-validation scheme was used and the Receiver Operation Characteristics (ROC) curve and the corresponding Area under the Curve (AUC) were calculated with pROC [30]. The 95% CI was computed with 2000 stratified bootstrap replicates.


Proposed non-invasive scores do not correlate with histological assessment of NAFLD severity

To assess performance of existing non-invasive scores, we calculated correlations between APRI, BARD, Gholam’s score, NAFLD fibrosis score (NFS), and Palekar’s score and NAS, and fibrosis, respectively. Of note, APRI and NFS were developed to assess severity of fibrosis/cirrhosis, but have been tested for assessment of NASH by other groups previously. Only the APRI (r = 0.3269, p < 0.001) and Gholam’s scores (r = 0.3670, p < 0.0001) were significantly, positively correlated with NAS (S1A and S1B Fig), while other scores were not (S1C–S1E Fig). ROC curves were built for each score to differentiate between NAFL (NAS ≤ 4) or NASH (> 4) as additional measure of performance. APRI (AUC: 0.6523, p = 0.001) and Gholam (AUC: 0.7001, p < 0.0001; S2A and S2B Fig) were able to separate between NAFL and NASH. No significant correlations were found for the scores with the grade of fibrosis (S3 Fig). Demographic and clinical data of this cohort are given in Table 1.

A new score with unbiased parameter selection reaches high accuracy to predict NASH in the training cohort

To improve non-invasive classification of NAFL versus NASH, all available markers were evaluated by EFS. The markers age, HbA1c, GGT, adiponectin, and M30, were used as input for a logistic regression model. Logistic regression models are easy to interpret, have been widely applied and have very high acceptance among physicians in clinical practice [31]. The model was evaluated using a 10-fold cross-validation. The model is publicly available at

The predicted scores are strongly correlated with NAS (r = 0.4473; p < 0.0001; Fig 1A) and the model reached an AUC of 0.7339 (95% CI: 0.6567–8111; p < 0.0001; Fig 1B). The AUC of the model was significantly higher compared to all other tested scores. While no correlation was found for the new score and fibrosis stage, the AUC of a ROC curve to separate moderate from advanced fibrosis reached 0.8117 (95% CI: 0.6964 to 0.9271; p = 0.03; advanced fibrosis n = 4).

Fig 1. Performance of the new score within the training cohort.

Based on available non-invasive parameters in an obese cohort of 164 patients a new score was generated. This score correlated strongly with the NAS (A), assessed by two independent pathologists, in this cohort. For separation between NAFL (NAS ≤ 4) and NASH (NAS > 4) an AUC of 0.73 was achieved (B).

The new score achieves high accuracy in an independent validation cohort

In the validation cohort (n = 122, see Table 1) the new model showed performance similar to the training set. The score is significantly correlated with NAS (r = 0.2792; p = 0.002; Fig 2A) and the model reached an AUC of 0.7028 (p = 0.0007; Fig 2B). In addition, the model was applied to separate NAFL and NASH according to the classification by SAF, achieving slightly lower AUC in the training cohort (S4 Fig). The new model can predict severity of NAFLD with reasonable performance.

Fig 2. The new score achieves reasonable performance in a validation cohort.

The newly generated score was evaluated in an independent cohort from the University Hospital Würzburg (A, B) with 122 patients. For the validation cohort the new score achieved a Spearman correlation coefficient with the NAS of 0.28 (A) and was able to differentiate between NAFL (NAS ≤ 4) and NASH (NAS > 4) with an AUC of 0.7 (B).

The new score can be applied to monitor therapy response in patients with metabolic alterations

Due to the fact that the new model was able to separate NAFL and NASH in an independent cohort with good accuracy, we explored, whether it would be applicable for monitoring response to weight loss therapy. To this end, a cohort of patients participating in a lifestyle intervention program to treat morbid obesity for one year, based on the current guideline of the German Association of Obesity [32], was analyzed (Table 2) [28]. No liver biopsies were available in this cohort. Complete datasets for calculating the new score at start and end of the study were available for 38 patients. The treatment regime led to weight loss in all patients and an overall improvement of metabolic parameters and health indices in the majority of patients. In parallel, a drop of the new score was observed in 60% of the patients (26 with reduced score, 17 with increased score). Mean reduction of the new score was 1.53 points. This would correspond to a reduction in NAS by 1 point for seven patients, including one case of hypothetical resolution from NASH (NAS = 5) to NAFL (NAS = 4).

Table 2. Basic clinical and demographic data of a subgroup of the lifestyle intervention study cohort.

The new score indicates efficacy of weight loss therapy to restore metabolic health.

All treated patients exhibited weight loss and at least mild improvements of the clinical situation. Since the new score was reduced in only 60% of the patients, correlation analyses were performed to identify markers which might explain this discrepancy. Significant correlations of the new score at start (T0) and end (T52) of the study are given in S1 and S2 Tables. Interestingly, the new score is correlated to almost all metabolic and liver related health indicators, including body composition and basal metabolic rate at both time points. We then grouped patients by either a decrease of—3.8 [(-10.32)—(-0.05)] points or increase of 2.5 (0.13–9.7) points of the score during the treatment duration. Basic parameters of these subgroups did not differ at T0 (Table 3) but suggested a slightly better liver related health condition for individuals with an increase of the new score. At T0 the new score was significantly higher in patients with stronger reduction of the new score. Patients with decreased score exhibited significantly higher absolute weight loss (Fig 3A), and stronger reduction of waist circumference (Fig 3B) at the end of the study duration. While reduction of corrected body fat proportion (NutriPlus Version 5.4.1, Data input, Poecking, Germany) was not significantly different (Fig 3C) the change of basal metabolic rate was higher in individuals with reduced score (Fig 3D). In addition, serum albumin and the gastrointestinal hormone ghrelin were significantly higher in patients with reduced score at both time points (T0 and T52; Fig 4A and 4B). Conversely, serum palmitic acid (C16:0; Fig 4C) was significantly lower in patients with reduced score at T0 and T52. Fatty liver index or NAFLD fibrosis score did not differ between patients with decrease or increase of the new score (not shown). When applied as monitoring tool the new score can indicate efficacy of weight loss treatment for restoration of metabolic health.

Fig 3. Differences in response to weight loss therapy indicated by the new score.

In a subgroup of 38 patients receiving weight loss therapy for 1 year (Tables 2 and 3) only 23 patients (60%) exhibited a reduction of the newly generated score, while weight and most other metabolic health indicators were improved. Individuals with an increase of the score (15 patients, 40%) during study duration had significantly lower absolute weight loss (A), lower reduction of waist circumference (B), nominally lower reduction in body fat content (C), and a significantly less pronounced impairment of basal metabolic rate (D).

Fig 4. The new score indicates liver and adipose tissue function as possible determinants for weight loss efficiency.

38 patients receiving weight loss therapy for 1 year (Tables 2 and 3) were grouped by either reduction (23 patients) or increase (15 patients) of the new score. In the group with decreased score values at end of the study duration we found significantly higher serum albumin (A) and ghrelin (B) concentrations at start and end of the study. Conversely, concentration of the unsaturated fatty acid palmitic acid (C16:0) was significantly lower at both time points in patients with decrease of the score during weight loss therapy.

Table 3. Basic clinical and demographic data of a subgroup of the lifestyle intervention study cohort separated by CHeK alterations during study course (data from T0).


The non-invasive assessment and monitoring of NAFLD is a complex problem and has been reviewed outstandingly [23,33,34]. In brief, histological evaluation still remains the gold standard, but is limited by sampling error and inter- as well as intra-observer variance in assessment [35]. Moreover, liver biopsy procedures needed for histological assessment are time-consuming, require trained experts, and pose a risk for the patient [36,37]. Magnetic resonance (MR) imaging or MR elastography might be more exact and less dangerous compared to biopsy, but are also costly and time-consuming, making a screening or monitoring inefficient [38]. Other non-invasive methods, as ultrasound or transient elastography lack sensitivity and accuracy. No known single non-invasive parameter achieves sufficient sensitivity and specificity to discern NAFL from NASH, let alone reflect nuanced NAS-based grading [39]. The current consensus seems that more exact and easy accessible biomarker panels are warranted for research and disease monitoring [23,34].

Due to the described diagnostic dilemma many scores and methods for non-invasive assessment of NAFLD have been proposed in various settings. In the present study, we evaluated previously published scoring systems in a specific cohort, to identify the ideal one for our purposes. Unfortunately, few of the scores correlate with NAS or fibrosis score in our cohort. The best performing Gholam score includes presence of diabetes as a dichotomous variable, prohibiting disease or therapy monitoring, in particular for short time frames. Separation of NAFL versus NASH is only possible with the APRI score, initially developed to predict advanced fibrosis in NASH, with a suboptimal AUC of 0.65. Another disadvantage of many scores is a dichotomous separation of disease entities: non-NASH vs. NASH, no/mild fibrosis vs. advanced fibrosis. This is sufficient in clinical settings to decide further diagnostics or therapy for an individual patient. However, it does not reflect the underlying biology, where a more nuanced spectrum exists. Non-invasive scores to predict severity of NAFLD or fibrosis have limited performance in independent settings, as shown in the current study.

The unsatisfactory performance of existing scores in our setting, prompted us to find an optimized score from biomarkers available in the present cohort in an unsupervised manner. An ensemble biomarker discovery approach, which included several machine learning algorithms, identified age, γGT, HbA1c, M30, and adiponectin as ideal available parameters. The logistic regression model generated from these parameters has a strong correlation with NAS in the training and in the independent validation cohort. In addition, it demonstrated reasonable performance to discriminate between NAFL and NASH in both cohorts. Notably individual components of the new score are not limited to classic liver related markers. The selected markers reflect an overall metabolic health state:

  • age as surrogate for the duration of metabolic derangements or obesity;
  • γGT as classic liver injury marker;
  • HbA1c as surrogate marker for impaired glucose metabolism;
  • M30 as marker for epithelial / hepatocyte apoptosis;
  • Adiponectin as marker for adipose tissue function.

We believe that this combination of factors from adipose tissue, glucose metabolism, and liver health to detect NASH represent processes influencing severity of NAFLD by altered hepatic metabolism in obesity. Of note, these markers have been selected by an unbiased biomarker discovery method and seem to reflect an organism wide metabolic situation, as shown in the cohort under weight loss therapy. One advantage of the new score is a continuous distribution of values. We currently refrain from giving a fixed cut off to exclude or definitely confirm NASH, as these may depend on the tested cohort. However, the score reflects a biological spectrum from low to high liver injury due to NAFLD.

In a cohort with treatment for obesity, the new score seems to allow assessment of the metabolic status of a patient. Higher scores demonstrate an impaired situation regarding metabolic syndrome and associated diseases, while negative values suggest a low risk to suffer from metabolic alterations. For an individual patient reduction of the score would demonstrate an improved situation and therapeutic benefit beyond mere weight loss. In parallel to weight loss and improvement of many health indicators, the new score dropped in 60% of the tested patients. Upon analysis of the remaining 40%, relatively low absolute weight loss, lower reduction of waist circumference, and only mildly impaired metabolic rate were associated with increased score values. This suggests that the new score is reflective of an overall metabolic health and could indicate objective efficiency of weight loss programs for metabolic health. This particular finding is of course limited by lack of liver biopsies as confirmation of the situation in the liver itself. Moreover, the baseline score differed significantly between the groups, suggesting that part of the study cohort did only have mild NAFLD related liver injury, which could not improve during the weight loss treatment. These and other possible confounders for monitoring with the new score need to be evaluated in appropriate study settings.

There are some limitations of the presented score. The training cohort for score generation was extremely obese, which might influence distribution of values for most of the included parameters. This includes a very low proportion of advanced fibrosis (2.4%), a common observation in extremely obese NAFLD patients [19,35]. Advanced fibrosis is the central determining factor for hepatic outcomes, including those in NAFLD, and for further clinical and therapeutic management [23,40,41]. Thus, non-invasive detection of advanced fibrosis would be more important from a mere hepatological point of view. As the new score was able to separate no or mild fibrosis from advanced fibrosis in this cohort, despite the low proportion of advanced fibrosis, it would be interesting to test the score in cohorts with higher prevalence. As limitation could also be seen that categorization of NAFL vs. NASH was performed by NAS [24], since this is still widely applied in clinical practice. Separation of NAFL vs. NASH by the SAF [11] would result in a slightly different score, nevertheless performance of the new score in the training cohort when SAF was applied was still reasonable. Another limitation is the population background that is mostly Caucasian/European. Data from other cohorts will be required to test whether the newly generated score can reach similar performance in other populations. To this end, this new score is freely available and open to use for testing in any cohort. Integration of new data into the score will hopefully refine it and further improve performance for specific cohorts and populations. One addition could be ultrasonographic or elastographic measurements, when available in a sufficiently large cohort. Finally, the use of adiponectin as parameter could be seen as another limitation, as it is not routinely measured, not even in settings of metabolic alterations, diabetes, or suspected NAFLD. It has been consistently shown that reduced adiponectin is closely associated to the status of the adipose tissue and the liver in obesity and metabolic syndrome [9,10,4246]. Adiponectin also seems to indicate severity of glucose intolerance or insulin resistance and could possibly replace the BMI in screening approaches for diabetes [9,47,48]. In the present study, an unsupervised selection of parameters again resulted in adiponectin as one important factor indicating severity of liver injury in NAFLD. Thus, we deem it is about time to integrate adiponectin quantification into clinical routine for metabolic syndrome, insulin resistance, and associated diseases.

In summary, we present a new score for non-invasive assessment of the severity of NAFLD. Advantages of this score are a continuous distribution allowing disease assessment apart from a dichotomous classification as NAFL or NASH. Additional parameters, i.e., transient elastography or controlled attenuation parameter, could be added, given sufficiently large reference datasets. The score could possibly be used to monitor disease progression or resolution over time. We invite the scientific community and in particular all hepatologists examining patients suspected with NAFLD to test and apply the new score to assess patient health at

Supporting information

S1 Fig. Current non-invasive scores for assessment of NAFLD severity do not correlate with NAS.

In a cohort of 164 obese individuals with NAFLD known non-invasive scores were tested for correlation with the NAS. The APRI (A) and Gholam’s score (B) achieved a reasonable Spearman r of 0.34 and 0.37, respectively. Though, neither the BARD score (C) nor the NAFLD fibrosis score (D), nor Palekars Score (E) correlated with the NAS in this cohort.


S2 Fig. Current non-invasive scores cannot differentiate histological NASH from NAFL.

In a cohort of 164 obese individuals with NAFLD AUCs were calculated to assess classification into NAFL or NASH by known non-invasive scores. The APRI score reached an AUC of 0.65 (A) and Gholam’s Score an AUC of 0.7 (B), both significantly better than random guessing. This was not the case for the BARD (C), the NAFLD fibrosis score (D), or Palekars Score (E).


S3 Fig. Current non-invasive scores for assessment of NAFLD severity do not correlate with fibrosis stage.

In a cohort of 164 obese individuals with NAFLD known non-invasive scores were tested for correlation with the fibrosis stage. None of the tested scores, APRI (A), Gholam’s Score (B), BARD (C), NAFLD fibrosis score (D), or Palekars score (E) were correlated to the histological fibrosis stage. ROC calculations did not show separation between no or mild fibrosis (grade 0–2) and advanced fibrosis (grades 3–4), that would have been better than random guessing (not shown). This lack of performance might be due to the very low number of individuals with advanced fibrosis (Grade 3: n = 3; grade 4: n = 1).


S4 Fig. The new score significantly correlates with SAF score.

Using NAS-based classification of NAFLD to generate the new score might be seen as limitation. Thus the score was correlated to the SAF in the training (A) and validation cohort (C), resulting in significant robust correlations. In addition the score was tested to separate NAFL from NASH according to the Bedossa algorithm (at least 1 point in steatosis, ballooning and lobular inflammation each to diagnose NASH). The new score achieved reasonable performance in the training cohort (B) but insufficient performance in the validation cohort (D), though still significant versus random guessing.


S1 Table. Significant correlations of the new score with available parameters in a subgroup of the lifestyle intervention cohort at T0.


S2 Table. Significant correlations of the new score with available parameters in a subgroup of the lifestyle intervention cohort at T52.


S3 Table. Raw data tables for all datasets.



The authors thank Dorothe Möllmann, Martin Schlattjan, and Karl-Heinz Strucksberg for their skillful technical assistance.


  1. 1. Younossi ZM, Koenig AB, Abdelatif D, Fazel Y, Henry L, Wymer M. Global epidemiology of nonalcoholic fatty liver disease-Meta-analytic assessment of prevalence, incidence, and outcomes. Hepatology. 2016;64: 73–84. pmid:26707365
  2. 2. Kim CH, Younossi ZM. Nonalcoholic fatty liver disease: a manifestation of the metabolic syndrome. Cleve Clin J Med. 2008;75: 721–728. pmid:18939388
  3. 3. Byrne CD, Targher G. NAFLD: a multisystem disease. J Hepatol. 2015;62: S47–64. pmid:25920090
  4. 4. Marengo A, Jouness RIK, Bugianesi E. Progression and Natural History of Nonalcoholic Fatty Liver Disease in Adults. Clin Liver Dis. 2016;20: 313–324. pmid:27063271
  5. 5. Kälsch J, Bechmann LP, Kälsch H, Schlattjan M, Erhard J, Gerken G, et al. Evaluation of Biomarkers of NAFLD in a Cohort of Morbidly Obese Patients. J Nutr Metab. 2011;2011: 369168. pmid:21773018
  6. 6. Starley BQ, Calcagno CJ, Harrison SA. Nonalcoholic fatty liver disease and hepatocellular carcinoma: a weighty connection. Hepatology. 2010;51: 1820–1832. pmid:20432259
  7. 7. Siddiqui MS, Sterling RK, Luketic VA, Puri P, Stravitz RT, Bouneva I, et al. Association between high-normal levels of alanine aminotransferase and risk factors for atherogenesis. Gastroenterology. 2013;145: 1271–1279.e1–3. pmid:23973920
  8. 8. Anstee QM, Targher G, Day CP. Progression of NAFLD to diabetes mellitus, cardiovascular disease or cirrhosis. Nat Rev Gastroenterol Hepatol. 2013;10: 330–344. pmid:23507799
  9. 9. Kälsch J, Bechmann LP, Heider D, Best J, Manka P, Kälsch H, et al. Normal liver enzymes are correlated with severity of metabolic syndrome in a large population based cohort. Sci Rep. 2015;5: 13058. pmid:26269425
  10. 10. Kälsch J, Keskin H, Schütte A, Baars T, Baba H, Bechmann L, et al. Patients with ultrasound diagnosis of hepatic steatosis are at high metabolic risk. Z Für Gastroenterol. 2016;54: 1312–1319. pmid:27936481
  11. 11. Bedossa P, Poitou C, Veyrie N, Bouillot J-L, Basdevant A, Paradis V, et al. Histopathological algorithm and scoring system for evaluation of liver lesions in morbidly obese patients. Hepatology. 2012;56: 1751–1759. pmid:22707395
  12. 12. Saadeh S, Younossi ZM, Remer EM, Gramlich T, Ong JP, Hurley M, et al. The utility of radiological imaging in nonalcoholic fatty liver disease. Gastroenterology. 2002;123: 745–750. pmid:12198701
  13. 13. Palekar NA, Naus R, Larson SP, Ward J, Harrison SA. Clinical model for distinguishing nonalcoholic steatohepatitis from simple steatosis in patients with nonalcoholic fatty liver disease. Liver Int. 2006;26: 151–156. pmid:16448452
  14. 14. Angulo P, Hui JM, Marchesini G, Bugianesi E, George J, Farrell GC, et al. The NAFLD fibrosis score: A noninvasive system that identifies liver fibrosis in patients with NAFLD. Hepatology. 2007;45: 846–854. pmid:17393509
  15. 15. Gholam PM, Flancbaum L, Machan JT, Charney DA, Kotler DP. Nonalcoholic fatty liver disease in severely obese subjects. Am J Gastroenterol. 2007;102: 399–408. pmid:17311652
  16. 16. Harrison SA, Oliver D, Arnold HL, Gogia S, Neuschwander-Tetri BA. Development and validation of a simple NAFLD clinical scoring system for identifying patients without advanced disease. Gut. 2008;57: 1441–1447. pmid:18390575
  17. 17. Shah AG, Lydecker A, Murray K, Tetri BN, Contos MJ, Sanyal AJ, et al. Comparison of noninvasive markers of fibrosis in patients with nonalcoholic fatty liver disease. Clin Gastroenterol Hepatol. 2009;7: 1104–1112. pmid:19523535
  18. 18. Sumida Y, Yoneda M, Hyogo H, Yamaguchi K, Ono M, Fujii H, et al. A simple clinical scoring system using ferritin, fasting insulin, and type IV collagen 7S for predicting steatohepatitis in nonalcoholic fatty liver disease. J Gastroenterol. 2011;46: 257–268. pmid:20842510
  19. 19. Poynard T, Lassailly G, Diaz E, Clement K, Caïazzo R, Tordjman J, et al. Performance of biomarkers FibroTest, ActiTest, SteatoTest, and NashTest in patients with severe obesity: meta analysis of individual patient data. PloS One. 2012;7: e30325. pmid:22431959
  20. 20. Wai C-T, Greenson JK, Fontana RJ, Kalbfleisch JD, Marrero JA, Conjeevaram HS, et al. A simple noninvasive index can predict both significant fibrosis and cirrhosis in patients with chronic hepatitis C. Hepatology. 2003;38: 518–526. pmid:12883497
  21. 21. Younossi ZM, Page S, Rafiq N, Birerdinc A, Stepanova M, Hossain N, et al. A biomarker panel for non-alcoholic steatohepatitis (NASH) and NASH-related fibrosis. Obes Surg. 2011;21: 431–439. pmid:20532833
  22. 22. Machado MV, Cortez-Pinto H. Non-invasive diagnosis of non-alcoholic fatty liver disease. A critical appraisal. J Hepatol. 2013;58: 1007–1019. pmid:23183525
  23. 23. Younossi ZM, Loomba R, Anstee QM, Rinella ME, Bugianesi E, Marchesini G, et al. Diagnostic Modalities for Non-alcoholic Fatty Liver Disease (NAFLD), Non-alcoholic Steatohepatitis (NASH) and Associated Fibrosis. Hepatology. 2017; pmid:29222917
  24. 24. Kleiner DE, Brunt EM, Van Natta M, Behling C, Contos MJ, Cummings OW, et al. Design and validation of a histological scoring system for nonalcoholic fatty liver disease. Hepatology. 2005;41: 1313–1321. pmid:15915461
  25. 25. Neumann U, Genze N, Heider D. EFS: an ensemble feature selection tool implemented as R-package and web-application. BioData Min. 2017;10: 21. pmid:28674556
  26. 26. Bissonnette J, Altamirano J, Devue C, Roux O, Payancé A, Lebrec D, et al. A prospective study of the utility of plasma biomarkers to diagnose alcoholic hepatitis. Hepatology. 2017;66: 555–563. pmid:28120471
  27. 27. Joka D, Wahl K, Moeller S, Schlue J, Vaske B, Bahr MJ, et al. Prospective biopsy-controlled evaluation of cell death biomarkers for prediction of liver fibrosis and nonalcoholic steatohepatitis. Hepatology. 2012;55: 455–464. pmid:21993925
  28. 28. Hohenester S, Christiansen S, Nagel J, Wimmer R, Artmann R, Denk G, et al. Lifestyle intervention for morbid obesity: effects on liver steatosis, inflammation, and fibrosis. Am J Physiol Gastrointest Liver Physiol. 2018;315: G329–G338. pmid:29878845
  29. 29. Neumann U, Riemenschneider M, Sowa J-P, Baars T, Kälsch J, Canbay A, et al. Compensation of feature selection biases accompanied with improved predictive performance for binary classification by using a novel ensemble feature selection approach. BioData Min. 2016;9: 36. pmid:27891179
  30. 30. Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J-C, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12: 77. pmid:21414208
  31. 31. Dechêne A, Jochum C, Fingas C, Paul A, Heider D, Syn W-K, et al. Endoscopic management is the treatment of choice for bile leaks after liver resection. Gastrointest Endosc. 2014; pmid:24796959
  32. 32. AWMF: Detail [Internet]. [cited 10 Oct 2017].
  33. 33. Tapper EB, Lok AS-F. Use of Liver Imaging and Biopsy in Clinical Practice. N Engl J Med. 2017;377: 756–768. pmid:28834467
  34. 34. Alkhouri N, Feldstein AE. Noninvasive diagnosis of nonalcoholic fatty liver disease: Are we there yet? Metabolism. 2016;65: 1087–1095. pmid:26972222
  35. 35. Ratziu V, Charlotte F, Heurtier A, Gombert S, Giral P, Bruckert E, et al. Sampling variability of liver biopsy in nonalcoholic fatty liver disease. Gastroenterology. 2005;128: 1898–1906. pmid:15940625
  36. 36. Rockey DC, Caldwell SH, Goodman ZD, Nelson RC, Smith AD, American Association for the Study of Liver Diseases. Liver biopsy. Hepatology. 2009;49: 1017–1044. pmid:19243014
  37. 37. Tapper EB, Hunink MGM, Afdhal NH, Lai M, Sengupta N. Cost-Effectiveness Analysis: Risk Stratification of Nonalcoholic Fatty Liver Disease (NAFLD) by the Primary Care Physician Using the NAFLD Fibrosis Score. PloS One. 2016;11: e0147237. pmid:26905872
  38. 38. Karlas T, Petroff D, Wiegand J. Collaboration, Not Competition: The Role of Magnetic Resonance, Transient Elastography, and Liver Biopsy in the Diagnosis of Nonalcoholic Fatty Liver Disease. Gastroenterology. 2017;152: 479–481. pmid:28038929
  39. 39. Guha IN, Parkes J, Roderick P, Chattopadhyay D, Cross R, Harris S, et al. Noninvasive markers of fibrosis in nonalcoholic fatty liver disease: Validating the European Liver Fibrosis Panel and exploring simple markers. Hepatology. 2008;47: 455–460. pmid:18038452
  40. 40. Angulo P, Kleiner DE, Dam-Larsen S, Adams LA, Bjornsson ES, Charatcharoenwitthaya P, et al. Liver Fibrosis, but No Other Histologic Features, Is Associated With Long-term Outcomes of Patients With Nonalcoholic Fatty Liver Disease. Gastroenterology. 2015;149: 389–397.e10. pmid:25935633
  41. 41. Kleiner DE, Makhlouf HR. Histology of Nonalcoholic Fatty Liver Disease and Nonalcoholic Steatohepatitis in Adults and Children. Clin Liver Dis. 2016;20: 293–312. pmid:27063270
  42. 42. Wree A, Schlattjan M, Bechmann LP, Claudel T, Sowa J-P, Stojakovic T, et al. Adipocyte cell size, free fatty acids and apolipoproteins are associated with non-alcoholic liver injury progression in severely obese patients. Metabolism. 2014;63: 1542–1552. pmid:25267016
  43. 43. Berg AH, Combs TP, Du X, Brownlee M, Scherer PE. The adipocyte-secreted protein Acrp30 enhances hepatic insulin action. Nat Med. 2001;7: 947–953. pmid:11479628
  44. 44. Arita Y, Kihara S, Ouchi N, Takahashi M, Maeda K, Miyagawa J, et al. Paradoxical decrease of an adipose-specific protein, adiponectin, in obesity. Biochem Biophys Res Commun. 1999;257: 79–83. pmid:10092513
  45. 45. Silva TE, Colombo G, Schiavon LL. Adiponectin: A multitasking player in the field of liver diseases. Diabetes Metab. 2014;40: 95–107. pmid:24486145
  46. 46. Graßmann S, Wirsching J, Eichelmann F, Aleksandrova K. Association Between Peripheral Adipokines and Inflammation Markers: A Systematic Review and Meta-Analysis. Obesity. 2017; pmid:28834421
  47. 47. de Abreu VG, Martins CJ de M, de Oliveira PAC, Francischetti EA. High-molecular weight adiponectin/HOMA-IR ratio as a biomarker of metabolic syndrome in urban multiethnic Brazilian subjects. PloS One. 2017;12: e0180947. pmid:28746378
  48. 48. Owei I, Umekwe N, Provo C, Wan J, Dagogo-Jack S. Insulin-sensitive and insulin-resistant obese and non-obese phenotypes: role in prediction of incident pre-diabetes in a longitudinal biracial cohort. BMJ Open Diabetes Res Care. 2017;5: e000415. pmid:28878939