Evaluating reproducibility of AI algorithms in digital pathology with DAPPER
Table 4
Matthew correlation coefficient values for each experiment, and classifier head pairs on HINT dataset.
The average cross validation MCC with 95% CI (H-MCCt), and MCC on the external validation set (H-MCCv) are reported. Best-performing backend network, and classifier head combination on each dataset are reported in bold.