Accurate multi-population imputation of MICA, MICB, HLA-E, HLA-F and HLA-G alleles from genome SNP data
Fig 5
Evaluation of the quality of the 1000 Genomes whole exome short-read sequencing-based allele calling.
Model VII was trained using the 1000 Genomes reference and applied to the Finnish reference with clinical-grade typing quality. Confusion matrices for MICA, MICB, HLA-E and HLA-F show the alleles that are common to both references and the amount of correctly and wrongly predicted alleles. Empty lines represent the alleles present in the model but absent from the Finnish reference that could not be validated. Overall imputation accuracies were 99.6%, 99.8%, 100.0% and 99.8% for MICA, MICB, HLA-E and HLA-F, respectively.