Per-sample immunoglobulin germline inference from B cell receptor deep sequencing data
Fig 2
Fraction of true alleles missing (left) and alleles spuriously inferred (right) by partis on simplified “sparse” repertoires as a function of the number of sequences in the sample.
Each point represents the mean performance (± standard error) on 50 independent simulation samples of the indicated sample size varying the following variables. Top: Nsnp with multiple new alleles, where, e.g. “1 + 3” indicates two new alleles, separated by 1 and 3 SNPs from the same existing allele. Middle: mean number of leaves per clonal family. Bottom: tree balance.