Skip to main content
Advertisement

< Back to Article

Inference of B cell clonal families using heavy/light chain pairing information

Fig 9

Naive sequence inference accuracy on simulation as a function of tree imbalance.

Accuracy is measured as the Hamming distance separating the true and inferred naive sequences. Tree imbalance (the standard deviation of root-to-tip distances) is shown for samples with precisely zero imbalance (“n/a”, since it uses a different simulation framework), and samples generated from the imbalanced trees from [47]. Several partis versions are shown: “1-seq.” uses only information from each single sequence for its annotation (rather than from all family members); “star” approximates each family as a star tree (as in partis versions before May 2021 https://bit.ly/3KIjFiF); “full” is the current default, including subcluster annotation, which improves on the star tree assumption by iteratively annotating small subtrees. Each point is the mean (± standard error) over two samples with 15% mean nucleotide SHM, each consisting of 50 simulated rearrangement events with sizes of around 50 (drawn from geometric for zero imbalance; otherwise taken from the trees from [47]).

Fig 9

doi: https://doi.org/10.1371/journal.pcbi.1010723.g009