Inference of B cell clonal families using heavy/light chain pairing information
Fig 5
Effectiveness on simulation of the approximate bulk data pairing method as a function of true family size, shown as the fraction of sequences paired with a sequence from the correct family (but not necessarily the correct sequence, left) and the fraction paired with a family similar to the correct family (right).
The method merges together a single cell and a bulk sample drawn from the same pool of B cells, and the “bulk data fraction” shows the fraction of this merged sample that stems from the bulk sample. “Similar” families are defined as families with true naive sequences separated by nucleotide Hamming distances of 3 or less. The fraction of sequences correctly paired, mispaired, and left unpaired are shown in S9 Fig. Other details same as in Fig 4.