Inference of B cell clonal families using heavy/light chain pairing information
Fig 2
Schematic representation of our paired clustering method on two families in the heavy chain (left) and light chain (right), with droplet identifiers represented as single letters.
The naive sequences (green crosses) and trees (dashed black lines) represent the collision (i.e. occurrence of very similar VDJ rearrangements) of the two light chain families, while the heavy families are easily distinguishable. The first step of our method groups together apparently-clonal sequences using only single chain information, and would thus merge together the two light chain families. The second step, which refines the single chain partitions using information on which heavy and light chain sequences pair together (“pairing information” or “pair info”), would then use the difference in heavy clusters to split apart the light families.