Inference of B cell clonal families using heavy/light chain pairing information
Fig 6
Time required for clustering for a variety of methods for both single chain clustering (top, for IgH [left] and IgK [right]) and paired clustering (including single chain clustering, bottom left).
Note that for partis the actual paired clustering takes only around five minutes on the 100k samples, so for partis the bottom plot time is roughly equal to the sum of the top two plots (at the moment the single chain clustering for heavy and light are not run concurrently). Each point is the mean (± standard error, often smaller than points) over two samples with the indicated size, run on a single desktop with a 14-core Intel i-99940X processor and 128GB memory (maximum memory usage for partis on the 100k samples was around 9GB). The size of each family is drawn from a geometric distribution with mean 10. Note that because enclone by design discards sequences with high SHM, here it is clustering only the ≃80% of sequences that it passes.