Somatic hypermutation analysis for improved identification of B cell clonal families from next-generation sequencing data
Fig 2
Overview of the SCOPer workflow.
First, AIRR-seq data are partitioned into VJ(ℓ)-groups which contain sequences with the same IGHV gene annotation, IGHJ gene annotation, and junction length. Next, each VJ(ℓ)-group is subject to a recombination-based and a SHM-based distance calculation. Finally, the outputs of these calculations are combined into an integrated distance function that is used as the basis for inferring the BCR clonal relationships using a spectral clustering-based approach.