Comparison of sequence- and structure-based antibody clustering approaches on simulated repertoire sequencing data
Fig 2
Performance comparison of different clustering strategies.
The three approaches, clonotyping, SAAB+ and SPACE2, were applied to the repertoire. Their performance on the annotated set of 213 functionally similar antibody pairs was evaluated. A: Euler diagram showing the overlap of correctly clustered antibody pairs between methods regarding. B: A scatter plot shows the CDRH3 sequence identity and epitope overlap of each antibody pair. Marker style indicates the antigen species. Color indicates which methods correctly grouped each antibody pair (see Euler diagram for color code). The majority of antibody pairs have not been identified together by any methods (gray, 184 antibody pairs). C: All three strategies cluster antibody pairs with a significantly higher CDRH3 sequence identity compared to the full antibody pair set. D: The epitope overlap of clustered antibody pairs is similar between the methods, albeit SPACE2 identified antibody pairs with a slightly higher epitope overlap compared to the full set. Statistical significance was tested using the Wilcoxon rank-sum test.