Skip to main content

Advertisement

PLOS Computational Biology

Browse
Publish
- Submissions
- Policies
- Manuscript Review and Publication
About

Search Search

advanced search

< Back to Article

Fig 1 — Fig 1.

Flowchart of MobiLLe.
MobiLLe requires IGH annotated sequences (IGHV, IGHJ, and CDR3 region were previously identified) to form initial clusters (pre-clustering panel), we first group sequences with the same IGHV, IGHJ, and same CDR3 (AA) length; then, we separate sequences with less than s% CDR3 identity (default 70%). Refinement has two steps: ‘resolve inconsistencies’ and ‘merge singletons.’ The first one detects and resolves inconsistencies until no improvement is observed in cluster cohesion or separation. The second one tries to merge singletons into higher-density clusters to improve their uniformity. Final groups (output) represent clonal lineages with low intra-clonal diversity, high inter-clonal diversity, and a minimum number of singletons.

More »

Table 1 — Table 1.

Simulated repertoire properties.

More »

Table 2 — Table 2.

Properties of nine experimental repertoires.
For each sample, we show clonality status, the individual label, the total number of sequences and unique sequences (clonotypes).

More »

Table 3 — Table 3.

Properties of patients and healthy donor repertoires.

More »

Table 4 — Table 4.

Clinical and repertoire characteristics of healthy donors and patients with moderate/severe COVID-19.

More »

Fig 2 — Fig 2.

Clonal distribution comparisons.
Five “events” describe the differences between two clonal distributions (d₁ and d₂). The identical event counts the number of identical clonal lineages found in both distributions (a). The “join” event reports the number of clonal lineages in d₁ found merged in d₂ (b), while the split counts the number of clonal lineages in d₁ found separated in d₂ (c). The “mix” event is a mixture of these two later events (d), while “not found” reports the number of clonal lineages in d₂ not found in d₁ (e).

More »

Fig 3 — Fig 3.

Performance of different parameter configurations.
We computed the closeness F-score distribution for all simulated repertoire. Each distribution contains 4480 values, one for each parameter configuration. Samples are sorted by repertoire types and SHM rates.

More »

Fig 4 — Fig 4.

Effect of pre-clustering threshold on MobiLLe’s performance.
The pre-clustering threshold varied from 50% to 90%. We computed the closeness F-score (A), precision (B), and recall (C) distribution by considering all simulated repertoires (53760 parameter configurations).

More »

Fig 5 — Fig 5.

Importance of using refinement and ‘merge singletons’ parameters.
A) Scatter plot of MobiLLe F-scores with refinement (ordinate) and without refinement (abscissa) parameter. B) Scatter plot of MobiLLe F-scores with ‘merge singletons’ (ordinate) and without ‘merge singletons’ (abscissa) parameter.

More »

Fig 6 — Fig 6.

Impact of refinement parameters in the best and worst performance.
We averaged F-scores of 12 simulated repertoires and ranked them to form two sets of parameter configurations: those with the best performance (F-score = 1) and those with the worst performance (lowest F-score < 0.7). The ordinate shows parameter frequency and abscissa parameter type. (A, B, and C) show IGHV, IGHJ, and CDR3 distances, while (D) shows coefficient variations. Note that d––– indicates the coefficient values for α, β, and λ respectively, while ‘mean’ represents the arithmetic mean.

More »

Fig 7 — Fig 7.

Comparison of clustering accuracy on simulated repertoires.
Performance evaluation of five different BCR lineage grouping methods on 12 simulated repertoires.

More »

Fig 8 — Fig 8.

Performance comparison on artificial monoclonal repertoires.
We generated three artificial monoclonal repertoires (AMR1, AMR2, and AMR3) by sampling sequences from a pure B cell lineage (10%) and a polyclonal background (90%). Each benchmark contained 10000 sequences. Accurate tools might group sequences from the pure B cell lineage and separate those from the polyclonal background. We measured the performance of BCR lineage grouping methods by computing the number of splits (SC) and false positives (FP) of the most abundant group. To better visualize and compare clustering results, we show alluvial diagrams for AMR1 (a), AMR2 (b), and AMR3 (c), where blue blocks represent the pure B cell lineage and pink or orange inferred groups. Pink blocks contain only sequences belonging to the pure B cell lineage (true positives), while the orange blocks contain sequences from the polyclonal background (false positives). SONAR and BRILIA did not produce results for the AMR3 benchmarks since they do not deal with non-productive sequences.

More »

Fig 9 — Fig 9.

Clonal distribution comparisons on three experimental repertoires.
We compared the inferred clonal lineages of each BCR lineage grouping tool with MobiLLe’s clustering results. For that, we defined five events: identical, join, split, mix, and not found, representing the (dis)similarities between two clonal distributions: d₁ (MobiLLe) and d₂ (another tool). The “identical” event accounts for the percentage of identical clonal lineages found in both distributions; the “join” event reports the percentage of d₁ clonal lineages found merged in d₂ while “split” the percentage of d₁ clonal lineages found separated in d₂. The “mix” event accounts for a mixture of “join” and “split” events while “not found” reports the percentage of clonal lineages in d₂ not found in d₁; see an illustration in Fig 2.

More »

Fig 10 — Fig 10.

Comparing running times of clonal grouping tools.
The running times for MobiLLe and other tools were measured for three experimental repertoires with different clonal compositions. To a better visualisation, we used log scale, S16 Table shows the time in second for each considered tool.

More »

Fig 11 — Fig 11.

Clonal distribution/density of nine experimental repertoires.
We plotted the 20 most abundant clonal lineages for each repertoire; Circles represent clonal groups, while their areas are proportional to the clonal group abundance.

More »

Fig 12 — Fig 12.

Clonal distributions of a healthy donor and individuals with different lymphoproliferative diseases.
We plotted the 20 most abundant clonal lineages for each repertoire; Circles represent clonal groups, while their areas are proportional to the clonal group abundance. Report to Table 3 for repertoire properties and individuals’ labels.

More »

Fig 13 — Fig 13.

Clonal distribution of healthy donors and patients with moderate/severe COVID-19.
A) Abundance of top 20 ranked clonal groups stratified by clinical status. We plotted each individual’s most abundant clonal groups until achieving 20 samples. B) Abundance of top 20 ranked clonal groups stratified by individuals. C) CDR3 nucleotide lengths of the top 20 clonal groups, stratified by clinical status. In panels A and B, circles represent clonal lineages, while their areas are proportional to the clonal group abundance. Each color represents an individual. Report to Table 4 for repertoire properties and individuals’ labels. Abbreviations: S-severe and M-moderate COVID-19; H-healthy donors.

More »

Publications
PLOS Aging and Health
PLOS Biology
PLOS Climate
PLOS Complex Systems
PLOS Computational Biology
PLOS Digital Health
PLOS Ecosystems
PLOS Genetics

PLOS Global Public Health
PLOS Medicine
PLOS Mental Health
PLOS Neglected Tropical Diseases
PLOS One
PLOS Pathogens
PLOS Sustainability and Transformation
PLOS Water

Home
Blogs
Collections
Give feedback
LOCKSS

Privacy Policy
Terms of Use
Advertise
Media Inquiries
Contact

PLOS is a nonprofit 501(c)(3) corporation, #C2354500, based in California, US