Interpreting and de-noising genetically engineered barcodes in a DNA virus
Fig 5
Comparison of barcode counts between raw and different clustering distances.
This figure shows that the number of barcodes that are called decreases with the increasing clustering distance; clustering with L = 3 substantially decreases the barcode counts called in the raw sequencing reads. Note that the highest-count cluster is ranked 1. The figure is cut off to include only the 10,000 most abundant barcodes to focus on the “elbow” where the number of barcodes that are called display a steep drop-off.