RESCRIPt: Reproducible sequence taxonomy reference database management
Fig 10
Comparison of sequence information from BOLD COI gene database for available arthropod and chordate sequences.
Differences in datasets reflect whether sequences were trimmed to a particular primer region (boldANML) or not (boldFull), and whether sequences were dereplicated (100) or clustered at a particular percent identity (97, 98, 99). A, Number of unique sequences. B, Entropy of sequences and different kmer lengths.