Quantification of HTLV-1 Clonality and TCR Diversity
Figure 2
Outline of DivE distribution generation algorithm.
A Truncated species frequency distribution with x individuals distributed among y species. The frequency of species Si after sampling x individuals is denoted Fx(Si). B Species accumulation data generated from frequency distribution. C An aggregate of the best performing models as returned by DivE is used to extrapolate to point (x+a, y+1), where the next species is predicted. D Species Sy+1 is assigned a frequency of (1 - pmax)(x+a), where pmax is the maximum-likelihood proportion of individuals occupied by the y previously observed species. The remaining pmax(x+a) individuals are distributed among species S1, …, Sy in proportion to their observed relative frequencies at x. Steps C and D are repeated until the predicted species richness is reached. See Text S1 for further details.