Impact of phylogeny on the inference of functional sectors from protein sequence data
Fig 5
Impact of earliest mutation generation G on eigenvector components.
Violin plots of the absolute value of components of the key eigenvectors of the ICOD, covariance C and SCA matrices are represented versus the earliest mutation generation G at which the associated site first mutates in the phylogeny. Results are shown for data sets generated with μ = 50 (top panels) and μ = 5 (bottom panels). Datasets of M = 2048 sequences of length L = 200 were generated along a perfect binary tree with 11 generations, using two different numbers μ of accepted mutations per branch. As in Fig 3, we employed the mutation acceptance criterion in Eq 2 with and τ* = 90. We used the same vector of mutational effect
as in Figs 2 and 3. Violin plots are obtained over 100 realisations of data generation.