Exploring Sequence Characteristics Related to High-Level Production of Secreted Proteins in Aspergillus niger

Figure 5

Feature selection.

For the first three feature selection iterations (-axis), the bar plot shows how often features were selected in the 10 CV-loops for both (A) and (B). Features with a different shade of the same color are correlated (). The letters between brackets in the legend are amino acids that denote either which amino acids are in the cluster, e.g. the basic cluster contains amino acids R, K, and, H, or for which amino acid a codon encodes, e.g. codon TAC encodes for Y.

