Exploring Sequence Characteristics Related to High-Level Production of Secreted Proteins in Aspergillus niger
The heat maps show the combined result of the best performing clusters obtained in 10 CV-loops for both (A) and (B). The values on the diagonals denote how often an amino acid ended up in a cluster (due to selecting the optimal clusters, amino acids might not be selected at all). The colors on the non-diagonal places denote how often two amino acids ended up in the same cluster. Complete linkage hierarchical clustering was used to cluster the heat map, using the euclidean distance as distance measure. The color of the amino acid letters indicates if the amino acid has a positive (green) or negative (red) contribution in Figure 3A.