A Systematic Computational Analysis of Biosynthetic Gene Cluster Evolution: Lessons for Engineering Biosynthesis
Figure 2
Complex BGC architectures evolve through new combinations of sub-clusters that are shared between multiple gene cluster types.
a, Network of sub-clusters shared among 34 known BGCs. Nodes represent BGCs, and node size indicates the number of sub-clusters present in the gene cluster that are shared with other BGCs within the network. Edges represent shared sub-clusters, coded by color. The pattern of sharing indicates that many sub-clusters are regularly transferred between BGCs of different types. In the interpretation of this analysis, it should be kept in mind that in rare cases different biosynthetic routes (and hence, different sub-clusters) exist towards the same moiety. b, A sub-network from a showing the shared sub-clusters among the BGCs for rubradirin, rifamycin, simocyclinone, everninomicin, and polyketomycin, as well as the chemical moieties encoded by the sub-clusters.