Inference and design of antibody specificity: From experiments to models and back
Fig 2
The model predicts accurately the evolution of sequence variants abundances in response to multiple selective pressures.
We considered different tasks of increasing difficulty, depending on the training set used: A. Model trained on the experiments with Black, Blue complexes, and empty Beads, and prediction evaluated with a mixture of the Black and Blue complexes; B. Model trained on experiments with a mixture of Black and Blue complexes, Blue complexes only, and naked Beads, with predictions evaluated on the experiment with Black complexes only; C. Model trained on experiments with Blue complexes only, and predictions evaluated on experiments with naked Beads; D. Model trained on experiments with a mixture of Black and Blue complexes and naked Beads, and predictions evaluated on experiments with Black complexes only. The panels show scatter plots of the observed (x-axis) vs. predicted sequence frequencies (y-axis), with the initial library abundances shown in gray for comparison. The Pearson correlation between empirical enrichments and the model-predicted enrichments for each task are given in the legend and in Table B in S1 Text. In all cases p-values (from Student’s test) are < 10−90. See SI Mathematical supplement for details about model training.