Strain design optimization using reinforcement learning

doi:10.1371/journal.pcbi.1010177

Strain design optimization using reinforcement learning

Fig 1

Strain design optimization loop using reinforcement learning.

Enzyme levels corresponding to the strain i are denoted as e_i, and y_i and s_i, correspond to the response (used in reward) and output concentrations (used as state), respectively. The action (a_i), corresponding to the difference of the enzyme levels in the two consecutive iterations, is given by the policy learned with MMR.

doi: https://doi.org/10.1371/journal.pcbi.1010177.g001