Reinforcement learning to develop policies for fair and productive employment: A case study on wage theft within the day-laborer community

doi:10.1371/journal.pcsy.0000079

Reinforcement learning to develop policies for fair and productive employment: A case study on wage theft within the day-laborer community

Fig 8

Multi-Agent Q Values.

Illustration of the employee’s Q-values for their decision on whether or not to report a steal on the left and the employer’s decision on whether or not to steal on the right for various values of the probability of reporting success.

doi: https://doi.org/10.1371/journal.pcsy.0000079.g008