Reinforcement learning to develop policies for fair and productive employment: A case study on wage theft within the day-laborer community
Fig 8
Illustration of the employee’s Q-values for their decision on whether or not to report a steal on the left and the employer’s decision on whether or not to steal on the right for various values of the probability of reporting success.