Figures
Abstract
Eating disorders are associated with one of the highest mortality rates among all mental disorders, yet there is very little research about them within the newly emerging and promising field of computational psychiatry. As such, we focus on investigating a previously unexplored, yet core aspect of eating disorders–body image dissatisfaction. We continue a freshly opened debate about model-based learning and its trade-off against model-free learning–a proxy for goal-directed and habitual behaviour. We perform a behavioural study that utilises a two-step decision-making task and a reinforcement learning model to understand the effect of body image dissatisfaction on model-based learning in a population characterised by high scores of disordered eating and negative appearance beliefs, as recruited using Prolific. We find a significantly reduced model-based contribution in the body image dissatisfaction task condition in the population of interest as compared to a healthy control. This finding suggests general deficits in deliberate control in this population, leading to habitual, compulsive-like behaviours (body checking) dominating the experience. Importantly, the results may inform treatment approaches, which could focus on enhancing the reliance on goal-directed decision making to help cope with unwanted behaviours.
Citation: Onysk J, Seriès P (2022) The effect of body image dissatisfaction on goal-directed decision making in a population marked by negative appearance beliefs and disordered eating. PLoS ONE 17(11): e0276750. https://doi.org/10.1371/journal.pone.0276750
Editor: Kamila Czepczor-Bernat, Medical University of Silesia: Slaski Uniwersytet Medyczny w Katowicach, POLAND
Received: February 21, 2022; Accepted: October 13, 2022; Published: November 28, 2022
Copyright: © 2022 Onysk, Seriès. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Decision-making task behavioural data, questionnaire scores, and fitted model parameters are available at: https://github.com/onyskj/mbl_edbi_manuscript_v2.
Funding: The funding for the online participants participation was provided by the Institute for Adaptive and Neural Computation, School of Informatics, University of Edinburgh and was not covered by a specific grant. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Eating disorders (ED) form a group of different conditions that, in a detrimental way affects a person’s relationship with food. This usually leads to physical and psychological problems, which severely decrease the quality of life. In fact, these problems result in many deaths, as eating disorders are reported to have one of the highest, if not the highest, mortality rate among all mental disorders [1–4].
Generally, ED involve heightened preoccupation with food, such as restriction, or consumption of unusually large amounts of food, which in some cases is followed by compensatory behaviours such as vomiting, use of laxatives, or overexercising [5]. In extreme cases, eating disorders result in death either due to severe malnutrition, morbid obesity or suicide [4, 6]. Moreover, the recovery from ED can be as low as 24% even after 10 years [7], which in combination with high mortality and a significant decrease in quality of life, calls for extended research into the roots and treatments of ED.
In addition to eating related behaviours, one of the core symptoms of ED is body image disturbance, which can be understood as a negative misrepresentation of one’s body, body image preoccupation, usually involving disgust, shame and dissatisfaction [8]. We here aim to investigate how body image dissatisfaction in eating disorders affects basic mechanisms of decision making, namely, habitual (repeating of the same action as a response to a stimulus) and goal-directed (intentional and deliberate decision-making) behaviour [9]. On top of logistic regression analysis of the behavioural data [10], the mechanism will be captured in a computational, reinforcement learning (RL) framework.
Background
Eating disorders
There are three main subtypes of eating disorders–Anorexia Nervosa (AN), Bulimia Nervosa (BN) and Binge-Eating Disorder (BED) as in 5th edition of The Diagnostic and Statistical Manual of Mental Disorders [5]. All share low life satisfaction, greatly impaired quality of life, and increased mortality or suicide risk, and sometimes body image issues. A brief distinction between the types, but nowhere close to accurately describing the nuance and lived-experience of each subtype, would characterise: i) AN by extreme restriction of food and pursuit of weight loss; ii) BN by undergoing recurrent binge eating episode followed by compensatory behaviours; and iii) BED by the recurrence of uncontrollable binges, but usually without compensatory behaviours [5]. In addition, there exists an array of atypical eating disorders that do not fit the description of the main (clinical) types [11], but nevertheless constitutes a majority of cases [12]. In this paper, we do not focus on any particular type, but rather aim to explore and quantify the effect of body image dissatisfaction on decision making in broadly understood eating disorders (see Methods).
Computational psychiatry
One possible avenue to understand and help devise treatments for eating disorders, alongside the traditional approach of psychotherapy, psychiatry and neuroimaging [13–15] is the emerging field of computational psychiatry (CP).This framework is based in the assumption that the brain’s characteristic function is one of computation and information processing. As such, it offers an understanding of mental illness whereby the differences and/or errors within these computations may result in maladaptive behaviours and mental states [16]. In an attempt to describe these mechanisms, some researchers have focused on multiple modes of decision making that can be gauged with a decision-making task and quantified with a computational model that captures individual and group differences [17, 18]. Mental disorders such as schizophrenia, anxiety, and obsessive-compulsive disorder (OCD) have been linked to various significant changes in decision-making processes [19–21]. Moreover, employing computational methods to understand mental disorders promises bridging the neurobiological and psychiatric levels of descriptions of populations of interest, for example the role of dopaminergic signals in reward learning and it implications in addiction [22]. Such a link can potentially inform better and more advanced theories of mental disorders, as well as inspire new treatment approaches, and promises improved early detection leading to prevention [23].
Computational psychiatry of eating disorders
As mentioned above, depression, OCD, anxiety, and schizophrenia have received a lot attention in the field of CP, with very promising results and theories. Eating disorders have received less attention, with only eleven papers in the last nine years [24–34]. To illustrate the difference a Google Scholar search for eating- disorder related papers:
("computational psychiatry" "OR" "computational" "OR" "reinforcement learning" "OR" "reinforcement" "OR" "bayesian" "OR" "decision making" "OR" "decision-making") "AND" ("anorexia" "OR" "anorexia nervosa" "OR" “bulimia nervosa" "OR" "binge" "OR" "binge eating” OR" "eating" "OR" "bulimia") [35],
returns 11 results, while a search for schizophrenia related papers:
("computational psychiatry" "OR" "computational" "OR" "reinforcement learning" "OR" "reinforcement" "OR" "bayesian" "OR" "decision making" "OR" "decision-making") "AND" ("schizophrenia") [36],
returns 466 results.
The main focus has been on two aspects of decision making in eating disorders. One is related to how sensitive to punishments individuals with AN are, that is how well they learn from negative feedback from the environment (in this case a decision-making task). Unfortunately, the results prove to be quite contradictory [24, 33].
The second aspect of ED that has been of interest to computational psychiatrists is the trade-off between goal-directed and habitual system utilisation in decision-making. As mentioned in the introduction, goal-directed decision-making is related to acting in the environment with a goal in mind, intentionally and deliberately. It is usually characterised by forming an internal model of the environment that describes which states and actions will bring about the best results over prolonged time [9], an idea originally put forward by Tolman [37] who considered animals forming a ‘field map of the environment” while in a maze, later reframed as a ‘cognitive map’ [38]. On the other hand, habitual decision-making is associated with responding to stimuli in the environment in an automatic manner, usually repeating those actions that immediately yield the best results. In this case, an agent does not create a model of the environment. As such, goal-directed behaviour relies on model-based learning, which allows building an accurate model of the states and actions and their associated values that takes into account a hidden probabilistic structure of the environment. However, this kind of learning is computationally more demanding, using more resources to support the process. The habitual behaviour employs model-free learning, which updates a running score of possible states and actions, based on previous experience, without registering any hidden structures in the environment. Such a process is computationally efficient as it relies mainly on the memory of the last events [9]. To make the best of accuracy and computational efficiency, the two systems may in fact interact with each other, rather than compete for cognitive control, where the model-based system trains the model-free system by simulating the environment. Such a simulation produces transitions and rewards from which the model-free system can then learn [39].
The two systems can be captured using a well-established two-step decision-making task based on a reinforcement learning paradigm, which shows that healthy participants employ both model-free and model-based learning that trade off against each other [10]. Several studies attempted to see how this trade-off is different within eating disorders. The results are converging and suggest that reduced model based learning is associated with high eating disorder questionnaire scores in a general population [30]. The reduction is also evident in clinical AN and BED groups [27, 34]. Moreover, a recent study by Foerde et al. [27] suggests even further reduced model-based learning contribution in AN when the task is strictly food related as compared to a monetary task. This is of particular significance for two reasons. Firstly, AN is characterised by extreme pursuit of weight-loss, which could intuitively be understood as extreme goal-directed behaviour. As such, one would expect to see increased model-based learning in AN, yet the results suggest otherwise–such “goal-pursuit” of weight-loss in AN is actually habitual, almost compulsive. Secondly, the study includes an additional food-related task, which aims to test whether the goal-directed deficits observed during the monetary task were due to monetary rewards not being perceived as motivating enough for the AN participants to employ model-based strategy (domain-specific). The results from this study show that such deficits are in fact domain-general, where model-based learning is even further reduced during the food-related task. Another study of potential interest is that on the effect of hunger on the two systems by Van Swieten et al. [40]. The authors provide evidence that hunger improves healthy participants’ overall performance in the two-step decision task by increasing reliance on model-free control, without affecting model-based learning.
Problem statement, objectives and hypothesis
Evidently, there is a need for more computational psychiatry research about eating disorders, given the high mortality. A particular aspect that has not been previously investigated is that of body image disturbance.
As a first step in this direction, we aim to explore the effect of body image dissatisfaction on decision making in a population marked by negative appearance beliefs and disordered eating by implementing a two-step decision-making task that captures both model-based and model-free contribution [10]. The task is given to a population that scores high on an eating disorder, and body image disturbance questionnaires. For comparison, a healthy control group (HC) is also recruited. We employ two task conditions: first, a standard monetary version [10], with a changed narrative to achieve higher engagement; second, a body-image related version, which aims to target dissatisfaction with body image and its manifestation in decision making. The introduction of a body image dissatisfaction condition aims to create a context similar to that in which ED finds themselves during everyday struggle—excessive worry about body image and how to change it through dieting. Secondly, our design intends to test whether the goal-directed deficit may extend beyond the domain of purely monetary tasks, into that more specific to negative appearance beliefs. This partly mirrors Foerde et al. [27], but unlike the authors’ objective ours may not necessarily capture whether the said deficits are domain-general.
We hypothesise that: 1) In the monetary (neutral) condition and as described by previous studies, the group with ED will report significantly decreased model-based learning as compared to HC, and no difference in model-free learning as previously shown [27, 30]; 2) Model-based learning will be further decreased in the body image dissatisfaction condition in the group with ED. HC will not report any significant difference between conditions.
Materials and methods
Participants
An online study was performed. Participants for the study were recruited using Prolific, which is an online participant recruitment service used mainly for research and academic purposes [41]. The platform provides a pool of participants that are reliable and allows to custom screen for different groups before recruitment.
Since two groups were needed–a healthy control (HC) and an eating disorder (ED) group—we applied two separate sets of pre-screening criteria on Prolific (pre-set filters that are available in the study designer). For HC we looked for people who:
- “Have never gone on a diet in the past.”
- “Do no currently for at least one week restrict food intake to manage weight.”
- “Have no diagnosed mental health condition that is uncontrolled (by medication or intervention) and which has a significant impact on your daily life/activities.”
- “Do not have or have not had a diagnosed, on-going mental health illness or condition.”
For ED, participants had to select that they have gone on a diet, as in criterion (a), and that they restrict food intake to either lose or maintain weight, as in criterion (b). Moreover, to ensure data of satisfactory quality both groups had additional criteria to meet:
- age between 18–38
- have normal or corrected-to-normal vision
- female as an assigned sex at birth
- an approval rate of 98%
- a minimum of 20 previous submissions on Prolific.
Participants first completed a range of questionnaires (sub-study 1) to be then further selected to complete a decision-making task (sub-study 2). The questionnaires were hosted on Qualtrics (an online survey software) [42], while the decision-making task was hosted partly on Qualtrics and partly on Pavlovia (an online behavioural experiment platform) [43], designed using PsychoPy3 v. 2020.1.2 software [44]. For each sub-study, participants were paid at a rate of £6.25 per hour. The questionnaire took on average 7 minutes to complete, while the task took on average 27 minutes. Subjects were based all around the world. The study was approved according to the University of Edinburgh’s Informatics Research Ethics Process, with an RT number 2019/48215. After being presented with a participant information sheet, participants gave explicit consent to take part in the study using an online consent form hosted on Qualtrics.
After further selection based on questionnaires and application of exclusion criteria for the decision-making task (see S1 File), 35 (ED group) and 32 (HC group) participants were selected for data analysis.
Power analysis
A prior power analysis was performed to determine the sample size needed to detect a significant difference (at the 5% level) between groups in the neutral condition. Following a similarly designed study by Foerde et al. [27], for calculation 1, we aimed for a medium effect size of d = 0.75 (two-tailed t-test between two independent means), and the power (1−β) = 0.8. The sample size calculation using G*Power software [45] suggested recruiting 29 participants per group. Moreover, we calculated (calculation 2) the sample size needed to detect the difference between conditions in the ED group (two-tailed t-test between two dependent means). Setting the effect size at d = 0.75 and the power at (1−β) = 0.8, resulted in n = 16 for the ED group.
Additional post hoc power analysis was performed as above. Using the results from this study with 35 participants in the ED group and 32 in the HC group, calculation 1 revealed effect size of d = 0.88 and power (1−β) = 0.94, while calculation 2 revealed effect size of d = 1.04 and power (1−β) = 0.99.
Self-report questionnaires
In the first part of the study, participants answered questions about their age, gender, weight, and height. They also completed three questionnaires—Eating Attitudes Test (EAT-26) [46], Appearance Anxiety Inventory (AAI) [47] and The Obsessive–Compulsive Inventory (OCI-R) [48]–to assess the spectrum of: an eating disorder, body image disturbance, and obsessive-compulsive behaviours, respectively.
In order to divide participants into HC and ED groups, cut-off points were applied for the questionnaires as in the Table 1. For EAT-26, it is suggested to use a cut-off of 11 for subclinical populations [49] who display disordered eating behaviours that could warrant further clinical diagnosis, while a score of 20 and above is considered to be within a clinical range [46]. For AAI, a few cut-off points have been suggested: a score above 6 may suggest some issues with body image [50], while scores between 15–20 is reported in an appearance-concerned adult population, and a score above 20 indicates a high-risk group for body image disturbance [51]. Lastly, an additional criterion for HC group was applied so that they do not display worrying obsessive-compulsive behaviours as these have also been associated with reduced model-based learning [30]. This can be measured with OCI-R, where an optimal cut-off of 21 was suggested [48]. Given such varied recommendations, we decided to follow an approach based on sensitivity and specificity measures, where high sensitivity should ensure that participants are very unlikely to be diagnosed with a condition, while high specificity provides more confidence that participants would be diagnosed appropriately [52]. Hence, for EAT-26 [49] and AAI [53] we set a threshold of ≤10 for HC and ≥14 for ED; for OCI-R [48] we set a cut-off for HC at ≤10. The achieved specificity and sensitivity measures for each cut-off are given in Table 1.
Two-step decision making task
Selected participants were asked to complete an established two-step decision-making task shown to quantify model-based and model-free learning contribution [10]. Before proceeding with the task, participants were asked to select a body type that is most similar to their own from a diverse range of body types in S1 Fig. Furthermore, they were instructed on how to complete the task by reading through an illustrated tutorial. After reading the instructions, they completed 25 trials of the task as a practice.
The task reproduces that of Daw et al. [10] but was conceived as a treasure game to make it as engaging and easy to understand as possible (Fig 1). In the first stage (grey stage), participants are presented with a choice between two stimuli—on the left, a ship against a sunset, while on the right, a pirate ship near a lighthouse. They are asked to play a role of a treasure hunter and decide which ship to board. Each ship can sail to two destinations—a blue island and a purple island. The game has a hidden structure, which participants tend to learn with time, such that the normal ship sails to the blue island (left) 70% of the time (common transition) and to the purple island (right) 30% of the time (rare transition), while the pirate ship sails to the blue island 30% of the time and to the purple island 70% of the time.
In the first stage (grey), a dashed arrow corresponds to a rare transition with the probability of 30%, while the bold arrow corresponds to a common transition with the probability of 70%. In the second stage, the speckled arrow is associated with the probability of receiving a reward. The duration of each stage is noted to the right (stage one, stage two, reward).
Once the participant boards the chosen ship, they are taken to one of the islands (stage two), where two chests await them. Here, they are asked to quickly select the chest to try their chances at finding a treasure (a pirate coin). However, each chest is assigned a probability of containing the coin, which evolves over time according to a Gaussian random walk with a standard deviation σ = 0.0275, as in the Fig 2 (more details in the S1 File). One of the strategies is to track, over time, which chest is the most favourable, i.e. has the highest chance of yielding the coin. After the chest is opened and the participant receives the coin (or not), they are taken back to the first stage to repeat the trial.
The probabilities (in the order as in Fig 1) evolve over 150 trials, according to a Gaussian random walk with σ = 0.0275. One set of two evolutions always starts randomly in a range [0.58, 0.72], while the other set of two starts in [0.31, 0.45]. The values of probabilities are bounded in [0.25, 0.75].
There are two conditions, 150 trials each. Each condition consists of two series of 75 trials, with a short break in between to counteract a possible loss of attention or tiredness. The neutral (NT) condition is exactly like in Fig 1, with a pirate coin as a reward (Fig 3A), and, aside from the narrative change, follows exactly from Daw et al. [10] and Gillan et al. [30]. The other condition, the body image dissatisfaction (BID) condition, aims to gauge the effects of body image dissatisfaction on decision-making. The BID condition is different from the neutral condition in that the reward is a pirate coin next to a body type that the participant selected as most similar to their own (Fig 3C).
In the neutral condition, participants could receive either (A) or (B), while in the BID condition they could receive either (C) or (B). An empty box (B) indicates no reward. The body type was selected by the participant before the task, more body type examples can be found in S1 Fig.
For a balanced design half of the participants in each group completed the neutral condition first, followed by the BID condition. The other half completed the task in a reverse order. All results are then based on the average across two subgroups.
Model-based and model-free learning in the task
The task allows to capture the contribution of model-based and model-free learning. Thanks to its stage-like structure and probabilistic nature, we can easily distinguish between two strategies a participant can have, usually a mix of the two.
For the purpose of this example, we focus on two extreme cases [30]. On one hand, an agent could utilise only model-free learning strategy. This means that they would make their decision as to which ship and chest to select purely based on the reward they received (or lack thereof), usually repeating the rewarded action, regardless of the transition type. On the other hand, a participant who uses only model-based strategy, takes into account the learnt transition structure of the task, on top of the knowledge about previous rewards. Moreover, they track the probability of receiving the reward to know which chest is the most favourable. In other words, they chose the same ship if the previous trial had a common transition with a reward or a rare transition and no reward.
Mixed-effects logistic regression of raw choice data
First, a mixed-effects logistic regression analysis of raw choice data was performed to quantify the model-based and mode-free learning effects [10]. The analysis focuses only on the choices made in the first stage of the task and how these are influenced by received a reward and transition type in the previous trial. The regression models the probability of repeating the same choice, p(stay) and how it is influenced by reward (rewarded = 1, unrewarded = -1) and transition (common = 1, rare = -1), and their interaction. The main effect of reward is interpreted as model-free learning contribution and the reward x transition interaction as model-based learning contribution. These effects were regressed alongside with group, condition, and age (z-scored) as fixed effects as below:
The estimates were obtained with the lme4 package in R, using Bound Optimization by Quadratic Approximation (bobyqa) with 1e5 functional evaluations [30].
Stay probabilities calculation (frequency-based)
As an additional measure, for each participant we calculated the following probabilities p(stay|rewarded, common), p(stay|unrewarded, common), p(stay|rewarded, rare), p(stay|unrewarded, rare) based on the frequency of repeated choice in each of the four (reward, transition) cases. These were used for additional regression analyses and figures.
Model-based score calculation (frequency-based)
Yet another measure that helps to capture model-based learning contribution that is independent of the reinforcement learning is the model-based score (MB score) [54]. This is calculated as follows:
Model-based and model free learning—an RL model
Finally, the full computational RL model, described and quantified below [27, 30, 55], incorporates choice data from two stages of the task, For simplicity, we begin with the update equation of the state-action value function, QII(s2, c2) for the second stage (II) states and actions. There are two possible stage two states s2: 1—the blue island; 2—the purple island, such that s2∈{1,2}. On each island, an agent can make two separate choices c2: L—open the left chest; R—open the right chest, such that c2∈{L, R}. Moreover, after opening the chest, the agent can receive the reward, r∈{0,1}, where 0 corresponds to an empty chest, and 1 corresponds to the pirate coin. At the start of the task all value functions are initialised at 0.5. On any trial, t, we update the value function QII(s2, c2) of the visited state and action taken as in the Eq 1. The subscript, t, indicates the current trial values, whereas t+1 indicates the values at the following trial. α is the learning rate.
To calculate the probability of making a choice c∈{L, R} at stage two, we use the softmax function as in Eq 2, with an inverse temperature parameter β2 quantifying the influence of the value function on making the choice.
In stage one (I), we directly see how model-free and model-based learning play their part. Here, we have two sets of update equations. The first set, Eqs 3–5 is model-based, where the agent uses their knowledge about the environment—which stage two state-action is the best, as well as the structure of the task (common vs. rare transition) to update the values of the ships in stage one. In this case, the value functions for both ships are updated simultaneously, such that the value of choosing a ship is the weighted sum (by transition probability) of the maximums over stage two actions values.
The second, model-free update in Eq 6, updates the values of the ships based only on the reward received at the end of the trial.
Model-free and model-based contributions are joined together in a weighted value function over two ship choices, QI, as in Eq 7.
The contribution of each system is captured with βMB (model-based) and βMF (model-free) parameters. An additional indicator I(c = c1,t−1) tells if the choice made on the current trial is repeated as in the previous one, with a parameter ρ describing how much switching or staying is done regardless of the feedback. Finally, the probability of choosing either ship is calculated as in Eq 8, analogously to the second stage.
The model has a total of five parameters: βMB, βMF, β2, α, ρ.
Model fitting
Model parameters for each group and condition were estimated using hierarchical Bayesian approaches, which provides the best test-retest reliability for this particular model, and follows the procedure as in Brown et al. [56]. The estimation was performed using RStan package (v.2.21.0) [57] in R (v. 4.0.2) based on Markov Chain Monte Carlo techniques (No-U-Turn Hamiltonian Monte Carlo). Each parameter was estimated with a mean, scale, and individual error estimates. The learning rate α was constrained to (0,1) with an inverse logit transformation, while the means of βMB, βMF, were bounded below by 0. We used weakly informative and uninformative priors for the means: α~N(0,2.5), βMB~N(0,100), βMF~N(0,100), β2~N(0,100), ρ~N(0,100). For the scales of each parameter, we used Cauchy(0,2.5) distribution, all constrained to be greater than 0. Individual error terms were all N(0,1), where for βMB, βMF these were constrained to be greater than 0. For each of the four chains we ran 2000 samples (after discarding 2000 warm-up ones). Chains were inspected for convergence and their values were all around 1 (below 1.1). The mean value (across chains) of each parameter for each participant were used in the analysis.
Mixed-effects linear regression of parameter estimates
The estimates for each parameter were analysed using mixed-effects linear regression to compare them between groups and conditions, with group, condition, and age (z-scored) as fixed effects per subject:
Parameter recovery
To quantify the reliability of the parameter estimates, parameter recovery was performed after data collection (see S1 File). For the parameters range compatible with the collected data, the model and fitting procedure described above provide fair to excellent reliability, with average (across parameters, groups and conditions) parameter recovery Pearson correlation coefficient (PCC), r = 0.834.
Results
Demographic and questionnaires summary
The screening procedures resulted in recruiting two significantly different groups—healthy control (HC) and eating disorder (ED). The summarised information, along with two-sample t-tests, can be found in Table 2. The groups are mainly characterised by average total scores on the three questionnaires, with scores significantly different between groups across all questionnaires (p<0.001). As the groups also significantly differ in age, the variable (z-scored) was included as a covariate in the regression analysis.
Task performance—rewards and reaction times
The performance in the task was first analysed independently of the reinforcement learning model. Average characteristics were calculated for each group (detailed results in the S2 Table in S1 File). These measures include: total reward in the neutral and BID condition, after completing the full task, as well as mean reaction times (RT) during the neutral, BID, and across both conditions. There were no differences in the above measures between groups.
Collected raw choice data analysis
Mixed-effects logistic regression analysis–collected data.
To assess the model-based and model-free contribution during the task, raw choice data was analysed with mixed-effects logistic regression of the probability of repeating the same choice in stage one (Table 3). HC group showed a significant contribution of both model-free system as indicated by the reward effect (p-value<0.001), and model-based system as indicated by the reward x transition interaction effect (p-value = 0.018). Our hypothesis predicted a significantly lower estimate for the reward × transition × groupED effect to detect a difference between groups in the neutral condition; as well as a significantly lower estimate for the reward × transition × groupED × conditionBID effect to detect a difference between condition in the ED group. However, the analysis revealed no differences between groups or conditions in terms of model-based and model-free learning. The estimated probabilities (frequency-based) for each case in each group and condition are depicted in Fig 4.
(A) Neutral condition. (B) BID condition.
We further verified whether model-based and model-free learning strategies are used in our sample, regardless of group and condition. As such, these fixed effects were removed from the regression analysis (Table 4). The simpler model revealed significant contribution of both learning systems in the joint population (Reward effect p-value<0.001, Reward x Transition effect p-value = 0.008). Moreover, there was a significant effect of transition (p-value <0.001).
Frequency based-probability regression–collected data
Furthermore, we estimated the probabilities of sticking to the same stage 1 choice after a common transition on the previously rewarded trial using the frequency calculation and used them in mixed-effects linear regression analysis (Table 5). This simpler comparison revealed reduced probability of staying after common, rewarded trials between groups in the neutral condition, which was just short of significance (p-value = 0.054), indicating a potentially weaker model-based learning capacity in ED. There were no differences between conditions in either group.
MB score analysis–collected raw choice data
Lastly, we calculated MB scores in the collected data (Fig 5). These were regressed against group and condition variables (Table 6). There were no differences in MB scores between groups and conditions.
RL parameter estimates—model-based and model-free learning
Next, we performed a more sensitive analysis, using a RL model, which, unlike the previous analyses which are based on trial averages, takes into account incremental learning over many trials. As it is not uncommon for the results from model-agnostic analysis to differ from computational modelling results [58–60], we fit parameters for the reinforcement learning model as described in the Methods. A comparison of average parameter estimates with standard errors (SE) of βMB, βMF between groups and conditions can be found in Fig 6.
Mean standard error (SE) of the estimated model parameters in HC (blue) and ED (red) during each condition (neutral and BID). (A) βMB, (B) βMF.
The results from the mixed effects linear regression model for each parameter, showing a significant difference between groups and conditions can be found in Tables 7–10.
As hypothesised, the model-based contribution (as quantified with βMB parameter) in the neutral condition is decreased in ED as compared to HC (Table 7; p-value = 0.016). Moreover, there is a further reduction in model-based learning in the BID condition in ED (p-value = 0.003), which is not present in the HC group (p-value = 0.060).
Surprisingly, model-free learning is significantly reduced in the neutral condition in the ED group (compared to HC p-value <0.001, Table 8). Furthermore, model-free learning is slightly attenuated in the BID condition in HC (p-values = 0.011).
In addition to model-based and model-free learning, there was an increase in the learning rate between conditions for HC (p-value = 0.004; Table 9), as well as a reduction in the β2 inverse temperature parameter estimate in the ED group in the neutral condition (p-value = 0.001; Table 10), potentially indicating a more exploratory choice strategy in the second stage.
Correlations of ΔβMB with other covariates
To check how the difference in model-based learning between condition correlates with questionnaire scores and demographics (age, EAT-26, AAI, and OCI-R scores all z-scored), we introduced ΔβMB that captures the difference in βMB between conditions:
when negative, this would indicate the BID condition had a reducing effect on model-based learning.
The correlation plots with Pearson correlation coefficient (PCC), r, and p-values for the hypothesis of no relationship between ΔβMB and the covariates can be found in Fig 7. We found significant negative correlations with EAT-26 and AAI scores and OCI-R scores as in Table 11 (r = -0.312, r = -0.314, r = -0.316 with p-value = 0.01, p-value = 0.01, p-value = 0.009). Correlation between questionnaire scores can be found in the S7 Table in S1 File.
Correlation is across both groups. Pearson’s correlation coefficient (r) and p-values are included.
Discussion
The current study focused on a strikingly missing element in computational psychiatry research on eating disorders—body image dissatisfaction. In particular, model-based learning was explored to identify whether deficits in goal-directed learning manifest alongside negative appearance beliefs, with the hypotheses of decreased model-based learning between groups (ED vs. HC) and between conditions in ED.
First of all, the findings from a two-step decision-making task show a significantly smaller contribution of model-based learning in a population characterised by high body image dissatisfaction and disordered eating as compared to the HC group. This replicates previous findings that suggest a decreased model-based learning, in a monetary condition of the two-step decision-making task, in groups characterised by disordered eating and compulsive behaviours [27, 30, 34]. We found a further model-based learning reduction in the population of interest in the condition that was meant to involve body image dissatisfaction (as implemented in the BID condition). We did not find such an effect in the healthy control. These results support the set hypotheses regarding model-based learning, suggesting that goal-directed deficits extend beyond purely monetary tasks and are more pronounced in the domain of body image preoccupation. This mirrors previously established results of further goal-direct deficits in AN when the task is food-related [27], which demonstrate that such deficits affect multiple domains of decision making. Additionally, the reduction in model-based learning between groups, and between conditions in the ED group was not associated with age.
Surprisingly, contrary to one of our hypotheses, we found decreased (BID vs Neutral) model-free learning in the HC group, as well as reduced (ED vs HC) model-free learning between groups in the neutral condition. This is also contradictory with previous studies that find no difference in model-free learning between AN and HC groups in the monetary two-step task [27, 30] or increased reliance on model-free learning in a BED group [34]. One potential reason for the difference of results is the high heterogeneity in the estimates of model-free learning parameter both in ED and HC group, which may have skewed the results. In fact, HC group, in general, has more heterogeneous estimates of parameters than ED. This may be due to the HC group being in fact composed of different subgroups, possibly even along the eating disorder continuum, for example if they were not entirely sincere when answering the questionnaires. Alternatively, ED may exhibit a generally reduced learning capability (see below for further discussion), while the difference with Voon et al. [34] study may also stem from a different parameterisation of the two systems. Unlike in Voon et al. [34], our computational model does not include the parameter w that forces the two systems to trade-off against each other. In their case, the reported reduction in model-based learning necessarily results in an increase of model-free learning, while this is not necessarily the case for us.
Furthermore, we found a significant correlation of the difference between the measures of model-based learning in the neutral and BID condition with EAT-26, AAI scores and OCI-R scores, with a similar correlation coefficient. This may suggest that the model-based difference between conditions captures a similar psychiatric dimension common to all three questionnaires, which would also be in line with a significant correlation between EAT-26, AAI, and OCI-R scores, as well as with previously reported results [61]. As such, our study may highlight dimensionally-grounded approach to mental illness, similarly to a previous online behavioural study implicating reduced goal-directed learning in compulsive behaviour and intrusive thought dimension [30].
Lastly, the logistic regression of the raw choice data showed the two populations employ both learning systems. However, to detect the differences between groups and conditions we used a more sensitive RL model analysis that takes into account the incremental learning from many trials, as well as stage two choices, which are not part of the logistic regression model. This followed the same approach as in previous studies on goal-directed deficits in anorexia nervosa [27] and in a population marked by compulsive behaviour and intrusive thought [30].
Implications
First of all, as far as we know, this is the first study that looked at the body image dissatisfaction from a computational perspective. The fact this phenomenon is associated with decreased contribution of model-based learning provides some support for the mechanism of extreme habitual body preoccupation. The fear of gaining weight and a goal of either losing or not gaining weight, evolves over time into a rigid and habitual body checking, which may be based in a shift away from the model-based system, in a similar fashion as in addiction [62] where, overtime, such goal becomes overvalued [63]. When an individual’s body type is displayed onto a screen it may act as a trigger for the habitual behaviour of body comparison and concern. As such, a body-preoccupied state leads to significant reduction in the model-based capabilities that could be allocated towards task completion in a more goal-directed manner, as is done in the neutral condition. Our result adds to the discussion reopened by Foerde et al. [27] about domain-general/specific deficits in goal-directed learning. It provides support for the view that goal-directed control impairments are not specific to purely monetary tasks but suggests that it could be aggravated in conditions that trigger body image preoccupations.
The high heterogeneity of parameter estimates (model-based/free) in the healthy control may suggest that some healthy controls share some traits with the ED group. This might be related to traits that we have not directly explored in our study but that have been associated with deficits in model-based learning, e.g. alcohol addiction, or impulsivity [30], despite partially controlling for the effect of compulsivity on model-based learning [30] by excluding participants with high OCI-R scores in the healthy control group. Alternatively, it is possible that some participants from a healthy control group do in fact have body image preoccupation issues. A potential factor in group misclassification could be a wide-spread and widely accepted societal preoccupation with dieting, looks as well as the phenomenon of body shaming that could render the recruitment of a ‘healthy’ population difficult [64, 65]. This could manifest in participants as selecting the option of “no past diet experience” in the screening stage despite ‘clean eating’ they might engage in as a widely accepted ‘health standard’, which is actually emotionally distressing and linked with functional impairments [66]. As such, the heterogeneity in the parameter estimates in the healthy control may highlight a blurred boundary between health and dieting, and indeed call into question the possibility of a truly healthy control group within this field of research.
Limitations
Group differences are here only detectable with a sensitive RL model. As such, it would be beneficial to perform a similar study on a larger online population and/or in the lab on a clinical group to strengthen the evidence for the hypothesis of this paper and test its replicability.
A possible limitation presents itself in the selection of body types/silhouettes by the participants. These may be seen as overly expressive and caricature-like, failing to accurately capture their body type, which may cast doubt on the source of the reduction in model-based learning. As such, we are currently working on a paradigm that offers a more rigorously defined range of body types for the participants to select from, where a selected image is compared to the participant’s basic body parameters (e.g. weight, height), capturing the discrepancy between reality and perception [67–69]. That being said, the results of this study offer a new point for further examination: the significant change in goal-directed decision-making exclusively in the ED group, caused by the mere presence of said silhouettes.
An alternative explanation of the results may expose potential issues with the design of the BID condition. Since the silhouette is displayed with the reward, this may induce aversive behaviour (due to high-body dissatisfaction), whereby valuation of (sensitivity to) the reward is reduced. This could result in lower values of inverse temperature parameters both for model-free and model-based learning, as reward sensitivity has been shown indistinguishable from the inverse temperature [23]. Though a valid concern, we did not observe any reduction in model-free learning in the BID condition (vs. neutral) in the ED group. This suggests that the silhouettes did not have an aversive effect on reward learning, at least in the simple temporal difference learning. However, we recognise that the BID condition may have been distracting to the ED participants, due to body image preoccupation, and lead them to ruminate about their body, or to allocate more mental resources to the displayed silhouettes. As such, the BID condition could be interpreted as a modulator of the availability of cognitive resources, which has been shown to control the extent to which healthy participants employ model-based strategy [70]. As shown by Otto et al. [70], when performance of the two-step task is systematically disrupted with a high working-memory load component (a numerical stroop task [71]), participants reduce their reliance on the model-based system. An additional interpretation of the reduced inverse temperature parameters may indicate that the ED group employs a slightly different strategy where they rely more on action exploration rather than exploitation to perform well in the task [19]. To further test those possible concerns, it would be useful to verify the findings by re-running the study using a different design.
Despite employing the most reliable fitting method for this particular task and model [56], as well as applying strict exclusion criteria to participants’ behavioural data and questionnaire attention checks, our parameter recovery resulted only in fair reliability of the model-based parameter. However, this is not uncommon, as studies using similar models achieve comparable measures for the model-based parameters [72, 73]. Having informally tested various models that allow for constraining of the βMB, βMF parameters to be greater than 0, the only way to improve the parameter recovery would be to increase the sample size or obtain higher quality lab-based data.
Kool et al. [73] suggest that the structure of the two step decision making task used here does not accurately estimate the trade-off between model-based and model-free learning. The authors propose certain modifications to the task to increase that accuracy such as changing the drift rate in the Gaussian random walk of reward probabilities, reducing the number of stage two choices to one per state, or introducing a deterministic transition structure. However, it has been shown that in some cases of slight variations to the task structure, the reinforcement learning model will not be able to distinguish between model-based and model-free actions as efficiently as in the standard task [74], while the task in the current form has proven to yield reliable and consistent results [27, 30].
Lastly, there are a couple of steps that could be taken in order to further explore the mechanisms and effects of body image dissatisfaction on decision-making in eating disorders. A similar study could be performed on a clinical population, expanding to fMRI data collection and analysis. This could allow to correlate the behavioural-computational changes in the ED group with neural signatures providing neurobiological basis of some of the decision-making mechanisms associated with body image dissatisfaction. For example, following Wunderlich et al. [75], it would be worth exploring whether reduced model-based learning in ED would be associated with reduced planning value representation in the anterior caudate, previously implicated in goal-directed behaviour [76]. Alternatively, focusing on correlating body-image related goal-directed deficits with the activity in the orbitofrontal cortex, could provide further support to experimental theta burst stimulation treatment approaches for eating disorders [77, 78].
Conclusions
Given the high mortality of eating disorders, this study expands the sparse field of computational psychiatry of eating disorders that so far has focused on the general perception of reward [24, 33], and quantification of model-based learning in a neutral setting [30, 34] or in relation to food choices [27]. Since one of the prevalent aspects of eating disorders is body image preoccupation, we explored its effect on model-based learning in comparison with a healthy control. The results from the online study on a population characterised by high scores on eating disorder and body image dissatisfaction questionnaires show a significantly negative effect of body image dissatisfaction on model-based learning that is not present in the healthy control. This finding offers additional insight into the mechanisms of the disorder and the effect that the core element of the disorder, such as body image dissatisfaction, has on decision-making. Directed by the above result, further inquiry can be performed into more nuanced treatment strategies that could help to break from rigid habits and strengthen model-based capabilities of patients related to the perception of their body through well-tailored therapeutic activities.
References
- 1. Arcelus J, Mitchell AJ, Wales J, Nielsen S. Mortality Rates in Patients With Anorexia Nervosa and Other Eating Disorders: A Meta-analysis of 36 Studies. Arch Gen Psychiatry. 2011;68(7):724–31.
- 2. Fichter MM, Quadflieg N. Mortality in eating disorders—results of a large prospective clinical longitudinal study. Int J Eat Disord. 2016;49(4):391–401. pmid:26767344
- 3. Godier LR, Park RJ. Compulsivity in anorexia nervosa: a transdiagnostic concept. Front Psychol. 2014;5.
- 4. Smink FRE, van Hoeken D, Hoek HW. Epidemiology of Eating Disorders: Incidence, Prevalence and Mortality Rates. Curr Psychiatry Rep. 2012;14(4):406–14. pmid:22644309
- 5.
American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders (DSM-5®). American Psychiatric Pub; 2013.
- 6. Morris J, Twaddle S. Anorexia nervosa. BMJ. 2007;334(7599):894–8. pmid:17463461
- 7. Berkman ND, Lohr KN, Bulik CM. Outcomes of eating disorders: A systematic review of the literature. International Journal of Eating Disorders. 2007;40(4):293–309. pmid:17370291
- 8. Henn AT, Taube CO, Vocks S, Hartmann AS. Body Image as Well as Eating Disorder and Body Dysmorphic Disorder Symptoms in Heterosexual, Homosexual, and Bisexual Women. Front Psychiatry. 2019;10:531. pmid:31427996
- 9. Dolan RJ, Dayan P. Goals and Habits in the Brain. Neuron. 2013;80(2):312–25. pmid:24139036
- 10. Daw ND, Gershman SJ, Seymour B, Dayan P, Dolan RJ. Model-Based Influences on Humans’ Choices and Striatal Prediction Errors. Neuron. 2011;69(6):1204–15. pmid:21435563
- 11. Garcia F, Delavenne H, Déchelotte P. Atypical eating disorders: a review. NDS. 2011;67.
- 12. Fairburn CG, Bohn K. Eating disorder NOS (EDNOS): an example of the troublesome “not otherwise specified” (NOS) category in DSM-IV. Behav Res Ther. 2005;43(6):691–701. pmid:15890163
- 13. Kessler RM, Hutson PH, Herman BK, Potenza MN. The neurobiological basis of binge-eating disorder. Neuroscience & Biobehavioral Reviews. 2016;63:223–38. pmid:26850211
- 14. Linardon J, Kothe EJ, Fuller Tyszkiewicz M. Efficacy of psychotherapy for bulimia nervosa and binge-eating disorder on self-esteem improvement: Meta-analysis. European Eating Disorders Review. 2019;27(2):109–23. pmid:30623519
- 15. Simon JJ, Skunde M, Walther S, Bendszus M, Herzog W, Friederich HC. Neural signature of food reward processing in bulimic-type eating disorders. Soc Cogn Affect Neurosci. 2016;11(9):1393–401. pmid:27056455
- 16. Huys QJM, Browning M, Paulus MP, Frank MJ. Advances in the computational understanding of mental illness. Neuropsychopharmacology. 2021;46(1):3–19. pmid:32620005
- 17. Huys QJM, Moutoussis M, Williams J. Are computational models of any use to psychiatry? Neural Networks. 2011;24(6):544–51. pmid:21459554
- 18.
Seriès P, editor. Computational psychiatry: a primer. Cambridge, Massachusetts: The MIT Press; 2020.
- 19. Raymond JG, Steele JD, Seriès P. Modeling Trait Anxiety: From Computational Processes to Personality. Front Psychiatry. 2017;8.
- 20. Rocha FF da, Alvarenga NB, Malloy-Diniz L, Corrêa H. Decision-making impairment in obsessive-compulsive disorder as measured by the Iowa Gambling Task. Arq Neuro-Psiquiatr. 2011;69(4):642–7. pmid:21877034
- 21. Valton V, Romaniuk L, Douglas Steele J, Lawrie S, Seriès P. Comprehensive review: Computational modelling of schizophrenia. Neuroscience & Biobehavioral Reviews. 2017;83:631–46. pmid:28867653
- 22.
Huys QJM, Tobler PN, Hasler G, Flagel SB. Chapter 3—The role of learning-related dopamine signals in addiction vulnerability. In: Diana M, Di Chiara G, Spano P, editors. Progress in Brain Research. Elsevier; 2014. p. 31–77. (Dopamine; vol. 211).
- 23. Huys QJ, Pizzagalli DA, Bogdan R, Dayan P. Mapping anhedonia onto reinforcement learning: a behavioural meta-analysis. Biology of Mood & Anxiety Disorders. 2013;3(1):12. pmid:23782813
- 24. Bernardoni F, Geisler D, King JA, Javadi AH, Ritschel F, Murr J, et al. Altered Medial Frontal Feedback Learning Signals in Anorexia Nervosa. Biological Psychiatry. 2018;83(3):235–43. pmid:29025688
- 25. Chan TWS, Ahn WY, Bates JE, Busemeyer JR, Guillaume S, Redgrave GW, et al. Differential impairments underlying decision making in anorexia nervosa and bulimia nervosa: A cognitive modeling analysis: Decision-Maing In Anorexia And Bulimia. Int J Eat Disord. 2014;47(2):157–67.
- 26. DeGuzman M, Shott ME, Yang TT, Riederer J, Frank GKW. Association of Elevated Reward Prediction Error Response With Weight Gain in Adolescent Anorexia Nervosa. AJP. 2017;174(6):557–65. pmid:28231717
- 27. Foerde K, Daw ND, Rufin T, Walsh BT, Shohamy D, Steinglass JE. Deficient Goal-Directed Control in a Population Characterized by Extreme Goal Pursuit. Journal of Cognitive Neuroscience. 2021;33(3):463–81. pmid:33284076
- 28. Frank GKW, Reynolds JR, Shott ME, O’Reilly RC. Altered Temporal Difference Learning in Bulimia Nervosa. Biological Psychiatry. 2011;70(8):728–35. pmid:21718969
- 29. Giannunzio V, Degortes D, Tenconi E, Collantoni E, Solmi M, Santonastaso P, et al. Decision-making impairment in anorexia nervosa: New insights into the role of age and decision-making style. European Eating Disorders Review. 2018;26(4):302–14. pmid:29665149
- 30. Gillan CM, Kosinski M, Whelan R, Phelps EA, Daw ND. Characterizing a psychiatric symptom dimension related to deficits in goal-directed control. eLife. 2016;5:e11305. pmid:26928075
- 31. Patzelt EH, Kool W, Millner AJ, Gershman SJ. Incentives Boost Model-Based Control Across a Range of Severity on Several Psychiatric Constructs. Biological Psychiatry. 2019;85(5):425–33. pmid:30077331
- 32. Reiter AMF, Heinze HJ, Schlagenhauf F, Deserno L. Impaired Flexible Reward-Based Decision-Making in Binge Eating Disorder: Evidence from Computational Modeling and Functional Neuroimaging. Neuropsychopharmacol. 2017;42(3):628–37. pmid:27301429
- 33. Verharen JPH, Danner UN, Schröder S, Aarts E, van Elburg AA, Adan RAH. Insensitivity to Losses: A Core Feature in Patients With Anorexia Nervosa? Biological Psychiatry: Cognitive Neuroscience and Neuroimaging. 2019;1–9. pmid:31262707
- 34. Voon V, Derbyshire K, Rück C, Irvine MA, Worbe Y, Enander J, et al. Disorders of compulsivity: a common bias towards learning habits. Mol Psychiatry. 2015;20(3):345–52. pmid:24840709
- 35.
Google Scholar. Computational Psychiatry of Eating Disorders Search—Google Scholar [Internet]. [cited 2021 Sep 12]. Available from: https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=%28%22computational+psychiatry%22+%22OR%22+%22computational%22+%22OR%22+%22reinforcement+learning%22+%22OR%22+%22reinforcement%22+%22OR%22+%22bayesian%22+%22OR%22+%22decision+making%22+%22OR%22+%22decision-making%22%29+%22AND%22+%28%22anorexia%22+%22OR%22+%22anorexia+nervosa%22+%22OR%22+%E2%80%9Cbulimia+nervosa%22+%22OR%22+%22binge%22+%22OR%22+%22binge+eating%E2%80%9D+OR%22+%22eating%22+%22OR%22+%22bulimia%22%29&btnG=
- 36.
Google Scholar. Computational Psychiatry of Schizophrenia Search—Google Scholar [Internet]. [cited 2021 Sep 12]. Available from: https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=%28%22computational+psychiatry%22+%22OR%22+%22computational%22+%22OR%22+%22reinforcement+learning%22+%22OR%22+%22reinforcement%22+%22OR%22+%22bayesian%22+%22OR%22+%22decision+making%22+%22OR%22+%22decision-making%22%29+%22AND%22+%28%E2%80%9Cschizophrenia%E2%80%9D%29&btnG=
- 37. Tolman EC. Cognitive maps in rats and men. Psychological Review. 1948;55(4):189–208. pmid:18870876
- 38.
O’Keefe J, Nadel L. The hippocampus as a cognitive map. Oxford: New York: Clarendon Press; Oxford University Press; 1978.
- 39. Gershman SJ, Markman AB, Otto AR. Retrospective revaluation in sequential decision making: A tale of two systems. Journal of Experimental Psychology: General. 2014;143(1):182–94. pmid:23230992
- 40. van Swieten MMH, Bogacz R, Manohar SG. Hunger improves reinforcement-driven but not planned action. Cogn Affect Behav Neurosci. 2021;21(6):1196–206. pmid:34652602
- 41.
Prolific. Prolific | Online participant recruitment for surveys and market research [Internet]. [cited 2020 Aug 4]. Available from: https://www.prolific.co/
- 42.
Qualtrics. Qualtrics—Leading Experience Management & Survey Software [Internet]. Qualtrics. [cited 2020 Aug 5]. Available from: https://www.qualtrics.com/uk/
- 43.
Pavlovia. Pavlovia [Internet]. [cited 2020 Aug 5]. Available from: https://pavlovia.org/
- 44.
PsychoPy. Home—PsychoPy v3.0 [Internet]. [cited 2020 Apr 14]. Available from: https://www.psychopy.org/
- 45. Faul F, Erdfelder E, Lang AG, Buchner A. G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behavior Research Methods. 2007;39(2):175–91. pmid:17695343
- 46. Garner DM, Olmsted MP, Bohr Y, Garfinkel PE. The Eating Attitudes Test: psychometric features and clinical correlates. Psychological Medicine. 1982;12(4):871–8. pmid:6961471
- 47. Veale D, Eshkevari E, Kanakam N, Ellison N, Costa A, Werner T. The Appearance Anxiety Inventory: Validation of a Process Measure in the Treatment of Body Dysmorphic Disorder. Behavioural and Cognitive Psychotherapy. 2014;42(5):605–16. pmid:23823485
- 48. Foa EB, Huppert JD, Leiberg S, Langner R, Kichic R, Hajcak G, et al. The Obsessive-Compulsive Inventory: Development and validation of a short version. Psychological Assessment. 2002;14(4):485–96. pmid:12501574
- 49. Orbitello B, Ciano R, Corsaro M, Rocco PL, Taboga C, Tonutti L, et al. The EAT-26 as screening instrument for clinical nutrition unit attenders. International Journal of Obesity. 2006;30(6):977–81. pmid:16432540
- 50. Bjureberg J, Enander J, Andersson E, Ivanov VZ, Rück C, Fernández de la Cruz L. Sudden Gains in Internet-Based Cognitive Behavior Therapy for Body Dysmorphic Disorder. Behavior Therapy. 2019.
- 51. Mastro S, Zimmer-Gembeck MJ, Webb HJ, Farrell L, Waters A. Young adolescents’ appearance anxiety and body dysmorphic symptoms: Social problems, self-perceptions and comorbidities. Journal of Obsessive-Compulsive and Related Disorders. 2016;8:50–5.
- 52. Sensitivity Trevethan R., Specificity, and Predictive Values: Foundations, Pliabilities, and Pitfalls in Research and Practice. Frontiers in Public Health [Internet]. 2017 [cited 2022 Feb 1];5. Available from: https://www.frontiersin.org/article/10.3389/fpubh.2017.00307
- 53. Flygare O, Chen LL, Fernández de la Cruz L, Rück C, Andersson E, Enander J, et al. Empirically Defining Treatment Response and Remission in Body Dysmorphic Disorder Using a Short Self-Report Instrument. Behavior Therapy. 2021;52(4):821–9. pmid:34134823
- 54. Chen H, Mojtahedzadeh N, Belanger MJ, Nebe S, Kuitunen-Paul S, Sebold M, et al. Model-Based and Model-Free Control Predicts Alcohol Consumption Developmental Trajectory in Young Adults: A 3-Year Prospective Study. Biological Psychiatry. 2021;89(10):980–9. pmid:33771349
- 55. Decker JH, Otto AR, Daw ND, Hartley CA. From Creatures of Habit to Goal-Directed Learners: Tracking the Developmental Emergence of Model-Based Reinforcement Learning. Psychol Sci. 2016;27(6):848–58. pmid:27084852
- 56. Brown VM, Chen J, Gillan CM, Price RB. Improving the Reliability of Computational Analyses: Model-Based Planning and Its Relationship With Compulsivity. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging. 2020. pmid:32249207
- 57. Stan Development Team. RStan: the R interface to Stan [Internet]. 2020. Available from: http://mc-stan.org/
- 58. Vikbladh O, Meager MR, King J, Blackmon K, Devinsky O, Shohamy D, et al. Two Sides of the Same Coin: The Hippocampus as a Common Neural Substrate for Model-Based Planning and Spatial Memory [Internet]. Neuroscience; 2018 Jul [cited 2022 Jan 30]. Available from: http://biorxiv.org/lookup/doi/10.1101/359232
- 59. Nussenbaum K, Scheuplein M, Phaneuf CV, Evans MD, Hartley CA. Moving Developmental Research Online: Comparing In-Lab and Web-Based Studies of Model-Based Reinforcement Learning. Collabra: Psychology. 2020;6(1):17213.
- 60. Hunter LE, Meer EA, Gillan CM, Hsu M, Daw ND. Increased and biased deliberation in social anxiety. Nat Hum Behav. 2022;6(1):146–54. pmid:34400815
- 61. Roberts ME. Disordered eating and obsessive-compulsive symptoms in a sub-clinical student population. New Zealand journal of psychology. 2006;35(1):45–54.
- 62. Everitt BJ, Robbins TW. Drug Addiction: Updating Actions to Habits to Compulsions Ten Years On. Annu Rev Psychol. 2016;67(1):23–50.
- 63. Barbarich-Marsteller N C., Foltin R W., Timothy Walsh B. Does Anorexia Nervosa Resemble an Addiction? CDAR. 2011;4(3):197–200. pmid:21999694
- 64. Kar P. Partha Kar: Dieting and body shaming. BMJ [Internet]. 2019 [cited 2020 Aug 16];364. Available from: https://www.bmj.com/content/364/bmj.l1222 pmid:30898783
- 65. Sujoldzić A, De Lucia A. A cross-cultural study of adolescents—BMI, body image and psychological well-being. Coll Antropol. 2007;31(1):123–30. pmid:17598390
- 66. Ambwani S, Shippe M, Gao Z, Austin SB. Is #cleaneating a healthy or harmful dietary strategy? Perceptions of clean eating and associations with disordered eating among young adults. Journal of Eating Disorders. 2019;7(1):17.
- 67. Alexi J, Dommisse K, Cleary D, Palermo R, Kloth N, Bell J. An Assessment of Computer-Generated Stimuli for Use in Studies of Body Size Estimation and Bias. Frontiers in Psychology. 2019;10:2390. pmid:31695661
- 68. Ralph-Nearman C, Filik R. Development and validation of new figural scales for female body dissatisfaction assessment on two dimensions: thin-ideal and muscularity-ideal. BMC Public Health. 2020;20(1):1114. pmid:32677919
- 69. Hudson GM, Lu Y, Zhang X, Hahn J, Zabal JE, Latif F, et al. The Development of a BMI-Guided Shape Morphing Technique and the Effects of an Individualized Figure Rating Scale on Self-Perception of Body Size. European Journal of Investigation in Health, Psychology and Education. 2020;10(2):579–94. pmid:34434689
- 70. Otto AR, Gershman SJ, Markman AB, Daw ND. The Curse of Planning: Dissecting Multiple Reinforcement-Learning Systems by Taxing the Central Executive. Psychol Sci. 2013;24(5):751–61. pmid:23558545
- 71. Waldron EM, Ashby FG. The effects of concurrent task interference on category learning: Evidence for multiple category learning systems. Psychonomic Bulletin & Review. 2001;8(1):168–76. pmid:11340863
- 72. Shahar N, Hauser TU, Moutoussis M, Moran R, Keramati M, NSPN consortium, et al. Improving the reliability of model-based decision-making estimates in the two-stage decision task with reaction-times and drift-diffusion modeling. Gershman SJ, editor. PLoS Comput Biol. 2019;15(2):e1006803. pmid:30759077
- 73. Kool W, Cushman FA, Gershman SJ. When Does Model-Based Control Pay Off? O’Reilly JX, editor. PLoS Comput Biol. 2016;12(8):e1005090. pmid:27564094
- 74. Silva CF da Hare TA. A note on the analysis of two-stage task results: How changes in task structure affect what model-free and model-based strategies predict about the effects of reward and transition on the stay probability. PLOS ONE. 2018;13(4):e0195328. pmid:29614130
- 75. Wunderlich K, Dayan P, Dolan RJ. Mapping value based planning and extensively trained choice in the human brain. Nat Neurosci. 2012;15(5):786–91. pmid:22406551
- 76. Tanaka SC, Balleine BW, O’Doherty JP. Calculating Consequences: Brain Systems That Encode the Causal Effects of Actions. Journal of Neuroscience. 2008;28(26):6750–5. pmid:18579749
- 77. Price RB, Gillan CM, Hanlon C, Ferrarelli F, Kim T, Karim HT, et al. Effect of Experimental Manipulation of the Orbitofrontal Cortex on Short-Term Markers of Compulsive Behavior: A Theta Burst Stimulation Study. AJP. 2021;178(5):459–68. pmid:33726523
- 78. Hall PA, Vincent CM, Burhan AM. Non-invasive brain stimulation for food cravings, consumption, and disorders of eating: A review of methods, findings and controversies. Appetite. 2018;124:78–88. pmid:28288802