Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Using Fiction to Assess Mental State Understanding: A New Task for Assessing Theory of Mind in Adults


Social functioning depends on the ability to attribute and reason about the mental states of others – an ability known as theory of mind (ToM). Research in this field is limited by the use of tasks in which ceiling effects are ubiquitous, rendering them insensitive to individual differences in ToM ability and instances of subtle ToM impairment. Here, we present data from a new ToM task – the Short Story Task (SST) - intended to improve upon many aspects of existing ToM measures. More specifically, the SST was designed to: (a) assess the full range of individual differences in ToM ability without suffering from ceiling effects; (b) incorporate a range of mental states of differing complexity, including epistemic states, affective states, and intentions to be inferred from a first- and second-order level; (c) use ToM stimuli representative of real-world social interactions; (d) require participants to utilize social context when making mental state inferences; (e) exhibit adequate psychometric properties; and (f) be quick and easy to administer and score. In the task, participants read a short story and were asked questions that assessed explicit mental state reasoning, spontaneous mental state inference, and comprehension of the non-mental aspects of the story. Responses were scored according to a rubric that assigned greater points for accurate mental state attributions that included multiple characters’ mental states. Results demonstrate that the SST is sensitive to variation in ToM ability, can be accurately scored by multiple raters, and exhibits concurrent validity with other social cognitive tasks. The results support the effectiveness of this new measure of ToM in the study of social cognition. The findings are also consistent with studies demonstrating significant relationships among narrative transportation, ToM, and the reading of fiction. Together, the data indicate that reading fiction may be an avenue for improving ToM ability.


Navigation of the social world depends on one’s ability to make inferences about the mental life of others. Accurate understanding of another individual’s beliefs, emotions, intentions, and desires allows for the prediction of future mental states, associated actions, and engagement in appropriate social behavior. The importance of the mechanism that allows for mental state attribution, known as theory of mind (ToM), is perhaps best illustrated by cases in which ToM is impaired, as in schizophrenia and autism spectrum disorders [1-3]. In both of these disorders, ToM impairment carries functional and clinical significance in that the extent of ToM impairment is associated with the extent of dysfunction in social behavior [4-6]. Furthermore, in schizophrenia, improving ToM ability through targeted intervention is associated with improvements in aspects of real-world functioning [7-12]. In addition to its obvious clinical relevance, ToM underlies myriad social processes including compassion, sympathy, and empathy [13-15], moral judgment [16-21], negotiation [22], and marital/romantic relationship adjustment [23,24], among others.

One challenge confronting researchers studying ToM in adults is how to assess ToM accurately and reliably in a way that is sensitive to both subtle individual differences and clinical impairment. The most commonly used or “classic” ToM tasks [25], including the False-Belief Task [26-28], Hinting Task [29], Strange Stories Task [30], Faux Pas Task [31,32], Cartoon-Sequencing tasks [26,33,34], variations on the Heider and Simmel task [35-37], and the Reading the Mind in the Eyes Task (Eyes Task) [38] have been used successfully to distinguish clinical populations, such as individuals with schizophrenia [1-3], autism spectrum disorders [3], bipolar disorder [39-41], and individuals with brain damage to prefrontal cortex [31,42] and temporo-parietal junction [43,44], from healthy control participants. However, except in these aforementioned cases of severe ToM impairment, these tasks are insensitive to more subtle ToM deficits, let alone normal variation in ToM ability (although see below for a discussion of the Eyes Task, which does appear to be more sensitive to individual differences). Ceiling effects – in which participants perform at 100% or near 100% accuracy – are ubiquitously observed with these tasks, and their variants, in healthy control participants as well as patient groups (although less often; e.g., [29,31,32,34,36,39,45-59]). For example, in studies investigating ToM in schizophrenia, using papers identified in two meta-analyses [1,3], the comparison group of healthy control participants scored >90% accuracy in 6 of 7 studies using the Hinting Task [29,54,60-63] and 5 of 7 studies using the Faux Pas Task [64-68]. Clearly, these tasks are inadequate for addressing questions related to individual differences and normal variation in ToM ability. This inevitably limits the scope of questions researchers can ask about ToM ability and social behavior in adults. For example, ToM deficits have been observed in unaffected first-degree relatives of individuals with schizophrenia [58,69-72] and autism spectrum disorders [73-76], as well as individuals exhibiting attenuated symptoms of schizophrenia, but do not meet diagnostic criteria for a psychotic disorder [77-79]. Deficits in these “at-risk” groups have led researchers to propose ToM impairment as a vulnerability marker for these disorders [41,72], specifically that the presence of ToM deficits may reflect dysfunction in underlying neural circuitry associated with liability for the disorder. The negative consequences of ToM deficits, such as social conflict and social isolation, might also indirectly contribute to the development and onset of illness in populations at-risk. Tasks that do not adequately assess the full range of ToM abilities limit the potential to test ToM in these populations, in which deficits, when they do exist, are subtle, hard to detect, and yet may carry important implications regarding risk for psychopathology [72]. The ability to detect subtle impairment would bolster early identification and prevention efforts, and make ToM assessment a very useful clinical tool.

There are several reasons as to why extant ToM measures lack sensitivity. For one, many of these tasks are adaptions of measures used to assess ToM skills in children [32,80-82]. As a consequence, the stimuli used may not be challenging enough for older individuals with more developed conceptual knowledge, reasoning skills, and social experience. Researchers increase the difficulty of ToM tasks by increasing the complexity of the mental state information, for example, by asking participants to make second-order (and higher) mental state inferences where mental states are embedded within other mental states (e.g., “Barbara thought that Hank knew where she thought her Yiddish dictionary was.”). This approach does indeed make tasks more challenging [83], but with greater complexity comes greater demands on non-social aspects of cognition including executive function, working memory, and verbal ability [84]. With these greater non-social demands, it becomes difficult to interpret performance as a function of ToM ability or non-social cognitive ability. Another important consideration is the context in which the participant is asked to make mental state inferences [25]. Are participants asked about the mental state of a single character that has a false-belief regarding the location of their chocolate bar? Or are participants asked questions about the mental state of characters involved in an ongoing dynamic social interaction embedded within a social context that requires the participant to apply their knowledge of social rules and contingencies? The latter is clearly more representative of mental state attributions made during real-world social interactions, and yet not at all representative of the stimuli used in “classic” ToM tasks. One final consideration is the distinction between implicit and spontaneous (i.e., considering mental state information without being prompted to do so) versus explicit and evoked mental state attributions [85]. Just about all of the standard ToM tasks ask participants to make explicit, reasoned mental state attributions that require considerable effort. Variations on the Heider and Simmel task, in which participants are asked to watch animated geometric figures move with or without ostensible intent and answer simply “What happened in the cartoon?” may be the exception [35,36]. The dissociation between implicit and explicit processes has been demonstrated elegantly in young infants, who seem capable of spontaneously attributing mental states to agents [86-88], and individuals with autism spectrum disorders who seem to have preserved explicit mental state reasoning, but impaired spontaneous mental state reasoning [89,90]. Though the relative consequences of implicit versus explicit ToM ability for social functioning are unknown, these data suggest these processes can be dissociated and studied separately.

Given these considerations, the goal of this study was to design a new ToM task – the Short Story Task (SST) - that improved upon the limitations of existing ToM measures. More specifically, we aimed to create a task that (a) was sensitive to individual differences in ToM ability and did not suffer from ceiling effects, (b) incorporated a range of mental states of differing complexity, including epistemic states, affective states, and intentions to be inferred from a first- and second-order level, (c) used ToM stimuli representative of real-world social interactions, (d) required participants to utilize social context when making mental state inferences, (e) exhibited adequate psychometric properties, and (f) was quick and easy to administer and score.

In considering appropriate stimuli for the task, literary fiction seemed like an ideal venue to test ToM ability. Fiction offers the opportunity to engage in simulated social experiences by transporting the reader into the social and mental life of story characters [91]. To make sense of story events and character actions, the reader is required to make inferences about the characters’ beliefs, emotions, desires, and intentions in the context of dynamically unfolding social scenarios. This idea is supported by several lines of research demonstrating that exposure to fiction is positively associated with greater ToM ability [92-95], the tendency to become emotionally transported into fictional stories is positively associated with an increase in empathy [96], and that the neural network recruited for ToM is largely overlapping with the network recruited during narrative comprehension [97].

Thus, in consultation with a Boston-based novelist, we used The End of Something [98], a short story by Ernest Hemingway, to test ToM ability. This story presents a nuanced interaction between a romantic couple (spoiler alert) that has a conflict and subsequently breaks up. As is typical of Hemingway’s fiction, the mental lives of the characters are not explicitly described, requiring readers to make a series of first- and second-order mental state inferences regarding epistemic states, affective states, and intentions, to understand story events and character actions. The prose is direct and easy to understand, reducing the potential impact of verbal ability on ToM reasoning. After reading the story, participants were asked a series of questions to gauge explicit mental state reasoning ability, spontaneous mental state inference, and, finally, comprehension of the non-mental story content to ensure adequate understanding of the prose. Performance on the mental state reasoning questions was evaluated with a scoring rubric completed by the experimenter. Points were assigned depending on the accuracy of the mental state inference and number of mental states taken into account. Spontaneous mental state reasoning was assessed with a single question that simply asked participants to summarize the story. The unprompted mention of mental states here theoretically reflects the salience of mental state information, and the propensity to think about mental states by the participants.

Towards the goal of assessing the concurrent validity of the SST as a measure of ToM ability, we employed two additional measures of social cognition: the Interpersonal Reactivity Index (IRI) [99,100] and the Eyes Task [38]. By testing for concurrent validity, we aimed to evaluate the extent to which SST performance is associated with these other well-established measures of social cognition, which were administered concurrently with the SST. We chose these particular measures for several reasons. First, both are ubiquitously employed in the social cognition and social neuroscience literature in studies of neurotypical and clinical populations. Second, both tasks have excellent psychometric properties [38,99,100], show concurrent validity with a range of other behavioral and neural measures of ToM [101-106], and distinguish clinical populations with established ToM deficits from non-clinical populations [1-3,38,107-109]. Furthermore, the Eyes Task is one of the few ToM tasks in which healthy adults show substantial variation in performance, and ceiling effects are not observed. Lastly, these two measures index different aspects of ToM than that tested by the SST. The IRI provides a self-reported measure of transportation into the mental and emotional lives of story characters, and an individual’s tendency to engage in different facets of perspective-taking and empathy in their own life. The Eyes Task provides an index of mental state decoding ability, which is the ability to identify mental states based on immediately available information (eyes in this case). This is different from the mental state reasoning demands of the SST which requires attributing mental states and then using that information to predict other mental states and actions [110]. Additionally, the Eyes Task requires analysis of visual images and thus tests ToM ability in a different sensory modality than the SST. Converging associations between the SST and these measures would provide strong support for the concurrent validity of the SST as a measure of ToM. We included the comprehension questions to provide further evidence regarding task validity. More specifically, the comprehension questions required similar verbal skills as the ToM questions, but did not test ToM ability. If the SST ToM scores are indexing some aspect of ToM ability, only these scores, and not the comprehension score, should be associated with the IRI and Eyes Task.

We tested for the following: (a) general psychometric properties of the SST including inter-rater reliability between independent judges scoring the mental state reasoning and spontaneous mental state inference question, and internal consistency, (b) relationships between mental state reasoning, spontaneous mental state inference, and comprehension of non-mental state information, (c) relationships between ToM ability as measured with the SST and demographic variables, as well as general intelligence, and, finally, (d) concurrent validity of the SST by examining the relationship between SST performance and scores on the IRI and Eyes Task.

Materials and Methods


Seventy-four individuals (27 males, 47 females) were recruited from the greater Boston area via online advertisements and participated for monetary compensation. Participants ranged in age from 18 to 58 years (M = 27.8, SD = 9.6) and completed between 12 and 20 years of education (M = 15.7, SD = 1.9). As is typical of study samples in the Boston area, average IQ was quite high (M = 120.4, SD = 9.1) and ranged between 94 and 138 (IQ data were not collected for five participants who terminated their participation prior to the experiment being completed).

Inclusion criteria included being a native English speaker, IQ>70, and none of the following: neurological or major medical illness, lifetime Axis I/II DSM disorder, or current substance abuse problem. Of the 82 individuals who came to the lab to participate, six were excluded for meeting criteria for an Axis I DSM disorder and two were excluded for having a neurological abnormality. Lifetime psychopathology was assessed with the Mini-International Neuropsychiatric Interview (MINI) [111]. IQ was assessed using either the vocabulary and matrix reasoning subtests of the Wechsler Abbreviated Scale of Intelligence (WASI) [112] or the North American Adult Reading Test (NAART) [113]. Trained PhD students in clinical psychology administered these assessments.

Ethics Statement.

This study was approved by Harvard University’s Internal Review Board. All participants gave informed written consent before beginning the experiment.

Short Story Task


In the Short Story Task (SST), participants read The End of Something, a short story by Ernest Hemingway [98], which presents a nuanced interaction between a romantic couple in which the male protagonist, Nick, starts an argument and breaks up with his girlfriend, Marjorie. Through the course of the story, the characters display sarcasm, non-verbal and indirect communication, higher-order emotions like guilt, and attempts to hide their intentions and feelings from one another. As is often the case in Hemingway’s fiction, the mental lives of the characters are not explicitly described. Thus, the reader is forced to make a series of first-order (i.e., inferring the belief or emotion of a single character) and second-order (i.e., inferring what one character thinks about another character’s belief, emotion, or action) mental state inferences in order to understand the ostensible mental lives of, and social interactions between the characters. Additionally, Hemingway’s prose is direct and easy to understand, reducing the potential impact of verbal ability on mental state reasoning. Hemingway, and this short story in particular, was chosen as the stimulus for this task for these aforementioned reasons with the consultation of a Boston-based novelist with a PhD in English and expertise in 20th Century American Literature (JPC).

The Flesch Reading Ease Score (FRES) [114], which denotes the ease of reading comprehension on a 0-100 scale (higher scores indicate easier text), and the Flesch-Kincaid Grade Level (FKGL), which estimates the grade level at which text should be understood, indicated that The End of Something contained highly readable text (FRES = 92.7; for reference, the FRES of this manuscript’s abstract is 31.2) that should be understood by the average individual at a 3rd grade reading level (FKGL = 2.8). The text is 1,427 words in length.


Before reading The End of Something, participants were given the following instructions:

“You are going to read a short story called The End of Something. The story is only a few pages, but take your time reading it. Try to get a sense of what happens and what the relationships are between the characters. After you’re finished, I’m going to ask you some questions and tape-record your responses. Do you have any questions before we begin?”

After reading the story, the experimenter asked a series of open-ended questions in a structured format. Participants were allowed to refer back to the story as needed, and were given a copy of the questions the experimenter asked in order to eliminate memory demands. First, the experimenter asked a set of questions regarding familiarity with the story to ensure that participants had no prior knowledge that might affect their responses. Four participants reported being familiar with the book that contained the short story – In Our Time – however, no participants reported having read The End of Something prior to the experiment. Participants were then given the following instructions:

“Now I’m going to ask you some questions about the story. Here is a copy of the questions I’ll be asking so you can read along. For most of the questions, there are no right or wrong answers and the questions can be answered with short responses. We’re also interested in the character’s thoughts, feelings and intentions when it applies to the question.”

We included this last sentence based on pilot data, which suggested that unless explicitly prompted, many participants were inclined to respond to questions by simply recounting the events of the story, instead of making inferences regarding what characters might be thinking or feeling.

An excerpt from The End of Something and an example mental state reasoning question follows: “He was afraid to look at Marjorie. Then he looked at her. She sat there with her back toward him. He looked at her back. ‘It isn’t fun any more. Not any of it.’” Question: Why is Nick afraid to look at Marjorie?

While administering the questions, the experimenter provided no feedback regarding the participant’s responses, and participants were free to respond at any length. Responses were recorded with a digital recorder and later transcribed by an undergraduate research assistant. The task was administered by either the first-author (DDF), another trained PhD student, or trained undergraduate research assistants. Administration of the task, including the time needed for the participant to read the story and the experimenter to administer the questions, typically took around 10 minutes.

Questions and Scoring.

Questions were designed to assess three factors: (a) five questions probed comprehension of the prose and story events (i.e., non-mental state content), (b) eight questions probed explicit mental state reasoning regarding story characters’ beliefs, emotions, intentions, and desires, and (c) one question assessed spontaneous mental state inference (Table 1). Scoring was completed by the first-author (DDF), using the transcriptions, according to a rubric. In order to evaluate inter-rater reliability, 25% of the transcripts were chosen at random and scored by a second independent rater (SHL).

Explicit Mental State ReasoningSpontaneous Mental State InferenceComprehension
Number of Question(s)81 (Participant is asked to summarize the story with no other prompt)5
Individual Question(s) Scored0, 1, 2Yes, No0, 1, 2
0No MS inference; inaccurate MS reasoning-Patently inaccurate response
1Consideration of only one (or few) perspectives, emotions, intentions; partial understanding of a character(s) MS-Partial understanding of non-mental story content
2Consideration of several characters’ MS; second-order and higher MS inferences; accurate MS reasoning-Full understanding of non-mental story content
Yes/No-Yes = presence of unprompted MS inference regarding a story character’s beliefs, emotions, desires, or intentions; No = no presence of unprompted MS inference; response recounts only non-mental state story events-
Total Score0 - 16-0 - 10

Table 1. Description of Assessment Questions and Scoring Criteria in the Short Story Task.

Note. MS = Mental state.
Download CSV

For comprehension questions, the rubric was designed to assign more points depending on the accuracy of the participant’s response to questions probing the understanding of non-mental state story content. A 0 was assigned for responses that were patently inaccurate; 1 for responses that demonstrated partial understanding; and 2 for responses that demonstrated full understanding. Comprehension scores, which are the sum of scores from the five comprehension questions, can range from 0 – indicating no understanding of the story’s non-mental events and/or prose – to 10 – indicating excellent understanding of the story’s non-mental events and/or prose. This score was used to investigate whether mental state reasoning was associated with general understanding of the non-social aspects of the story.

For explicit mental state reasoning questions (hereafter referred to as mental state reasoning), the rubric was designed to assign points based on the accuracy of the mental state inference, number of character perspectives/emotions taken into account (i.e., second-order inferences generally received more points than first-order inferences), and understanding of non-verbal/indirect communications (e.g., sarcasm and body language). Similar to the comprehension questions, each of these questions were assigned a value of 0, 1, or 2, and an overall mental state reasoning score was calculated as the sum of points from the eight mental state reasoning questions. Thus, scores can range from 0 – indicating little to no understanding of the story characters’ mental states – to 16 – indicating excellent understanding of the story characters’ mental states.

To assess spontaneous mental state inference, participants were asked a single question that simply asked them to summarize the story. Responses were coded for the presence or absence of a mental state inference. We had originally planned to code these responses not just for the presence versus absence of a mental state inference, but for the number of mental state inferences to use as a continuous variable. However, most participants provided very short summaries (1-3 sentences) and either made a single mental state inference (e.g., “Nick felt bad about breaking up with Marjorie.”) or none. Given that the summary question, which did not explicitly ask participants to make reference to the characters’ mental states, the unprompted mention of mental states should in theory reflect the relative importance and salience of mental states for the participant, and the propensity for the participant to think about mental states. This question was asked first, before the comprehension or mental state reasoning questions, in order not to prime participants with certain aspects of the story to summarize. We note however, that prior to asking this question, participants were told, as part of the instructions, “We’re also interested in the character’s thoughts, feelings and intentions when it applies to the question.” Thus, though the question itself does not specifically ask for the mention of mental states, the extent to which the mention of mental states here can be considered truly unprimed or spontaneous should be cautioned.

Scoring each participant’s transcript took somewhere between 5 and 10 minutes depending on the length of the responses. All testing material, including the questions, scoring instructions, and rubric are provided in Text S1.

Interpersonal Reactivity Index

The Interpersonal Reactivity Index (IRI) is a 28-item self-report questionnaire that consists of the following four subscales: fantasy, perspective-taking, empathic concern, and personal distress [99,100]. The fantasy scale assesses the tendency to identify with fictional characters, become immersed in a narrative, and be mentally transported into a character’s mental and emotional life [92,93] (e.g., “When I am reading an interesting story or novel, I imagine how I would feel if the events in the story were happening to me.”). This subscale has been shown to be highly correlated with another measure of narrative immersion [93]. The perspective taking subscale assesses the tendency to adopt and reason about the mental states of others (e.g., “I sometimes try to understand my friends better by imagining how things look from their perspective.”). The empathic concern subscale assesses the tendency to consider the emotional states and experience sympathy for others (e.g., “I often have tender, concerned feelings for people less fortunate than me.”). The personal distress subscale assesses the tendency to experience negative affect in response to negative events experienced by others (e.g., “Being in a tense, emotional situation scares me.”). Each subscale consists of 7 items that are rated on a scale from 0 (does not describe me well) to 4 (describes me very well).

Reading the Mind in The Eyes Task

In the Reading the Mind in The Eyes Task – Revised (Eyes Task) [38], participants view 36 pictures of the eye region of actors’ faces, and judge which of four adjectives best describes the mental state being expressed through the eyes. Photographs are centrally displayed on the computer screen and the four adjectives (one correct adjective and three distractors) are placed in the four corners of the screen. Participants respond with one of four buttons on a keyboard corresponding to each of the four adjectives. Participants were instructed to respond as accurately as possible. The 36 experimental trials are preceded by a single practice trial. Upon request, participants were provided with a list of the adjectives and their definitions used in the task. E-prime 2.0 was used to present the stimuli and collect accuracy data.

General Procedure

Participants came to the lab to participate in one of several larger ongoing studies investigating social cognition in healthy and clinical populations. Upon entering the lab, all participants completed a general demographics questionnaire and the MINI to ensure eligibility. Most participants completed the IQ assessment and SST after these assessments and before the IRI and Eyes Task; however a portion of participants completed the IQ assessment, SST, IRI, Eyes Task, and other behavioral experiments/questionnaires unrelated to the current study, in a different order. One project did not collect IRI data, leaving 44 participants of the total sample with IRI data. After completing the experimental procedures, participants were debriefed and compensated for their time.

Statistical Analysis

Distributions of the comprehension score, mental state reasoning score, IRI, Eyes Task, and IQ were visually inspected for normality and outliers (±2.5 SD of the mean). Comprehension scores were substantially negatively skewed indicating a ceiling effect. Given this distribution, these data were dichotomized into two groups of individuals who attained a perfect score of 10 (n = 36) and those who scored below 10 (n = 38) for further analysis. We analyzed comprehension data in this way instead of performing a median split (Mdn = 9) as this would have resulted in substantially unequal group n’s 2 participants’ IQ scores were <2.5 SD of the mean and identified as outliers. These two values were Winsorized by replacing them with the next lowest non-outlying IQ score and subtracting 10% of that score to maintain variance.

Inter-rater reliability of the comprehension and mental state reasoning score was assessed with the intraclass correlation coefficient (ICC) using the 25% of transcripts scored by the independent judge. Inter-rater agreement on the presence/absence of a spontaneous mental state inference in the spontaneous mental state inference summary question was assessed with the kappa coefficient. Internal consistency of the comprehension and mental state reasoning questions was assessed with Cronbach’s alpha. We note that by emphasizing content validity in the questions asked, that is, by having participants reason about a range of different mental states from a first- and second-order level, alpha levels will be negatively impacted [115,116].

Subsequent analysis addressed four main questions. First, we examined the relationship between the SST variables to examine whether individuals who made a spontaneous mental state inference were also better at explicit mental state reasoning, and whether spontaneous mental state inference and explicit mental state reasoning were related to understanding the non-mental aspects of the story (comprehension score). Second, we examined whether any of the SST scores were related to demographic variables, including age, gender, and education. Third, given the verbal demands of the task, we examined whether any of the ToM variables from the SST (mental state reasoning score, spontaneous mental state inference) were associated with general intelligence (IQ). Fourth, to assess concurrent validity of the SST, we investigated the relationship between SST scores, the IRI, and Eyes Task performance. For all of these analyses, the relationship between the mental state reasoning score and the other variables were evaluated with Pearson product-moment correlations, which are accompanied by 95% CIs (bias-corrected and accelerated) derived from 2,000 bootstrap samples. The relationship between the comprehension and spontaneous mental state inference score was evaluated between groups (i.e., those with/without a perfect comprehension score, and those who made/did not make a spontaneous mental state inference), with two-sample t-tests or chi-square tests where appropriate. Statistical significance was defined as p < .05, two-tailed for all analyses. Statistical analysis was performed with R (


Inter-Rater Reliability and Internal Consistency

Inter-rater reliability was high for the mental state reasoning score (ICC = .98) as well as the comprehension score (ICC = .90). Inter-rater agreement on the presence versus absence of a spontaneous mental state inference was also high (kappa = .86). Unsurprisingly, given the range of content asked in the questions, internal consistency was low for the mental state reasoning questions (α = .54) and comprehension questions (α = .31).


For all SST scores, we visually inspected the distributions and conducted measures of skewness and kurtosis. For a unimodal normal distribution, a skew value of 0 indicates perfect symmetry of scores around the mean. Positive kurtosis values indicate that the distribution has relatively sharp peaks and fat tails relative to a normal distribution; negative kurtosis values indicate that the distribution has wide peaks and thin tails.

Mental state reasoning scores were relatively normally distributed with a slight negative skew (skew = -.72, kurtosis = .13) indicating an asymmetry in the distribution whereby the majority of scores were on the right side of the distribution (reflecting that the majority of individuals received scores of 8 out of 16 possible points or higher) (Figure 1). Importantly, there was substantial variation in performance across individuals with scores ranging from 2 to 14 (possible scores = 0-16), and no indication of a ceiling effect (0% of participants scoring 16/16 or 15/16). Mean score was 8.6 ± 2.6.

Figure 1. Distribution of the mental state reasoning score.

Data from the spontaneous mental state inference summary question was collected from 70 participants (four participants were not asked the spontaneous mental state inference question due to experimenter error). 50% made at least one spontaneous mental state inference. Further analysis of this variable with other data proceeded with a dichotomized variable (i.e., individuals who did versus did not make a spontaneous mental state inference) as individuals tended to either make a single mental state inference or none.

Comprehension scores exhibited a substantial negative skew due to 48.6% of the participants performing at ceiling (skew = -.98, kurtosis = -.13). Performance ranged from 6 to 10 and the mean score was 9.0 ± 1.2 (possible scores = 0-10). Further analysis of the comprehension score was performed with the dichotomized variable; that is, those individuals who achieved a perfect score (n = 36) and those who did not (n = 38).

Relationship Between the SST Variables

Individuals who made a spontaneous mental state inference in the summary question had higher mental state reasoning scores (M = 9.3, SD = 2.0) compared to those individuals who did not make a spontaneous mental state inference (M = 8.0, SD = 3.0) (Figure 2). This difference was statistically significant, t(68) = 2.19, p = .032, Cohen’s d = .52.

Figure 2. Mental state reasoning score as a function of spontaneous mental state inference.

Mean mental state reasoning score of individuals who did (turquoise-colored bar) and individuals who did not (salmon-colored bar) make a spontaneous mental state inference in the summary question. Error bars depict standard error of the mean.

Individuals who achieved a perfect score on the comprehension questions performed no differently on the mental state reasoning questions (M = 9.0, SD = 2.3) compared to those who had a score <10 (M = 8.1, SD = 2.9), t(72) = 1.47, p = .15, d = .34. Similarly, individuals who achieved a perfect score on the comprehension questions were equally as likely to make a spontaneous mental state inference (47.2%) as those who had a score <10 (47.4%), χ2(1, N = 74) = 0, p = 1.0.

Relationship Between ToM Performance on the SST and Demographic Variables

Mental state reasoning scores did not significantly differ by gender (Mmales = 9.0, SD = 2.4; Mfemales = 8.3, SD = 2.8), t(72) = .97, p = .33, d = .24, nor did they correlate with age or education (Table 2).

Variablerp95% CI
Age-.12.29[-.36, .07]
Education.19.11[-.06, .41]
IQ.24.047[.02, .50]
IRI-Fantasy.37.012[.17, .53]
IRI-Perspective Taking-.07.65[-.35, .23]
IRI-Empathic Concern-.07.67[-.29, .14]
IRI-Personal Distress.05.76[-.36, .35]
Eyes Task.49< .0001[.27, .68]

Table 2. Relationship Between Mental State Reasoning Score, Demographic Variables, IQ, and Social Variables.

Note. Bold values denote statistical significance at p < .05.
Download CSV

The number of males who made a spontaneous mental state inference (61.5%) did not significantly differ from the number of females who made a spontaneous mental state inference (43.2%), χ2(1, N = 70) = 2.20, p = .14. Similarly, neither age nor education differed between those who made a spontaneous mental state inference and those who did not (Table 3).

VariableSpontaneous Mental State Inference GroupNo Spontaneous Mental State Inference GroupBetween-Group Difference
Age (years)26.4 (8.3)29.1 (11.2)t(68) = 1.15, p = .25, d = .27
Education (years)15.6 (1.8)15.8 (2.0)t(68) = .44, p = .66, d = .11
IQ121.8 (8.3)119.5 (9.2)t(63) = 1.07, p = .29, d = .26
IRI-Fantasy17.1 (5.4)15.9 (5.5)t(42) = .75, p = .46, d = .23
IRI-Perspective Taking20.3 (5.2)18.5 (4.6)t(42) = 1.23, p = .23, d = .37
IRI-Empathic Concern21.2 (5.3)20.5 (4.1)t(42) = .51, p = .61, d = .15
IRI-Personal Distress9.9 (5.1)10 (5.5)t(42) = .06, p = .95, d = .02
Eyes Task (% correct)78.4 (7.3)76.3 (11.6)t(62) = .85, p = .40, d = .21

Table 3. Relationship Between Spontaneous Mental State Inference, Demographic Variables, IQ, and Social Variables.

Note. Values represent means and standard deviations in parentheses. All tests were performed between individuals who did and those who did not make a spontaneous mental state inference.
Download CSV

Relationship Between ToM Performance on the SST and IQ

Mental state reasoning scores exhibited a statistically significant relationship with IQ such that higher mental state reasoning scores were associated with higher IQ (Table 2, Figure 3). Sixty-five participants had IQ data and were asked the spontaneous mental state inference question. There was no difference in IQ between individuals who made a spontaneous mental state inference and individuals who did not (Table 3).

Figure 3. Relationship between mental state reasoning scores and IQ, fantasy, and eyes task scores.

Shaded area represents 95% CIs.

Concurrent Validity of the SST and Other Measures of Social Cognition

In order to evaluate concurrent validity of the SST, we examined ToM performance on the SST with the IRI and Eyes Task. IRI data were collected for 44 participants. Performance on the Eyes Task ranged from 50 to 94.4% correct. Mean performance was 77.4 ± 9.8% correct, which is similar to other studies of non-clinical populations (e.g., [38,103,117]).

Mental state reasoning scores on the SST exhibited a statistically significant relationship with the fantasy subscale such that better performance was associated with higher fantasy scores (Table 2, Figure 3). This relationship was not found with the other IRI subscales. Mental state reasoning scores also exhibited a statistically significant relationship with the Eyes Task such that better performance was associated with greater accuracy on the Eyes Task (Table 2, Figure 3). This relationship was preserved in the subset of 43 participants who had IRI and Eyes Task data, r(41) = .59, p < .0001, 95% CI [.32, .77].

Performance on the Eyes Task was significantly correlated with IQ, r(66) = .24, p = .048, 95% CI [-.03, .50]. Thus, in order to evaluate the relative contribution of IQ to the relationship between SST mental state reasoning and Eyes Task performance, we conducted a partial correlation controlling for IQ, which did not alter the relationship, r(65) = .45, p < .0001, 95% CI [.26, .62]. Fantasy scores were not associated with IQ, r(42) = -.13, p = .39, 95% CI [-.41, .15]. Controlling for IQ also did not alter the relationship between mental state reasoning and IRI fantasy scores, r(41) = .42, p = .003, 95% CI [.18, .63].

To further evaluate whether mental state reasoning, specifically, was associated with the fantasy scale and performance on the Eyes Task, as opposed to some other aspect of the task such as general reading or verbal ability, we looked at these measures as a function of comprehension score. Fantasy scores in the perfect comprehension group (M = 16.6, SD = 4.7) did not differ from those in the <10 group (M = 16.5, SD = 6.0), t(42) = .06, p = .95, d = .02. Similarly, Eyes Task performance in the perfect comprehension group (M = 78.1, SD = 9.5) did not differ from those in the <10 group (M = 76.7, SD = 10.1), t(66) = .59, p = .56, d = .14.

Lastly, we evaluated whether making a spontaneous mental state inference on the summary question was also associated with the IRI and Eyes Task. Individuals who made a spontaneous mental state inference on the summary question had higher scores on all subscales of the IRI (particularly perspective-taking, d = .37) except personal distress, and the Eyes Task; however, none of these differences were statistically significant (Table 3).


Here, we report findings from the Short Story Task (SST), a new measure of ToM ability for adults. This task was designed to improve upon limitations inherent in existing ToM tasks. More specifically, the SST was designed to provide a relatively sensitive metric of ToM ability in adults, capable of picking up on individual differences and normal variation in ToM ability, with assessment procedures that were quick and easy to administer and score reliably. Furthermore, the task stimulus (the short story) was representative of a real-world, dynamically unfolding, complicated social scenario that required the application of social knowledge, and participants answered questions that assessed both explicit mental state reasoning and spontaneous mental state inference.

We found that on our measure of explicit mental state reasoning, participants demonstrated substantial variation in performance across almost the full range of possible scores. There was no indication of a ceiling effect as no participant received a perfect score of 16 out of 16 possible points. This variation suggests that the SST is sensitive to individual differences in ToM ability; a clear improvement from many of the existing ToM tasks. The improvement in sensitivity could be related to the fact that participants were asked to reason about a dynamically unfolding social scenario that required the consideration of the social context. This scenario was far more complicated in terms of the social context, emotions, and intentions ostensibly experienced by the story characters compared to the simple vignettes used in other ToM tasks. Furthermore, the scoring rubric was tailored to award higher scores for responses that were not only more accurate, but considered the mental life of several characters at once.

On the spontaneous mental state inference question, half of the participants made an unprompted mention of a character’s belief, emotion, desire, or intention. Interestingly, participants who made a spontaneous mental state inference performed better on the explicit mental state reasoning questions, suggesting that the increased salience and propensity to think about mental state information is associated with better conscious reasoning about mental states. Though data from young infants [86-88] and individuals with autism spectrum disorders [89,90] suggest that the capacity to spontaneous attribute mental states may be relatively independent of explicit mental state reasoning, our data suggest that, at least in healthy adults, these two processes may be related. Furthermore, the fact that performance is related on these two SST measures, which theoretically index aspects of ToM ability, provides additional evidence of the SST as measuring the underlying construct of ToM. We also note that a higher percentage of males (61.5%) made a spontaneous mental state inference than females (43.2%). Though not statistically significant (p = .14), this pattern of results is not typical with tasks assessing aspects of ToM and empathy [32,38,118] perhaps, in part, because of the shared variance between ability on these measures and autistic/schizotypal traits, which may be higher in males [119-121]. With that said, many of these findings are with tasks testing explicit mental state reasoning; less is known about gender differences in spontaneous mental state reasoning.

We examined several additional psychometric properties of the SST, including inter-rater reliability, concurrent validity, and internal consistency. Inter-rater reliability was excellent for the mental state reasoning and comprehension scores, as well as judgments on the presence versus absence of a spontaneous mental state inference. This highlights the SST as a measure that is relatively easy to score reliably. We tested whether the SST scores exhibited concurrent validity with other commonly used measures of social cognition that exhibit adequate psychometric properties. We found that greater performance on the mental state reasoning questions was positively associated with scores on the fantasy scale of the IRI and performance on the Eyes Task. The fact that the IRI and Eyes Task differs from the SST on several important dimensions (i.e., the IRI being self-report and the fantasy scale measuring the tendency to become immersed in the mental life of fictional characters; the Eyes Task testing mental state decoding) provides strong support for the validity of the SST as measuring the underlying construct of ToM ability. Internal consistency was low for the mental state reasoning questions, which is not surprising given the several different facets of ToM ability probed by the questions (e.g., inferences regarding epistemic, affective, intentional states, first- and second-order inferences, etc.). Here, adequate content validity might have made some questions more difficult than others, decreasing this statistic [115,116], which in our opinion is a worthwhile tradeoff. Furthermore, the alpha values observed for the mental state reasoning score are similar or superior to those derived from other ToM ability tests (e.g., [122,123]).

The correlation between the fantasy scale and SST mental state reasoning performance is consistent with other studies showing significant positive inter-relationships among the fantasy scale, Eyes Task performance, and exposure to fiction. More specifically, healthy adults who report greater exposure to fiction report higher scores on the fantasy scale (but not other subscales of the IRI) and perform better on the Eyes Task, even after controlling for demographic and personality variables [92,93]. Additional research has demonstrated that greater transportation into the emotional life of fictional characters is associated with increased empathy over time [96]. The current data provide additional evidence that individuals who become immersed in the mental life of fictional characters perform better on ToM tasks. This raises the intriguing possibility that fiction reading actually improves ToM ability. Though our data cannot speak to causation, findings from preschoolers demonstrate that increased exposure to storybooks predicts better ToM ability [94]. Given that preschool children are unable to control their access to the type of media they are exposed to, self-selection effects (i.e., individuals who are better at ToM simply enjoy reading fiction more) are unlikely. Furthermore, its been shown that adults randomly assigned to read a short piece of literary fiction outperform individuals assigned to read non-fiction on a variety of ToM tasks, including the Eyes Task [124]. The way in which fiction reading could improve ToM may occur through several routes. One possibility is that fiction provides an opportunity to simulate the character’s social experience and thus provide a forum for the reader to practice reasoning about others’ mental states, and using that information to imaginatively implement appropriate social behaviors. Another possibility is that fiction helps readers build their social knowledge by exposing them to social rules and contingencies presented in the context of the story [91,92]. If reading fiction does indeed improve ToM ability, it would have obvious clinical applications, as it could be an easily implemented and cost-effective intervention for individuals with ToM impairment. Additional research has demonstrated that brief exposure to short fictional stories decreases one’s need for cognitive closure, specifically the need for order and structure and discomfort with ambiguity [125]. Such decreased rigidity regarding intolerance of uncertainty may be a similar skill to that trained by many interventions that aim to improve impaired social cognitive abilities, such as Cognitive Enhancement Therapy [126], Social Cognition and Interaction Training [8], and Social Cognitive Skills Training [127]. These interventions aim, in part, to reduce “jumping to conclusions” (i.e., forming rigid interpretations not amenable to disconfirming evidence) regarding what other individuals may be thinking, feeling, or intending, and foster an individual’s ability to flexibly evaluate multiple interpretations of other individuals’ behavior. Thus, in addition to potentially improving ToM ability per se, fiction reading may additionally cultivate more general skills that subserve social cognitive ability.

Mental state reasoning scores and spontaneously inferring a character’s mental state were unrelated to understanding the non-mental aspects of the story, suggesting that our questions isolated ToM ability and not general reading ability. With that said, despite our efforts to reduce the non-social cognitive demands of the task (i.e., verbal ability, memory) by using a story with relatively easy-to-read prose, allowing participants to refer back to the story as needed, and providing them with the questions, mental state reasoning scores exhibited a significant, although weak, positive association with IQ. We found a similar positive relationship between IQ and our other social-cognitive ability measure, the Eyes Task. Positive associations between IQ or verbal ability and ToM have been found in studies with children [128,129], individuals with schizophrenia [1,34], and individuals with autism spectrum disorders [130-134]. This relationship becomes especially apparent when ToM is tested with verbal stimuli. Similar to our study, given the verbal demands of the SST, it is not surprising that there exists some relationship between IQ and ToM ability as measured here. Importantly, despite this relationship, we found SST task performance to be related to the Eyes Task and the fantasy scale even after controlling for IQ. Additionally, the fact that comprehension scores were not related to either the IRI or Eyes Task provides further support that the mental state reasoning score is indexing ToM ability and not some peripheral cognitive process or ability that is concomitant with mental state reasoning.

Several limitations are notable. First, our measure of spontaneous mental state inference, while associated with performance on the mental state reasoning questions, was not associated with performance on either the IRI or Eyes Task. It is noteworthy that individuals who did make a spontaneous mental state inference had higher scores on several IRI subscales and the Eyes Task of reasonable effect sizes (e.g., perspective-taking d = .37); however, statistical significance (p < .05) was not achieved. We probed spontaneous mental state inference with a single question and coded responses into a dichotomous variable, all of which may have limited the sensitivity of the measure and our ability to pick up on individual differences. Spontaneous mental state inference may be better evaluated with tasks that capture a wider range of performance. Eye-tracking patterns during visual inspection of social images, for example, may be a better proxy of real-world social interaction in which mental state information is often initially processed through gaze following [89]. It will also be important to tease apart the spontaneous mention of mental states relative to the spontaneous mention of non-mental state content (e.g., [135]); something which we were unable to investigate here due to the limited mention of mental states and short overall responses. Furthermore, as part of the instructions, which were administered prior to this question, participants were asked to consider the story characters’ thoughts, feelings, and intentions when it applied to the question. As a consequence, it is unclear whether the mention of mental states here can be considered truly spontaneous. With that said, only half of participants made a mental state inference to this question suggesting that the mention of mental states here was not considered mandatory (as could have been interpreted from the instructions), and reflects differences in the salience or importance of mental states to the participant as central to the story’s events. Second, we do not have data speaking to the predictive validity of the SST, specifically concerning real-world social outcomes. Given the relationship between ToM and social functioning, we would expect SST ToM scores to predict social skills and social success both longitudinally and cross-sectionally. Experience sampling methods that allow for repeated, momentary assessment of real-world social interaction would be well suited to address this important question. Lastly, we tested the SST with a relatively small number of participants. As a consequence, many of the analyses may have been underpowered (e.g., the correlations between SST scores and IRI scores where n = 44 or less) and should be interpreted with caution.

In summary, the SST represents a new task for assessing ToM ability in adults that is sensitive to individual differences, correlates with other well established measures of ToM ability, and is relatively quick and easy to administer and score. Given the diversity of contexts in which mental state attributions are made [25], we recommend the use of this task with other measures of social cognition that test ToM in these different contexts. There is still much progress to be made in the assessment of ToM and we hope that the use of this task will be fruitful in that endeavor.

Supporting Information

Text S1.

Short Story Task Administration and Scoring Materials.



the authors would like to thank Cheryl Best, Matthew Yung, Sarah Rosenkrantz, Jessica Li, Diana Li, Steven Felix, and K. Juston Osborne for assistance with data collection and transcription, T.J. Eisenstein, Laura M. Tully, Laura germine, and Dianne M. Hezel for helpful feedback regarding task design and implementation, and an anonymous reviewer for helpful comments.

Author Contributions

Conceived and designed the experiments: DDF JPC CIH. Performed the experiments: DDF. Analyzed the data: DDF SHL. Wrote the manuscript: DDF CIH.


  1. 1. Bora E, Yucel M, Pantelis C (2009) Theory of mind impairment in schizophrenia: meta-analysis. Schizophr Res 109: 1-9. doi: PubMed: 19195844.
  2. 2. Sprong M, Schothorst P, Vos E, Hox J, van Engeland H (2007) Theory of mind in schizophrenia: meta-analysis. Br J Psychiatry 191: 5-13. doi: PubMed: 17602119.
  3. 3. Chung YS, Barch D, Strube M (2013) A Meta-Analysis of Mentalizing Impairments in Adults With Schizophrenia and Autism Spectrum Disorder. Schizophr Bull.
  4. 4. Fett AK, Viechtbauer W, Dominguez MD, Penn DL, van Os J et al. (2011) The relationship between neurocognition and social cognition with functional outcomes in schizophrenia: a meta-analysis. Neurosci Biobehav Rev 35: 573-588. doi: PubMed: 20620163.
  5. 5. Couture SM, Penn DL, Roberts DL (2006) The functional significance of social cognition in schizophrenia: a review. Schizophr Bull 32 Suppl 1: S44-S63. doi: PubMed: 16916889.
  6. 6. Tager-Flusberg H (2003) Exploring the relationships between theory of mind and social-communicative functioning in children with autism. In: B. RepacholiV. Slaughter. Individual differences in theory of mind: Implications for typical and atypical development. London: Psychology Press. pp. 197-212.
  7. 7. Eack SM, Greenwald DP, Hogarty SS, Cooley SJ, DiBarry AL et al. (2009) Cognitive enhancement therapy for early-course schizophrenia: effects of a two-year randomized controlled trial. Psychiatr Serv 60: 1468-1476. doi: PubMed: 19880464.
  8. 8. Combs DR, Adams SD, Penn DL, Roberts D, Tiegreen J et al. (2007) Social Cognition and Interaction Training (SCIT) for inpatients with schizophrenia spectrum disorders: preliminary findings. Schizophr Res 91: 112-116. doi: PubMed: 17293083.
  9. 9. Mazza M, Lucci G, Pacitti F, Pino MC, Mariano M et al. (2010) Could schizophrenic subjects improve their social cognition abilities only with observation and imitation of social situations? Neuropsychol Rehabil 20: 675-703. doi: PubMed: 20714969.
  10. 10. Tas C, Danaci AE, Cubukcuoglu Z, Brüne M (2012) Impact of family involvement on social cognition training in clinically stable outpatients with schizophrenia -- a randomized pilot study. Psychiatry Res 195: 32-38. doi: PubMed: 21831453.
  11. 11. Fiszdon JM, Reddy LF (2012) Review of social cognitive treatments for psychosis. Clin Psychol Rev 32: 724-740. doi: PubMed: 23059624.
  12. 12. Kurtz MM, Richardson CL (2012) Social cognitive training for schizophrenia: a meta-analytic investigation of controlled research. Schizophr Bull 38: 1092-1104. doi: PubMed: 21525166.
  13. 13. de Waal FB (2008) Putting the altruism back into altruism: the evolution of empathy. Annu Rev Psychol 59: 279-300. doi: PubMed: 17550343.
  14. 14. Gonzalez-Liencres C, Shamay-Tsoory SG, Brüne M (2013) Towards a neuroscience of empathy: Ontogeny, phylogeny, brain mechanisms, context and psychopathology. Neurosci Biobehav Rev 37: 1537-1548. doi: PubMed: 23680700.
  15. 15. Bruneau EG, Saxe R (2012) The power of being heard: The benefits of 'perspective-giving' in the context of intergroup conflict. J Exp Soc Psychol 48: 855-866. doi:
  16. 16. Cushman F, Young L (2011) Patterns of moral judgment derive from nonmoral psychological representations. Cogn Sci 35: 1052-1075. doi: PubMed: 21790743.
  17. 17. Young L, Cushman F, Hauser M, Saxe R (2007) The neural basis of the interaction between theory of mind and moral judgment. Proc Natl Acad Sci U S A 104: 8235-8240. doi: PubMed: 17485679.
  18. 18. Young L, Saxe R (2009) Innocent intentions: a correlation between forgiveness for accidental harm and neural activity. Neuropsychologia 47: 2065-2072. doi: PubMed: 19467357.
  19. 19. Young L, Saxe R (2011) When ignorance is no excuse: Different roles for intent across moral domains. Cognition 120: 202-214. doi: PubMed: 21601839.
  20. 20. Young L, Saxe R (2008) The neural basis of belief encoding and integration in moral judgment. Neuroimage 40: 1912-1920. doi: PubMed: 18342544.
  21. 21. Young L, Saxe R (2009) An FMRI investigation of spontaneous mental state inference for moral judgment. J Cogn Neurosci 21: 1396-1405. doi: PubMed: 18823250.
  22. 22. Galinsky AD, Maddux WW, Gilin D, White JB (2008) Why it pays to get inside the head of your opponent: the differential effects of perspective taking and empathy in negotiations. Psychol Sci 19: 378-384. doi: PubMed: 18399891.
  23. 23. Long ECJ, Andrews DW (1990) Perspective Taking as a Predictor of Marital Adjustment. J Pers Soc Psychol 59: 126-131. doi:
  24. 24. Franzoi SL, Davis MH, Young RD (1985) The Effects of Private Self-Consciousness and Perspective Taking on Satisfaction in Close Relationships. J Pers Soc Psychol 48: 1584-1594. doi: PubMed: 4020610.
  25. 25. Achim AM, Guitton M, Jackson PL, Boutin A, Monetta L (2013) On what ground do we mentalize? Characteristics of current tasks and sources of information that contribute to mentalizing judgments. Psychol Assess 25: 117-126. doi: PubMed: 22731676.
  26. 26. Corcoran R, Cahill C, Frith CD (1997) The appreciation of visual jokes in people with schizophrenia: a study of 'mentalizing' ability. Schizophr Res 24: 319-327. doi: PubMed: 9134592.
  27. 27. Frith CD, Corcoran R (1996) Exploring 'theory of mind' in people with schizophrenia. Psychol Med 26: 521-530. doi: PubMed: 8733211.
  28. 28. Rowe AD, Bullock PR, Polkey CE, Morris RG (2001) "Theory of mind" impairments and their relationship to executive functioning following frontal lobe excisions. Brain 124: 600-616. doi: PubMed: 11222459.
  29. 29. Corcoran R, Mercer G, Frith CD (1995) Schizophrenia, symptomatology and social inference: investigating "theory of mind" in people with schizophrenia. Schizophr Res 17: 5-13. doi: PubMed: 8541250.
  30. 30. Happé FG (1994) An advanced test of theory of mind: understanding of story characters' thoughts and feelings by able autistic, mentally handicapped, and normal children and adults. J Autism Dev Disord 24: 129-154. doi: PubMed: 8040158.
  31. 31. Stone VE, Baron-Cohen S, Knight RT (1998) Frontal lobe contributions to theory of mind. J Cogn Neurosci 10: 640-656. doi: PubMed: 9802997.
  32. 32. Baron-Cohen S, O'Riordan M, Stone V, Jones R, Plaisted K (1999) Recognition of faux pas by normally developing children and children with Asperger syndrome or high-functioning autism. J Autism Dev Disord 29: 407-418. doi: PubMed: 10587887.
  33. 33. Brunet E, Sarfati Y, Hardy-Baylé MC (2003) Reasoning about physical causality and other's intentions in schizophrenia. Cogn Neuropsychiatry 8: 129-139. doi: PubMed: 16571555.
  34. 34. Brüne M (2003) Theory of mind and the role of IQ in chronic disorganized schizophrenia. Schizophr Res 60: 57-64. doi: PubMed: 12505138.
  35. 35. Abell F, Happe F, Frith U (2000) Do triangles play tricks? Attribution of mental states to animated shapes in normal and abnormal development. Cogn Dev 15: 1-16. doi:
  36. 36. White SJ, Coniston D, Rogers R, Frith U (2011) Developing the Frith-Happe animations: A quick and objective test of Theory of Mind for adults with autism. Autism Res.
  37. 37. Horan WP, Nuechterlein KH, Wynn JK, Lee J, Castelli F et al. (2009) Disturbances in the spontaneous attribution of social meaning in schizophrenia. Psychol Med 39: 635-643. doi: PubMed: 18606048.
  38. 38. Baron-Cohen S, Wheelwright S, Hill J, Raste Y, Plumb I (2001) The "Reading the Mind in the Eyes" Test revised version: a study with normal adults, and adults with Asperger syndrome or high-functioning autism. J Child Psychol Psychiatry 42: 241-251. doi: PubMed: 11280420.
  39. 39. Kerr N, Dunbar RI, Bentall RP (2003) Theory of mind deficits in bipolar affective disorder. J Affect Disord 73: 253-259. doi: PubMed: 12547294.
  40. 40. Bora E, Vahip S, Gonul AS, Akdeniz F, Alkan M et al. (2005) Evidence for theory of mind deficits in euthymic patients with bipolar disorder. Acta Psychiatr Scand 112: 110-116. doi: PubMed: 15992392.
  41. 41. Bora E, Yücel M, Pantelis C (2009) Theory of mind impairment: a distinct trait-marker for schizophrenia spectrum disorders and bipolar disorder? Acta Psychiatr Scand 120: 253-264. doi: PubMed: 19489747.
  42. 42. Shamay-Tsoory SG, Tomer R, Berger BD, Aharon-Peretz J (2003) Characterization of empathy deficits following prefrontal brain damage: the role of the right ventromedial prefrontal cortex. J Cogn Neurosci 15: 324-337. doi: PubMed: 12729486.
  43. 43. Samson D, Apperly IA, Chiavarino C, Humphreys GW (2004) Left temporoparietal junction is necessary for representing someone else's belief. Nat Neurosci 7: 499-500. doi: PubMed: 15077111.
  44. 44. Apperly IA, Samson D, Chiavarino C, Humphreys GW (2004) Frontal and temporo-parietal lobe contributions to theory of mind: neuropsychological evidence from a false-belief task with reduced language and executive demands. J Cogn Neurosci 16: 1773-1784. doi: PubMed: 15701227.
  45. 45. Achim AM, Ouellet R, Roy MA, Jackson PL (2012) Mentalizing in first-episode psychosis. Psychiatry Res 196: 207-213. doi: PubMed: 22377576.
  46. 46. Bazin N, Brunet-Gouet E, Bourdet C, Kayser N, Falissard B et al. (2009) Quantitative assessment of attribution of intentions to others in schizophrenia using an ecological video-based task: a comparison with manic and depressed patients. Psychiatry Res 167: 28-35. doi: PubMed: 19346006.
  47. 47. Bird CM, Castelli F, Malik O, Frith U, Husain M (2004) The impact of extensive medial frontal lobe damage on 'Theory of Mind' and cognition. Brain 127: 914-928. doi: PubMed: 14998913.
  48. 48. Brüne M, Abdel-Hamid M, Lehmkämper C, Sonntag C (2007) Mental state attribution, neurocognitive functioning, and psychopathology: what predicts poor social competence in schizophrenia best? Schizophr Res 92: 151-159. doi: PubMed: 17346931.
  49. 49. Brüne M, Schaub D (2012) Mental state attribution in schizophrenia: what distinguishes patients with "poor" from patients with "fair" mentalising skills? Eur Psychiatry 27: 358-364. doi: PubMed: 21288697.
  50. 50. Gregory C, Lough S, Stone V, Erzinclioglu S, Martin L et al. (2002) Theory of mind in patients with frontal variant frontotemporal dementia and Alzheimer's disease: theoretical and practical implications. Brain 125: 752-764. doi: PubMed: 11912109.
  51. 51. Happé F, Brownell H, Winner E (1999) Acquired 'theory of mind' impairments following stroke. Cognition 70: 211-240. doi: PubMed: 10384736.
  52. 52. Herold R, Tényi T, Lénárd K, Trixler M (2002) Theory of mind deficit in people with schizophrenia during remission. Psychol Med 32: 1125-1129. doi: PubMed: 12214792.
  53. 53. Inoue Y, Yamada K, Hirano M, Shinohara M, Tamaoki T et al. (2006) Impairment of theory of mind in patients in remission following first episode of schizophrenia. Eur Arch Psychiatry Clin Neurosci 256: 326-328. doi: PubMed: 16927040.
  54. 54. Marjoram D, Gardner C, Burns J, Miller P, Lawrie SM et al. (2005) Symptomatology and social inference: A theory of mind study of schizophrenia and psychotic affective disorder. Cogn Neuropsychiatry 10: 347-359. doi: PubMed: 16571466.
  55. 55. Sarfati Y, Hardy-Baylé MC, Besche C, Widlöcher D (1997) Attribution of intentions to others in people with schizophrenia: a non-verbal exploration with comic strips. Schizophr Res 25: 199-209. doi: PubMed: 9264175.
  56. 56. Shamay-Tsoory SG, Aharon-Peretz J (2007) Dissociable prefrontal networks for cognitive and affective theory of mind: a lesion study. Neuropsychologia 45: 3054-3067. doi: PubMed: 17640690.
  57. 57. Slessor G, Phillips LH, Bull R (2007) Exploring the specificity of age-related differences in theory of mind tasks. Psychol Aging 22: 639-643. doi: PubMed: 17874961.
  58. 58. Versmissen D, Janssen I, Myin-Germeys I, Mengelers R, Campo JA et al. (2008) Evidence for a relationship between mentalising deficits and paranoia over the psychosis continuum. Schizophr Res 99: 103-110. doi: PubMed: 17936589.
  59. 59. Zaitchik D, Koff E, Brownell H, Winner E, Albert M (2006) Inference of beliefs and emotions in patients with Alzheimer's disease. Neuropsychology 20: 11-20. doi: PubMed: 16460218.
  60. 60. Corcoran R, Frith CD (2003) Autobiographical memory and theory of mind: evidence of a relationship in schizophrenia. Psychol Med 33: 897-905. doi: PubMed: 12877404.
  61. 61. Craig JS, Hatton C, Craig FB, Bentall RP (2004) Persecutory beliefs, attributions and theory of mind: comparison of patients with paranoid delusions, Asperger's syndrome and healthy controls. Schizophr Res 69: 29-33. doi: PubMed: 15145468.
  62. 62. Bertrand MC, Sutton H, Achim AM, Malla AK, Lepage M (2007) Social cognitive impairments in first episode psychosis. Schizophr Res 95: 124-133. doi: PubMed: 17630261.
  63. 63. Bora E, Gökçen S, Kayahan B, Veznedaroglu B (2008) Deficits of social-cognitive and social-perceptual aspects of theory of mind in remitted patients with schizophrenia: effect of residual symptoms. J Nerv Ment Dis 196: 95-99. doi: PubMed: 18277216.
  64. 64. Martino DJ, Bucay D, Butman JT, Allegri RF (2007) Neuropsychological frontal impairments and negative symptoms in schizophrenia. Psychiatry Res 152: 121-128. doi: PubMed: 17507100.
  65. 65. Zhu CY, Lee TM, Li XS, Jing SC, Wang YG et al. (2007) Impairments of social cues recognition and social functioning in Chinese people with schizophrenia. Psychiatry Clin Neurosci 61: 149-158. doi: PubMed: 17362432.
  66. 66. de Achával D, Costanzo EY, Villarreal M, Jáuregui IO, Chiodi A et al. (2010) Emotion processing and theory of mind in schizophrenia patients and their unaffected first-degree relatives. Neuropsychologia 48: 1209-1215. doi: PubMed: 20026084.
  67. 67. Hooker CI, Bruce L, Lincoln SH, Fisher M, Vinogradov S (2011) Theory of mind skills are related to gray matter volume in the ventromedial prefrontal cortex in schizophrenia. Biol Psychiatry 70: 1169-1178. doi: PubMed: 21917239.
  68. 68. Pijnenborg GH, Withaar FK, Evans JJ, van den Bosch RJ, Timmerman ME et al. (2009) The predictive value of measures of social cognition for community functioning in schizophrenia: implications for neuropsychological assessment. J Int Neuropsychol Soc 15: 239-247. doi: PubMed: 19203437.
  69. 69. Janssen I, Krabbendam L, Jolles J, van Os J (2003) Alterations in theory of mind in patients with schizophrenia and non-psychotic relatives. Acta Psychiatr Scand 108: 110-117. doi: PubMed: 12823167.
  70. 70. Anselmetti S, Bechi M, Bosia M, Quarticelli C, Ermoli E et al. (2009) 'Theory' of mind impairment in patients affected by schizophrenia and in their parents. Schizophr Res 115: 278-285. doi: PubMed: 19818586.
  71. 71. Montag C, Neuhaus K, Lehmann A, Krüger K, Dziobek I et al. (2011) Subtle deficits of cognitive theory of mind in unaffected first-degree relatives of schizophrenia patients. Eur Arch Psychiatry Clin Neurosci, 262: 217–26. PubMed: 21892777.
  72. 72. Bora E, Pantelis C (2013) Theory of mind impairments in first-episode psychosis, individuals at ultra-high risk for psychosis and in first degree relatives: Systematic review and meta-analysis. Schizophr Res.
  73. 73. Dorris L, Espie CAE, Knott F, Salt J (2004) Mind-reading difficulties in the siblings of people with Asperger's syndrome: evidence for a genetic influence in the abnormal development of a specific cognitive domain. J Child Psychol Psychiatry 45: 412-418. doi: PubMed: 14982254.
  74. 74. Gokcen S, Bora E, Erermis S, Kesikci H, Aydin C (2009) Theory of mind and verbal working memory deficits in parents of autistic children. Psychiatry Res 166: 46-53. doi: PubMed: 19200606.
  75. 75. Baron-Cohen S, Hammer J (1997) Parents of children with Asperger Syndrome: What is the cognitive phenotype? J Cogn Neurosci 9: 548-554. doi: PubMed: 23968217.
  76. 76. Losh M, Piven J (2007) Social-cognition and the broad autism phenotype: identifying genetically meaningful phenotypes. J Child Psychol Psychiatry 48: 105-112. doi: PubMed: 17244276.
  77. 77. Kim HS, Shin NY, Jang JH, Kim E, Shim G et al. (2011) Social cognition and neurocognition as predictors of conversion to psychosis in individuals at ultra-high risk. Schizophr Res 130: 170-175. doi: PubMed: 21620681.
  78. 78. Green MF, Bearden CE, Cannon TD, Fiske AP, Hellemann GS et al. (2012) Social cognition in schizophrenia, Part 1: performance across phase of illness. Schizophr Bull 38: 854-864. doi: PubMed: 21345917.
  79. 79. Thompson A, Papas A, Bartholomeusz C, Allott K, Amminger GP et al. (2012) Social cognition in clinical "at risk" for psychosis and first episode psychosis populations. Schizophr Res 141: 204-209. doi: PubMed: 22959742.
  80. 80. Wimmer H, Perner J (1983) Beliefs about beliefs: representation and constraining function of wrong beliefs in young children's understanding of deception. Cognition 13: 103-128. doi: PubMed: 6681741.
  81. 81. Baron-Cohen S, Leslie AM, Frith U (1986) Mechanical, behavioural and Intentional understanding of picture stories in autistic children. Br J Dev Psychol 4: 113-125. doi:
  82. 82. Gopnik A, Astington JW (1988) Children's understanding of representational change and its relation to the understanding of false belief and the appearance-reality distinction. Child Dev 59: 26-37. doi: PubMed: 3342716.
  83. 83. Kinderman P, Dunbar R, Bentall RP (1998) Theory-of-mind deficits and causal attributions. Br J Psychol 89: 191-204. doi:
  84. 84. Apperly IA, Samson D, Humphreys GW (2009) Studies of Adults Can Inform Accounts of Theory of Mind Development. Dev Psychol 45: 190-201. doi: PubMed: 19210001.
  85. 85. Frith CD, Frith U (2008) Implicit and explicit processes in social cognition. Neuron 60: 503-510. doi: PubMed: 18995826.
  86. 86. Onishi KH, Baillargeon R (2005) Do 15-month-old infants understand false beliefs? Science 308: 255-258. doi: PubMed: 15821091.
  87. 87. Southgate V, Senju A, Csibra G (2007) Action anticipation through attribution of false belief by 2-year-olds. Psychol Sci 18: 587-592. doi: PubMed: 17614866.
  88. 88. Surian L, Caldi S, Sperber D (2007) Attribution of beliefs by 13-month-old infants. Psychol Sci 18: 580-586. doi: PubMed: 17614865.
  89. 89. Senju A, Southgate V, White S, Frith U (2009) Mindblind eyes: an absence of spontaneous theory of mind in Asperger syndrome. Science 325: 883-885. doi: PubMed: 19608858.
  90. 90. Moran JM, Young LL, Saxe R, Lee SM, O'Young D et al. (2011) Impaired theory of mind for moral judgment in high-functioning autism. Proc Natl Acad Sci U S A 108: 2688-2692. doi: PubMed: 21282628.
  91. 91. Mar RA, Oatley K (2008) The function of fiction is the abstraction and simulation of social experience. Perspect Psychol Sci 3: 173-192. doi:
  92. 92. Mar RA, Oatley K, Hirsh J, dela Paz J, Peterson JB (2006) Bookworms versus nerds: Exposure to fiction versus non-fiction, divergent associations with social ability, and the simulation of fictional social worlds. J Res Pers 40: 694-712. doi:
  93. 93. Mar RA, Oatley K, Peterson JB (2009) Exploring the link between reading fiction and empathy: Ruling out individual differences and examining outcomes. CommunicationsEur J Communication Res 34: 407-428.
  94. 94. Mar RA, Tackett JL, Moore C (2010) Exposure to media and theory-of-mind development in preschoolers. Cogn Dev 25: 69-78. doi:
  95. 95. Kidd DC, Castano E (2013) Reading Literary Fiction Improves Theory of Mind. Science, 342: 377–80. PubMed: 24091705.
  96. 96. Bal PM, Veltkamp M (2013) How does fiction reading influence empathy? An experimental investigation on the role of emotional transportation. PLOS ONE 8: e55341. doi: PubMed: 23383160.
  97. 97. Mar RA (2011) The neural bases of social cognition and story comprehension. Annu Rev Psychol 62: 103-134. doi: PubMed: 21126178.
  98. 98. Hemingway E (2003) The end of something. In our time. New York: Scribner.
  99. 99. Davis MH (1980) A multidimensional approach to individual differences in empathy. JSAS Catalog of Selected Documents in Psychology 10: 85.
  100. 100. Davis MH (1983) Measuring Individual-Differences in Empathy - Evidence for a Multidimensional Approach. J Pers Soc Psychol 44: 113-126. doi:
  101. 101. Hooker CI, Verosky SC, Germine LT, Knight RT, D'Esposito M (2008) Mentalizing about emotion and its relationship to empathy. Soc; Affect Cogn Neurosci 3: 204-217.
  102. 102. Hooker CI, Verosky SC (1308) Germine LT, Knight RT, D'Esposito M (2010) Neural activity during social signal perception correlates with self-reported empathy. Brain Res: 100-113.
  103. 103. Lawrence EJ, Shaw P, Baker D, Baron-Cohen S, David AS (2004) Measuring empathy: reliability and validity of the Empathy Quotient. Psychol Med 34: 911-919. doi: PubMed: 15500311.
  104. 104. Baron-Cohen S, Ring H, Chitnis X, Wheelwright S, Gregory L et al. (2006) fMRI of parents of children with Asperger Syndrome: a pilot study. Brain Cogn 61: 122-130. doi: PubMed: 16460858.
  105. 105. Castelli I, Baglio F, Blasi V, Alberoni M, Falini A et al. (2010) Effects of aging on mindreading ability through the eyes: an fMRI study. Neuropsychologia 48: 2586-2594. doi: PubMed: 20457166.
  106. 106. Moor BG, Macks ZA, Güroglu B, Rombouts SA, Molen MW et al. (2012) Neurodevelopmental changes of reading the mind in the eyes. Soc Cogn Affect Neurosci 7: 44-52. doi: PubMed: 21515640.
  107. 107. Lombardo MV, Barnes JL, Wheelwright SJ, Baron-Cohen S (2007) Self-referential cognition and empathy in autism. PLOS ONE 2: e883. doi: PubMed: 17849012.
  108. 108. Montag C, Heinz A, Kunz D, Gallinat J (2007) Self-reported empathic abilities in schizophrenia. Schizophr Res 92: 85-89. doi: PubMed: 17350225.
  109. 109. Achim AM, Ouellet R, Roy MA, Jackson PL (2011) Assessment of empathy in first-episode psychosis and meta-analytic comparison with previous studies in schizophrenia. Psychiatry Res 190: 3-8. doi: PubMed: 21131057.
  110. 110. Sabbagh MA (2004) Understanding orbitofrontal contributions to theory-of-mind reasoning: implications for autism. Brain Cogn 55: 209-219. doi: PubMed: 15134854.
  111. 111. Sheehan DV, Lecrubier Y, Sheehan KH, Amorim P, Janavs J et al. (1998) The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J Clin Psychiatry 59 Suppl 20: 22-57;quiz: 9881538.
  112. 112. Wechsler D (1999) Wechsler abbreviated scale of intelligence. Psychological Corporation.
  113. 113. Blair JR, Spreen O (1989) Predicting premorbid IQ: A revision of the national adult reading test. Clin Neuropsychol 3: 129-136. doi:
  114. 114. Flesch R (1948) A new readability yardstick. J Appl Psychol 32: 221-233. doi: PubMed: 18867058.
  115. 115. Kline P (1999) Handbook of Psychological Testing. New York: Routledge.
  116. 116. Schmitt N (1996) Uses and abuses of coefficient alpha. Psychol Assess 8: 350-353. doi:
  117. 117. Fertuck EA, Jekal A, Song I, Wyman B, Morris MC et al. (2009) Enhanced 'Reading the Mind in the Eyes' in borderline personality disorder compared to healthy controls. Psychol Med 39: 1979-1988. doi: PubMed: 19460187.
  118. 118. McClure EB (2000) A meta-analytic review of sex differences in facial expression processing and their development in infants, children, and adolescents. Psychol Bull 126: 424-453. doi: PubMed: 10825784.
  119. 119. Baron-Cohen S, Wheelwright S, Skinner R, Martin J, Clubley E (2001) The autism-spectrum quotient (AQ): evidence from Asperger syndrome/high-functioning autism, males and females, scientists and mathematicians. J Autism Dev Disord 31: 5-17. doi: PubMed: 11439754.
  120. 120. Bora E, Baysan Arabaci L (2009) Effect of age and gender on schizotypal personality traits in the normal population. Psychiatry Clin Neurosci 63: 663-669. doi: PubMed: 19674380.
  121. 121. Miettunen J, Veijola J, Freimer N, Lichtermann D, Peltonen L et al. (2010) Data on schizotypy and affective scales are gender and education dependent--study in the Northern Finland 1966 Birth Cohort. Psychiatry Res 178: 408-413. doi: PubMed: 20478630.
  122. 122. Patterson ML, Foster JL, Bellmer CD (2001) Another look at accuracy and confidence in social judgments. J Nonverbal Behav 25: 207-219. doi:
  123. 123. Liu NH, Kee-Hong C, Reddy F, Spaulding WD (2011) Heterogeneity and the longitudinal recovery of functioning during inpatient psychiatric rehabilitation for treatment-refractory severe mental illness. Am J Psychiatr Rehabil 14: 55-75. doi:
  124. 124. Kidd DC, Castano E (2013) Reading literary fiction improves theory of mind. Science 342: 377-380. doi: PubMed: 24091705.
  125. 125. Djikic M, Oatley K, Moldoveanu MC (2013) Opening the Closed Mind: The Effect of Exposure to Literature on the Need for Closure. Creativity Res J 25: 149-154. doi:
  126. 126. Hogarty GE, Flesher S, Ulrich R, Carter M, Greenwald D et al. (2004) Cognitive enhancement therapy for schizophrenia: effects of a 2-year randomized trial on cognition and behavior. Arch Gen Psychiatry 61: 866-876. doi: PubMed: 15351765.
  127. 127. Horan WP, Kern RS, Shokat-Fadai K, Sergi MJ, Wynn JK et al. (2009) Social cognitive skills training in schizophrenia: an initial efficacy study of stabilized outpatients. Schizophr Res 107: 47-54. doi: PubMed: 18930378.
  128. 128. Carlson SM, Moses LJ, Breton C (2002) How specific is the relation between executive function and theory of mind? Contributions of inhibitory control and working memory. Infant Child Dev 11: 73-92. doi:
  129. 129. Buitelaar JK, van der Wees M, Swaab-Barneveld H, van der Gaag RJ (1999) Verbal memory and Performance IQ predict theory of mind and emotion recognition ability in children with autistic spectrum disorders and in psychiatric control children. J Child Psychol Psychiatry 40: 869-881. doi: PubMed: 10509882.
  130. 130. Bauminger N, Kasari C (1999) Brief report: theory of mind in high-functioning children with autism. J Autism Dev Disord 29: 81-86. doi: PubMed: 10097997.
  131. 131. Happé FG (1994) Wechsler IQ profile and theory of mind in autism: a research note. J Child Psychol Psychiatry 35: 1461-1471. doi: PubMed: 7868640.
  132. 132. Happé FG (1995) The role of age and verbal ability in the theory of mind task performance of subjects with autism. Child Dev 66: 843-855. doi: PubMed: 7789204.
  133. 133. Happé FG (1993) Communicative competence and theory of mind in autism: a test of relevance theory. Cognition 48: 101-119. doi: PubMed: 8243028.
  134. 134. Ozonoff S, Pennington BF, Rogers SJ (1991) Executive function deficits in high-functioning autistic individuals: relationship to theory of mind. J Child Psychol Psychiatry 32: 1081-1105. doi: PubMed: 1787138.
  135. 135. Rice K, Viscomi B, Riggins T, Redcay E (2013) Performance on a novel spontaneous theory of mind task correlates with the cortical surface area and thickness of social brain regions. San Francisco, CA.: Cognitive Neuroscience Society.