Familiar story structures possess an evolutionary edge in memory

Abla Alaoui-Soce; Diana I. Tamir

doi:10.1371/journal.pone.0341671

Abstract

Human beings demonstrate a universal impulse to share and consume stories. Over generations of transmission, within and across cultures, stories have evolved to develop regularities in their internal structures. Here, we investigated how two features of story structure – coherence and familiarity – impact recall as participants retold a story 5 days in a row. We predicted that familiar and coherent structures would be more stable over retellings. We measured stability using two novel story similarity measures of the (1) degree of structural change within storytellers, and (2) similarity in remembered structure across storytellers. Study 1 first validated our story similarity measure. Studies 2 and 3 then tracked the evolution of stories that varied in coherence and familiarity, respectively, using novel stories adapted from the popular “Cinderella” structure. Results showed that all stories became more structurally stable across retellings, with stories moving in a consistent direction (i.e., towards a consistent final form). However, retellings of a story with a more coherent and familiar structure showed both greater stability within and similarity across minds than retellings of a story with an incoherent (Study 2) or unfamiliar structure (Study 3). Thus, using novel tools to measure story evolution, our findings suggest that familiarity and coherence of a story structure offered it an advantage in memory, both within and across minds.

Citation: Alaoui-Soce A, Tamir DI (2026) Familiar story structures possess an evolutionary edge in memory. PLoS One 21(3): e0341671. https://doi.org/10.1371/journal.pone.0341671

Editor: Xiaoming Tian, Xi'an University of Posts and Telecommunications, CHINA

Received: October 5, 2024; Accepted: January 10, 2026; Published: March 3, 2026

Copyright: © 2026 Alaoui-Soce, Tamir. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All data and analysis files are available on OSF, at the following link: https://osf.io/hu7ke/.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Familiar story structures possess an evolutionary edge in memory

Long before Disney took over the fairy tale market, stories like Cinderella, Snow White, and Sleeping Beauty had been part of the Western canon and beyond. One of the earliest known variants of Cinderella, “Rhodopis”, for instance, has been traced back to Greece in the first century B.C.E. [1]. Researchers in folklore and mythology have amassed catalogs of folktales in different societies across the world [2,3]. Their work reveals shared traits in stories within and across cultural boundaries, as stories are transmitted both through the generations and across cultures [2–5]. Some stories that still appear today, like the “Devil and the Smith” folktale, can trace their roots as far back as the Bronze Age, thousands of years ago [4]. However, not all stories are successfully transmitted or retain their structure. Stories undergo evolution: Some disappear entirely, others change drastically, and others still survive in more recognizable forms. What factors determine how stories evolve over retellings?

Stories do not evolve of their own accord. They evolve through people. To borrow Terry Pratchett’s metaphor, stories are “parasitic life form[s]” occupying the minds of people [6]. Story evolution relies on the minds that carry them as people pass them along with varying levels of accuracy. The evolution of story structure has taken place over the course of human history, and, as such, involves complex interactions between individuals and communities. Using novel story structure measures, we tracked an aspect of this evolution in real time—focusing on the cognitive determinants of this evolution. In other words, we observed how stories changed in a single person’s mind over several days, to understand why some stories survive better than others. Although we studied only a small set of stories here, we define story broadly: A story is any report of connected events involving behaving agents. From a toddler’s incoherent account of a broken cookie jar to the highly formulaic plotlines of weekly detective dramas, we consider all of these stories—albeit with varying degrees of structure.

The role of schemas in story recall

The human mind is not a blank slate onto which a story is inscribed. Adult minds have expectations about how stories usually unfold. These expectations, or schemas, shape how people encode and recall new inputs [7–9]. In 1932, Bartlett first introduced the concept of schema [10]. In a series of seminal studies, Bartlett investigated the notion of memory as a reconstructive process by tracking how participants recalled narratives. In one study, when participants recalled a culturally unfamiliar Native American story, ‘War of the Ghosts’, they modified the story to fit their existing schemas [10]. They adjusted event sequences, added causal connections they felt were ‘missing’, and deleted elements that did not fit into their schemas. This pattern of distortion replicated across varying study designs, including individual recall (known as repeated reproduction) [11–13], recall in pairs [14], and recall in a chain [10], like the game of telephone. Bartlett’s research into schema and reconstructive memory has generated an impressive and active lineage of research [15].

Here, we build on this foundation to further test how schemas guide memory. We focus on two key factors that impact the extent to which a story fits into people’s schemas: Coherence and familiarity. We hypothesize that the more a story matches a person’s schemas, the more stable their recall.

Coherence and familiarity are key elements of story structure

Coherence and familiarity are critical components of schemas. Schemas capture coherent sequences, in which the events follow a sensible, logical order that may be useful in predicting and navigating future situations [16,17]. As people are repeatedly exposed to similar experiences, they extract regularities across exposures, forming schemas that reflect these common patterns [17–19]. Over time, the more we encounter familiar structures, the more deeply ingrained these schemas become in our understanding of the world.

Past research offers strong evidence that coherence improves recall [20,21]. Coherent texts, both narrative [7,22,23] and expository [24,25], are easier to interpret and remember than incoherent ones [26]. Coherence is a multi-faceted construct: Temporal coherence refers to the logic of the temporal sequence between subsequent sentences. Referential coherence involves the repetition of key concepts or references across different parts of the narrative, helping readers connect information. Causal coherence refers to the causal connections between events, with causally related events remembered better [27,28]. Each type of coherence can independently improve memory, with causal coherence being especially effective [29,30].

In our studies, we disrupt all three aspects of coherence by presenting story events out of order, impairing the comprehensibility of the sequence. For example, if the events of a story were scrambled to read “They were on the train. They were awakened by an alarm. They got on the train”, the disruption of temporal, referential, and causal coherence would make it harder for readers to construct a coherent mental representation of the story. Such scrambling of texts has been associated with poorer memory performance [12,31]. Further, people tend to remember scrambled “scripted” actions (e.g., going to a restaurant or to the dentist) by mentally reordering the events into their more typical, coherent sequence [9,32–34]. Essentially, participants work to restore a coherent version of the events, highlighting coherence’s strong influence on recall.

Familiarity is also integral to the notion of schemas. Schemas form through repeated exposure. Over multiple exposures, people come to understand the shared relationships and regularities in events that comprise a schema [17–19]. Once a person becomes familiar with a story schema, it influences their ability to comprehend, retain, and recall new and related stories [22,35]. Schemas are specific to a social and cultural context. People from different cultures remember stories differently – in a way that aligns with their familiar schemas [13]. For example, when asked to summarize a new story, people who summarized a story that followed culturally familiar story conventions produced more similar summaries than those who summarized an unfamiliar story [35]. Schemas are not fixed, however: Exposure to stories from other cultures can lead people to develop new schemas. As people from Western cultures are repeatedly exposed to unfamiliar stories, they show fewer distortions in recall [15].

In everyday stories, there is overlap between the constructs of coherence and familiarity. Familiar structures are typically more coherent and notions of coherence are developed through experience, as we familiarize ourselves with the statistics and causal relationships of the world [36]. Nevertheless, here we endeavor to manipulate coherence and familiarity independently. We expect that story structures that fit better with people’s schemas—whether because they are coherent or familiar—will be better retained.

Story structure as event schemas

Over the past century, scholars have attempted to catalog and define story structures. Recent work has focused on defining story structure in terms of event structure [37]. This approach suggests that people segment continuous experiences into discrete units of activity, or events [37–39]. These event schemas are essentially canonical sequences of causally linked events that people apply to understand naturalistic experiences [39,40]. As mentioned above, schemas abstract away from single instances to encompass broader patterns of action. People are familiar with a multitude of such naturalistic schemas. For example, one might have a schema for dining out at a restaurant: Sitting down, ordering from the menu, waiting for the food, eating the food, paying the bill. One might have a finer grained schema for paying the bill: Flagging down the waitstaff, receiving the bill, pulling out a credit card, waiting for the processed bill, calculating the tip. Event schemas come into play when we engage with a story, influencing both their perception and recall [37].

We operationalized story structure by the sequence of events it comprises. The sequence of events is defined by both which events are included and the order of these events. We used a novel, automated tool–referred to as “story similarity” –to measure changes in story structure across retellings.

Current research

How does a story structure’s coherence and familiarity determine its evolution in a person’s mind? To capture story evolution within the lab, we employ Bartlett’s (1932) narrative repeated reproduction design: In 3 studies, people read a story and then recall it across 5 subsequent days. By looking at transmission within a mind, our work, like Bartlett, can identify how schemas shape how stories are retained over time. We then build upon this foundation by employing novel, automated methods to quantify changes in story structure across recall. Prior work has often identified and measured distortions in recall by hand [11,14,15]. We capitalize on recent advances in natural language processing to develop a replicable automated process for systematically tracking the degree of change in story structure across retellings. We expect that this novel method will replicate prior work showing that schemas constrain recall, while also bringing new quantifiable insight into the degree of change across different story types.

In particular, we ask how two features of story structure – coherence and familiarity – impact recall. Study 1 first validates a new story similarity measure, in an exploratory test of how it tracks the evolution of a novel incoherent and unfamiliar story over time. We then use this story similarity measure to test a priori hypotheses about how the structure of stories would change across retellings of a coherent vs. incoherent and a familiar vs. unfamiliar story. In Study 2, we test the impact of coherence on the evolution of a story by comparing the evolution of a novel story that follows a familiar and coherent structure with the evolution of a familiar but incoherent version of the novel tale. We manipulate coherence by scrambling a familiarly-structured narrative. In Study 3, we test the impact of familiarity on the evolution of a story by comparing the evolution of that same familiarly structured story with that of an equally coherent, but unfamiliarly structured story. We manipulate familiarity by presenting two novel, equally coherent stories that include the same overall content, but differ in their underlying structure. The familiar story follows a familiar arc, modelled off of the highly familiar Cinderella tale. The Cinderella tale is a long enduring story, with surface level details varying across cultures, mediums and adaptations; over 300 versions have been documented [41]. We then measure the evolution of recall in two ways: (i) within an individual mind, replicating previous work on schematic influence [10–13], and (ii) across independent storytellers, to see if they converge upon a similar retelling. We expect familiar and coherent stories will be more stable in memory and converge to a greater extent across people.

Study 1: Evolution of an incoherent and unfamiliar story

Study 1 has two aims. The first aim is methodological: To validate the new measure of story similarity developed to track how a story evolves across retellings. The second aim is to investigate the evolution of a story that lacks both coherence and familiarity. The story used in Study 1 was designed to read like fluent nonsense—a mishmash of randomly ordered events incompatible with participants’ prior knowledge. Participants should not be able to rely on existing schemas to facilitate encoding and recall. Our goal is to study how such a nonsensical yet fluent story evolves across retellings. Using a novel measure of story similarity, based on a representation of the event structure, we can track shifts in the story across retellings. With no familiar or coherent structure in the initial story to scaffold recall, the story would need to change significantly to better fit pre-existing schemas. Thus, we predict that the story will undergo large changes initially, and then change less across retellings as it becomes more stable and consistent with its final form.

Methods

Participants.

Participants (N = 199) were recruited using Mechanical Turk (www.cloudresearch.com) to complete a 5 day study conducted between April 11^th and June 26^th, 2019. We set a target sample size of 50, after attrition. Of the 199 participants who completed Day 1, 71 participants completed all 5 days of the task. Participants (N = 18) were excluded based on the following a priori exclusion criteria: (i) not meeting task requirements (e.g., writing content unrelated to the story), (ii) writing insufficient text (i.e., word counts below 1 standard deviation from the mean; M = 126.50, SD = 48.53). These exclusions left us with a final sample size of 55.

All participants were U.S. residents fluent in English. Participants received a total of $3.50 for completing all days of the task (10 cents on Day 1, 20 cents on Day 2, 30 cents on Day 3, 40 cents on Day 4, and an additional $2.50 for completing all 5 days). Participants in this and all subsequent studies reported here provided informed consent in accordance with the Princeton University Institutional Review Board. Participant data (pre-exclusions and cleaning, as well as post-exclusions and cleaning) and analysis code for this and all subsequent studies can be accessed on OSF (https://osf.io/hu7ke/).

Procedure.

Participants completed the task over 5 consecutive days on Qualtrics (www.qualtrics.com). On Day 1, participants listened to a nonsense story that lasted 2 minutes and 22 seconds and consisted of 378 words (See Supporting Information). The story follows Sophia, a psychic, as she goes through a series of nonsensical adventures. Participants were asked to not take notes or replay the story to ensure they had no memory aids. After confirming they had listened to the entire story, participants were asked to recall the story as best they could. Participants were encouraged to write as much as possible.

On Days 2–5, participants were asked to recall the story from Day 1. Participants were invited to complete the task at the same time each day. They were allowed a maximum of 12 hours to complete the task. This protocol ensured that the time interval between retellings was no less than 12 hours and no more than 36 hours. By Day 5, participants produced 5 retellings of the initial story listened to on Day 1, one for each day.

Before data analyses, stories were cleaned in the following ways: (1) Spelling errors and typos were corrected. (2) Meta-commentary was removed. This included phrases like “if I remember correctly”, “I recall”, “I believe”, “I think”, as well as content about the difficulty of the task.

Analyses

Story similarity measure.

We developed a novel story similarity measure to track how a story’s structure changes across retellings. This measure captures changes in the content and sequence of events in a story. The identification of events and event boundaries is a complex and ongoing line of research [42,43]. Here we use sentences as a proxy for events. While sentences do not perfectly map onto events, the length of the stories used in these studies allows us to generally ascribe one main action to each sentence.

To assess content changes, we measured the overlap in the events remembered in common across two retellings (e.g., Day 1 vs. Day 3). To assess sequence changes, we measured how similarly ordered the remembered events are across retellings. Both the content and sequence aspects of the story similarity measure used Spacy’s semantic similarity calculator [44], which determines similarity by comparing word vectors, multi-dimensional representations of a word’s meaning. We then combined these two aspects—content and sequence—into a single overall story similarity measure. The following sections describe the steps involved in calculating the content, sequence, and overall story similarity measure.

Story content.

To capture how the content of a story evolves over retellings, we adapted the mnemonic similarity measure developed by Coman et al [45]. This measure tracks which events are remembered and forgotten in common between two retellings. This process proceeds as follows (Fig 1A-1D): First, we broke each story into its component sentences, which serve as a proxy for events in the narrative (Fig 1A). We then compared each sentence in Story A to every other sentence within Story A to determine the most similar sentence. The highest similarity value obtained from this comparison served as the minimum similarity threshold when comparing sentences across different retellings. This threshold allowed us to rule out sentences that likely did not represent the same events. The reason being: This highest similarity value represents the most similar a particular sentence X in Story A can be to another sentence Y in Story A. This other sentence Y describes a different occurrence. If the sentence in Story B that is most similar to sentence X in Story A is below this value, it likely does not depict the same occurrence. Therefore, we cannot qualify it as the same event.

Download:

Fig 1. Story Similarity Measure.

The similarity between two stories, Story A and Story B, is calculated using an automated measure of story content (A-D) and story sequence (E-G). The final story similarity value (H) is obtained by multiplying the content and sequence measures.

https://doi.org/10.1371/journal.pone.0341671.g001

Once we established this threshold, we compared each sentence in Story A to every sentence in Story B. If a sentence in Story B with the highest similarity value exceeded the minimum similarity threshold, we classified it as a Remembered event (Fig 1B-1C). If no sentence met this threshold, we classified it as a Forgotten event. This process was repeated for each sentence in Story A, yielding a list of events from Story A that are remembered and forgotten in Story B. Next, we calculated event content similarity by dividing the number of events remembered from Story A in Story B by the total number of events in Story A (Fig 1D).

This process was then repeated, now starting with Story B as the reference story, and comparing its sentences to those in Story A. The final content similarity measure was calculated by averaging the results from both comparisons (Fig 1D). This ensured symmetry in the way similarity was measured: Story A was just as similar to Story B as Story B was to Story A.

Story sequence.

To capture how the sequence of events in a story evolves over retellings, we developed a novel event order correlation measure (Fig 1A, Fig 1E-1G). Like the content measure, we began by dividing each retelling into sentences (Fig 1A). For each sentence in Story A, we used Spacy’s similarity measure to find the most similar sentence in Story B (Fig 1E). Unlike with the content measure, we did not apply a minimum similarity threshold when comparing sentences for sequence. The sentence with the highest similarity was assigned as a match. This ensured that the best possible match was selected for each sentence. This process was repeated for all the sentences in Story A (Fig 1F). A sentence in Story B could be matched to multiple sentences in Story A.

Once all sentences in Story A were matched with their most similar counterparts in Story B, we calculated the Spearman’s rank order correlation between the sequence of sentences in Story A and the order of their corresponding matches in Story B (Fig 1G). This correlation reflected how similarly the order of events is preserved across retellings.

Then, as with the content measure, this process was repeated, now starting with Story B as the reference and comparing its sentences to those in Story A. The final sequence similarity measure was calculated by averaging the two correlation coefficients (Fig 1G).

Story similarity.

Overall story similarity was derived by combining the content and sequence measures into a single score (Fig 1H). Specifically, we multiplied the content similarity measure by the sequence similarity measure (Story Content x Story Sequence). This combined measure captured how much of the story’s content is retained and how well the sequence of events is preserved across retellings. The resulting story similarity value ranges from −1 and 1, where higher values indicate greater structural similarity between two retellings. We used this measure to compare across retellings and between each retelling and the initial story. All pairwise t-tests were Bonferroni corrected for multiple comparisons and all Cohen’s d effect sizes were hedges corrected. This measure was used consistently for all studies reported.

Measuring the evolution of story similarity.

The story similarity measure allowed us to track how the structure of a story evolved across retellings. We captured this evolution by analyzing three forms of change: Stabilization, Consistency and Modification.

Stabilization is the process by which retellings increase in similarity over time. We measured this as the change in pairwise similarity between subsequent retellings from Day 1–5. We expected the similarity between Day 1 and Day 2 to be lower than the similarity between Day 2 and Day 3 or Day 3 and Day 4, as the story moved towards a more stable form with each retelling.

Consistency is defined as the extent to which retellings evolve in a consistent direction. We used Day 5, the last retelling, as the benchmark against which the earlier retellings are compared. We measured this as change in similarity to the final retelling (Day 5) from Days 1–4. We expected that, as participants continue to retell the story, the retellings would become more similar to the retelling produced on the final day, indicating consistency in the direction of story evolution.

Modification captures the degree of change undergone by the initial story. To measure this, we tracked the similarity between the initial story and each retelling (Days 1–5). We expected that stories with more familiar and coherent structures would undergo less modification across retellings, as they better align with people’s pre-existing schemas.

Validating the story similarity measure.

Before using the story similarity measure to capture evolution in our main analyses, we first validated it to ensure it was both informative and aligned with human judgements of similarity. To do so, we tested the extent to which our measure agreed with human ratings of story similarity, using the stories generated by participants in Study 1. The goal of our measure is to capture people’s overall judgment of similarity between two stories, which we argue is informed by both the content and sequence of events in the story.

To validate the measure, we recruited an independent set of participants (N = 212) from MTurk between September 11^th and 19^th, 2020. Each participant rated the similarity of 5 pairs of stories on a scale from 1 (Extremely Different) to 7 (Extremely Similar). The pairs were selected to cover five specific scenarios: (i) the initial nonsense story paired with a participant-generated retelling, (ii) two retellings from the same participant from different days (e.g., Participant X’s retellings on Days 2 and 4), (iii) two retellings from different participants on the same day of recall (e.g., Day 3 retellings from Participants X and Y), and (iv) two pairs from different participants on different days of recall (e.g., Participant X’s retelling from Day 3 and Participant Y’s from Day 5). This design ensured that we validated our automatic similarity measure using the full range of similarity measured in the study, including comparisons within participants (as needed for all three studies) and across participants (as needed for Studies 2 and 3). Each day of recall appeared at least once for each participant, and each pair of stories was rated by at least 3 different participants.

In total, 153 participants rated all the stories they were given, yielding at least 3 similarity ratings on 94 different pairs of stories. Intercoder reliability was moderate (ICC = 0.584), allowing us to average repeated similarity ratings to derive a single similarity rating for each pair of stories [46]. We then tested these human ratings against the outputs of our story automated similarity measure. We found significant correlations for all three components: (1) The content measure on its own, (r(92)=0.57, p < 0.001), (2) the sequence measure on its own, (r(92)=0.53, p < 0.001), and (3) the combined story similarity measure, (r(92)=0.67, p < 0.001). Importantly, the combined story similarity measure showed stronger alignment with human ratings of similarity than either the content measure (z = 2.11, p < 0.001) or sequence measure (z = 3.51, p < 0.001), confirming the import of both components for human judgments of similarity [47–49].

To ensure that the measure did not simply reflect differences in story length, beyond what a human rater might consider, we tested the correlation between the story similarity measure and human ratings while controlling for differences in word count and sentence count. We found similar correlations between our story similarity measure and human ratings of similarity when controlling (separately) for the effects of word count (pr(92)=0.67, p < 0.001) and sentence count differences (pr(92)=0.67, p < 0.001) [49]. Additional robustness checks further demonstrated that the story similarity measure did not simply reflect low-level story confounds, namely differences in story length (See Supporting Information).

Results

This study asked how a story evolves in the absence of a coherent or familiar structure, using a story that does not follow a pre-existing schema to guide its encoding and recall. We tracked how a nonsense story changes across retellings. We expected that the story would start more unstable, undergoing significant changes in early retellings, and then become more stable in subsequent retellings. In addition, we expected that retellings would evolve in a consistent direction, becoming more similar to the last retelling produced on Day 5.

Stabilization: Similarity across retellings

To test if the story became more stable over retellings, we examined how similar each retelling was to each subsequent retelling (Fig 2A). We found a main effect of retelling on similarity (F(3.26,176.04)=62.681, p < 0.001, η²_G=0.324), such that similarity was lowest between early retellings, and later stabilized into higher similarity values. Specifically, the similarity between the Initial story and Day 1 retelling (M = 0.39, SD = 0.12) was significantly lower than the similarities between all subsequent retellings [50]. Likewise, the similarity between Day 1 and Day 2 was lower than the similarity between all subsequent retellings. By Day 2, the story settled into a stable pattern, with similarity between Day 2 and Day 3 not differing from similarity between any subsequent retellings. These high similarity levels (around 0.7) suggested that the stories became highly stable by the second retellings, with only minor changes occurring across subsequent retellings.

Download:

Fig 2. Stabilization: Similarity Across Retellings.

Similarity increases across subsequent retellings in Studies 1-3, reflecting greater stability over time. Stability is higher for the Coherent, Familiar (blue) story. Color matched asterisks (e.g., red asterisks for Study 1) mark significant differences in story similarity between adjacent retellings. Green asterisks mark the significant differences between conditions. Error bars represent standard error.

https://doi.org/10.1371/journal.pone.0341671.g002

Consistency: Similarity to last retelling

Next, to test if the story evolved in a consistent direction, we tracked similarity to the final retelling on Day 5 (Fig 3A). We found that the similarity of each story to Day 5 retelling increased over time, with a significant main effect of retelling on similarity (F(3.51,189.55)=114.028, p < 0.001, η²_G=0.474). Similarity to the last retelling was lowest in earlier retellings before settling into consistently higher similarity. The similarity between the Initial story and Day 5 (M = 0.29, SD = 0.14) was significantly lower than all the similarities between each subsequent retelling and Day 5. The similarity between Day 1 and Day 5 was also lower than the similarity between each subsequent retelling and Day 5. By Day 2, however, we no longer saw differences in similarity to the last retelling. This suggests that by Day 2, the story has already settled into a relatively stable form consistent with the final retelling on Day 5.

Download:

Fig 3. Consistency: Similarity to Last Retelling.

Similarity to the final retelling increases across subsequent retellings in Studies 1-3, reflecting greater consistency over time. Consistency is higher for the Coherent, Familiar (blue) story. Color matched asterisks (e.g., red asterisks for Study 1) mark significant differences in story consistency between adjacent retellings. Green asterisks mark the significant differences between conditions. The green crosses mark marginal differences. Error bars represent standard error.

https://doi.org/10.1371/journal.pone.0341671.g003

Modification: Similarity to initial story

Lastly, we looked at how much the story was modified by measuring the similarity between the initial story and each retelling (Fig 4A). There was a main effect of retelling on similarity to the initial story (F(3.28,177.16)=13.762, p < 0.001, η²_G=0.074). The story similarity between the Initial story and Day 1 retelling (M = 0.39, SD = 0.12) was significantly higher than the similarities between the Initial story and any of the subsequent retellings. However, the similarity between the initial story did not change across later retellings, suggesting the major structural modifications to the story occurred early, likely by the second retelling.

Download:

Fig 4. Modification: Similarity to Initial Story.

Similarity to the initial story is generally stable after the first retelling. Modification is lower for the Coherent, Familiar retellings (blue), which remain more similar to the initial story than the Incoherent (pink) or Unfamiliar (purple) retellings. Color matched asterisks (e.g., red asterisks for Study 1) mark significant differences in story modification between adjacent retellings. Green asterisks mark the significant differences between conditions. Error bars represent standard error.

https://doi.org/10.1371/journal.pone.0341671.g004

Discussion

Study 1 tracked the evolution of an incoherent, unfamiliar nonsense story over five retellings and found that it reached high levels of stability as early as the second retelling. This suggests that participants rapidly modify the initial story into a form that is more stable in their recall. Most of the modifications to the initial story occurred during the first or second retelling, after which the story remained relatively unchanged. This does not mean that all subsequent retellings are identical, but rather that the core structure–both in terms of story content and sequence–remained highly stable from that point forward.

These findings highlight the utility of our automated similarity measure for tracking changes in stories across retellings. We first validated the measure by comparing it against human judgements of similarity, finding strong alignment between the two that cannot be explained by potentially superficial textual differences (e.g., length). Then, by applying this measure to the progression of an incoherent, unfamiliar story, we established a baseline for how a story, with no initial structure or familiarity, evolves and stabilizes across retellings. This progression illustrates how stories might evolve when they begin without a pre-existing schematic structure.

Armed with this validated story similarity measure, we are now in a position to explore how different factors, such as coherence and familiarity, guide story evolution.

Study 2: Evolution of a coherent vs. incoherent story

In Study 2, we test the influence of coherence on story recall. We manipulate coherence by comparing the evolution of an coherent story against that of an incoherent story. The coherent story is modeled off of the Cinderella-type tale, a tale for which we expect participants to have strong structural priors. The incoherent story was derived by scrambling the coherent story. We expect that coherence gives stories an evolutionary edge, such that they remain more stable, consistent, and less modified in memory. In addition to capturing evolution within a person, we also capture similarity across participants. We expect that the coherent story will be remembered more similarly across participants by the last day of recall.