Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The Situated Assessment Method (SAM2): Establishing individual differences in habitual behavior

  • Léo Dutriaux,

    Roles Data curation, Formal analysis, Investigation, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Laboratoire sur les Interactions Cognition, Action, Émotion (LICAÉ), Université Paris Nanterre, Nanterre Cedex, France

  • Naomi E. Clark,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Writing – review & editing

    Affiliation School of Health and Life Sciences, Glasgow Caledonian University, Glasgow, United Kingdom

  • Esther K. Papies,

    Roles Conceptualization, Funding acquisition, Investigation, Writing – review & editing

    Affiliation School of Health & Wellbeing, University of Glasgow, Glasgow, United Kingdom

  • Christoph Scheepers,

    Roles Formal analysis, Methodology, Software, Writing – review & editing

    Affiliation School of Psychology and Neuroscience, University of Glasgow, Glasgow, United Kingdom

  • Lawrence W. Barsalou

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    lawrence.barsalou@glasgow.ac.uk

    Affiliation School of Psychology and Neuroscience, University of Glasgow, Glasgow, United Kingdom

Abstract

From the perspectives of grounded, situated, and embodied cognition, we have developed a new approach for assessing individual differences. Because this approach is grounded in two dimensions of situatedness—situational experience and the Situated Action Cycle—we refer to it as the Situated Assessment Method (SAM2). Rather than abstracting over situations during assessment of a construct (as in traditional assessment instruments), SAM2 assesses a construct in situations where it occurs, simultaneously measuring factors from the Situated Action Cycle known to influence it. To demonstrate this framework, we developed the SAM2 Habitual Behavior Instrument (SAM2 HBI). Across three studies with a total of 442 participants, the SAM2 HBI produced a robust and replicable pattern of results at both the group and individual levels. Trait-level measures of habitual behavior exhibited large reliable individual differences in the regularity of performing positive versus negative habits. Situational assessments established large effects of situations and large situation by individual interactions. Several sources of evidence demonstrated construct and content validity for SAM2 measures of habitual behavior. At both the group and individual levels, these measures were associated with factors from the Situated Action Cycle known to influence habitual behavior in the literature (consistency, automaticity, immediate reward, long-term reward). Regressions explained approximately 65% of the variance at the group level and a median of approximately 75% at the individual level. SAM2 measures further exhibited well-established interactions with personality measures for self-control and neuroticism. Cognitive-affective processes from the Situated Action Cycle explained nearly all the variance in these interactions. Finally, a composite measure of habitualness established habitual behaviors at both the group and individual levels. Additionally, a composite measure of reward was positively related to the composite measure of habitualness, increasing with self-control and decreasing with neuroticism.

Introduction

From the perspectives of grounded, situated, and embodied cognition, the processes that implement human cognition are not abstract, amodal, and decontextualized but instead incorporate situations, embodiment, and action intrinsically into their operation. Decontextualizing cognition by abstracting over situations, embodiment, and action ignores fundamental processes and produces distorted accounts. Decades of research have established the grounded, situated, and embodied perspectives theoretically and empirically [110].

Inspired by these perspectives, we have developed a new approach for assessing individual differences. Because this approach is grounded in two dimensions of situatedness—situational experience and the Situated Action Cycle—we refer to it as the Situated Assessment Method (SAM2). In the remainder of the introduction, we review properties of traditional assessment instruments. We then use these properties to motivate the SAM2 framework. Finally, we present a specific implementation of this framework: the SAM2 Habitual Behavior Instrument (SAM2 HBI).

Traditional assessment instruments

From the perspective of situated cognition, it is striking how disconnected traditional assessment instruments often are from real-life situations. Typically, assessment instruments use decontextualized items to assess an individual difference of interest. To illustrate, consider an item that assesses conscientiousness in the classic Five Factor Model: “I am someone who perseveres until a task is finished” [1114]. In responding to this item, individuals must abstract over situations to establish a general assessment of how much they agree with it. Similarly consider an item from the Self-Control Scale, “I am good at resisting temptation” [15]. Again, individuals must abstract over situations to establish a general assessment. Many other assessment instruments similarly ask individuals to respond generally across situations to decontextualized items, including the Perceived Stress Scale [16], the Positive and Negative Affect Schedule [17], the Life Satisfaction Scale [18], the Three-Factor Eating Questionnaire [19], and the Five-Facet Mindfulness Questionnaire [20].

Instrument construction.

To develop traditional instruments like these that assess a construct across individuals, an initial pool of decontextualized items related to the construct is developed. To assess neuroticism, for example, items related to emotional reactivity, worry, and moodiness might be sampled. Items can be generated in a bottom up manner from lexical sources such as dictionaries [13] or from preliminary qualitative research [21, 22]. Alternatively, items can be generated in a top-down manner via literature review and expert intuition [12].

To identify a subset of items that coherently measures the target construct, exploratory factor analysis and other methods establish the latent factor structure of the item pool. Items that only load highly on the critical construct are then extracted from the pool and assessed again in a preliminary test instrument. Confirmatory factor analysis and other methods verify that all final test items measure the same target construct well (i.e., high test coherence).

By aggregating scores across these items into an average or sum, an overall measure for the test is established that orders individuals from high to low on the construct (e.g., high to low neuroticism). Ideally, the aggregate measure should order individuals reliably—it must exhibit satisfactory test reliability. As assessed by Cronbach’s alpha and related statistics [2327], an aggregate measure of a presumably stable trait such as neuroticism should exhibit test reliability of at least .7 to .8. Intuitively, alpha estimates how highly aggregate scores for individuals on one occasion would correlate with scores from the same test administered to the same individuals again under ideal conditions (e.g., no change in the construct over time, no carry-over effects from the initial test). In other words, alpha estimates how accurately a test captures stable individual differences in the construct assessed.

A satisfactory measure must also demonstrate construct validity—it must adequately measure the construct of interest and only that construct [28]. Specifically, the measure should first exhibit content validity, covering all relevant features of the target construct, not excluding any important ones. Additionally the measure should also demonstrate convergent and discriminant validity, capturing the same construct as related measures, while being unrelated to measures for different constructs [29]. A measure of neuroticism, for example, should cover all its relevant facets and correlate highly with a measure of emotionality but not with a measure of honesty.

Finally, traditional instruments sometimes assess a related set of constructs, using a single instrument with multiple subscales. For example, the Five Factor Model uses five subscales to assess five domains of personality: extroversion, agreeableness, openness, conscientiousness, and neuroticism [11]. Hierarchical assessment models may contain further subscales. In the Five Factor Inventory, for example, each personality domain is associated with subscales that measure six facets of it [12]. For neuroticism, individual subscales assess the facets of anxiety, angry hostility, depression, self-consciousness, impulsiveness, and vulnerability. Regardless of how many subscales an instrument includes, the criteria of item coherence, test reliability, and construct validity apply to each.

Limitations of traditional assessment instruments

A variety of issues bear on the use of traditional assessment instruments: judgment accuracy, situational variance, and theoretical understanding. Each is addressed in turn.

Judgment accuracy.

What cognitive processes produce responses to decontextualized test items such as, “I am someone who perseveres until a task is finished”? It seems unlikely that one can consult a running average in memory for how efficiently one has performed recent tasks, much less over the past year or even a lifetime. So, how does an individual produce a response to this question? How accurate is it?

One possibility is that individuals have an intuitive theory about themselves that they consult [3034]. According to this account, individuals don’t consult experiences in specific situations but instead draw inferences about themselves from abstract causal theories. Although these theories may reflect actual experience and be somewhat accurate, they may also reflect a variety of beliefs, goals, and biases. Another possibility is that individuals partially sample memories of situational experience but only access memories that are currently available, such that a biased sample results [35, 36]. To the extent that individuals use intuitive theories and/or the availability heuristic, responses to decontextualized test items are unlikely to be fully accurate [although they can be accurate to some extent; 37, 38].

Varying levels of a construct across situations.

Considerable research shows that individuals do not exhibit a constant level of a construct, such as extroversion, across different situations [3847]. Instead, the construct’s manifestation in behavior varies widely. Whereas an individual might be extraverted while dining with their family, they might be introverted while dining with co-workers.

From the classic theoretical perspective of interactionism [39, 48, 49], varying levels of extroversion expressed in different situations result from how the individual interacts with situations dynamically over time. Furthermore, expression in a particular situation does not simply result from a linear combination of the individual and the situation. Instead, different individuals respond to the same situations differently, such that an individual by situation interaction results.

It follows that assessing a construct in different situations for each individual is likely to be informative for characterizing individual differences in it. It further follows that predicting an individual’s behavior will be more accurate when taking specific situations into account than when using only trait-level information about the individual [37, 38, 43, 5053].

Explaining constructs theoretically.

Finally, simply establishing a fixed level of a construct in an individual offers little theoretical insight into it. Often it’s assumed that the level established represents a stable reified disposition or trait within the individual that operates consistently across situations [54]. Many personality researchers, however, question whether such dispositions literally exist within an individual’s cognitive-affective system. Saying, for example, that an individual is neurotic doesn’t necessarily mean that an enduring internal disposition for neuroticism exists in an individual’s cognitive-affective system, producing related behavior.

These kinds of trait attributions make the same error as concluding that a disposition of briskness exists in external climate conditions when one perceives the weather as “brisk.” Instead, briskness, like neuroticism, is a perception that says more about the perceiver than it does about mechanisms in the perceived entity that led to that perception. To mechanistically explain an individual’s level of neurotic behavior in specific situations, it is essential to establish the underlying cognitive and affective mechanisms that interact with specific situations to produce varying levels of neuroticism within specific situations, as well as stable perceived tendencies in behavior across them [3842, 44, 45, 47].

The Situated Assessment Method (SAM2)

SAM2 is a general assessment framework that can be used to measure diverse constructs, including stress, trichotillomania, eating, social connectedness, and sustainable behavior. We developed the SAM2 Habitual Behavior Instrument (SAM2 HBI) for several reasons: First, no instrument currently establishes habitual behaviors for an individual across a broad range of human activities. Second, no current instrument evaluates how extensively individuals vary in habitual behavior, nor addresses factors that predict these individual differences. Third, no current instrument establishes individual difference measures for positive versus negative habits and relates them to measures of personality, such as self-control and neuroticism. Fourth, a large literature on habits offers considerable guidance for developing a psychometric instrument that measures habitual behavior. Finally, the SAM2 HBI has potential to address important unresolved issues concerning the definition of habit and the relation of reward to habitual behavior.

Developing a SAM2 assessment instrument requires three steps: (a) Identify a representative set of situations where the target construct occurs and establish memory cues for sampling them, (b) Identify an appropriate measure of the target construct, (c) Identify scientifically-established factors in the Situated Action Cycle known to influence the target construct, maximizing coverage of influential factors. The next three subsections address each step in turn.

Measuring situational experience.

SAM2 is designed to assess individual differences for a construct in situations where it occurs. Instead of asking individuals to generalize about the construct across situations without taking them into account explicitly, SAM2 first identifies situations where the construct is likely to occur and then asks individuals to assess the construct in each.

Experience sampling offers one approach for assessing situations, where participants assess the behavior of interest at randomly sampled moments in everyday life [38]. Two factors limit this approach for psychometric purposes. First, the time and effort needed to collect experience sampling data lies beyond the scope of many psychometric assessments. Second, the random nature of experience sampling limits the ability to control the situations assessed (i.e., the situations sampled cannot be determined in advance and vary widely across individuals).

For these reasons, SAM2 uses features of situations to cue relevant experiences of situations from memory across individuals systematically. As much research has established, a diverse collection of features constitutes a situation, including spatiotemporal context, people, behaviors, objects, events, emotions, and so forth [5557]. When a situation is experienced, an episodic memory is established that represents its features [5861]. If, for example, someone runs for exercise in a park one day at sunrise, a memory of the episode becomes established that contains features for running, exercise, park, and sunrise.

On later occasions, when one of these features is encountered, it can activate the memory via content-addressable retrieval mechanisms [62]. If, for example, someone is asked how regularly they exercise, the word “exercise” may activate this situational memory and other situational memories like it. In this manner, a feature of a situation can activate situational experiences. Indeed, memory researchers have used this approach to activate and study memories for decades, as exemplified by the Galton-Crovitz method and many others [6366].

In the typical SAM2 instrument, a set of situational features is developed to cue relevant situational experience of interest. By using a constant set of features across individuals, SAM2 systematically assesses situations across individuals efficiently in a psychometric instrument. We assume further, however, that a given feature, such as “exercise,” activates widely differing situational memories between individuals, thereby establishing individual differences of interest.

Our approach to sampling situations is similar to other approaches that sample important life events [67, 68], repetitive daily events [69], and situated interaction [70]. Like SAM2, all these approaches emphasize the importance of taking an individual’s rich situational experience into account. Our approach also resonates strongly with approaches that incorporate the rich structure of everyday experience into psychological research [7175]. Finally, our approach follows longstanding traditions in personality and social psychology that champion the importance of situations in understanding cognition, emotion, and behavior [3840, 45, 76, 77].

Measuring situational experience with the SAM2 HBI. The SAM2 HBI uses behaviors to activate relevant situational memories. Specifically, as Table 1 illustrates, the SAM2 HBI uses 80 common behaviors likely to be habitual from 10 domains of human activity. Because our initial study assessed habitual behavior in college students, we selected behaviors relevant for this population. All later studies used the same 80 behaviors and continued to assess them in the student population. The 10 behavior domains were established by reviewing the literature on habits and exploring online resources associated with habit change. Our aim was to establish a broad representative range of common behaviors likely to be habitual across diverse life activities, ranging from food and drink to protecting the environment.

thumbnail
Table 1. The 80 behaviors assessed for all participants (4 positive and 4 negative from each of 10 domains).

https://doi.org/10.1371/journal.pone.0286954.t001

We do not assume that every behavior in Table 1 is a habitual behavior for every individual. Instead, we simply assume that these are common habitual behaviors in the population. We offer evidence for this claim later, showing that most of these behaviors were highly habitual for at least some individuals. Within each domain, we manipulated behavior valence, sampling both positive (good) and negative (bad) behaviors. As also discussed later, Study 2 provided strong support for the a priori valence assignments in Table 1, along with insight into the valence distinction: Whereas the positive behaviors were highly associated with long-term reward and social approval, the negative behaviors were not.

Measuring the target construct.

For each individual, our goal was to establish the habitualness of each behavior in Table 1. Many measures of habitualness have been proposed, including the frequency of a behavior, the consistency of the situations that elicit it, the automaticity of the behavior, and the automaticity of the cognitions that precede it. Considerable discussion continues to surround these issues [7892].

Our take on this literature is that the most fundamental property of a habitual behavior is the regularity of performing it in situations where doing so is possible. The raw frequency of performing a behavior is not the critical determinant of habitualness—instead it is the regularity of performing the behavior in relevant situations. From this perspective, attending Christmas Eve mass every year is as habitual as turning off the lights every time one leaves a room, even though the latter occurs much more frequently. Thus, for each behavior in Table 1, we asked participants to evaluate, “How frequently do you perform this behaviour in situations where doing so is possible?” (on a 0 to 100 scale). Again, we assume that each behavior cues situational memories that participants evaluate to produce a judgment.

It is certainly possible that behaviors performed regularly are also performed automatically in consistent situations [82, 87]. Many researchers, however, argue that habitual behaviors are not always performed automatically [80, 85, 88, 92]. From the latter perspective, it is an empirical question whether behaviors performed regularly are also performed automatically (and/or consistently). To assess this issue, we included both automaticity and consistency as factors from the Situated Action Cycle that potentially influence regularity (described later).

Establishing situational variability. By assessing how regularly participants perform the 80 behaviors in Table 1, we can establish classic situational effects (via situational memories that the behaviors activate). To the extent that some behaviors are performed more regularly than others, regularity should generally be higher in the associated situations across individuals. Interactions between situations and individuals can also be assessed similarly. To the extent that different individuals exhibit different patterns of regularity across the same behaviors, individuals should interact with the associated situations differently.

Establishing situation-based trait-level measures. Although constructs like regularity are likely to exhibit considerable variability across behaviors and their associated situations, they can also exhibit surprising stability [38, 46, 93, 94]. When the varying levels of a construct across situations are aggregated (e.g., averaged), the resulting ‘trait-level’ measure tends to be highly stable, typically correlating .7 to .9 with comparable aggregate measures of the same construct. Individuals, for example, whose aggregate neuroticism is high across multiple assessments over a week tend to also exhibit high neuroticism across multiple assessments the following week. Although one might be tempted to reify this stability into an internal trait mechanism, a more plausible approach is that it reflects an individual-specific configuration of cognitive-affective processes that produces a relatively stable pattern in behavior while simultaneously adapting to different situations dynamically [3840, 45, 48].

By aggregating assessments of regularity for an individual across the behaviors in Table 1, SAM2 establishes trait-level information about the construct in the associated situations. Importantly, trait-level information in SAM2 is established empirically, statistically aggregating levels across behaviors. As a consequence, trait-level measures in SAM2 are likely to differ from traditional decontextualized measures, which again may instead reflect intuitive theories and the availability heuristic. Previous comparisons of these two approaches for establishing trait-level information do indeed demonstrate significant differences [38, 95].

Item coherence. As we saw earlier, traditional assessment instruments attempt to maximize coherence of the items that assess a latent construct, thereby capturing the construct on a single factor. In contrast, item coherence is not necessary in SAM2 instruments, nor even desirable. In SAM2 instruments, the construct of interest is assessed by cuing a representative sample of situations where the construct occurs. When these situations differ significantly, each is likely to order individuals differently. Individuals exhibiting high levels of the construct in one situation may exhibit low levels in another. As a consequence, large individual by situation interactions emerge, producing low item coherence (i.e., situations do not load on a single factor). While this may not appear desirable from the traditional assessment perspective, it is desirable from the SAM2 perspective, reflecting the sometimes necessary cost of assessing a target construct comprehensively across a diverse sample of situations.

The Spearman-Brown formula indicates how to compensate for low item coherence [2327]. When item coherence is low, assessing a construct with a large number of test items can establish satisfactory test reliability of the aggregate measure. Imagine that coherence is only .05 across situations for a SAM2 instrument. If 80 situations are assessed, the Spearman-Brown formula estimates that alpha will be .81, indicating adequate test reliability. Because the aggregate measure is based on so many items, it is reliable, even though coherence is low. Moreover, using such a large number of situations increases the likelihood that the situations assessed do a reasonable job of covering situations representatively where the construct occurs.

Measuring situational experience with respect to the Situated Action Cycle.

To more completely understand a construct in the situations where it occurs, it is useful to establish its role in the embodied goal-directed action that transpires in them. Thus, SAM2 doesn’t simply measure a construct in relevant situations, it measures additional factors of these situations likely to influence the construct. By doing so, SAM2 establishes both content and convergent validity, comprehensively measuring situational factors related to the target construct of interest.

The Situated Action Cycle provides a powerful theoretical tool for identifying factors that influence a target construct [96]. Fig 1 illustrates the Situated Action Cycle as an idealized series of discrete linear phases. In actual operation, these phases may overlap in time, occur iteratively, or be omitted. Also, various loops and alternative relations between phases may emerge. The idealized representation in Fig 1 simply illustrates critical phases of the Situated Action Cycle and their approximate relations to each other.

thumbnail
Fig 1. The five phases of the situated action cycle, along with measures sampled from it for the SAM2 habits instrument.

Note. Valence in red was assigned a priori during the behavior selection process and validated later in Study 2. Measures in orange were assessed across Studies 1-all, Study 2, and Study 3. Measures in purple were only assessed in Study 2. In later regression analyses, regularity and motivation were dependent variables (DVs), and all other measures were predictors. Measures for the self-relevance and outcome phases of the situated action cycle are the same, given that they can be anticipated initially as self-relevant and then occur later as outcomes.

https://doi.org/10.1371/journal.pone.0286954.g001

As Fig 1 illustrates, phases of the Situated Action Cycle include: the environment, self-relevance, affect, action, and outcomes. In the environment phase, agents experience situations that offer affordances for action and trigger habitual behavioral patterns. In the self-relevance phase, entities and events in the environment implicitly activate self-relevant goals, values, identities, norms, etc. Once a situation’s relevance for the agent is established, it initiates diverse affective states associated with emotion and motivation, often expressed in the body. In turn, emotion and motivation induce diverse actions, ranging from gross bodily movements to eye movements and executive functions. Finally, the actions performed lead to outcomes, such as reward and prediction error. Fig 1 represents the iterative character of the situated action cycle with an arrow that loops back from outcomes to resultant changes in the environment that initiate the next cycle. Fig 1 further represents how each iterative run contributes to conditioning and habit development in long-term memory.

The Situated Action Cycle is hardly a novel idea, with variants playing central roles across disciplines for decades [96]. Behavioral conditioning, for example, offers a classic example of the Situated Action Cycle in psychology, where environmental cues signal reward outcomes that can be obtained with instrumental actions [97, 98]. In cognitive science, theories of goal pursuit offer a different take on the Situated Action Cycle, inserting internal cognitive and affective states between the environment, action, and outcomes central to behaviorism. Examples of this expanded approach can be found in computational accounts of planning and problem solving [99, 100], production systems [101, 102], reinforcement learning [103, 104], and neuroscience [105107]. In discourse analysis, theories of narrative structure propose that conceptual structures similar to the Situated Action Cycle organize people’s knowledge of events during event processing and autobiographical memory [108111]. Grounded theories of conceptual processing similarly propose that knowledge becomes organized around situated action [5, 6, 96, 112, 113]. Because the Situated Action Cycle captures an organized set of processes central to many activities and processes, its ubiquitous role across disciplines is no accident. For this reason, measuring factors across phases of the Situated Action Cycle that influence a target construct is likely to produce comprehensive and informative individual difference measures in a psychometric instrument.

Assessing factors from the Situated Action Cycle in the SAM2 HBI. To establish factors from the Situated Action Cycle that influence behavior regularity, we turned to the large literature on habitual behavior. The red, orange, and purple boxes in Fig 1 illustrate the specific factors in the literature we elected to assess in the studies that follow.

Two important factors associated with habitual behavior are consistency and automaticity. On the one hand, consistency contributes to the development, learning, and triggering of habitual behavior [78, 87, 97, 114, 115]. On the other, as triggering stimuli become frequently and consistently mapped to related responses, automaticity results in both cognition and behavior from repetitive practice [81, 82, 90, 91, 116118]. Considerable literature documents the roles of consistency and automaticity in the conditioning process that produces habitual behavior. Based on this literature, we asked participants to assess the consistency and automaticity of each behavior, predicting that both would be strongly related to behavior regularity, while exhibiting individual differences.

A contentious issue in the literature is whether reward modulates habitual behavior or not. To assess this issue, we asked participants in all studies to evaluate both the immediate and the long-term reward of each target behavior. Based on extensive evidence that reward plays a central role in habitual behavior [86], we predicted that both forms of reward would be positively related to behavior regularity at the group level [also see 88, 89, 92, 97, 101]. We further predicted, however, that consistency and automaticity would be more strongly related to habitual behavior than reward, reflecting the dominant effects of conditioning that make habitual behavior somewhat ballistic. Again, however, we expected to observe large individual differences.

Further literature suggests that as a behavior becomes increasingly habitual, the amount of self-regulation accompanying it decreases [119121]. As a consequence, the amount of conflict experienced with performing the behavior decreases as well. Whereas conflict should arise as people explicitly decide—and perhaps struggle—with what behavior to perform in a particular situation, conflict should decrease as they settle on a single behavior that becomes increasingly habitual. Thus, we predicted that conflict would decrease as behavior regularity increased, again though, expecting individual differences.

Assessing relations between personality and habitual behavior. Previous literature shows that self-control is associated with increased performance of positive behaviors and decreased performance of negative behaviors [88, 122125]. Conversely, previous literature shows that neuroticism is associated with increased performance of negative behaviors and decreased performance of positive behaviors [126129]. To assess whether the SAM2 HBI captures these well-established interactions between personality measures and behavior valence, we included traditional measures of self-control and neuroticism. We also assessed whether factors from the Situated Action Cycle in Fig 1 offer insight into the cognitive and affective processes associated with these personality dispositions.

Methods

Overview of methods in the individual studies

Study 1a was an initial exploratory assessment of the SAM2 HBI performed in our University of Glasgow laboratory (n = 31). Study 1b was an exact replication of Study 1a, again performed in our Glasgow laboratory (n = 31). Study 1c was an exact replication of Studies 1a and 1b combined, performed online with participants sampled across the UK from the Prolific platform (n = 66). Because these three studies were identical except for the details above, and because they exhibited comparable results, they were combined into “Study 1-all” (n = 128).

Study 2 was an online replication of Study 1-all with a larger UK-wide sample from Prolific (n = 199), while assessing three additional factors from the Situated Action Cycle: rated valence, social approval, and motivation (the purple factors in Fig 1).

Study 3 was another replication of Study 1-all using a UK-wide sample from Prolific (n = 115). Whereas Study 1-all and Study 2 collected judgments for regularity, consistency, immediate reward, long-term reward, conflict, and automaticity in a fixed blocked order, Study 3 randomized the order of assessing the predictive factors individually, thereby assessing whether the specific format for collecting judgments was important. Study 3 also added an additional metacognitive measure that assessed whether participants were aware of the factors that predicted their regularity judgments.

Common methods across studies

Participants.

All participants were required to be fluent English speakers, UK residents, and students aged 18 to 30. Online participants (paid £6/hour) were required to have participated in at least 10 Prolific studies and to have achieved at least a 95% approval rate. No participants were excluded in Studies 1a, 1b, 1c, or 2. In Study 3, two participants were excluded for taking very little time and responding randomly on measures that assessed information in the data (established prior to hypothesis testing).

Design.

All participants assessed the same 80 behaviors in Table 1 (40 positive, 40 negative) for the 6 orange factors from the Situated Action Cycle in Fig 1 (behavior regularity, consistency, immediate reward, long-term reward, conflict, automaticity). Additionally, all participants received a score for the Brief Self-Control Scale and for the five subscales of the Five Factor Inventory (extroversion, neuroticism, agreeableness, conscientiousness, openness to experience). In a multilevel design, behavior regularity served as the dependent variable, with consistency, immediate reward, long-term reward, conflict, and automaticity predicting regularity at the behavior level, and with self-control and neuroticism predicting regularity at the individual level.

Materials.

To cover a broad range of potential habits, we first included behaviors addressed regularly in the habits literature and then turned to internet sites that list habits and help people work with them. Besides attempting to cover as many relevant domains as possible, we manipulated behavior valence, including both positive and negative behaviors (Table 1).

Study 2 assessed the validity of our a priori assignments for valence, with all results reported in the supplemental materials, S1 File. As shown there, rated valence in Study 2 verified that the positive behaviors in Table 1 were strongly positive and that the negative behaviors were strongly negative, with no overlap. Regressions further found that the valence manipulation reflected SAM2 predictors from the Situated Action Cycle, with positive behaviors being high in long-term reward and social approval, and with negative behaviors being low.

Procedure.

All participants performed the study online in an anonymous Qualtrics survey. Participants first received an information sheet about the study and clicked a digital form to provide informed consent (ethics approval for this work was received on 1 August 2017 from the Ethics Committee for the College of Science and Engineering at the University of Glasgow, Application 300160200).

Participants then evaluated the 80 behaviors in blocks for behavior regularity, consistency, immediate reward, long-term reward, conflict, and automaticity. Table 2 presents the scales used to collect these judgments. As described shortly, the format for collecting these judgments varied across Studies 1-all, 2, and 3. Additionally, Study 2 included judgments for habit motivation, valence, and social approval (described in S1 File).

thumbnail
Table 2. Measures assessed in Studies 1-all, 2, and 3, with interrater agreement for each.

https://doi.org/10.1371/journal.pone.0286954.t002

After making these judgments, participants completed the Five Factor Inventory [14] and the Brief Self-Control Scale [15]. Finally, participants provided demographic information, were debriefed, and thanked for their participation. Participants typically took about 50 to 60 min to complete the study, with Studies 2 and 3 taking a little longer than Study 1-all. To ensure that the construct of habit was not salient to participants, the term “habit” was not mentioned in Studies 2 and 3 and was mentioned only once in the introduction of Study 1-all.

Analyses

Analyses performed include computations of intraclass correlations [25, 26], Cronbach’s alpha [23, 24], the Spearman-Brown formula [24, 26, 27], and multilevel mixed-effect regressions [130, 131]. S1 File presents the analysis pipeline used for all regression analyses.

Most analyses to follow focus on effect size, replicability, and generalizability, with little formal hypothesis testing [132134]. We report p values when inferences about statistical significance are potentially informative.

Unique methods for individual studies

Data collection mode.

To assess whether our results replicated across different modes of data collection, Studies 1a and 1b were performed in our laboratory, whereas Studies 1c, 2, and 3 were performed online.

Data collection format.

In Studies 1a, 1b, and 1c, participants performed three blocks of judgments. In the first block, participants assessed each of the 80 behaviors one at a time for both regularity and situational consistency. In the second block, participants assessed each behavior for both immediate reward and long-term reward. In the third block, participants assessed each behavior for both conflict and automaticity. In each block, the same 80 behaviors were randomly ordered for each participant. S1 File presents our reasons for blocking judgments in this manner, together with results that justify our reasoning.

In Study 2, the data collection format was the same as the procedure of Study 1-all, with the following exceptions. Judgments of behavior motivation were added to block 2, and a new block 4 was added that included rated valence and social approval. Blocks 1 and 3 were the same as in Study 1-all. To streamline the main text here, results for behavior motivation, rated valence, and social approval are only presented in S1 File. In all cases, they complement the main results reported here and do not raise issues that bear on them.

In Study 3, only one judgment was made on each trial instead of two or three. Because behavior regularity was the dependent variable, it was always assessed in a first block of the 80 randomized behaviors (so as not be influenced by the subsequent blocks for the five predictive factors). The predictive factors (consistency, immediate reward, long-term reward, conflict, automaticity) were then assessed one at a time in five subsequent blocks randomized for each participant, with the 80 behaviors randomized within each. Of interest was whether collecting unblocked judgments with the predictive factors in a random order replicated the pattern of results obtained with the blocked judgment procedure.

S1 File provides examples of the screens used to collect judgments in all studies. S1 File also provides the rationale for implementing blocked judgments initially in Studies 1-all and 2. As will be seen in the main text, data collection format had no impact on the general pattern of results obtained. S1 File presents additional evidence bearing on this issue.

Metacognitive judgments.

In Study 3, participants were asked to estimate how much influence each of the SAM2 factors—consistency, immediate reward, long-term reward, conflict, and automaticity—has on them when performing the 80 behaviors in Table 1 (just prior to filling out the Five Factor Inventory and Self-Control instruments). S1 File presents results showing that participants were relatively unaware of the factors associated with their regularity judgments.

Results

For all results in the main text and S1 File, data and analysis scripts are publicly available online. S1 File presents the correlation matrix for the measures common across studies, along with judgment means for the 80 behaviors across SAM2 measures.

Individual differences in behavior regularity

The SAM2 HBI provides an overall trait-level measure of an individual’s behavior regularity. Averaging an individual’s 80 regularity judgments provides an aggregate measure of how regularly they perform the 80 behaviors in Table 1. The median of these individual means in Studies 1-all, 2, and 3 was 54.91, 56.25, and 56.29, respectively, with inter-quartile ranges of 24.27, 21.60, and 21.46. These values provide a general sense of behavior regularity in the population for these kinds of behaviors, and for how much it varies across individuals. Because the scale value of 50 was labeled, “half the time,” these values indicate that participants performed the 80 behaviors in Table 1 over half the time, on average, in situations where possible, with some participants exhibiting more regularity and others exhibiting less.

It is important to note, however, that a given individual performed many behaviors much more regularly than their overall mean, indicating that some behaviors were performed very regularly. To see this, consider the heatmap in Fig 2 that visualizes each participant’s regularity judgments in Study 2 (S1 File presents analogous heatmaps for Studies 1-all and 3). The number below each column corresponds to the number of the corresponding behavior in Table 1 (with positive behaviors on the left and negative behaviors on the right). When a participant performed a behavior 50% to 60% of the time when possible, its cell is relatively white in the heat map. As can be seen, however, the cells for many behaviors are bright red, indicating that they were performed 80% to 100% of time. Later when we address an individual’s most habitual behaviors, we will again see that many behaviors were performed with very high regularity.

thumbnail
Fig 2. A heatmap that visualizes the individual × behavior interaction for behavior regularity in Study 2.

Note. The 80 regularity judgments for each of the 199 participants are presented in a single row, with their judgments for 40 positive behaviors in the left half, and their judgments for 40 negative behaviors in the right half. The number below each column corresponds to the number of the corresponding behavior in Table 1. As a cell becomes increasingly red, the regularity judgment increasingly approached 100 (on a scale of 0 to 100). As a cell becomes increasingly blue, the regularity judgment increasingly approached 0. As a cell becomes increasingly white, the regularity judgment increasingly approached 50. On the left, a hierarchical clustering dendrogram establishes groups of participants having similar vectors of regularity values across positive and negative behaviors (from hierarchical clustering with the Ward D measure). Analogous heatmaps for Studies 1-all and 3 can be found in S1 File. Table 1 provides intraclass correlations that assess interrater reliability of the judgments in this map.

https://doi.org/10.1371/journal.pone.0286954.g002

The SAM2 HBI further provides trait-level measures for how regularly individuals perform positive and negative behaviors. Fig 3 presents two means for how regularly each individual performs the 40 positive behaviors and the 40 negative behaviors in Table 1. Across studies, the median of these participant means was around 60 for positive behaviors and around 50 for negative behaviors, indicating that positive behaviors were performed more regularly than negative behaviors. Again, however, the bright red cells on the left and right of Fig 2 illustrate that a given individual performed many positive and negative behaviors 80% to 100% of the time.

thumbnail
Fig 3. For each individual participant, the mean judgment on each measure (regularity, consistency, immediate reward, long-term reward, conflict, automaticity) for positive (POS) and negative (NEG) behaviors.

Note. Each point is the mean judgment across 40 behaviors for a single individual. Box and whisker plots show the median and inter-quartile range of the means across participants for each measure.

https://doi.org/10.1371/journal.pone.0286954.g003

Interestingly, trait-level measures for the regularity of positive and negative behaviors tended to be negatively correlated. When the mean regularity of performing positive behaviors was correlated with the mean regularity of performing negative behaviors across participants, the correlations were -.22 in Study 1-all, -.03 in Study 2, and -.21 in Study 3 (t(126) = -2.54, p = .01; t(197) = -.16, p = .71; t(113) = -2.34, p = .02). As individuals tended to perform positive behaviors more regularly, they did not also tend to perform negative behaviors more regularly. In other words, these two individual difference measures exhibited divergent validity, measuring different trait-level constructs in an individual. Later we will see that personality measures for self-control and neuroticism integrate these divergent measures of positive and negative regularity, and that factors from the Situated Action Cycle explain relations between them.

The SAM2 HBI provides additional individual difference measures for factors from the Situated Action Cycle that influence habitual behavior. Consider the individual means for consistency and automaticity in Fig 3, whose median values were generally around 50 or higher across studies. These relatively high values indicate that the 80 behaviors in Table 1 tended to be performed automatically in consistent situations—hallmarks of habitual behavior.

Fig 3 also presents the individual means for immediate reward and long-term reward across positive and negative behaviors. As can be seen, the median values for immediate reward tended to be greater than 0, indicating that participants tended to find both positive and negative behaviors immediately rewarding. Perhaps surprisingly, positive behaviors were consistently associated with higher immediate reward than were negative behaviors. As Fig 3 further illustrates, long-term reward behaved more as expected, with a positive median value for positive behaviors but with a negative median value for negative behaviors.

Overall, the pattern of results for regularity, consistency, automaticity, immediate reward, and long-term reward across positive and negative behaviors offers construct validity for the SAM2 HBI. Across the Situated Action, Cycle factors related to habitual behavior reflected well-established patterns in the literature.

Individual differences in behavior regularity across situations

We anticipated that large differences would exist in how regularly individuals perform the 80 behaviors in Table 1. Specifically, we expected that the behaviors one individual performed regularly would differ considerably from the behaviors that another individual performed regularly—an individual by behavior interaction. To assess this prediction, we used the intraclass correlation to assess interrater agreement in how regularly participants performed the 80 behaviors [25]. Conceptually, the intraclass correlation in this context estimates the average correlation between all possible pairs of individuals in their regularity judgments across behaviors, establishing the overall similarity in their judgments (i.e., interrater agreement).

The first row of Table 2 presented earlier displays the intraclass correlations for behavior regularity across studies, with the lower rows presenting values for the other five measures from the Situated Action Cycle. Because these values were computed using the random effects version of the intraclass correlation [the ICC2 measure in 24, 25], they generalize beyond the current individuals in each study to other individuals in the larger population.

As the columns for all 80 behaviors in Table 2 illustrate, interrater agreement for behavior regularity was only around .30 across studies, suggesting large individual differences in how regularly individuals perform specific behaviors. On the average, the regularity of one individual’s behaviors only correlated about .30 with the regularity of another individual’s behaviors. As the additional values of the ICC2 illustrate for the 40 positive behaviors and for the 40 negative behaviors, comparable levels of interrater agreement for regularity tended to hold within each subset of behaviors as well. Although this low agreement between individuals could simply reflect noise, we will see later that it is highly systematic.

Fig 2 offers insight into the relatively low interrater agreement for regularity. As can be seen by comparing different columns with one another, regularity varied widely and systematically across the 80 behaviors. Nevertheless, the pattern of regularity across the 80 behaviors varied in different manners for different individuals, as can be seen by comparing different rows. It is these differences in patterns of regularity across rows that low interrater reliability in Table 2 is capturing.

In general, differences in the columns and rows of Fig 2 capture classic sources of variance associated with the interactionist approach [3840, 45, 48]. First, individuals exhibit different trait-level forms of a construct, as captured by overall differences in regularity across rows. Second, a construct takes widely varying forms across behaviors (and their associated situations), as captured by overall differences in regularity across columns. Third, a construct exhibits an individual by behavior interaction, with the construct’s levels across behaviors exhibiting different patterns for different individuals (reflected in low interrater agreement).

Additionally, the dendrogram on the left of Fig 2 establishes clusters of individuals who exhibited similar patterns of regularity across the 80 behaviors. Whereas the cluster comprising the top half of Fig 2 captures individuals who perform positive habits more than negative habits, the cluster comprising the bottom half captures individuals who perform positive habits relatively less and negative habits relatively more. Within each of these large clusters, smaller clusters capture other profiles of habit regularity. For example, a cluster just above the midline captures individuals who perform positive habits the most and negative habits the least.

Finally, interrater agreement in Table 2 for other measures from the Situated Action Cycle varied widely, ranging from ~.10 for conflict to ~.50 for long-term reward, with automaticity (~.18), consistency (~.18), and immediate reward (~.20) falling in between. In general, these values indicate that the SAM2 HBI exhibits large individual differences across all its measures. The relatively high agreement for long-term reward across all 80 behaviors (~.50) most likely reflects the salient difference between positive and negative behaviors. When agreement was assessed separately for positive and negative behaviors, interrater reliability for long-term reward dropped considerably to ~.10, indicating much less agreement within each behavior subset.

Test reliability

As discussed earlier, SAM2 establishes an overall trait-level measure of behavior regularity for each individual across the 80 behaviors in Table 1, along with more specific trait-level measures for positive and negative behaviors (Fig 3). Of interest next is how reliable these trait-level measures are (i.e., how accurately they capture individual differences in behavior regularity). We used Cronbach’s alpha to establish test reliability [23, 26; specifically the ICC3k measure in 24, 25].

The top row of Table 3 presents alpha for the three SAM2 measures of behavior regularity. As can be seen, these alphas are satisfactory, clustering around the conventionally acceptable level of .70 to .80. The rows immediately below similarly demonstrate acceptable values for factors from the Situated Action Cycle that influence behavior regularity.

thumbnail
Table 3. Test reliability and inter-behavior coherence for all measures in Studies 1-all, 2, and 3.

https://doi.org/10.1371/journal.pone.0286954.t003

The Spearman-Brown formula offers insight into the values of alpha in Table 3 [2327]. According to Spearman-Brown, alpha increases as a function of two parameters: (a) the coherence between test items (how much they intercorrelate), and (b) the number of test items. Given that the SAM2 aggregate measures used 40 to 80 items, the values of alpha at the top of Table 3 might be expected to be much higher than .80. The relatively low values of item coherence in the lower half of Table 3 explain the relatively low alphas at the top (the fixed-effects version of the intra-class correlation, ICC3, was used to compute these coherence values, reflecting the assumption that a constant set of behaviors is used to assess regularity across occasions [24, 25].

For the overall measure of behavior regularity across all 80 behaviors, inter-behavior coherence was around .04. These low values indicate that the 80 behaviors differed considerably in how they ordered individual participants from high to low on behavior regularity—individuals high in regularity for one behavior were low for another. In the columns of Fig 2, the widely varying patterns of regularity across individuals for different behaviors visualize the low coherence between behaviors. Most importantly, however, 80 test items are sufficient to produce acceptable alphas near .80, even when coherence is a low .04 (following Spearman-Brown).

Why might we be seeing such low coherence for the SAM2 HBI? According to the Situated Assessment Method, measuring a construct effectively requires assessing it in the situations where it occurs. To ensure that an aggregate measure across situations is not biased toward one kind of situation, the behavior should be assessed in a representative sample. It follows that if situations within a representative sample vary widely, then using them as a set of SAM2 test items is likely to exhibit low coherence. To the extent that different situations order individuals differently on the construct from high to low, coherence across a representative set of situations could be low.

One potential source of variance in habitual behaviors is whether they are positive or negative. If so, then computing coherence separately for positive vs. negative behaviors should produce greater coherence. As Table 3 illustrates, coherence for regularity increased from .04 to around .11 for both positive and negative behaviors. As a consequence, corresponding values of alpha in the top-half of Table 3 increased as well.

Construct validity with respect to the Situated Action Cycle: Group level

If the SAM2 measure of behavior regularity is a valid measure of habitualness, then it should be associated with factors known to influence habitualness in the scientific literature (Fig 1). To the extent that a SAM2 HBI measure of regularity behaves as expected when factors such as consistency, automaticity, immediate reward, and long-term reward vary, it demonstrates construct validity.

We used multilevel mixed-effect modelling to assess whether group-level data collected with the SAM2 HBI exhibited this pattern of construct validity. Specifically, the SAM2 measure of behavior regularity was regressed onto the factors in the Situated Action Cycle common across studies in Fig 1. All models included all main effects, two-way interactions, and three-way interactions, together with random intercepts for participants and behaviors, and with relevant random slopes. S1 File provides a detailed account of the regression pipeline, along with detailed regression results for findings reported here and later.

Fig 4 presents the standardized regression coefficients from these analyses with their 95% confidence intervals. By virtue of being standardized, these coefficients provide a measure of effect size. To demonstrate robustness of the coefficients across studies (and especially for small sample sizes), Fig 4 includes coefficients for the three parts of Study 1-all. The value for each coefficient in Fig 4 was obtained from an analysis where random slopes were included for the respective fixed effect, thereby conservatively assessing its generalizability across participants and behaviors.

thumbnail
Fig 4. Estimated regression coefficients and their 95% confidence intervals at the group level for the prediction of behavior regularity across Studies 1-all, 2, and 3, including predictors for a priori valence, consistency, immediate reward, long-term reward, conflict, and automaticity.

Note. To demonstrate replicability, even with small sample sizes, Studies 1a, 1b, and 1c are shown, along with these three studies combined for Study 1-all. All coefficients are standardized and were established in regression models that included random intercepts, random slopes, and interactions (all two- and three-way). S1 File provides details of the regression analysis procedure and results.

https://doi.org/10.1371/journal.pone.0286954.g004

As Fig 4 illustrates, a common pattern emerged across studies, demonstrating that the SAM2 HBI produces highly replicable results. The same pattern of results occurred regardless of whether sample size was 31 (Studies 1a and 1b) or 199 (Study 2), regardless of whether data were collected in the lab (Studies 1a and 1b) or online (Studies 1c, 2, and 3), regardless of whether judgment blocks were fixed (Studies 1-all and 2) or randomized (Study 3).

Most importantly, these results establish construct validity for the SAM2 HBI. Consistent with the habits literature, factors associated with conditioning—consistency and automaticity—were most strongly related to behavior regularity. A behavior became more regular as it was performed more automatically in increasingly consistent situations. Additionally, immediate and long-term reward were also associated with behavior regularity, albeit to a weaker extent than consistency and automaticity. Unexpectedly, conflict was not consistently related to regularity, with most standardized coefficients clustering around 0. Although valence exhibited a weak positive relation with regularity, its most potent effects emerged through interactions with self-control and neuroticism, addressed later.

Finally, these group-level analyses generally explained 65% to 70% of the total variance in behavior regularity (S1 File provides detailed results). This explanatory success demonstrates content validity of the SAM2 HBI: Its measures across the Situated Action Cycle are sufficiently thorough and complete to explain the bulk of the variance in habitual behavior.

Construct validity with respect to the Situated Action Cycle: Individual level

As we will see both here and in the remaining sections, results at the individual level further demonstrate construct validity for the SAM2 HBI. We first address whether each individual’s judgments of consistency, automaticity, immediate reward, long-term reward, and conflict tended to exhibit predicted relations with habitual behavior observed in the literature. As participants judged each of these factors, did their judgments covary with their judgments of behavior regularity? Notably, participants did not appear to be explicitly aware of how the various predictors covary with behavior regularity, given their general inability to report these relationships accurately (see supporting evidence from an analysis of awareness judgments in S1 File). Thus, if these factors from the Situated Action Cycle covary with behavior regularity, they are likely to instead reflect implicit structure across habitual behaviors that the SAM2 HBI captures.

Each row of the heat map in Fig 5 visualizes the correlations between behavior regularity and the five predictive factors from the Situated Action Cycle for each of the 199 participants in Study 2 (S1 File presents analogous heatmaps for Studies 1-all and 3). As a cell becomes redder, the correlation approaches +1; as it becomes bluer, it approaches -1; as it becomes whiter, it approaches 0. Table 4 summarizes the correlations in Fig 5, presenting the 25th, 50th, and 75th quartiles for each correlation across participants in Studies 1-all, 2, and 3.

thumbnail
Fig 5. A heatmap that visualizes correlations between behavior regularity and individual factors from the Situated Action Cycle (conflict, immediate reward, long-term reward, automaticity, and consistency).

Note. The six correlations for each of the 199 participants in Study 2 appear in a single row. As a cell becomes increasingly red, the correlation was increasingly positive. As a cell becomes increasingly blue, the correlation was increasingly negative. As a cell becomes increasingly white, the correlation increasingly approached 0. On the left, a hierarchical clustering dendrogram establishes groups of participants having similar prediction vectors (from hierarchical clustering with the Ward D measure). Analogous heatmaps for Studies 1-all and 3 can be found in S1 File. Table 4 summarizes the correlations in all three studies.

https://doi.org/10.1371/journal.pone.0286954.g005

thumbnail
Table 4. Medians (interquartile ranges) of the individual correlations in Fig 5 between behavior regularity and factors from the Situated Action Cycle (Consistency, immediate reward, long-term reward, conflict, and automaticity).

https://doi.org/10.1371/journal.pone.0286954.t004

As both Fig 5 and Table 4 illustrate, the general pattern for individual prediction conformed to the pattern at the group level. Again, consistency and automaticity were most strongly related to behavior regularity, exhibiting significant median correlations around .75 and .70, respectively. Again, immediate and long-term reward were also related to regularity significantly but to a lesser extent, exhibiting median correlations around .45 and .35, respectively. Again, conflict appeared relatively unrelated to regularity. These results further establish construct validity for the SAM2 HBI. Not only does it capture well-established relations between constructs at the group level, it also captures them at the individual level.

We hasten to add that considerable individual differences were nevertheless present. In Fig 5, behavior regularity failed to correlate highly with consistency or automaticity for quite a few individuals. The absence of correlations for some individuals is even more prevalent for reward, especially for long-term reward. Furthermore, major differences in individual prediction vectors differentiate clusters of related individuals, as captured by the hierarchical clustering dendrogram on the left.

Content validity in individual regressions

Earlier we saw considerable individual differences in judgments of regularity (ICC2 values in Table 2), consistent with a large individual by behavior interaction (Fig 2). Of interest here is whether these large individual differences simply reflected noise or whether they were systemic. If they were systematic, then factors from the Situated Action Cycle in Fig 1 should generally explain high levels of variance in the regularity judgments of individual participants. In other words, the SAM2 HBI should again illustrate a high level of content validity.

To assess this prediction, a simple linear regression was performed for each participant in Studies 1-all, 2, and 3 on their individually standardized data (n = 128, 199, 115, respectively). Each regression only modeled fixed effects, given that no random effects were logically possible in the individual models. Each regression included behavior regularity as the dependent variable, along with consistency, immediate reward, long-term reward, conflict, and automaticity as predictors. Only main effects were modeled with no interactions. The goal of these simple multiple regressions was to construct a prediction profile of behavior regularity for each individual.

Fig 6 presents the variance explained (R2) in the individual regressions, with the median R2 being 77% in Study 1-all, 75% in Study 2, and 72% in Study 3 (with R2 expressed as a percentage). The R2 for the same main-effects-only individual model with no random effects or interactions when run at the group level is plotted as the lower dashed purple line. The R2 for a group-level model with random intercepts and interactions is plotted as the upper dashed blue line. As can be seen, the median R2 for the individual models was significantly higher than the variance explained by both group-level models in every study.

thumbnail
Fig 6. R2 plotted as percentages for the individual regressions in Studies 1-all, 2, and 3 (in percentage points).

Note. Each dot is the R2 for a participant’s individual regression, where each regression predicts the individual’s judgments of behavior regularity from their judgments of consistency, immediate reward, long-term reward, conflict, and automaticity (with no interactions or random effects). A box and whisker plot shows the median and inter-quartile range in each study. The lower dashed purple line shows R2 for the same main-effects-only individual model with no interactions or random effects when run at the group level. The upper dashed blue line shows R2 for Model 1 at the group level with random intercepts and interactions from Table 5. Each median R2 for the individual regressions in Studies 1-all, 2, and 3 differed significantly from the respective main-effects-only group model in the predicted positive direction (using Mann-Whitney tests, all p < 1.0e-6, one-tailed). Each median also differed significantly from the respective random-effects-plus-interactions group model in the predicted positive direction (using Mann-Whitney tests, all p < 1.0e-4, one-tailed).

https://doi.org/10.1371/journal.pone.0286954.g006

thumbnail
Table 5. The 80 behaviors ordered by the composite measure of habitualness (averaged across Study 2 participants), with their composite reward values shown as well.

https://doi.org/10.1371/journal.pone.0286954.t005

These results for the individual regressions indicate that the relatively low interrater agreement in Table 2 did not reflect noise but instead reflected systematic individual differences. Specifically, high R2 values for the individual regressions show that the SAM2 judgments within individuals tended to be highly systematic. As a result, low agreement between individuals for behavior regularity reflected large differences in their systematic judgments of it. These results demonstrate the content validity of the SAM2 HBI at the individual level, explaining even more variance in behavior regularity than at the group level.

Construct validity with respect to self-control and neuroticism

As described earlier, the literature on habitual behavior has established interactions between personality measures and habit valence. As an individual’s self-control increases, they tend to perform positive behaviors more and negative ones less (a self-control × valence interaction). Conversely, as an individual’s neuroticism increases, they tend to perform negative behaviors more and positive ones less (a neuroticism × valence interaction). If the SAM2 measure of behavior regularity is a valid measure of habitualness, it should capture these interactions.

Figs 7 and 8 present these interactions across studies, including the three sub-studies of Study 1-all to demonstrate robustness in small samples. As Fig 7 illustrates, self-control was associated with increasing regularity of positive behaviors and decreasing regularity of negative behaviors across studies. As Fig 8 illustrates, neuroticism was associated with decreasing regularity of positive behaviors and increasing regularity of negative behaviors. S1 File presents detailed analyses of these interactions, all remaining robust when random slopes were included for participants and behaviors, demonstrating their generalizability.

thumbnail
Fig 7. The interaction of behavior valence with self-control across Studies 1-all, 2, and 3.

Note. Each interaction modeled here was established in a mixed-effect regression analysis that predicted behavior regularity as a function of only valence and self-control (with all three variables standardized prior to analysis). Valence was an a priori variable that contrasted positive versus negative behaviors (Table 1). S1 File provides details of the regression analysis procedure and results.

https://doi.org/10.1371/journal.pone.0286954.g007

thumbnail
Fig 8. The interaction of behavior valence with neuroticism across Studies 1-all, 2, and 3.

Note. Each interaction modeled here was established in a mixed-effect regression analysis that predicted behavior regularity as a function of only valence and neuroticism (with all three variables standardized prior to analysis). Valence was an a priori variable that contrasted positive versus negative behaviors (Table 1). S1 File provides details of the regression analysis procedure and results.

https://doi.org/10.1371/journal.pone.0286954.g008

These findings further establish construct validity for the SAM2 HBI. Consistent with previous literature on relations between personality and habit valence, self-control and neuroticism strongly affected the regularity of positive and negative behaviors in opposite manners.

Using the Situated Action Cycle to understand self-control and neuroticism

As discussed earlier, viewing self-control and neuroticism as internal trait mechanisms that cause behavior offers limited theoretical understanding of these behavioral dispositions. To understand self-control and neuroticism, it is instead necessary to develop accounts of the underlying cognitive and affective processes as they interact with situations to produce these stable tendencies in behavior [3842, 44, 45, 47]. As will see in these next analyses, the SAM2 framework offers a novel and effective approach for establishing such accounts.

Of interest in these analyses was whether factors from the Situated Action Cycle for habitual behavior in Fig 1—consistency, automaticity, immediate reward, long-term reward, and conflict—explain variance in self-control and neuroticism. To see this, first consider the self-control × valence interactions in Fig 7, which resulted from regressing behavior regularity onto only self-control, valence, and their two-way interaction (including random intercepts and slopes as described in S1 File). If factors from the Situated Action Cycle for habitual behavior in Fig 1 underlie self-control, then adding all these factors into the initial regression models should eliminate self-control’s interactions with valence. If these factors don’t underlie self-control, then the self-control × valence interactions should remain.

When all factors from the Situated Action Cycle for habitual behavior were added to the original regression models, the self-control × valence interactions in Fig 7 completely disappeared across studies. When the same procedure was applied to the neuroticism × valence interactions in Fig 8, they, too, disappeared completely. These results demonstrate that factors from the Situated Action Cycle underlie the behavioral tendencies associated with self-control and neuroticism. S1 File presents these analyses in detail.

We next used stepwise regression to identify the specific SAM2 factor(s) from the Situated Action Cycle that explained variance in these interactions. At each regression step, all factors were entered, one at a time, to assess how much adding each alone decreased the estimated regression coefficient for the interaction of interest. After each step, we added the (remaining) factor that decreased the coefficient the most, continuing until all factors had been added. S1 File provides the details of these analyses.

Fig 9 presents results from the stepwise process for the self-control × valence interactions in Studies 1-all, 2, and 3. Across studies, automaticity, consistency, long-term reward, and immediate reward explained the increased regularity of positive behaviors and the decreased regularity of negative behaviors associated with self-control. Fig 10 presents the analogous results for the neuroticism × valence interactions. Across studies, automaticity, long-term reward, and consistency explained the decreased performance of positive behaviors and the increased performance of negative behaviors associated with neuroticism.

thumbnail
Fig 9. Results from stepwise regression to establish the SAM2 predictors that explained variance in the valence × self-control interaction in Studies 1-all, 2, and 3 (Fig 7).

Note. In the original analyses (left column), behavior regularity was regressed onto only valence and self-control in Model 2 from the analysis pipeline to test the valence × self-control interaction maximally. The five primary SAM2 predictors (Fig 1) were then added one at a time into the original regression to assess how much each alone decreased the coefficient for the valence × self-control interaction (estV×S). The predictor that decreased estV×S the most is shown in the second column, along with a plot of the resulting interaction and the new value of estV×S. In four further iterations of the stepwise process, the remaining SAM2 predictors were again added one by one to identify the predictor that next decreased the estV×S interaction the most. These results are shown in the third, fourth, fifth, and sixth columns. S1 File provides details of the regression analysis procedure and results.

https://doi.org/10.1371/journal.pone.0286954.g009

thumbnail
Fig 10. Results from stepwise regression to establish the SAM2 predictors that explained variance in the valence × neuroticism interaction in Studies 1-all, 2, and 3 (Fig 8).

Note. In the original analyses (left column), behavior regularity was regressed onto only valence and neuroticism in Model 2 from the analysis pipeline to test the valence × neuroticism interaction maximally. The five primary SAM2 predictors (Fig 1) were then added one at a time into the original regression to assess how much each alone decreased the coefficient for the valence × neuroticism interaction (estV×N). The predictor that decreased estV×N the most is shown in the second column, along with a plot of the resulting interaction and the new value of estV×N. In four further iterations of the stepwise process, the remaining SAM2 predictors were again added one by one to identify the predictor that next decreased the estV×N interaction the most. These results are shown in the third, fourth, fifth, and sixth columns. S1 File provides details of the regression analysis procedure and results.

https://doi.org/10.1371/journal.pone.0286954.g010

These analyses further demonstrate construct validity of the SAM2 HBI. Not only does this instrument capture well-established relations between behavior regularity and relevant personality measures, it also provides insights into factors from the Situated Action Cycle that underlie the associated behavioral dispositions.

Establishing habitual behaviors

The dimension of habitualness.

We next illustrate how the SAM2 HBI can be used to establish habitual behaviors for both a group and an individual. As discussed earlier, considerable disagreement surrounds the criteria that establish a behavior as a habit. For example, a habit could be a behavior produced automatically or a behavior preceded by automatic cognition. A habit could be a behavior that is performed frequently in consistent situations. A habit could be a ballistic behavior not modulated by reward. A habit could be a behavior performed regularly as conditions allow, regardless of automaticity, consistency, and reward.

To address this issue, we developed a composite measure of habitualness that combined regularity, consistency, and automaticity (i.e., for each behavior in Table 1, the average of an individual’s judgements of regularity, consistency, and automaticity). Essentially, this measure assumes that as behavior becomes increasingly regular, consistent, and automatic, it becomes increasingly habitual in a continuous manner.

Establishing habitualness at the group level.

Table 5 presents the 80 behaviors from Table 1 ordered by composite habitualness averaged across the 199 participants in Study 2. The 40 most habitual behaviors appear on the left, and the 40 least habitual behaviors appear on the right. Examining the most and least habitual behaviors confers face validity on the composite measure of habitualness. The most habitual behaviors strike us as intuitively likely to be highly habitual (e.g., saying please and thank you, check my phone, shower every day, worry). In contrast, the least habitual behaviors strike us as intuitively likely to be less habitual (e.g., chew on pencils, skip lectures, use substances to relax, litter).

For each study, Fig 11 presents a histogram for the 80 behaviors as a function of composite habitualness. Strong similarity in the distributions holds across studies, with maximal density in the 50 to 60 range. An interesting question is where one might establish a threshold on this distribution for classifying a behavior as a habit. Because all measures going into composite habitualness used a 0 to 100 likelihood scale, a value of 50 can be viewed as a behavior that was performed regularly, automatically, and in consistent situations about 50% of the time. What should the threshold of this measure be for calling a behavior a habit? If the threshold were set at 80%, very few behaviors would be habits at the group level. If the threshold were set around 50%, many habits would be performed regularly and automatically in consistent situations 50% of the time. Should these be called habits? Many are behaviors that people might well consider habitual, such as take study breaks, eat healthy snacks, and use shopping lists.

thumbnail
Fig 11. Histograms showing counts of the 80 behaviors in Table 1 on their compositive values for habitualness averaged across participants in Studies 1-all, 2, and 3.

https://doi.org/10.1371/journal.pone.0286954.g011

An alternative way of viewing these plots is to conceptualize habitualness as a dimension that varies continuously across behaviors. From this perspective, attempting to establish a category of bona fide habits is not likely to be easy, successful, or theoretically meaningful. Similarly, instead of asking how long it takes to learn a habit [135], it perhaps is more useful to establish how the habitualness of a behavior grows continuously over time.

Establishing habitualness at the individual level.

As we saw earlier in Fig 2 and Table 2, considerable differences occurred in how regularly individuals perform the 80 behaviors in Table 1. Thus, the values of composite habitualness for specific behaviors are likely to vary considerably across individuals as well.

We first addressed this issue as follows. For each study, we computed an intraclass correlation that established how much the composite measure of habitualness varied between individuals [ICC2; 24, 25]. Analogous to Table 2, these intraclass correlations assessed interrater agreement in habitualness for the 80 behaviors. For Studies 1-all, 2, and 3, these values were .31, .26, and .27, respectively, indicating large individual differences in habitualness. Because the ICC2 estimates random effects, these values generalize to other individuals in the population.

To further document individual differences in habitualness, we established how many of the 80 behaviors in Table 1 were the first most habitual behavior for at least one individual. In Studies 1-all, 2, and 3, this number was 35, 53, and 39 out of 80, respectively. We then assessed the number of behaviors that fell into the top five most habitual behaviors for at least one individual. In Studies 1-all, 2, and 3, this number was 65, 73, and 70 out of 80, respectively. Earlier we proposed that the 80 behaviors in Table 1 could potentially be highly habitual for at least some individuals. These results support that claim.

Fig 12 further illustrates these large individual differences, showing the top ten most habitual behaviors for eight individuals in Study 2. Whereas the four individuals on the left were relatively high in self-control and low in neuroticism, the four individuals on the right were relatively low in self-control and high in neuroticism. As can be seen, considerable variability existed in the top ten most habitual behaviors for even individuals with the same personality profile. Even more striking differences occurred between profiles.

thumbnail
Fig 12. The top ten most habitual behaviors from four Study 2 participants who were high in self-control and low in neuroticism (left), and the top ten most habitual behaviors from four Study 2 participants who were high in neuroticism and low in self-control (right).

Note. Composite values of habitualness and reward are shown for each behavior. Each inset Fig shows the habitualness histogram for a participant’s 80 behaviors, with the same axes as the histograms in Fig 11.

https://doi.org/10.1371/journal.pone.0286954.g012

For each individual, an inset figure plots the histogram of their 80 behaviors for the measure of composite habitualness (with the axes being the same as the group distributions in Fig 11). As can be seen, the individual distributions varied widely, departing considerably from the group distributions. Again, large individual differences occurred, and it is not clear how one might motivate a consistent habit threshold across them in a principled manner.

Establishing the relation between habitualness and reward

Finally, we illustrate how the SAM2 HBI can be used to establish the relation between habitualness and reward at both the group and individual levels. As described earlier, some researchers propose that habits run ballistically, not modulated by reward. In contrast, other researchers make compelling empirical and theoretical cases that reward modulates the implementation of habitual behavior.

To establish the relation between habitualness and reward, we constructed a composite measure of reward from the SAM2 measures for immediate and long-term reward: the average of an individual’s judgements of immediate and long-term reward for each behavior in Table 1. Table 5 presents the compositive values averaged across participants in Study 2. Fig 12 presents the compositive values for the top ten most habitual behaviors of eight participants in Study 2.

Assessing the relation between habitualness and reward at the group level.

Fig 13 presents scatterplots for the composite measures of habitualness and reward across the 80 behaviors at the group level (averaged across participants within Studies 1-all, 2, and 3). The point in a scatterplot for each behavior is designated by its number in Tables 1 and 5. As can be seen, habitualness and reward were positively related across studies, exhibiting correlations of .57, .65., and .47 for Studies 1-all, 2, and 3, respectively (every p < .0001, two-tailed). As a behavior became increasingly habitual, it became increasingly rewarding as well.

thumbnail
Fig 13. Scatterplots of the 80 behaviors in Table 1 on their compositive values for reward and habitualness averaged across participants in Studies 1-all, 2, and 3.

Note. The behavior numbers correspond to those in Table 1. rHR is the correlation between the two composite means across behaviors.

https://doi.org/10.1371/journal.pone.0286954.g013

Examining specific behaviors in Fig 13 is informative. Across studies, Saying please and thank you (behavior 28) was simultaneously most habitual and most rewarding. Conversely, at the other end, littering (behavior 77) was simultaneously least habitual and least rewarding. Other interesting cases include worry (behavior 22)—a highly habitual behavior low in reward—and participate in sports activities and clubs (behavior 11)—a relatively non-habitual behavior high in reward. The position of a specific behavior within a scatterplot was strikingly consistent across studies, again demonstrating the replicability of results obtained with the SAM2 HBI.

Assessing the relation between habitualness and reward at the individual level.

We next assessed the correlation between the composite measures of habitualness and reward for each individual across the 80 behaviors. Fig 14 plots the distribution of these correlations in each study, where a point represents the correlation for one individual. Similar to the analogous correlations at the group level, habitualness tended to be positively related to reward across individuals, with mean values of .45, .48, and .41 for Studies 1-all, 2, and 3, respectively (all different from 0, two-tailed: t = 26.27, SE = .02, p < .0001; t = 32.04, SE = .02, p < .0001; t = 16.86, SE = .02, p < .0001).

thumbnail
Fig 14. Values of the correlation between the composite measures of habitualness and reward, rHR, for individual participants in Studies 1-all, 2, and 3.

https://doi.org/10.1371/journal.pone.0286954.g014

Finally, we assessed whether individual correlations of habitualness and reward were related to the personality measures of self-control and neuroticism. As Fig 15 illustrates, a consistent pattern emerged across studies. As an individual’s self-control increased, the positive relation between reward and habitualness tended to become stronger, exhibiting positive correlations of .51, .48., and .41 for Studies 1-all, 2, and 3, respectively (every p < .0001, two-tailed). In contrast, as an individual’s neuroticism increased, the positive relation between reward and habitualness tended to become weaker, exhibiting negative correlations of -.38, -.40., and -.39 for Studies 1-all, 2, and 3, respectively (every p < .0001, two-tailed). These results suggest that self-control was associated with increased reward monitoring, whereas neuroticism was associated with decreased reward monitoring.

thumbnail
Fig 15. Individual correlations in Studies 1-all, 2, and3 between the composite measures of habitualness and reward, rHR, plotted as a function of individual measures for self-control and neuroticism.

Note. The composite measures of habitualness and reward were both transformed to z scores before being plotted along the X axis.

https://doi.org/10.1371/journal.pone.0286954.g015

Discussion

Summary of results

Three studies demonstrated that the SAM2 HBI produces highly replicable findings. The same robust pattern of results that occurred in the larger samples of Studies 2 and 3 (n = 199 and n = 115, respectively) also occurred in samples as small as n = 31 (Studies 1a and 1b in Study 1-all). This pattern also replicated regardless of whether SAM2 judgments were collected in the laboratory (Studies 1a and 1b) or online (Studies 1c, 2, and 3), whether judgments were collected in a fixed order (Studies 1-all and 2) or in a random order (Study 3), whether additional measures were collected (Study 2). We next summarize the robust pattern of results that replicated across studies.

Trait-level measures.

The SAM2 HBI produced a variety of trait-level measures that provided insight into behavior regularity at both the group and individual levels. Most basically, the SAM2 HBI produced a trait-level measure of behavior regularity, showing that, on average, participants performed the 80 behaviors in Table 1 about 50% of the time when possible (with some behaviors performed more regularly and others performed less). More specific trait-level measures for the regularity of positive behaviors and for the regularity of negative behaviors captured personality differences in self-control and neuroticism (Figs 3, 7 and 8).

Finally, a composite measure of habitualness established habitual behaviors at the group and individual levels (Figs 11 and 12), and a composite measure of reward established relations between habitualness and reward (Figs 1315).

Individual differences.

Participants exhibited considerable differences in the behaviors they performed regularly. When we assessed interrater reliability between participants in how regularly they performed the 80 behaviors, we only observed agreement around .30 (Table 2). For a composite measure of habitualness, participants similarly only exhibited agreement around .28. Behaviors that were habitual for some participants were not habitual for others. These results indicate that the SAM2 HBI captured large individual differences in habitual behavior. Consistent with the classic interactionist perspective [3840, 45, 48], an individual’s behavior regularity varied considerably across situations, with the pattern of variability taking different forms for different individuals (i.e., the individual × behavior interaction in Fig 2).

Although such variability could simply reflect noise in judgments of regularity, individual regressions explained a median variance of approximately 75% in individual regularity judgments (Fig 6). Such a high level of explained variance suggests that individual regularity judgments were highly systematic—not noisy. If low interrater agreement had simply reflected noise, such high percentages of explained variance at the individual level would not have been observed.

Test reliability.

When Cronbach’s alpha was used to assess test reliability, acceptable values around .80 and above were observed (Table 3). Interestingly, the 80 behaviors in the overall measure of behavior regularity exhibited relatively low coherence around .04, indicating that individual behaviors ordered participants quite differently. As we suggested, such low coherence most likely resulted from attempting to sample behaviors representatively. Broad coverage ensured that an overall measure of regularity reflected as many potentially relevant behaviors as possible.

Including a large number of 80 behaviors, however, compensated for low coherence, producing satisfactory values of alpha [2326]. When coherence was computed separately for positive and negative behaviors, alpha increased as the behaviors within each group became more coherent (around .11).

Content and construct validity with respect to the Situated Action Cycle.

When behavior regularity was regressed onto factors from the Situated Action Cycle associated with habitual behavior, multilevel mixed-effect regressions generally explained approximately 65% of the total variance at the group level. This explanatory success demonstrates content validity of the SAM2 HBI: Its measures across the Situated Action Cycle are sufficiently thorough and complete to explain the bulk of the variance in habitual behavior.

Two additional findings consistent with the habits literature further demonstrate construct validity for the SAM2 HBI (Fig 4). First, regularity was most highly associated with factors that reflect conditioning—consistency and automaticity. Second, immediate and long-term reward were also associated with regularity, albeit to a lesser extent. Because the SAM2 measure of behavior regularity captured this well-established pattern of results, it exhibited construct validity.

Regressions performed at the individual level were even more successful in establishing construct validity. As just noted, individual regressions explained more variance in regularity than did group-level regressions, with the median explained variance around 75% (Fig 6). Because regularity exhibited large individual differences, individual regressions explained more variance than group regressions, where individual differences attenuated explained variance. Explanatory success at the individual level again demonstrates content validity, with factors from the Situated Action Cycle explaining the bulk of the variance in behavior regularity.

Similar to the group-level regressions, regularity in the individual regressions tended to be best predicted by consistency and automaticity, with immediate and long-term reward making lesser contributions (Table 4). Nevertheless, substantial individual differences occurred, with different individuals exhibiting different patterns of prediction (Fig 5). As these patterns illustrate, the SAM2 HBI established individual models of behavior regularity, specifying factors from the Situated Action Cycle that best predicted regularity for each individual.

Validity with respect to self-control and neuroticism.

Interactions of behavior valence with self-control and neuroticism further established the validity of the SAM2 HBI (Figs 7 and 8). In every study, increasing self-control was associated with increasing regularity of positive behaviors and decreasing regularity of negative behaviors (a self-control × valence interaction). Conversely, increasing neuroticism was associated with decreasing regularity of positive behaviors and increasing regularity of negative behaviors (a neuroticism × valence interaction). The fact that the SAM2 HBI captured these well-established interactions between personality and valence further demonstrates its construct validity.

Additional construct validity follows from SAM2’s ability to explain these interactions with factors from the Situated Action Cycle. As Figs 9 and 10 illustrate, automaticity, consistency, long-term reward, and immediate reward explained the variance in the interactions between valence, self-control, and neuroticism (across all three studies). These results suggest that self-control and neuroticism can be understood as emerging from individual patterns in these basic cognitive-affective processes [3840, 45, 48].

The construct of habitualness.

As Fig 5 illustrates, the automaticity of executing a behavior was not strongly associated with behavior regularity for all participants, nor was consistency. These results question whether automaticity and consistency are universal features of habitual behavior. As Fig 5 further illustrates, the predictive models for individual participants varied widely. Although clusters of individuals exhibited similar profiles of the factors that predict regularity, considerable differences existed between clusters. These large individual differences in predictive models further suggest caution in trying to develop a narrow definition of habitual behavior.

Still another problem is the distributions of the composite habitualness measure in Figs 11 and 12. On the one hand, it’s not clear where one can effectively set a threshold that distinguishes habits from non-habits. On the other, many behaviors fall in the middle of the distribution, where behaviors exhibit regularity, consistency, and automaticity a little over 50% of the time. Many of these behaviors could well be habitual, at least to some extent.

Together, these results suggest that a definition of habits based on necessary features such as automaticity and consistency may not be feasible or theoretically justifiable. Instead, the construct of habitualness may be better understood as a family resemblance structure, having prototypical features that are not definitional [136, 137]. From this perspective, habitual behaviors are statistically likely to be automatic, consistent, and rewarding, with none of these features being necessary. Because habitualness takes so many different forms, attempting to understand how its complex multidimensional structure manifests across individuals and behaviors may be more productive than attempting to establish a rigid definition. Similar conclusions have been reached about the construct of automaticity [138140].

Finally, it may be useful to view this heterogeneous construct of habitualness as a continuous dimension. Rather than attempting to distinguish categories of habits from non-habits, it may be more useful to ask how habitual a behavior is, where habitualness tends to be continuously associated with increasing regularity, consistency, and automaticity.

The relation of reward to habitualness.

A common view is that a behavior only becomes a habit if it is elicited ballistically, no longer modulated by reward. In their classic review of the non-human conditioning literature, Yin and Knowlton [118] document how organisms generally monitor reward and adjust behavior accordingly, with increases in reward increasing response rates to obtain reward (and vice versa). Reward only falls away at the very end of the conditioning process, following extreme levels of practice. Researchers often argue that humans never reach these levels, with reward always continuing to modulate behavior [78, 80, 88, 92]. Recent reviews document extensive evidence for the ever-present role of reward in habitual behavior, along with compelling theoretical arguments for why we should expect this [79, 83, 86].

Several of our results bear on this issue. Most basically, immediate and long-term reward were consistently related to behavior regularity at both the group and individual levels (Table 4, Figs 4 and 5). Additionally, composite measures of habitualness and reward were positively related at both the group and individual levels (Figs 13 and 14). As a behavior became more habitual, it became more rewarding, challenging the widespread assumption that reward becomes increasingly non-influential as behaviors become increasingly habitual. Instead, high levels of reward may induce high levels of regularity to maintain or increase the reward.

Finally, the relation between the composite measures of reward and habitualness behaved quite differently as self-control vs. neuroticism varied (Fig 15). As self-control increased, the positive relation between reward and habitualness increased as well. One speculative interpretation is that individuals high in self-control closely monitor sources of reward in their experience. When they find a reward of interest, they increasingly regularize the behavior needed to obtain it [68, 95, 106]. Conversely, the positive relation between reward and habitualness decreased as neuroticism increased (Fig 15). One speculative possibility is that the strong emotional reactions associated with neuroticism deplete the regulatory resources available for monitoring reward and adapting behavior as reward changes [141]. As a consequence, behavior becomes increasingly ballistic as poorly monitored behaviors become rigidly automated in response to situational cues.

Limitations

Correlational results.

As informative as these results are in assessing habitual behaviors at the individual and group levels, their correlational nature does not support causal interpretations. Indeed, one must be careful to avoid the temptation of drawing causal conclusions from the rich descriptive data available.

Responsible speculation about causality, however, can still be useful. On the one hand, correlational evidence can contribute to the accumulation of evidence related to causal claims, being consistent or inconsistent with them. On the other, correlational evidence is typically easier to gather than causal evidence and can thus serve as an effective form of discovery, to be followed later by careful experimental assessment when useful and possible.

Population sampled.

Another limitation of the current findings is that they were obtained from UK students aged 18–30. It is important to explore whether the SAM2 HBI exhibits similar results in other populations. Although we are optimistic that these results will generalize robustly, verification is necessary. To the extent that different populations exhibit different habitual behaviors, the behaviors in Table 1 must be adapted. To the extent that relevant factors from the Situated Action Cycle vary, they may need to be adapted as well (Fig 1). We assume, however, that other implementations are likely to reproduce many features of the results reported here. Specifically, we anticipate that behavior regularity will exhibit similar relations to conditioning, reward, and personality, along with a comparable range of individual differences.

Developing SAM2 assessment instruments

The SAM2 framework is sufficiently open-ended to support a wide variety of instruments for diverse purposes. We next explore a few of the possibilities.

Full-length SAM2 instruments.

In contexts where thorough assessment of a construct is desired, a relatively comprehensive and representative set of situations may be optimal, along with a thorough sampling of relevant factors from the Situated Action Cycle. The SAM2 HBI explored here constitutes an example.

If one wanted to develop a version of the SAM2 HBI for another population, one could sample behaviors and factors from the Situated Action Cycle differently. To assess habitual behavior in children across levels of socioeconomic status, for example, one could sample behaviors related to education, play, and family, along with factors from the Situated Action Cycle related to resources, supervision, and choice. Alternatively, to assess habitual behavior in socially anxious adults, one could focus on social and achievement behaviors, along with factors from the Situated Action Cycle related to threat, social support, and acceptance.

More generally, SAM2 instruments can be developed for many other constructs besides habitual behaviors. In our work, we have explored instruments for stress, trichotillomania, social connectedness, eating, drinking, emotion regulation, mindfulness, wellbeing, and sustainable behavior. In each case, we first establish relevant situations where the target construct occurs and develop a set of systematic cues for activating them (often informed by empirical norming). We then establish a measure of the construct. Finally, we examine the relevant scientific literature to identify factors from the Situated Action Cycle known to influence the construct in the situations of interest. We then build a SAM2 instrument around this content. Typically, we find that such instruments take 30 to 60 minutes for participants to complete, depending on the domain.

Brief SAM2 instruments.

We appreciate the need for briefer instruments in some contexts. One brief instrument that may often be sufficient is to only obtain judgements of the target construct across situations and to not collect judgments for influential factors from the Situated Action Cycle. To the extent that a larger initial study has validated judgments of the construct with respect to these factors, judgments of the target construct alone may be sufficient for many purposes.

Consider the SAM2 HBI developed here. A brief version would only include regularity judgments for the 80 situations (taking 5–10 minutes). We now know that regularity judgments exhibit construct validity because we validated them here in relation to factors from the Situated Action Cycle and to personality measures. We also know that constructing separate overall trait-level measures for the regularity of positive vs. negative behaviors can be informative, making them potentially applicable in a variety of clinical and applied settings.

A second way to create a brief SAM2 instrument is to compress either or both dimensions of situatedness: (a) situations, (b) factors from the Situated Action Cycle. To compress situations, cluster analyses could establish clusters of related behaviors, with the most central behavior sampled for each cluster. Another possibility would be to sample one positive behavior and one negative behavior from each of the domains in Table 1, reducing the number of behaviors assessed from 80 to 20. To compress factors from the Situated Action Cycle, a set of existing factors, such as those in Fig 1, could be submitted to exploratory factor analysis. Again, groups of closely related measures from the Situated Action Cycle could be identified, with only one from each group included in a brief instrument.

In this manner, brief compressed versions of full-length instruments can be developed. It is an empirical question whether a brief version is as informative as a complete version. In clinical settings, another interesting question is whether using a brief version leads to comparable insight and behavior change in clients as using a complete version.

Targeted SAM2 instruments.

A different approach to creating brief instruments is to increasingly focus the target construct and then only sample situations and factors from the Situated Action Cycle relevant to this narrower focus. In the domain of habitual behavior, for example, one could focus on health behaviors, social behaviors, or sustainable behaviors. For each focus, a smaller set of behaviors becomes relevant, and perhaps also fewer factors from the Situated Action Cycle.

Another approach is to create a brief SAM2 instrument adapted to a specific individual. By only including situations and factors relevant for the individual, a brief instrument for individual use can be created. In the domain of habitual behaviors, for example, a unique instrument for each individual could be developed that tracks their most positive and negative behaviors over time. Such instruments might also be useful in experience sampling applications.

Clinical use. SAM2 instruments could be designed to support a variety of clinical functions. Most basically, such instruments could map out a client’s habitual tendencies and identify factors from the Situated Action Cycle associated with them. For example, such instruments could establish the regularity of performing positive versus negative behaviors (or other target behaviors of interest). A predictive model could then be developed of factors from the Situated Action Cycle associated with these behaviors, with this model then being used to guide behavior change interventions. Finally, longitudinal assessment could track the effectiveness of these interventions over time.

Conclusion

From the theoretical perspectives of grounded, situated, and embodied cognition, we have developed a new approach for assessing individual differences—the Situated Assessment Method (SAM2). Rather than asking individuals to abstract across situations when evaluating a target construct, SAM2 situates the construct in two ways: First, SAM2 assesses the construct in situations where it occurs (using features of these situations to activate them). Second, SAM2 assesses factors from the Situated Action Cycle that potentially influence the construct in these situations. The result is a rich descriptive profile for each individual that embeds the construct in their unique situational experience of it.

In the implementation for habitual behaviors developed here, we saw that this approach produces useful measures of behavior regularity that are reliable, valid, and potentially useful for a variety of functions. For each individual, a detailed profile of habitual behavior emerges. The general SAM2 framework can be implemented in diverse ways to achieve a wide variety of assessment goals.

Supporting information

S1 File. Supplemental materials for the methods, analyses, and results of all studies.

https://doi.org/10.1371/journal.pone.0286954.s001

(PDF)

Acknowledgments

We are grateful to the Healthy Cognition Research Group in the School of Psychology and Neuroscience at the University of Glasgow for ongoing contributions, support, and advice on this project. We are also grateful to Daniel Cervone for directing us to relevant literature on personality and for insightful discussion about it.

References

  1. 1. Aydede M, Robbins P. The Cambridge handbook of situated cognition. Cambridge: Cambridge University Press; 2009.
  2. 2. Barsalou LW. Perceptual symbol systems. Behav Brain Sci. 1999;22:577–660. pmid:11301525
  3. 3. Barsalou LW. Grounded cognition. Annu Rev Psychol. 2008;59:617–45. pmid:17705682
  4. 4. Barsalou LW. Simulation, situated conceptualization, and prediction. Philos Trans R Soc B Biol Sci. 2009 May 12;364(1521):1281–9. pmid:19528009
  5. 5. Barsalou LW. Situated conceptualization: Theory and applications. In: Foundations of embodied cognition: Volume 1 Perceptual and emotional embodiment. East Sussex: Psychology Press; 2016. p. 11–37.
  6. 6. Barsalou LW. Situated conceptualization offers a theoretical account of social priming. Curr Opin Psychol. 2016 Dec;12:6–11.
  7. 7. Coello Y, Fischer MH, editors. Foundations of embodied cognition: Volume 1. Perceptual and emotional embodiment. Oxford, UK: Routledge; 2016.
  8. 8. Coello Y, Fischer MH, editors. Foundations of embodied cognition: Volume 2. Conceptual and interactive embodiment. Oxford, UK: Routledge; 2016.
  9. 9. Newen A, Bruin LD, Gallagher S, editors. The Oxford Handbook of 4E Cognition. Oxford, New York: Oxford University Press; 2018. 960 p. (Oxford Library of Psychology).
  10. 10. Pecher D, Zwaan RA, editors. Grounding cognition: The role of perception and action in memory, language, and thinking. New York: Cambridge University Press; 2005. 336 p.
  11. 11. McCrae RR, Costa PT Jr.. The Five-Factor Theory of personality. In: Handbook of personality: Theory and research. New York: Guilford; 1999. p. 139–53.
  12. 12. Costa PT Jr., McCrae RR. Domains and Facets: Hierarchical Personality Assessment Using the Revised NEO Personality Inventory. J Pers Assess. 1995 Feb;64(1):21–50. pmid:16367732
  13. 13. Goldberg LR. An alternative ‘description of personality’: the big-five factor structure. J Pers Soc Psychol. 1990. pmid:2283588
  14. 14. John OP, Srivastava S. The Big Five trait taxonomy: History, measurement, and theoretical perspectives. In: Handbook of personality: Theory and research, 2 [Internet]. New York: Guilford; 1999 [cited 2017 May 3]. p. 102–38. http://www.rc.usf.edu/~jdorio/Personality/BIG%20FIVE%20TRAIT%20TAXONOMY.pdf
  15. 15. Tangney JP, Baumeister RF, Boone AL. High Self-Control Predicts Good Adjustment, Less Pathology, Better Grades, and Interpersonal Success. J Pers. 2004 Apr;72(2):271–324. pmid:15016066
  16. 16. Cohen S, Kamarck T, Mermelstein R. A global measure of perceived stress. J Health Soc Behav. 1983 Dec;24:385. pmid:6668417
  17. 17. Watson D, Anna L, Tellegen A. Development and Validation of Brief Measures of Positive and Negative Affect: The PANAS Scales. J Pers Soc Psychol. 1988;54:1063–70. pmid:3397865
  18. 18. Diener E, Emmons RA, Larsen RJ, Griffin S. The Satisfaction With Life Scale. J Pers Assess. 1985 Feb 1;49(1):71–5. pmid:16367493
  19. 19. de Lauzon Blandine, Romon M, Deschamps V, Lafay L, Borys JM, Karlsson J, et al. The Three-Factor Eating Questionnaire-R18 Is Able to Distinguish among Different Eating Patterns in a General Population. J Nutr. 2004 Sep 1;134(9):2372–80. pmid:15333731
  20. 20. Baer RA. Using Self-Report Assessment Methods to Explore Facets of Mindfulness. Assessment. 2006 Mar 1;13(1):27–45. pmid:16443717
  21. 21. Branson V, Dry MJ, Palmer E, Turnbull D. The Adolescent Distress-Eustress Scale: Development and Validation. SAGE Open. 2019 Jul 1;9(3):2158244019865802.
  22. 22. Peel R, Caltabiano N. The relationship sabotage scale: an evaluation of factor analyses and constructive validity. BMC Psychol. 2021 Sep 19;9(1):146. pmid:34538259
  23. 23. Cronbach LJ. Coefficient alpha and the internal structure of tests. Psychometrika. 1951 Sep 1;16(3):297–334.
  24. 24. de Vet HCW, Mokkink LB, Mosmuller DG, Terwee CB. Spearman–Brown prophecy formula and Cronbach’s alpha: different faces of reliability and opportunities for new applications. J Clin Epidemiol. 2017 May;85:45–9. pmid:28342902
  25. 25. Shrout PE, Fleiss JL. Intraclass correlations: Uses in assessing rater reliability. Psychol Bull. 1979 Mar;86(2):420–8. pmid:18839484
  26. 26. Bloch R, Norman G. Generalizability theory for the perplexed: A practical introduction and guide: AMEE Guide No. 68. Med Teach. 2012 Nov;34(11):960–92. pmid:23140303
  27. 27. Allen MJ, Yen WM. Introduction to measurement theory. Monterey, CA: Brooks/Cole; 2001. 330 p.
  28. 28. Cronbach LJ, Meehl PE. Construct validity in psychological tests. Psychol Bull. 1955;52:281–302. pmid:13245896
  29. 29. Campbell DT, Fiske DW. Convergent and discriminant validation by the multitrait-multimethod matrix. Psychol Bull. 1959;56:81–105. pmid:13634291
  30. 30. Ajzen I. Intuitive theories of events and the effects of base-rate information on prediction. J Pers Soc Psychol. 1977;35(5):303–14.
  31. 31. Gelman SA, Legare CH. Concepts and Folk Theories. Annu Rev Anthropol. 2011 Oct 21;40(1):379–98. pmid:23436950
  32. 32. Nisbett RE, Wilson TD. Telling more than we can know: Verbal reports on mental processes. Psychol Rev. 1977;84(3):231.
  33. 33. Ross L. The Intuitive Psychologist And His Shortcomings: Distortions in the Attribution Process. In: Berkowitz L, editor. Advances in Experimental Social Psychology. Academic Press; 1977. p. 173–220.
  34. 34. Wilson TD. Strangers to ourselves. Cambridge, MA: Harvard University Press; 2004.
  35. 35. Tversky A, Kahneman D. Availability: A heuristic for judging frequency and probability. Cognit Psychol. 1973 Sep 1;5(2):207–32.
  36. 36. Wänke M, Schwarz N, Bless H. The availability heuristic revisited: Experienced ease of retrieval in mundane frequency estimates. Acta Psychol (Amst). 1995 Jun 1;89(1):83–90.
  37. 37. Fleeson W, Gallagher MP. The Implications of Big-Five Standing for the Distribution of Trait Manifestation in Behavior: Fifteen Experience-Sampling Studies and a Meta-Analysis. J Pers Soc Psychol. 2009 Dec;97(6):1097–114. pmid:19968421
  38. 38. Fleeson W, Jayawickreme E. Whole traits: Revealing the social-cognitive mechanisms constituting personality’s central variable. In: Advances in Experimental Social Psychology [Internet]. Elsevier; 2021 [cited 2021 Nov 25]. p. 69–128. https://linkinghub.elsevier.com/retrieve/pii/S0065260120300277
  39. 39. Bandura A. The self system in reciprocal determinism. Am Psychol. 1978;33(4):344–58.
  40. 40. Cervone D. Personality Architecture: Within-Person Structures and Processes. Annu Rev Psychol. 2005 Feb;56:423–52. pmid:15709942
  41. 41. Cervone D, Shadel WG, Jencius S. Social-Cognitive Theory of personality assessment. Personal Soc Psychol Rev. 2001 Feb;5(1):33–51.
  42. 42. Collins MD, Jackson CJ, Walker BR, O’Connor PJ, Gardiner E. Integrating the context-appropriate balanced attention model and reinforcement sensitivity theory: Towards a domain-general personality process model. Psychol Bull. 2017 Jan;143(1):91–106. pmid:27893219
  43. 43. Mischel W. Personality and assessment. New York: Wiley; 1968.
  44. 44. Mischel W. Toward a cognitive social learning reconceptualization of personality. Psychol Rev. 1973;80:252–83. pmid:4721473
  45. 45. Mischel W, Shoda Y. A Cognitive-Affective System Theory of Personality: Reconceptualizing Situations, Dispositions, Dynamics, and Invariance in Personality Structure. Psychol Rev. 1995;102:246–68. pmid:7740090
  46. 46. Fleeson W. Toward a Structure- and Process-Integrated View of Personality: Traits as Density Distributions of States. J Pers Soc Psychol. 2001;80:1011–27.
  47. 47. Fleeson W. Moving Personality Beyond the Person-Situation Debate: The Challenge and the Opportunity of Within-Person Variability. Curr Dir Psychol Sci. 2004 Apr;13(2):83–7.
  48. 48. Endler NS, Magnusson D. Toward an interactional psychology of personality. Psychol Bull. 1976;83(5):956–74. pmid:794903
  49. 49. Reynolds K, Turner J, Branscombe N, Mavor K, Bizumic B, Subasic E. Interactionism in Personality and Social Psychology: An Integrated Approach to Understanding the Mind and Behaviour. Eur J Personal. 2010 Aug 1;24:458–82.
  50. 50. Ajzen I, Fishbein M. Attitude-behavior relations: A theoretical analysis and review of empirical research. Psychol Bull. 1977;84:888–918.
  51. 51. Ajzen I, Fishbein M. The influence of attitudes on behavior. In: The handbook of attitudes. Mahwah, New Jersey: Erlbaum; 2005. p. 173–221.
  52. 52. Glasman LR, Albarracín D. Forming attitudes that predict future behavior: A meta-analysis of the attitude-behavior relation. Psychol Bull. 2006;132(5):778–822. pmid:16910754
  53. 53. Siegel JT, Navarro MA, Tan CN, Hyde MK. Attitude–behavior consistency, the principle of compatibility, and organ donation: A classic innovation. Health Psychol. 2014;33(9):1084–91. pmid:24707844
  54. 54. Costa PT Jr., McCrae RR. The Revised NEO Personality Inventory (NEO-PI-R). In: The SAGE handbook of personality theory and assessment, Vol 2: Personality measurement and testing. Thousand Oaks, CA, US: Sage Publications, Inc; 2008. p. 179–98.
  55. 55. Funder DC. Taking Situations Seriously: The Situation Construal Model and the Riverside Situational Q-Sort. Curr Dir Psychol Sci. 2016 Jun;25(3):203–8.
  56. 56. Cree GS, McRae K. Analyzing the factors underlying the structure and computation of the meaning of chipmunk, cherry, chisel, cheese, and cello (and many other such concrete nouns). J Exp Psychol Gen. 2003;132:163–201. pmid:12825636
  57. 57. Wu LL, Barsalou LW. Perceptual simulation in conceptual combination: Evidence from property generation. Acta Psychol (Amst). 2009;132:173–89. pmid:19298949
  58. 58. Conway MA. Sensory–perceptual episodic memory and its context: autobiographical memory. Philos Trans R Soc Lond B Biol Sci. 2001 Sep 29;356(1413):1375–84. pmid:11571029
  59. 59. Conway MA. Episodic memories. Neuropsychologia. 2009 Sep 1;47(11):2305–13. pmid:19524094
  60. 60. Rubin DC. The basic-systems model of episodic memory. Perspect Psychol Sci. 2006;1(4):277–311. pmid:26151796
  61. 61. Rubin DC, Umanath S. Event memory: A theory of memory for laboratory, autobiographical, and fictional events. Psychol Rev. 2015;122(1):1–23. pmid:25330330
  62. 62. Bechtel W, Abrahamsen A. Connectionism and the mind. Parallel processing, dynamics, and evolution in networks. Oxford: Blackwell Publishers; 2002.
  63. 63. Robinson JA. Sampling autobiographical memory. Cognit Psychol. 1976 Oct 1;8(4):578–95.
  64. 64. Rubin DC. On the retention function for autobiographical memory. J Verbal Learn Verbal Behav. 1982;21:21–38.
  65. 65. Cabeza R, St Jacques P. Functional neuroimaging of autobiographical memory. Trends Cogn Sci. 2007 May 1;11(5):219–27. pmid:17382578
  66. 66. Conway MA, Loveday C. Remembering, imagining, false memories & personal meanings. Conscious Cogn. 2015 May 1;33:574–81.
  67. 67. Cohen S, Gianaros PJ, Manuck SB. A Stage Model of Stress and Disease. Perspect Psychol Sci. 2016 Jul 1;11(4):456–63. pmid:27474134
  68. 68. Monroe SM. Modern approaches to conceptualizing and measuring human life stress. Annu Rev Clin Psychol. 2008 Apr;4:33–52. pmid:17716038
  69. 69. Charles ST, Piazza JR, Mogle J, Sliwinski MJ, Almeida DM. The Wear and Tear of Daily Stressors on Mental Health. Psychol Sci. 2013 Mar 26;24(5):733–41. pmid:23531486
  70. 70. Pangallo A, Zibarras L, Lewis R, Flaxman P. Resilience through the lens of interactionism: A systematic review. Psychol Assess. 2015 Mar;27(1):1–20. pmid:25222438
  71. 71. Barsalou LW. Establishing generalizable mechanisms. Psychol Inq. 2019 Oct 2;30(4):220–30.
  72. 72. Brunswik E. Representative design and probabilistic theory in a functional psychology. Psychol Rev. 1955 May;62(3):193–217.
  73. 73. Dhami MK, Hertwig R, Hoffrage U. The Role of Representative Design in an Ecological Approach to Cognition. Psychol Bull. 2004;130(6):959–88. pmid:15535744
  74. 74. Hammond KR, Stewart TR. The Essential Brunswik: Beginnings, Explications, Applications. Oxford: Oxford University Press; 2001. 562 p.
  75. 75. Miller LC, Shaikh SJ, Jeong DC, Wang L, Gillig TK, Godoy CG, et al. Causal Inference in Generalizable Environments: Systematic Representative Design. Psychol Inq. 2019 Oct 2;30(4):173–202. pmid:33093760
  76. 76. Yang Y, Read SJ, Denson TF, Xu Y, Zhang J, Pedersen WC. The Key Ingredients of Personality Traits: Situations, Behaviors, and Explanations. Pers Soc Psychol Bull. 2014 Jan;40(1):79–91. pmid:24072753
  77. 77. Zayas V, Whitsett DD, Lee JJY, Wilson N, Shoda Y. From situation assessment to personality: Building a social-cognitive model of a person. In: Handbook of personality theory and testing. Thousand Oaks, CA, US: Sage; 2008. p. 377–401.
  78. 78. Danner UN, Aarts H, Vries NK. Habit vs. intention in the prediction of future behaviour: The role of frequency, context stability and mental accessibility of past behaviour. Br J Soc Psychol. 2008 Jun;47(2):245–65. pmid:17678574
  79. 79. De Houwer J. On How Definitions of Habits Can Complicate Habit Research. Front Psychol. 2019;10:2642. pmid:31849762
  80. 80. de Wit S, Kindt M, Knot SL, Verhoeven AAC, Robbins TW, Gasull-Camos J, et al. Shifting the balance between goals and habits: Five failures in experimental habit induction. J Exp Psychol Gen. 2018 Jul;147(7):1043–65. pmid:29975092
  81. 81. Gardner B. Habit as automaticity, not frequency. Eur Health Psychol. 2012;14:32–6.
  82. 82. Gardner B. A review and analysis of the use of ‘habit’in understanding, predicting and influencing health-related behaviour. Health Psychol Rev. 2015;9(3):277–95. pmid:25207647
  83. 83. Hommel B, Wiers RW. Towards a Unitary Approach to Human Action Control. Trends Cogn Sci. 2017 Dec 1;21(12):940–9. pmid:29150000
  84. 84. Ji MF, Wood W. Purchase and Consumption Habits: Not Necessarily What You Intend. J Consum Psychol. 2007 Oct 1;17(4):261–76.
  85. 85. Judah G, Gardner B, Kenward MG, DeStavola B, Aunger R. Exploratory study of the impact of perceived reward on habit formation. BMC Psychol. 2018 Dec;6(1):62. pmid:30572936
  86. 86. Kruglanski AW, Szumowska E. Habitual Behavior Is Goal-Driven. Perspect Psychol Sci. 2020 Sep;15(5):1256–71. pmid:32569529
  87. 87. Mazar A, Wood W. Defining Habit in Psychology. In: Verplanken B, editor. The Psychology of Habit: Theory, Mechanisms, Change, and Contexts [Internet]. New York: Springer; 2018 [cited 2019 Jul 9]. p. 13–29. https://doi.org/10.1007/978-3-319-97529-0_2
  88. 88. McCarthy MB, Collins AM, Flaherty SJ, McCarthy SN. Healthy eating habit: A role for goals, identity, and self-control? Psychol Mark. 2017 Aug;34(8):772–85.
  89. 89. Ouellette JA, Wood W. Habit and intention in everyday life: The multiple processes by which past behavior predicts future behavior. Psychol Bull. 1998;124(1):54–74.
  90. 90. Verplanken B, Orbell S. Reflections on Past Behavior: A Self-Report Index of Habit Strength1. J Appl Soc Psychol. 2003;33(6):1313–30.
  91. 91. Wood W, Neal DT. The habitual consumer. J Consum Psychol. 2009 Oct;19(4):579–92.
  92. 92. Marien H, Custers R, Aarts H. Studying Human Habits in Societal Context: Examining Support for a Basic Stimulus–Response Mechanism. Curr Dir Psychol Sci. 2019;614–8.
  93. 93. Epstein S. The stability of behavior: I. On predicting most of the people much of the time. J Pers Soc Psychol. 1979;37(7):1097–126.
  94. 94. Epstein S. The stability of behavior: II. Implications for psychological research. Am Psychol. 1980 Sep;35(9):790–806.
  95. 95. Fleeson W. Situation‐Based Contingencies Underlying Trait‐Content Manifestation in Behavior. J Pers. 2007 Aug;75(4):825–62. pmid:17576360
  96. 96. Barsalou LW. Challenges and opportunities for grounding cognition. J Cogn. 2020;3(1):31. pmid:33043241
  97. 97. Bouton ME, Todd TP. A fundamental role for context in instrumental learning and extinction. Behav Processes. 2014 May;104:13–9. pmid:24576702
  98. 98. Domjan M. The principles of learning and behavior. Independence, KY: Cengage Learning; 2014. 450 p.
  99. 99. Miller GA, Galanter E, Pribram KH. Plans and the structure of behavior. New York: Holt Reinhart & Winston; 1960. 226 p.
  100. 100. Newell A, Simon HA. Human problem solving. Englewood Cliffs, NJ: Prentice-Hall; 1972.
  101. 101. Anderson JR. The architecture of cognition. Cambridge, MA: Cambridge University Press; 1983.
  102. 102. Newell A. Production systems: Models of control structures. In: Visual Information Processing [Internet]. Academic Press; 1973 [cited 2020 May 26]. p. 463–526. http://www.sciencedirect.com/science/article/pii/B9780121701505500160
  103. 103. Daw ND, Frank MJ. Reinforcement learning and higher level cognition: Introduction to special issue. Cognition. 2009 Dec;113(3):259–61. pmid:19819429
  104. 104. Sutton RS, Barto AG. Introduction to reinforcement learning [Internet]. MIT Press; 1998 [cited 2014 Nov 21]. http://dl.acm.org/citation.cfm?id=551283
  105. 105. Botvinick MM, Niv Y, Barto AC. Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective. Cognition. 2009 Dec;113(3):262–80. pmid:18926527
  106. 106. Ito M, Doya K. Multiple representations and algorithms for reinforcement learning in the cortico-basal ganglia circuit. Curr Opin Neurobiol. 2011 Jun 1;21(3):368–73. pmid:21531544
  107. 107. O’Doherty JP. Beyond simple reinforcement learning: the computational neurobiology of reward-learning and valuation. Eur J Neurosci. 2012;35(7):987–90. pmid:22487029
  108. 108. Baerger DR, McAdams DP. Life story coherence and its relation to psychological well-being. Narrat Inq. 1999;9(1):69–96.
  109. 109. Edson Escalas J. Narrative processing: Building consumer connections to brands. J Consum Psychol. 2004;14(1):168–80.
  110. 110. Reese E, Haden CA, Baker-Ward L, Bauer P, Fivush R, Ornstein PA. Coherence of personal narratives across the lifespan: A multidimensional model and coding method. J Cogn Dev. 2011 Oct 1;12(4):424–62. pmid:22754399
  111. 111. Stein NL, Hernandez MW. Assessing understanding and appraisal during emotional experience. In: Handbook of emotion elicitation and assessment. Oxford: Oxford University Press; 2007. p. 298–317.
  112. 112. Barsalou LW. Situated simulation in the human conceptual system. Lang Cogn Process. 2003;18:513–62.
  113. 113. Barsalou LW, Dutriaux L, Scheepers C. Moving beyond the distinction between concrete and abstract concepts. Philos Trans R Soc B Biol Sci. 2018 Aug 5;373(1752):20170144. pmid:29915012
  114. 114. McCloskey K, Johnson BT. Habits, Quick and Easy: Perceived Complexity Moderates the Associations of Contextual Stability and Rewards With Behavioral Automaticity. Front Psychol. 2019 Jul 24;10:1556. pmid:31396122
  115. 115. Orbell S, Verplanken B. Progress and prospects in habit research. In: The psychology of habit: Theory, mechanisms, change, and context. Switzerland: Springer Nature; 2018. p. 397–410.
  116. 116. Schneider W, Shiffrin RM. Controlled and automatic human information processing: I. Detection, search, and attention. Psychol Rev. 1977;84(1):1–66.
  117. 117. Shiffrin RM, Schneider W. Controlled and automatic human information processing: II. Perceptual learning, automatic attending and a general theory. Psychol Rev. 1977;84(2):127–90.
  118. 118. Yin HH, Knowlton BJ. The role of the basal ganglia in habit formation. Nat Rev Neurosci. 2006 Jun;7(6):464–76. pmid:16715055
  119. 119. Hofmann W, Rauch W, Gawronski B. And deplete us not into temptation: Automatic attitudes, dietary restraint, and self-regulatory resources as determinants of eating behavior. J Exp Soc Psychol. 2007 May;43(3):497–504.
  120. 120. Hofmann W, Schmeichel BJ, Baddeley AD. Executive functions and self-regulation. Trends Cogn Sci. 2012 Mar;16:174–80. pmid:22336729
  121. 121. Neal DT, Wood W, Wu M, Kurlander D. The Pull of the Past When Do Habits Persist Despite Conflict With Motives? Pers Soc Psychol Bull. 2011;37(11):1428–37. pmid:21859902
  122. 122. Adriaanse MA, Kroese FM, Gillebaart M, De Ridder DTD. Effortless inhibition: habit mediates the relation between self-control and unhealthy snack consumption. Front Psychol. 2014 May 16;5:444. pmid:24904463
  123. 123. de Ridder DTD, Lensvelt-Mulders G, Finkenauer C, Stok FM, Baumeister RF. Taking Stock of Self-Control: A Meta-Analysis of How Trait Self-Control Relates to a Wide Range of Behaviors. Personal Soc Psychol Rev. 2012 Feb;16(1):76–99. pmid:21878607
  124. 124. Galla BM, Duckworth AL. More than resisting temptation: Beneficial habits mediate the relationship between self-control and positive life outcomes. J Pers Soc Psychol. 2015 Sep;109(3):508–25. pmid:25643222
  125. 125. Wood W. Habit in Personality and Social Psychology. Personal Soc Psychol Rev. 2017 Nov;21(4):389–403. pmid:28737111
  126. 126. Duggan KA, Friedman HS, McDevitt EA, Mednick SC. Personality and Healthy Sleep: The Importance of Conscientiousness and Neuroticism. Arias-Carrion O, editor. PLoS ONE. 2014 Mar 20;9(3):e90628. pmid:24651274
  127. 127. Elliston KG, Ferguson SG, Schüz B. Personal and situational predictors of everyday snacking: An application of temporal self-regulation theory. Br J Health Psychol. 2017 Nov;22(4):854–71. pmid:28722217
  128. 128. Keller C, Siegrist M. Does personality influence eating styles and food choices? Direct and indirect effects. Appetite. 2015 Jan 1;84:128–38. pmid:25308432
  129. 129. Terracciano A, Löckenhoff CE, Crum RM, Bienvenu OJ, Costa PT. Five-Factor Model personality profiles of drug users. BMC Psychiatry. 2008 Dec;8(1):22. pmid:18405382
  130. 130. Gelman A, Hill J. Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge University Press; 2006. 651 p.
  131. 131. Barr DJ, Levy R, Scheepers C, Tily HJ. Random effects structure for confirmatory hypothesis testing: Keep it maximal. J Mem Lang. 2013;68(3):255–78. pmid:24403724
  132. 132. Cumming G. The New Statistics: Why and How. Psychol Sci. 2014 Jan 1;25(1):7–29. pmid:24220629
  133. 133. Price R, Bethune R, Massey L. Problem with p values: why p values do not tell you if your treatment is likely to work. Postgrad Med J. 2020 Jan 1;96(1131):1–3. pmid:31662411
  134. 134. Ellis PD. The Essential Guide to Effect Sizes: Statistical Power, Meta-Analysis, and the Interpretation of Research Results. Cambridge University Press; 2010. 193 p.
  135. 135. Lally P, van Jaarsveld CHM, Potts HWW, Wardle J. How are habits formed: Modelling habit formation in the real world. Eur J Soc Psychol. 2010 Oct;40(6):998–1009.
  136. 136. Rosch E, Mervis CB. Family resemblances: Studies in the internal structure of categories. Cognit Psychol. 1975;7:573–605.
  137. 137. Wittgenstein L. Philosophical investigations. Oxford: Basil Blackwell; 1953.
  138. 138. Lebois LAM, Wilson-Mendenhall CD, Barsalou LW. Are automatic conceptual cores the gold standard of semantic processing? The context-dependence of spatial meaning in grounded congruency effects. Cogn Sci. 2015;39:1764–801. pmid:25243925
  139. 139. Moors A, De Houwer J. Automaticity: A Theoretical and Conceptual Analysis. Psychol Bull. 2006;132(2):297–326. pmid:16536645
  140. 140. Sharma L, Markon KE, Clark LA. Toward a theory of distinct types of ‘impulsive’ behaviors: A meta-analysis of self-report and behavioral measures. Psychol Bull. 2014 Mar;140(2):374–408. pmid:24099400
  141. 141. Crow AJD. Associations Between Neuroticism and Executive Function Outcomes: Response Inhibition and Sustained Attention on a Continuous Performance Test. Percept Mot Skills. 2019 Aug 1;126(4):623–38. pmid:31146642