Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Assessing toxic leadership in high-performance sports: Psychometric validation of a Swedish Version of the Toxic Leadership Scale for Sports (TLS-S)

  • Carolina Lundqvist ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Supervision, Writing – original draft, Writing – review & editing

    carolina.lundqvist@liu.se

    Affiliations Department of Behavioural Sciences and Learning, Linköping University, Linköping, Sweden, Athletics Research Center, Linköping University, Linköping, Sweden

  • Saga Johansson

    Roles Conceptualization, Data curation

    Affiliation Department of Behavioural Sciences and Learning, Linköping University, Linköping, Sweden

Abstract

This study evaluated the internal consistency and factorial validity of a sport-adjusted 15-item version of the Toxic Leadership Scale (TLS-S). The sample consisted of 189 Swedish elite team sport athletes (56 females, 132 males, 1 preferred not to disclose; mean age = 17.9 years, SD = 3.3) from 21 different teams (ice hockey n = 68; soccer n = 60; American football n = 26; basketball n = 15; handball n = 14; volleyball n = 3; other team sports not specified or multiple sports n = 3). Participants competed at different performance levels: sub-elite (n = 15), national junior elite (n = 105), international junior elite (n = 26), national senior elite (n = 30), international senior elite (n = 12), and other not specified (n = 1). Results showed adequate internal consistency (ωt > .70) for the total TLS-S score and all subscales except the authoritarian leadership subscale (ωt = .61). Confirmatory factor analysis indicated acceptable model fit for both a five-factor and a one-factor model. While the five-factor model (χ²(80)=135.87, p < .001; CFI = .98; RMSEA = .062 [90% CI: .044–.080]; SRMR = .044) was superior to the one-factor solution (difference test: χ²(10)=53.60, p < .001), intercorrelations across subscales were high (range: .73−.92). A bifactor model supported a unidimensional structure (explained common variance = .79; percentage of uncontaminated correlations = .86). The findings provide psychometric evidence for the TLS-S and support a unidimensional structure. This study constitutes an important initial step toward facilitating the quantitative assessment of toxic leadership within sport environments, with the TLS-S emerging as a promising instrument for this purpose.

1. Introduction

In high-performance sport, athletic success depends not only on physical capacity, talent, and technical skills but also on the quality of leadership. Effective leadership can enhance team cohesion, foster personal development, and support sustainable performance [1,2]. However, when cultural norms within high-performance sport endorse the belief that strict, dominant, and controlling leadership is essential for sports success, destructive and toxic leadership may become normalized or even incentivized by sports organizations [36]. Such practices can contribute to dysfunctional team dynamics, fear-based relationships between leaders and followers, adverse health outcomes (e.g., psychological distress, injuries, diminished self-esteem), early career termination or turnover as well as compromised performance and negative organizational consequences [3,79]. While traditional leadership frameworks (e.g., situational leadership, the social identity approach, transformational leadership theory) have been applied in sport to explain how coaching influences athlete and team functioning,[e.g., 1012] these approaches primarily emphasize leadership effectiveness in promoting wellbeing and performance. In contrast, the more destructive and toxic dimensions of leadership, those that undermine athlete welfare, team cohesion, and performance, have only recently begun to receive systematic attention in sport research [3]. Consequently, the destructive and toxic side of sports leadership remains comparatively underexplored and measurement tools are lacking. To advance a comprehensive understanding of leadership in sport, researchers must address both its constructive and destructive dimensions, acknowledging the dual potential of leadership to either support or undermine athlete welfare, health, and short- and long-term performance.

1.1. Definitions of toxic leadership

Toxic leadership is well documented in domains outside the sports sector (e.g., academia, business, politics, the military) that share historical connections or organizational similarities with sports, such as a performance-oriented nature and a hierarchical power structure [8,1317]. Although there is currently no universally accepted definition of toxic leadership, and numerous partially overlapping constructs related to destructive leadership exist (e.g., abusive supervision, despotic leadership, tyrannical leadership, leader narcissism, derailed leadership), these conceptualizations converge on a core premise: leaders are considered to engage in, or intend to engage in, behaviors perceived as harmful or destructive to one or more followers or the organization [8,9,18]. The term toxic leadership typically denotes leadership considered ‘poisonous’ to those within its sphere of influence and is conceptualized as a distinct subset within the broader framework of destructive leadership [18,19].

Toxic leadership is associated with self-serving, authoritarian, manipulative, narcissistic, ignorant, and exploitative behaviors, whereby leaders abuse power for personal gain, prioritizing their own interests and goals over collective needs [79,14,16,17,20]. Lipman-Blumen [20] defined toxic leaders as ‘individuals, who by dint of their destructive behaviors and dysfunctional personal qualities generate a serious and enduring poisonous effect on the individuals, families, organizations, communities, and even societies they lead’ (p. 30). Similarly, Gandolfi and Stone [7] characterize toxic leadership as an ‘intentional or unintentional series of acts that undermine and discourage those followers who genuinely seek to carry out the mission and vision of the organization, who then become stifled in the process of achievement by self-serving leaders who put missional or personal gain above the needs of followers, creating a demoralized state that deteriorates organizations from the inside out’ (p. 25). Building on definitions from non-sports contexts and recognizing through a scoping review that research on destructive and toxic leadership in sports remains at a nascent stage of development, Lundqvist et al. [3] introduced toxic leadership in high-performance sports as ‘a broad umbrella term encompassing various destructive, harmful, or socially undesirable leadership characteristics, behaviors, or intentions, as well as aligned processes that enable and maintain such leadership within the sports or organizational context, ultimately resulting in perceived or actual short- or long-term detrimental outcomes for affected individuals, organizations, or communities’ (p. 21).

Toxic leadership is not a single behavior or personality trait, and toxic leaders need not exhibit destructive behaviors in all situations, nor employ the same destructive behaviors consistently [14,16,20,21]. Rather, it is argued to represent the cumulative effect of demotivating actions that erode morale and create an unhealthy and unproductive organizational climate over time [14]. A one‑sided focus on individual characteristics or traits addresses the symptoms of toxicity, but not its underlying systemic causes and how organizational culture and climate can foster toxic leadership and shape subsequent behaviors and outcomes [17,22]. Thoroughgood et al. [19] criticized leader-centric approaches and argued for integrative views on leadership phenomena that involve interactions between flawed, toxic, or ineffective leaders, susceptible followers, and conducive environments [19,23]. Destructive leadership is typically characterized by individual-level behaviors that negatively target followers or subordinates, whereas toxic leadership emphasizes organizational-level dysfunction and its systemic consequences [19,21,23]. Thus, toxic leadership links harmful leader behaviors to their broader impact on followers, organizations, and the surrounding environment that collectively enable and perpetuate toxicity within organizational systems [23]. This dynamic shapes organizational culture, affects subordinates’ experiences, and impacts both organizational and individual performance outcomes [17].

1.2. Assessment of toxic leadership in high-performance sports

A limiting factor in advancing empirical understanding on leaders perceived as toxic in high-performance sport settings, is the current lack of validated measurement tools [3]. Leadership scales traditionally used in sports research, such as the Leadership Scale for Sports [24], the Coaching Behaviour Scale for Sport [25], and the Differentiated Transformational Leadership Inventory [26], generally do not include items that explicitly assess toxic leadership dimensions. To date, much of the available literature on various destructive leadership practices in high-performance sports has relied on qualitative methodologies, which, while valuable for capturing nuanced experiences, may limit the ability to quantify prevalence and expand the theoretical understanding of toxic leadership and its impact (e.g., health, performance, athlete dropout/staff turnover) on individuals, teams and organizations in high-performance sports settings [3].

The Toxic Leadership Scale (TLS) was specifically developed to assess toxic leadership, originally for military and civilian contexts such as business [27,28], and has yet not been utilized within sports. The TLS is a multidimensional measure intended to capture followers’ perceptions of core dimensions of toxic leadership, based on the assumption that toxic leadership comprises a constellation of behaviors that predictably undermine subordinate groups. The scale is grounded in the operationalization of toxic leaders as ‘narcissistic, self-promoters who engage in an unpredictable pattern of abusive and authoritarian supervision’ (p. 57) [27]. Accordingly, the TLS incorporates elements of self-promotion, unpredictability, abusive supervision, narcissism, and authoritarianism. An underlying assumption is that the destructive behaviors of toxic leaders extend beyond direct targets; by shaping the organizational climate, these behaviors may negatively affect a broader group of individuals, including those who merely witness such conduct [27]. In light of qualitative descriptions of toxic leadership noted in the nascent yet evolving field of research on toxic leadership in the sports domain [3], exploring the psychometric properties of the TLS adapted for use in a sports context could not only enable quantitative assessment of toxic leadership dimensions in sports research but also support the development of more effective strategies for identifying and mitigating harmful leadership practices in athletic environments. The purpose of this study was therefore to conduct an initial psychometric evaluation of the internal consistency and factorial validity of the TLS when adapted for use in sports settings (TLS-S). Since toxic leadership is closely related to culture and the climate in group settings, we chose to focus on team sports in this study to ensure that the athletes responding to the scale were embedded in a team context where leadership can exert influence on the group climate.

2. Materials and methods

This study employed a cross-sectional design, using a web-based survey administered to team sport athletes playing at the national or international elite level within their respective age categories.

2.1. Participants

To be eligible for inclusion in this study, participants were required to meet the following criteria: (a) be at least 15 years of age; (b) compete at a national or international elite level in a team sport, as determined by league classification; and (c) possess sufficient proficiency in Swedish to comprehend the survey content. Participants were excluded from the study if they reported experiencing severe mental health issues requiring active psychiatric treatment (e.g., substance abuse, psychotic disorders, obsessive-compulsive disorder or similar conditions) or if they were undergoing a significant personal crisis (e.g., divorce, death within the family or another close relationship, or similar serious events) at the time of data collection. Eligibility was assessed through initial screening questions. A total of 202 Swedish team sport athletes completed the survey, of whom 13 reported yes on one exclusion criterion (severe mental health issue, n = 10; personal crisis, n = 3) and were excluded from this study. After exclusion of non-eligible participants, the study sample consisted of 189 team sport athletes (females: n = 56; males: n = 132; prefer not to disclose: n = 1; mean age: 17.9 years, SD = 3.3).

The study sample represented athletes from 21 different teams in various team sports: ice hockey (n = 68), soccer (n = 60), American football (n = 26), basketball (n = 15), handball (n = 14), volleyball (n = 3), and other team sports not specified or multiple sports (n = 3). Participants reported competing at different performance levels: sub-elite (n = 15), national junior elite (n = 105), international junior elite (n = 26), national senior elite (n = 30), international senior elite (n = 12), and other not specified (n = 1). On average, athletes reported training 11.3 (SD = 5.5) hours per week and participating in 3.9 (SD = 2.9) games per week. The average number of coaches associated with each team was 4.2 (SD = 1.4), with most athletes indicating that their head coach was male (n = 175; 92.6%).

2.2. Measures

Demographics collected were self-reported sex, age, team sport practiced at an elite level, performance level, average/training hours in their main sport/week, average games played/week, number of coaches associated to the players main team, sex of the head coach for the players main team.

The Toxic Leadership Scale [27,28] is available in two formats: a full version comprising 30 items and a shortened version with 15 items. In the present study, the short version was used, which assesses toxic leadership with five subscales and three items per subscale: Self-promotion, abusive supervision, unpredictability, narcissism, and authoritarian leadership. All items are rated on a 5-point response scale ranging from 1 ‘strongly disagree’ to 5 ’strongly agree’, with subscale scores ranging from 3 to 15. Higher scores indicate greater perceived toxic leadership. The TLS has demonstrated adequate internal consistency in both military and civilian samples (all subscales α > .70), and previous research has supported its five-factor structure [27,28]. In this study, the scale was first translated into Swedish using a back-translation procedure. Minor semantic adjustments to the scale, hereafter referred to as The Toxic Leadership Scale for Sports (TLS-S), were then made to better reflect the language and context relevant to athletes. For example, the phrase ‘My current supervisor’ was replaced with ‘My current head coach.’ An example item is: ’My current head coach publicly belittles team members’ (original: ’Publicly belittles subordinates’). The full set of translated items is provided in S1 Appendix.

2.3. Data collection

Data were collected from 01/04/2025–17/12/2025 using a web-based survey administered via the Survey & Report platform. Participants were recruited by convenience sampling through direct contact with representatives of team sports clubs, such as sport directors, head coaches or related staff. The recruitment strategy aimed to include both female and male athletes. Team representatives received both oral and written information detailing the study’s purpose and procedures. The web-based survey was distributed by their coaches to players through a link and QR-code during scheduled team gatherings (e.g., training sessions, travels) or by e-mail, depending on logistical preferences by the team’s coach or sporting director. The survey was administered anonymously to enhance the likelihood of obtaining honest responses. Participants were asked to enter a pre-assigned team code, which enabled an analytical clustering of athletes who belonged to the same team and, by extension, shared the same coach. The team code served no purpose beyond facilitating this analytical clustering.

2.4. Statistical analyses

Descriptive statistics (mean and standard deviation (SD)) were used to explore sample characteristics of TLS-S. Intercorrelations between subscales were examined using Spearman rank-order correlations, a non-parametric test appropriate for the ordinal data used in this study. Acknowledging that cutoffs for measures of correlation coefficients strengths need to be interpreted with caution because different guidelines exist [29,30], we followed rule of thumbs suggesting interpretations of coefficients of .10, .20. and .30 as relatively small, typical and relatively large respectively [30]. Scale reliability was assessed using McDonald’s omega (ωt) adopting a cutoff of .70 to interpret acceptable internal consistency [31,32]. Omega (ωt) has many desirable properties over coefficient alpha as a measure of reliability, such as less stringent and unrealistic assumption concerning, for instance, essential tau-equivalence [33]. Descriptive analyses, non-parametric tests and assessment of scale reliability were performed using SPSS Statistical Package version 30.

Confirmatory factor analyses (CFA) were conducted using MPlus version 9 to investigate the factor structure of three á priori specified models: (a) A first order measurement model with five latent correlated factors (self-promotion, abusive supervision, unpredictability, narcissism, authoritarian leadership; Fig 1), (b) a first order measurement model with one latent factor (Fig 2), (c) a bifactor model with one general factor and five latent factors (Fig 3). The models were hypothesized based on the scale developer’ s original psychometric studies of the TLS [27,28], as well as arguments in the literature that toxic leadership is a cumulative effect of demotivating actions and thereby a unidimensional construct [8,9,14,16,17,21].

thumbnail
Fig 1. Conceptual model of a first order measurement model with five latent correlated factors.

https://doi.org/10.1371/journal.pone.0343533.g001

thumbnail
Fig 2. Conceptual model of a first order measurement model with one latent factor.

https://doi.org/10.1371/journal.pone.0343533.g002

thumbnail
Fig 3. Conceptual model of a bifactor model with one general factor and five latent factors.

https://doi.org/10.1371/journal.pone.0343533.g003

Prior to running CFAs, variance inflation factor (VIF) values were examined to diagnose multicollinearity. There are several rule of thumbs discussed to interpret VIF and we adopted the commonly used guideline that VIF values should be < 10 to indicate no serious concerns with multicollinearity [34,35]. The dataset was also screened for multivariate outliers using Mahalanobis distance. Cases were classified as multivariate outliers if they exceeded the chi-square (χ²) threshold corresponding to the appropriate degrees of freedom at a significance level of p < .001 [36]. To mitigate the risk of such outliers exerting undue influence on parameter estimates and compromising overall model fit, identified outliers were removed before conducting the CFAs. To assess the model fit of the hypothesized model, and given the ordinal nature of the data, weighted least squares mean and variance (WLSMV) [37,38] estimation with theta parameterization was used. The WLSMV estimator does not require the assumption of multivariate normality. The CLUSTER command in MPlus was used to model for non-independence in the data related to clusters of respondents having the same coach. Model fit was evaluated using the Comparative Fit Index (CFI) [39] and the Root Mean Square Error of Approximation (RMSEA) [40]. A good model fit is indicated by CFI > .95 and RMSEA < .06. RMSEA values between .08 and .10 indicate a mediocre fit, while values > .10 suggest poor model fit [41]. It should be noted, however, that these conventional cut-off criteria were based on ML-estimation and were not intended to be broadly generalized to models under distinctly different conditions, or estimators, than those in the original study. Furthermore, the idea of one-size-fits-all cut-off criteria for SEM is also a contested topic in the literature [42,43]. Nested models were compared with difference testing described by Muthén and Muthén as appropriate when WLSMV is used as estimator [37]. Statistical significance in all analyses was determined by a p-value < .05.

Explained common variance (ECV) was calculated to determine the extent to which the general factor in the bifactor model dominates relative to the group factors [44,45]. ECV was computed as the proportion of the sum of squared standardized loadings on the general factor relative to the total sum of squared standardized loadings across both the general and group factors [45]. A higher ECV value (>.70) indicates that the general factor accounts for most of the common variance, supporting essential unidimensionality. Conversely, lower ECV values suggest that specific factors explain a greater proportion of variance, indicating a multidimensional structure [44,45]. Percentage of uncontaminated correlations (PUC) was also calculated to complement ECV by considering the overall data structure. PUC represents the proportion of item correlations that are unaffected by shared group factors. PUC tends to be higher when there are many group factors with relatively few items per factor, compared to situations where there are fewer group factors but a larger number of items within each factor. When both ECV (>.70) and PUC (>.70) are high, the common variance can be interpreted as essentially unidimensional, with only minor bias introduced by group factors [45]. Missing data in all analyses were handled using pairwise deletion.

2.5. Ethical statement

The study was submitted to the Swedish Ethical Review Authority for ethical approval. According to Swedish ethical rules, parental consent is required for children and young people aged 14 or younger. Based on the study’s design, where all participants were aged 15 years or older and the survey was conducted anonymously without collecting any personal data that could reveal the identity of the participants, the Swedish Ethical Review Authority evaluated that the study did not fall within the scope of the Swedish Ethical Review Act and therefore was not in need of formal ethical approval by the Swedish Ethical Review Authority (Dnr: 2025-00838-01). At the beginning of the survey, all participants were presented with written information about the study to enable an informed decision regarding their participation. Written informed consent was obtained through the first question of the survey, in which participants were asked whether they consented to take part in the study. Those who declined to provide consent at this stage were unable to proceed, as the survey was automatically terminated.

3. Results

3.1. Descriptives of the TLS-S, internal consistency and subscale intercorrelations

Descriptive statistics (mean, SD) for the subscales of the TLS-S, along with intercorrelations, internal consistency estimates as well as skewness and kurtosis, are presented in Table 1. Spearman rank-order correlations revealed relatively large associations among the subscales (self-promotion, abusive supervision, unpredictability, narcissism, authoritarian leadership) in the TLS-S. The authoritarian leadership subscale exhibited the weakest intercorrelations with the other subscales. Internal consistency, assessed using McDonald’s omega, indicated satisfactory reliability for four subscales as well as the TLS-S total score (ωₜ > .70). In contrast, the authoritarian leadership subscale (ωₜ = .61) fell below the accepted threshold, indicating inadequate reliability. Item-level response distributions for the TLS-S, grouped by gender and sports, are presented in S2 Appendix. Partitioning the sample into subsamples resulted in small groups and item response patterns revealed that certain scale points were not utilized within some subsamples, likely due to these small sizes. Therefore, at this initial stage of psychometric evaluation of the TLS-S, we decided to conduct all subsequent analyses on the total sample.

thumbnail
Table 1. Descriptive statistics, intercorrelations and internal consistency for the original subscales of the TLS-S.

https://doi.org/10.1371/journal.pone.0343533.t001

Table 1. Descriptive statistics, intercorrelations and internal consistency for the original subscales of the TLS-S.

3.2. Confirmatory factor analyses

Multicollinearity was not identified as a problem in the data, as VIF values were below 10 for all items (range: 1.23–3.01). Mahalanobis distance identified eight multivariate outliers (χ2(15) = 37.7, p  < .001), which were removed before CFAs were performed. Thus, a final sample of 181 cases was used for CFA analyses. Although both the five-factor and the one-factor model overall exhibited acceptable fit indices, indices favored the five-factor solution (Table 2). The difference test for WLSMV confirmed that the five-factor model provided significantly better model fit than the one-factor solution (p < .001). As shown in Fig 4, all standardized factor loadings in the five-factor model exceeded .60, except for item 13, which had a lower loading (.38). This finding suggests that item 13 may not represent the underlying construct (i.e., authoritarian leadership) as strongly as other items on the subscale. Intercorrelations among the five factors were high (range: .73−.92), indicating strong associations between the five subscales (Fig 4). The bifactor model demonstrated good overall fit to the data. Because this specification was not nested within the alternative models, statistical difference testing was not applicable. Instead, model adequacy was evaluated descriptively through fit indices, which supported the appropriateness of the bifactor structure.

thumbnail
Table 2. Confirmatory Factor Analysis and χ² Difference Testing for Nested Model Comparison.

https://doi.org/10.1371/journal.pone.0343533.t002

thumbnail
Fig 4. Confirmatory factor analysis with the five-factor correlated measurement model (standardized factor loadings).

https://doi.org/10.1371/journal.pone.0343533.g004

Standardized factor loadings and item-level explained variance (R2) for the bifactor model are presented in Table 3. Overall, items exhibited stronger loadings on the general factor compared to the subscales. Two exceptions emerged: Item 13 loaded more strongly on the authoritarian leadership subscale than on the general factor, and item 1 showed comparable factor loadings on the general factor and the self-promotion subscale. The computed explained common variance (ECV) was .79 and the percentage of uncontaminated correlations (PUC) was .86. Together, the high values (>.70) of both ECV and PUC suggest that most variance in the TLS-S are primarily explained by the general factor and that most item correlations are unaffected by shared group factors, supporting the model’s unidimensional nature.

thumbnail
Table 3. Standardized Factor Loadings and Item-Level Explained Variance (R²) in the Bifactor Model.

https://doi.org/10.1371/journal.pone.0343533.t003

4. Discussion

The present study assessed the psychometric properties of the Toxic Leadership Scale adapted for sport settings (TLS-S). Overall, the findings supported the reliability and the factorial validity of the TLS-S within a team sport context. Internal consistency was satisfactory (ωₜ > .70) for the total TLS-S score and four of the five subscales: self-promotion, abusive supervision, unpredictability, and narcissism. The authoritarian leadership subscale demonstrated weaker reliability, with item 13 (‘My current head coach controls how team members complete their tasks’) also exhibiting a high degree of uniqueness. Users of the scale should be attentive to this issue, and interpret particularly item 13 with caution. If similar patterns are observed in future research, this item should be considered for removal or reformulation to more accurately reflect authoritarian leadership in sport settings. The findings provided initial evidence for the factorial validity of the TLS-S. While all three tested models (i.e., the five-factor model, the one-factor model, the bifactor model) demonstrated acceptable model fit, the subscales displayed high intercorrelations. Results from the bifactor model provided evidence for a predominantly unidimensional structure of the TLS-S. The general factor accounted for the majority of variance, suggesting that toxic leadership, as assessed by the TLS-S, is appropriately conceptualized as a single overarching construct.

Researchers have debated whether destructive leadership (e.g., abusive supervision, narcissism, authoritarianism) should be treated as distinct constructs or as overlapping dimensions within a unified framework, with arguments supporting both views [18,46]. Differentiating between specific forms of destructive leadership may enable more nuanced analyses and targeted interventions; however, empirical evidence suggests that leaders’ traits or behaviors may not, in isolation, be perceived by followers as inherently toxic. For example, leaders exhibiting authoritarian tendencies, characterized by unilateral decision-making, strict discipline, and expectations of unquestioning obedience [47], or narcissistic traits such as self-centeredness and grandiosity [48], have traditionally been regarded as less effective or “dark” due to their controlling and autonomy-suppressive characteristics [49]. Yet, contemporary research indicates that neither authoritarian nor subclinical narcissistic tendencies alone are necessarily destructive or harmful to followers. Authoritarian leadership has been associated with both positive and negative outcomes depending on group context and objectives [47]. Similarly, studies on subclinical narcissism and leadership effectiveness report mixed findings. A systematic review [49] revealed a curvilinear relationship between narcissism and leadership effectiveness, suggesting that moderate levels of narcissism may be beneficial. Nevertheless, narcissistic leaders often engage in self-enhancement, particularly when outcomes are assessed through self-report measures, raising concerns about the reliability and validity of such data [49]. In performance-driven environments such as elite sports, perceived coach narcissism may even be associated with outcomes athletes regard as advantageous [50,51]. Specifically, athletes may interpret narcissistic traits in coaches as beneficial, as these traits can reinforce athletes’ self-enhancement motives and create opportunities to showcase their skills during competition [50].

Our findings support previous scholars who have suggested that toxic leadership should be investigated as a holistic construct [7,14,1622]. Arguably, athletes are likely to respond to leadership measures from judgments about their coach across multiple situations over time. Rather than differentiating between clearly delineated and distinct leadership behaviors or traits, athletes may integrate diverse cues into a general perception of whether the coach is toxic. Moreover, assessing toxic leaders solely through their characteristics and traits without considering cultural and system-related factors remains incomplete, as it neglects the influence of follower behaviors and organizational culture in enabling or amplifying toxicity [14,19,2123]. Given the self-serving motives and behavioral patterns inherent in toxic leadership, such leaders can, however, be difficult to detect because they often conceal harmful actions from individuals perceived as influential or capable of advancing their careers [7,14,20,21]. Lipman-Blumen [20] contended that, although single whistleblowers may occasionally try to disclose such leaders, most followers, even in the presence of highly toxic leadership, typically remain silent and passive due to psychological, existential, financial, political, and social barriers to disengagement. As a result, followers may endure extended periods of inaction, anticipating that others will ultimately hold toxic leaders accountable. Meanwhile, toxic leaders can continue to perpetuate reassuring illusions of omnipotence and benevolence, presenting themselves as indispensable saviors, even as the consequences of their actions generate diverse forms and degrees of harm for both individuals and organizations [20].

The prevailing performance-oriented culture in high-performance sports, where funding and future opportunities for athletes, coaches, and organizations are closely tied to demonstrated status and medal counts [3,52,53], may also serve to excuse, normalize, or even incentivize toxic leadership within sports organizations. Although toxic leaders may deliver short-term results, any gains typically occur at the long-term expense of athletes’ wellbeing, impaired interpersonal relationships, and organizational stability [14]. Consequently, prioritizing short-term funding or medal objectives above all else may inadvertently create conditions that legitimize harmful leadership practices and undermine both long-term athlete development and organizational integrity. For organizations, such prioritization could therefore constitute a form of self-destructive behavior, whereby perceived short-term benefits ultimately lead to long-term adverse consequences. Furthermore, unquestioned and empirically unsubstantiated assumptions regarding the necessity of harsh, tough and controlling leadership to foster athletic success can produce further systemic vulnerabilities and a notably resistance to change within sports organizations [54]. When core beliefs are reinforced by unspoken traditions, individuals and organizations often cling to outdated practices, assuming the status quo is both optimal and necessary. This mindset can foster resistance to new ideas and methods, ultimately hindering development and adaptation to norms supported by research as enhancing both health and performance over the long term [54]. Once toxic dynamics become entrenched, they are exceedingly difficult to disrupt, as the underlying processes may become self-reinforcing and perpetuate themselves by attracting and socializing new leaders who adopt similar values and toxic behavioral patterns. Early identification, for example through regular and systematic assessments of both the constructive and destructive dimensions of leadership, and proactive measures to curb the development of toxic leadership from the outset are therefore essential [3].

While this study represents a novel effort to facilitate the assessment of toxic leadership within sport contexts, several limitations nevertheless warrant consideration. First, the sample size was relatively small and consisted exclusively of athletes from team sports, which restricts the generalizability of the findings. Moreover, teams included in this study had predominantly male coaches. Although this distribution reflects the gender composition typically observed among elite-level coaches, it is important to note that the athletes’ evaluations primarily pertain to male coaches which may introduce bias in the results. The sample size in this study did not allow for subgroup analyses, such as tests of measurement invariance across gender or sport type. We encourage researchers to pursue this work in future studies. Given the ordinal nature of the data, we employed WLSMV as the estimation method, as it typically yields less biased parameter estimates compared to alternatives such as robust maximum likelihood (MLR). However, prior research indicates that WLSMV can moderately overestimate interfactor correlations when sample sizes are small [55]. Although model fit in this initial validation study was adequate, replication in larger samples is recommended. Second, the study population was drawn from a single Scandinavian country. Culture may influence athletes’ perceptions of leadership, including which behaviors are regarded as effective or detrimental. These cultural nuances highlight the need for future research to evaluate the scale across diverse cultural settings. Such efforts would strengthen understanding of its relevance, applicability, and potential for cross-cultural comparisons among athletes, coaches, managers, stakeholders, and other members of the sporting entourage in varied national contexts. Third, in performance-oriented cultures, toxic leadership and even abusive behaviors can become normalized. Despite their inherently destructive nature, such behaviors may be perceived as acceptable or even encouraged within these environments [35]. Consequently, individuals completing self-report scales may struggle to identify themselves as followers of toxic leaders. Awareness of these harmful dynamics often emerges only retrospectively, once individuals have exited the environment and gained critical distance [3], This represents a general limitation of self-report instruments and poses a risk of systematic underreporting of toxic leadership, even in contexts where it is highly prevalent. It is also plausible that individuals exhibiting toxic leadership tendencies may decline participation in leadership studies, thereby introducing sampling bias in research reliant on voluntary participation. In the present study, we used convenience sampling, and thus we cannot exclude the possibility that our findings were influenced by sampling bias. To estimate the prevalence of toxic leadership in future investigations, the use of random probability sampling is strongly recommended.

5. Conclusions

In conclusion, the TLS-S is a promising instrument for assessing toxic leadership within team sport contexts. Our results provide support for the scale’s internal consistency and for a unidimensional factor structure. This study represents a novel contribution to the quantitative assessment of toxic leadership in sport and establishes a foundation for subsequent research aimed at further validation and systematic refinement of the scale across more diverse athletic populations and cultural contexts.

Supporting information

S1 Appendix. The Toxic Leadership Scale for Sports (TLS-S).

The Swedish translation the Toxic Leadership Scale adapted to sports (TLS-S) and a preliminary English version.

https://doi.org/10.1371/journal.pone.0343533.s001

(DOCX)

S2 Appendix. Item-level response patterns of the TLS-S.

Item-by-item analysis of the TLS‑S by gender and sports.

https://doi.org/10.1371/journal.pone.0343533.s002

(DOCX)

References

  1. 1. Ramírez-Bravo I, Pulido JJ, Díaz-García J, et al. What are the individual and collective consequences of transformational leadership coaching? A longitudinal structural equation model analysis with youth footballers. Curr Psychol. 2025;44(5):4084–95.
  2. 2. Jowett S. 25 years of relationship research in sport: The quality of the coach-athlete relationship as defined by Closeness, Commitment, Complementarity and Co-orientation (3 + 1Cs Model). Psychol Sport Exerc. 2025;80:102909. pmid:40473086
  3. 3. Lundqvist C, Camps J, Vertommen T, Barker-Ruchti N, Kolbeinsson Ö. Toxic leadership in high-performance sports and its consequences for mental health and performance: a scoping review. International Review of Sport and Exercise Psychology. 2025;:1–27.
  4. 4. Roberts V, Sojo V, Grant F. Organisational factors and non-accidental violence in sport: A systematic review. Sport Management Review. 2020;23(1):8–27.
  5. 5. Carleton EL, Barling J, Christie AM, Trivisonno M, Tulloch K, Beauchamp MR. Scarred for the Rest of My Career? Career-Long Effects of Abusive Leadership on Professional Athlete Aggression and Task Performance. J Sport Exerc Psychol. 2016;38(4):409–22. pmid:27634106
  6. 6. Runquist EB 3rd, Adenaiye OO, Sarzaeim M, Milroy J, Wyrick D, Tuakli-Wosornu YA. Associations of abusive supervision among collegiate athletes from equity-deserving groups. Br J Sports Med. 2025;59(16):1124–34. pmid:40032295
  7. 7. Gandolfi F, Stone S. Toxic leadership: Behaviors, characteristics, and consequences. J Manag Res. 2022;22(1):19–27.
  8. 8. Akinyele AI, Chen Z. Dark clouds of leadership: causes and consequences of toxic leadership. International Studies of Management & Organization. 2024;55(4):476–503.
  9. 9. Milosevic I, Maric S, Lončar D. Defeating the Toxic Boss: The Nature of Toxic Leadership and the Role of Followers. Journal of Leadership & Organizational Studies. 2019;27(2):117–37.
  10. 10. Arthur CA, Bastardoz N. Leadership in sport. Handbook of sport psychology: Social perspectives, cognition, and applications. 4th ed. Wiley. 2020. 344–71.
  11. 11. Chelladurai P. Leadership in sports. Handbook of Sport Psychology. Wiley. 2007. 111–35.
  12. 12. Cotterill ST, Loughead TM, Fransen K. Athlete Leadership Development Within Teams: Current Understanding and Future Directions. Front Psychol. 2022;13:820745. pmid:35250758
  13. 13. Gallus JA, Walsh BM, van Driel M, Gouge MC, Antolic E. Intolerable Cruelty: A Multilevel Examination of the Impact of Toxic Leadership on U.S. Military Units and Service Members. Military Psychology. 2013;25(6):588–601.
  14. 14. Reed GE. Toxic leadership. Military Review. 2004;84(4):67–71.
  15. 15. Heppell T. Toxic leadership: applying the lipman-blumen model to political leadership. Representation. 2011;47(3):241–9.
  16. 16. Lipman-Blumen J. Toxic Leadership: A Conceptual Framework. Handbook of Top Management Teams. Palgrave Macmillan UK. 2010. 214–20.
  17. 17. Smith N, Fredricks-Lowman I. Conflict in the workplace: a 10-year review of toxic leadership in higher education. International Journal of Leadership in Education. 2019;23(5):538–51.
  18. 18. Mackey JD, Parker Ellen B III, McAllister CP, Alexander KC. The dark side of leadership: A systematic literature review and meta-analysis of destructive leadership research. Journal of Business Research. 2021;132:705–18.
  19. 19. Thoroughgood CN, Sawyer KB, Padilla A, Lunsford L. Destructive Leadership: A Critique of Leader-Centric Perspectives and Toward a More Holistic Definition. J Bus Ethics. 2016;151(3):627–49.
  20. 20. Lipman-Blumen J. Toxic leadership: When grand illusions masquerade as noble visions. Leader to Leader. 2005;2005(36):29–36.
  21. 21. Ronnie L. The challenge of toxic leadership in realising Sustainable Development Goals: A scoping review. SA j hum resour manag. 2024;22.
  22. 22. Pelletier KL. Leader toxicity: An empirical investigation of toxic behavior and rhetoric. Leadership. 2010;6(4):373–89.
  23. 23. Padilla A, Hogan R, Kaiser RB. The toxic triangle: Destructive leaders, susceptible followers, and conducive environments. The Leadership Quarterly. 2007;18(3):176–94.
  24. 24. Chelladurai P, Saleh SD. Dimensions of Leader Behavior in Sports: Development of a Leadership Scale. Journal of Sport Psychology. 1980;2(1):34–45.
  25. 25. Côté J, Yardley J, Hay J, et al. An exploratory examination of the Coaching Behavior Scale for Sport. Avante. 1999;5:82–92.
  26. 26. Callow N, Smith MJ, Hardy L, Arthur CA, Hardy J. Measurement of Transformational Leadership and its Relationship with Team Cohesion and Performance Level. Journal of Applied Sport Psychology. 2009;21(4):395–412.
  27. 27. Schmidt AA. Development and validation of the toxic leadership scale. University of Maryland. 2008. https://scispace.com/pdf/development-and-validation-of-the-toxic-leadership-scale-2dr91qipqn.pdf
  28. 28. Schmidt AA. An examination of toxic leadership, job outcomes, and the impact of military deployment. Faculty of the Graduate School of the University of Maryland, College Park. 2014. https://api.drum.lib.umd.edu/server/api/core/bitstreams/c637939b-5e85-4dfb-9375-31ab24d7c765/content
  29. 29. Schober P, Boer C, Schwarte LA. Correlation Coefficients: Appropriate Use and Interpretation. Anesth Analg. 2018;126(5):1763–8. pmid:29481436
  30. 30. Gignac GE, Szodorai ET. Effect size guidelines for individual differences researchers. Personality and Individual Differences. 2016;102:74–8.
  31. 31. Cheung GW, Cooper-Thomas HD, Lau RS, Wang LC. Reporting reliability, convergent and discriminant validity with structural equation modeling: A review and best-practice recommendations. Asia Pac J Manag. 2023;41(2):745–83.
  32. 32. Zinbarg RE, Revelle W, Yovel I, Li W. Cronbach’s α, Revelle’s β, and Mcdonald’s ωH: their Relations with Each Other and Two Alternative Conceptualizations of Reliability. Psychometrika. 2005;70(1):123–33.
  33. 33. Green SB, Yang Y. Evaluation of Dimensionality in the Assessment of Internal Consistency Reliability: Coefficient Alpha and Omega Coefficients. Educational Measurement. 2015;34(4):14–20.
  34. 34. O’brien RM. A Caution Regarding Rules of Thumb for Variance Inflation Factors. Qual Quant. 2007;41(5):673–90.
  35. 35. Field A. Discovering statistics using IBM SPSS statistics. 5th ed. SAGE. 2018.
  36. 36. Tabachnick BG, Fidell LS. Using multivariate statistics. 6th ed. Pearson. 2014.
  37. 37. Muthén LK, Muthén BO. Mplus statistical analysis with latent variables: User’s guide. 8th ed. Muthén & Muthén. 2018.
  38. 38. DiStefano C, Morgan GB. A Comparison of Diagonal Weighted Least Squares Robust Estimation Techniques for Ordinal Data. Structural Equation Modeling: A Multidisciplinary Journal. 2014;21(3):425–38.
  39. 39. Bentler PM. Comparative fit indexes in structural models. Psychol Bull. 1990;107(2):238–46. pmid:2320703
  40. 40. Browne MW, Cudeck R. Alternative Ways of Assessing Model Fit. Sociological Methods & Research. 1992;21(2):230–58.
  41. 41. Hu L, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling: A Multidisciplinary Journal. 1999;6(1):1–55.
  42. 42. Bazzoli A. Magic Number .95? Or was it .08? A Refresher on SEM Approximate Fit Indices Thresholds for Applied Psychologists and Management Scholars. Group & Organization Management. 2024;50(4):1136–44.
  43. 43. McNeish D, An J, Hancock GR. The Thorny Relation Between Measurement Quality and Fit Index Cutoffs in Latent Variable Models. J Pers Assess. 2018;100(1):43–52. pmid:28631976
  44. 44. Rodriguez A, Reise SP, Haviland MG. Evaluating bifactor models: Calculating and interpreting statistical indices. Psychol Methods. 2016;21(2):137–50. pmid:26523435
  45. 45. Correction to: Applying Bifactor Statistical Indices in the Evaluation of Psychological Measures. J Pers Assess. 2016;98(4):444. pmid:26699286
  46. 46. Foley KO, Shaughnessy Plemmons S. Negative leader behavior: What do our scales measure?. United States Army Research Institute for the Behavioral and Social Sciences. 2021. https://apps.dtic.mil/sti/trecms/pdf/AD1127211.pdf
  47. 47. Pizzolitto E, Verna I, Venditti M. Authoritarian leadership styles and performance: a systematic literature review and research agenda. Manag Rev Q. 2022;73(2):841–71.
  48. 48. Miller JD, Back MD, Lynam DR, Wright AGC. Narcissism Today: What We Know and What We Need to Learn. Curr Dir Psychol Sci. 2021;30(6):519–25.
  49. 49. Grijalva E, Harms PD, Newman DA, Gaddis BH, Fraley RC. Narcissism and Leadership: A Meta‐Analytic Review of Linear and Nonlinear Relationships. Personnel Psychology. 2014;68(1):1–47.
  50. 50. Nevicka B, van Gerven EJG, Sedikides C. Narcissistic coaches and athletes’ individual rowing performance. Sci Rep. 2023;13(1):21010. pmid:38030807
  51. 51. Stanford J, Roberts R, Johnston JP, Sarkar M, Holmes P, Healy LC. “You’re just like me, so we must be great together”: How similarity in narcissism impacts the quality of the coach–athlete relationship. Sport, Exercise, and Performance Psychology. 2025;14(3):437–56.
  52. 52. Purdy LG, Lang M. Raibuliavimas (making ripples): student-athletes action safe sport in Lithuania. Sport, Education and Society. 2023;30(1):15–28.
  53. 53. Lundqvist C, Schary DP, Jacobsson J, Korhonen L, Timpka T. Aligning categories of mental health conditions with intervention types in high-performance sports: A narrative cornerstone review and classification framework. J Sci Med Sport. 2024;27(8):525–31. pmid:38796375
  54. 54. Gandolfi F, Stone S, Gano M, Nasrah SKM. Toxic Engagement: Identifying Toxic Leadership Through Employee Engagement to Benefit the SDGs. jlsdgr. 2024;5(1):e03164.
  55. 55. Li C-H. Confirmatory factor analysis with ordinal data: Comparing robust maximum likelihood and diagonally weighted least squares. Behav Res Methods. 2016;48(3):936–49. pmid:26174714