Figures
Abstract
The return-to-sport (RTS) process is multifaceted and complex, as multiple variables may interact and influence the time to RTS. These variables include intrinsic factors related the player, such as anthropometrics and playing position, or extrinsic factors, such as competitive pressure. Providing an individualised estimation of time to return to play is often challenging, and clinical decision support tools are not common in sports medicine. This study uses epidemiological data to demonstrate a Bayesian Network (BN). We applied a BN that integrated clinical, non-clinical factors, and expert knowledge to classify time day to RTS and injury severity (minimal, mild, moderate and severe) for individual players. Retrospective injury data of 3374 player seasons and 6143 time-loss injuries from seven seasons of the professional German football league (Bundesliga, 2014/2015 through 2020/2021) were collected from public databases and media resources. A total of twelve variables from three categories (player’s characteristics and anthropometrics, match information and injury information) were included. The response variables were 1) days to RTS (1–3, 4–7, 8–14, 15–28, 29–60, > 60, and 2) injury severity (minimal, mild, moderate, and severe). The sensitivity of the model for days to RTS was 0.24–0.97, while for severity categories it was 0.73–1.00. The user’s accuracy of the model for days to RTS was 0.52–0.83, while for severity categories, it was 0.67–1.00. The BN can help to integrate different data types to model the probability of an outcome, such as days to return to sport. In our study, the BN may support coaches and players in 1) predicting days to RTS given an injury, 2) team planning via assessment of scenarios based on players’ characteristics and injury risk, and 3) understanding the relationships between injury risk factors and RTS. This study demonstrates the how a Bayesian network may aid clinical decision making for RTS.
Citation: Yung KKY, Wu PPY, aus der Fünten K, Hecksteden A, Meyer T (2025) Using a Bayesian network to classify time to return to sport based on football injury epidemiological data. PLoS ONE 20(3): e0314184. https://doi.org/10.1371/journal.pone.0314184
Editor: Benjamin F. Mentiplay,, La Trobe University, AUSTRALIA
Received: November 6, 2024; Accepted: February 13, 2025; Published: March 20, 2025
Copyright: © 2025 Yung et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Dataset cannot be shared publicly because the data used for this study were collected, summarised and verified from various media outlets/databases and therefore include patient privacy. At this level of play (Bundesliga 1), effective annoymisation is also not possible. Due to above reason and to protect patient privacy, Saarland University does not allow the data to be shared publicly. Data are available from the from the Institute for Sports and Preventive Medicine, Saarland University (contact via sportmed@mx.unisaarland.de) for researchers who meet the criteria to access to confidential data.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Challenges in return to sport decision making
Return to sport (RTS) is defined as when an injured athlete can return to full unrestricted team training and play without modifications in duration and/ or activities[1–3]. Forecasting or estimating the return date of an injured athlete is crucial for team planning, performance optimisation, and game strategy development. By having an estimated return date, medical staff can create an individualised rehabilitation plan to support a graduated progression in the athlete’s conditioning program to improve the athlete’s condition and assist in mitigating re-injury risk. The provision of predicted timeframe for recovery also helps coaching staff to adjust their game plans and strategies, maximising the team’s chances of success.
Athletes operate within a complex system and are influenced by factors such injury history, current injury, body mass index, playing positions, sociological factors, psychological status, and the nature of the sports event [4,5]. Accurately predicting an athlete’s RTS can be a challenging task for medical staff due to the intricate interactions between various variables and their influence on injuries. Understanding the interactions between the human physiological system’s and injury risk is crucial to provide accurate injury prognoses and project an athlete’s likely RTS time range (e.g., there is a 75% chance for the player to return between 4–5 weeks). Such RTS projections could help medical staff and athletes to prepare RTS programmes based on both the lower and upper projection limits.
Large-scale epidemiological studies have estimated the expected RTS time for major injuries in football at the population level accounting for associated injury risk factors[6–10]. These studies describe the risk factors for a disease or injury and the extent of the problem. Medical staff can then use an “anchor and adjust” strategy to optimise predictive accuracy, thus better estimating RTS times[11]. While typical epidemiological studies aim to estimate population-level effect and trends, tools to translate population-level data into individual-level estimations would be helpful.
Forecasting the time to RTS based on individual characteristics can be challenging for several reasons. First, the Strategic Assessment of Risk and Risk Tolerance Framework recommends considering personal characteristics and factors, such as age and playing position[12]. However, quantifying and synthesising this information in the clinical reasoning process can be difficult. For example, what is the difference between the rehabilitation time required for a 20-year-old striker versus a 30-year-old defender with a same type of hamstring strain injury? Second, synthesising useful information from broad population-level epidemiological data covering entire leagues and multiple seasons and then personalising it to a specific player-context is almost cognitively infeasible for a single individual or even a small group of individuals. There are no tools to help translate the sheer volume and complexity of the population-level data and tailor them to an individual level. Third, medical staff, like any other humans, are susceptible to various decision-making challenges. They have limited information processing capacity[13] and face potential cognitive biases[14,15]. Human judgment can be vulnerable, particularly when it comes to statistical and probabilistic reasoning[16]. Without an objective and reliable decision-making tool specifically designed to provide individualised RTS estimations, medical staff often rely heavily on their clinical experience and population-level epidemiological data.
Data scientists have the potential to develop decision support tools that can effectively synthesise broad epidemiological data and personalise it to specific player-contexts, overcoming the cognitive limitations faced by individuals. These tools can provide valuable additional information that would otherwise be challenging for a single human to process. In practical settings, data scientists can undertake the setting up and maintaining the decision support tool, while users such as medical staff and coaches utilise the tool in planning for an athlete’s RTS, making necessary adjustments for the team, and ensuring the team remains competitive. This collaboration between data scientists and end users can lead to more informed decision-making and strategic planning.
Developing a computer-based decision support tool that uses a complex systems approach may be helpful to overcome the previous challenges and offer a competitive advantage in forecasting RTS[17]. Specifically, the key differentiator of a complex systems approach in RTS is the explicit modelling of factor interactions that could be queried and used by medical staff in multiple ways. This includes, but is not limited to understanding the strength of influence of different variables and predicting the outcome based on custom RTS scenarios.
Computer-based decision support tools can be divided into predictive and descriptive modelling[18]. Predictive modelling can be used for injury diagnosis, severity estimation, and rehabilitation planning. In particular, the Bayesian network (BN) is well suited for providing injury prognosis in sports, due to its capacity to model complex systems and to integrate clinical data, non-clinical factors and expert knowledge. However, it has not been previously used for such a purpose.
About the Bayesian network
A BN is a graph-based modelling method where the relationships between variables (nodes) are represented with arrows (arcs) (See Fig 1). The presence of an arc denotes the influence of one node on another, and the absence of one assumes conditional independence[19]. While real data often contains a mixture of discrete and continuous variables, BN structure learning algorithms often assume the random variables are discrete. This type of BN is called the discrete BN, which involves discretising continuous variables in the dataset into categories. Although some information is lost when continuous data are categorised[20], there are merits of using the discrete Bayesian network that are worth discussing.
In practical settings, people often find it easier to work with discrete representations rather than continuous data. Discrete variables tend to be more interpretable, facilitating abstract reasoning[21]. For example, word tokens often enable fast and exact processing[21]. Consider the comparison between the words “tall” and “short” versus the numerical values “183cm” and “150cm”. Some may find the descriptive terms are easier to grasp and apply in reasoning. Given the complexity of the human body and interacting processes involved in RTS, discretisation may help capture the resolution of the data available and its relevance to the decision support scenario at hand[22].
However, it’s also important to note the disadvantages to discretising continuous variables[23]. Discretisation can lead to information loss, as the finer details and nuances captured by continuous data may be overlooked. By converting continuous variables into discrete categories, we sacrifice the precision and granularity inherent in the original data. Additionally, the choice of how to discretise the variables can introduce subjectivity and bias into the analysis and therefore, it is crucial to ensure transparency in how variables are discretised to mitigate these potential issues.
BNs provide a platform for inferring state probabilities given observations, referred to as evidence, of one or more nodes in the network. In the discrete BN, the relationship between parent nodes and a child node can be quantified using conditional probability tables (CPTs). The CPT reflects the probability of child node states (or outcomes) given every possible combination of parent node states[24]. As new evidence comes in, changes may be made to the node’s marginal probability[25], which is known as the posterior probability. The posterior probability is the updated probability of a hypothesis or event after considering new evidence. They combine prior beliefs with the likelihood of the data to provide an updated estimate of the event’s probability.
BNs can perform both predictive and diagnostic inference. For example, medical staff can use the former to predict the outcome of an injury for a given clinical diagnosis, anthropometric and match factors (predictive inference); but can also enter the injury severity as an observation to examine what injury factors could explain that observation (diagnostic inference). As an important feature to end users who may not be familiar with statistics, the BN provides visuals to facilitate understanding and supporting decision making across teams of users, such as, among athletes, coaching staff, medical and non-medical personnel.
A is the parent of B, while B is the child of A; B is the parent of C and D, while C and D are the children of B; and so on. A is the ancestor of B, C, D, E and F, while B, C, D, E and F are the descendants of A; and so on. The arrows indicate the direction of influence.
Another key feature of a BN is that medical staff can integrate and visualise data from multiple sources into a single BN, such as clinical data, empirical evidence and expert knowledge[26], or combinations of these[27]. Expert knowledge can be invaluable when empirical data is scarce or unavailable. It also plays a significant role in developing models, selecting data or variables, estimating parameters, interpreting results and determining the uncertainty characteristics. Since the BN can be customised based on the experts’ knowledge[28,29], it may be appealing for small sample size research in elite sports research[30–32], where some data may be missing or not be feasible to collect (e.g., limitations in applied settings)[26]. BNs have been used to support clinical decisions[33–36], analyse complex systems in ecology [22,37,38] and logistics[39].
Study objectives
The objective of this study was to demonstrate the use of a Bayesian network and its potential utility in becoming a decision support tool for medical staff for RTS. Specifically, a discrete BN was modelled based on a set of basic epidemiological data to demonstrate how medical staff can use BN to understand 1) the most influential variable to the outcomes, 2) the strength of influence of different variables, and to 3) demonstrate its use with a case scenario.
Materials and methods
Study design
The study is a retrospective analysis of prospectively collected injury data from the German professional men’s highest football league (Bundesliga) between 2014/2015–2020/2021. Neither research ethics board approval nor a trial registration was required as all data were collected from publicly available sources[9,40]. We reported the result with reference to the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD) statement[41].
Participants
Players who played in the Bundesliga in the above-mentioned seven seasons were included in the study. Injured players who did not return to the Bundesliga in the same season and those without complete data were excluded from the analysis. All participants were identified using a publicly available database, including Kicker SportmagazinTM, and clubs’ official websites. Data collection was performed via methods established in previous investigations[9,42]. The data has been collected according to the Fuller et al. consensus statement for football injury research and the same definitions have been the same throughout the 7 seasons[43]. 3374 player Bundesliga seasons were registered over the seven seasons from 2014/2015–2020/2021. A total of 6653 time-loss injuries were recorded. After removing injuries without complete data, 6143 time-loss injuries remained.
Construction of the Bayesian network
The modelling process and validation were performed in GeNIe 2.0 (Bayes Fusion, LLC)[44]. There are four main steps to creating and validating a BN:
- 1) Identify the variables (nodes) of the BN and the key response variables. In this study, the key response variables are the (i) severity of the injury (severity) and (ii) days to RTS (days_rts). They are used to evaluate the performance of the BN. The nodes and sources of information are outlined in Table 1.
- 2) Learn the graphical structure of the BN and display the relationships between all the nodes.
- 3) Learn the probability distribution governing the relationships between the nodes.
- 4) Validate the BN to evaluate how well the BN can classify injury based on the key response variables: (i) severity and (ii) days_rts.
Step 1: Identify the main variables.
This model is intended to capture the conditional probabilistic relationships among personal characteristics, match and injury information. The variables in this study are commonly included in epidemiologic studies[45] and we have summarised them in Table 1.
All time-loss injuries that occurred during football competitions and training sessions were included, and the day of the injury is counted as day zero. A time-loss injury is when a player cannot fully participate in training or competition due to injury[43]. RTS is defined as when the player has received medical clearance to allow full participation in training and is available for match selection[2]. Details of the injury (type of injury, body region, contact/non-contact injury, training or match) were labelled based on the Fuller et al. consensus of data collection in football[43]. The time of the season was based on an injury epidemiologic study in European football[46]. Personal characteristics and anthropometrics, match information and the above information are used as the explanatory variables for constructing the model.
We are interested in determining the days to RTS, therefore, the two key response variables in this model are injury severity (severity) and days to RTS (days_rts). Injury severity (severity) was categorised according to the days of absence in match or training as outlined by the Fuller et al. consensus statement on football injury studies[43]: minimal (1–3 days), mild (4–7 days), moderate (8–28 days), or severe (>28 days). We further create a new variable based on days to RTS (days_rts) to evaluate if the model could classify the days to RTS into more precise categories. There are two additional categories in days_rts as compared to severity: 1–3, 4–7, 8–14, 15–28, 29–60, > 60 days. In context, these variables form the nodes in the BN. A summary of the variables and source of the data are presented in Table 1.
Step 2: Define the graphical structure.
We incorporated expert knowledge to constrain the search to help ensure graph structures produced are consistent with clinical science (See S1 Fig). We first specified the temporal order of the variables, ensuring there were no arcs from variables that occurred later (e.g., injury) to nodes happening earlier (e.g., main playing position). Following this, Author KY established the relationships (arcs) within the graphical structure. Notably, the key response variable, days_rts, was directly linked to severity in the graphical structure. This mapping was based on the definition of severity outlined in the Fuller et al. consensus statement on football injury studies, which defines severity by the duration of unavailability for full training[43]. This mapping was done manually based on clinical knowledge and could not be learnt from algorithms. Finally, with the constraints in place, we used the Bayesian Search algorithm in GeNIe 2.0 to find the best-fitted network for the optimal network configuration that aligns with the collected data and expert knowledge[19,47].
Step 3: Define the probability distribution.
There are multiple ways to discretise continuous data, including manual, unsupervised and supervised. Each method offers distinct advantages, such as improved model performance, easier interpretation, and computational efficiency[48]. Manual discretisation involves manually defining thresholds or categories to discretise the continuous data. This approach provides flexibility and allows domain experts to incorporate their knowledge, ensuring that the categories align with relevant domain-specific consensus statements and the specific needs of the analysis. Unsupervised discretisation involves using clustering algorithms to identify natural groupings or patterns in the data. These clusters can then be treated as discrete categories. This is particularly useful when prior knowledge or predefined categories are unavailable[49]. Supervised discretisation utilises labelled data to guide the discretisation process. Machine learning algorithms or decision trees can be employed to learn optimal thresholds or categories that maximise predictive performance. This method can enhance both model performance and interpretability[49].
We manually discretised the continuous data based on domain-specific decision categories to maximise usability across various practitioner types and clubs. Specifically, the continuous data are discretised based on categories that are easy to apply practically or with reference to relevant consensus statements. We discretised age, height, and weight into three categories that represents the typical values observed in the sample, to create uniform count categories that were both meaningful and easy to interpret. severity was discretised based on the Fuller et al. consensus of epidemiological data collection into four categories (minimal, mild, moderate and severe)[43]. days_rts was discretised into six categories: 1–3, 4–7, 8–14, 15–28, 29–60, > 60 days, with reference to the Fuller et al. consensus of epidemiological data collection[43]. Descriptions of the nodes and categories are summarised in Table 2. We used the Expectation Maximisation (EM) algorithm in GeNIe 2.0 to determine the probability distribution (parameter learning) of the dataset[50,51].
Step 4: Validation of the BN.
A crucial element of learning is to validate the model. Validation was performed on the two target nodes, i.e., severity and days_rts, which are the main outcomes of interest. Ten-fold cross-validation was performed where the dataset was split into ten parts of equal probability[52]. The model was trained on nine parts and tested on the remaining tenth part of the unseen data (holdout test sets). The process was repeated ten times, with a different part of the data being used for testing. The model evaluation technique implemented in GeNIe 2.0 keeps the model structure fixed and re-learns the model parameters during each of the folds. We compiled the results of the test splits to report the sensitivity and user’s accuracy[53]. The sensitivity is calculated by dividing the total number of correctly classified injury severity by the total number of actual occurrences of injury of that severity and represents the true positive rate for each category.
The user’s accuracy is calculated by dividing the total number of correctly classified injury severity by the total classified true occurrences of injury of that severity. User’s accuracy provides insights into the precision of the model’s predictions for each category and how reliable it is from the users’ perspective.
By considering both the sensitivity and user’s accuracy, we can evaluate the model’s performance in terms of ability to classify correctly and accuracy of predictions within each category. This is particularly relevant when dealing with models that have multiple categories, such as our current model’s injury severity classification.
Results
Demographics
The demographics and the main playing positions of injured players are shown in S1 Table. The breakdown of injury by nature of injury (contact or non-contact), event (match or training), body region, and types of injuries are available in S2 Table.
Bayesian network
The network and the probability distribution of each variable are presented in Fig 2. The model’s sensitivity in classifying days_rts and severity is presented in Table 3. In terms of categorising days_rts, the sensitivity ranges from 0.24 to 0.97, with the best performance for shorter days (i.e., below 3 days) and the worst performance for the mid-range category (i.e., 8–14 days). In classifying the injury’s severity, the sensitivity ranges from 0.73 to 1.00, with the best performance for severe and the worst for minimal. In terms of categorising days_rts, the user’s accuracy ranges from 0.52 to 0.83, with the best performance for days 3–7 and the worst performance for the mid-range category (i.e., 8–14 and 15–28 days). In classifying the injury’s severity, the sensitivity ranges from 0.67 to 1.00, with > 0.90 in all categories except minimal.
The percentages indicate the distribution of values under each variable, while the blue arrows indicate how factors influence one another.
In our model, the type of injury and the injured body region are directly connected to days_rts (Fig 2). The percentages indicate the distribution of values under each variable, with the blue arrows indicate how factors influence one another (Fig 2). Based on the sensitivity analysis, the type of injury and the injured body region are most influential to the days_rts, followed by age, contact or non-contact injury and the nature of the event (training or match) (see Fig 2). Time of the season, weight, and height only had a minor influence on the result. The time of the season is also associated with the nature of the event (training or match) and the type of injury, which is supported by empirical evidence[54].
Sensitivity analysis, feature selection and strength of influence
A sensitivity analysis was performed using built-in functionality in GeNIe 2.0 to determine the influence of the individual nodes. Sensitivity analysis helps determine the influence of observing the states of specific nodes (i.e., prior and conditional probabilities) on the output variables (i.e., posterior probabilities)[55], which in this case, are the days to RTS and severity of the injury (Fig 3). This can help to support the selection of key variables to be included in a model (feature selection). Results of the sensitivity analysis are visually summarised in the form of tornado charts in S2 Fig.
Red-coloured nodes: contain variables important for calculating posterior probability distributions in days_rts. Key: A darker red colour indicates a higher degree of influence. Grey-coloured nodes: the node has no influence on the posterior probability distributions of days_rts.
Highly sensitive variables affect the inference results more significantly. Identifying the highly sensitive variables directs medical staff to specific areas to focus on to affect the BN’s outcome. As an example, in Fig 3, we have set days_rts as the key response variable. Nodes coloured in red contain variables important for calculating posterior probability distributions in days_rts. Grey-coloured nodes have no influence on the posterior probability distributions of days_rts, as determined by data and knowledge. Based on the sensitivity analysis, body_region, injury_type and contact_noncontact are the most influential to the outcome in days_rts (Fig 3). A numeric form of the strength of influence of different variables can be found in S3 Table.
Discussion
This study demonstrates the applicability of a BN in an epidemiological dataset to project rehabilitation timelines and provides evidence-based support to RTS decisions. We integrated personal characteristics and anthropometrics, match information and injury information to construct a BN that may inform the days to RTS and injury severity.
The BN in the context of RTS
In this model, we use a hybrid approach in constructing the BN; that is, we have combined clinical knowledge and data-driven learning (Bayesian Search algorithm) when constructing the graphical structure. This is because constructing the BN using pure algorithmic approaches (i.e., unsupervised learning) can sometimes produce graph structures with unreasonable arcs, such as the model attempting to explain age with severity. However, if we use a pure domain approach, we may miss some of the patterns and linkages between variables that were not observed by clinicians. Therefore, we opted to use a hybrid approach when constructing the BN.
The BN model may help medical staff working in the Bundesliga to view athletes and injuries with a complex systems approach. For example, the model captured the complex relationships between the time of the season, the injury type and injury occurrence (Fig 2). Specifically, in our BN model, the time of the season correlates with the weather, which is then associated with the ground condition [54] and injury occurrence[56,57]. As a key feature of a complex systems approach, the BN can explicitly model the factor interactions/relationships and query in multiple ways, such as comparing the outcomes of custom RTS scenarios in the section titled Illustrative application of the BN), understanding the most influential factors (in the section titled Sensitivity analysis, feature selection and strength of influence) and the strength of influence (in the section titled Sensitivity analysis, feature selection and strength of influence).
To our knowledge, no study in the Bundesliga has studied the correlations of the above factors, so we use an example from the English Premier League (EPL) to illustrate the possible correlations. In the EPL, the ground condition tends to be drier in the preseason. Warm, dry and hard surfaces may be associated with higher injury occurrence[56,57], possibly due to a higher level of shoe-surface traction influence [58] and faster running speed[59]. On the contrary, wet and muddy ground is associated with lower injury occurrence[57], possibly due to changes in playing style (e.g., less high-speed tackles) and reduced shoe-surface traction. While the time of the season affects the injury occurrence, there has been no direct effect on days to RTS. Analysing these complex relationships would be difficult without the use of computers and advanced statistical modelling.
The model is more accurate for classifying injuries into 4 categories under severity (sensitivity = 0.75 – 1.00), compared to 6 categories days_rts (sensitivity = 0.24 – 0.97). This is not surprising because the increase in the number of categories challenged the model to provide a higher accuracy. The model is most accurate in classifying injuries with shorter days_rts (days below 3, sensitivity = 0.97) and least accurate in classifying mid-range injuries (days 8–14, sensitivity = 0.24). A possible explanation is that injuries with minimal days of absence have particular injury patterns, for example, they may be upper body injuries and hematoma/contusion/bruise injuries. The model is least accurate for classifying mid-range injuries (8–14 days), possibly because the model lacks information that may differentiate the prognoses. This information may be the extent of tissue damage (e.g., the sub-classification of muscle injury) and the specific location of the injury (e.g., involvement of central tendon injury). When such information becomes available, it is possible to incorporate them into the BN modelling as BNs are quite flexible in accommodating new information. However, the BN will have to be trained again with the new data to see if the model’s performance has been improved
Illustrative application of the BN
Here, we use a hypothetical case study to demonstrate the practicability of using the BN in classifying the days to RTS and the injury severity of a player. We input the player’s characteristics and anthropometrics, match information and injury information into the BN constructed earlier. The probability distribution is shown in Fig 4
A player (age: 25, weight: 77 kg, height: 185 cm, attacker) playing in the German professional football league sprained his ankle ligament in a non-contact injury and pulled out of the preseason training session. The coach would like to know when the player is available for selection.
The BN indicates the joint multivariate probability distribution: the likelihood of the injury to be minimal is 63%, mild 24%, moderate 12% and severe 1% (Fig 4). The likelihood of RTS below 3 days is 52%, 3–7 days is 33%, 7–14 days is 10%, 14–28 days is 4%, 28–60 days and more than 60 days are less than 1%, respectively. This information, in combination with clinical assessment, may be used to support coaches and players in predicting days to RTS and support team planning by assessing the number of players available for training and competition.
In Fig 4, the thickness of the arcs indicates the strength of influence. Medical staff can use the strength of influence analysis to understand the local relationships among the variables in a network and how they contribute to the posterior marginal probability (i.e., outcome) for each node in the network (Fig 4). The strength of influence calculates the average effect of changing the state of a parent node on the probability distribution of states in the child node[60]. The arcs are normalised, thus the thickest arc indicates the one with the highest influence. From Fig 4, the nature of the injury (contact or non-contact) is related to the injury type and the body region. Injury type and body region are directly associated with the days_rts. Medical staff may use days_rts in conjunction with the clinical knowledge to project the rehabilitation timeline.
Practical application and future developments
The software we used for constructing the BN, GeNIe 2.0, could be used as a decision support tool by sports practitioners, including but not limited to coaches, team managers, sports science and medical staff. Once the BN has been constructed, for example, with the use of GeNIe 2.0, the end users do not need any machine-learning knowledge or advanced computer skills to use the BN. Instead, the only input from the sports practitioners is to collect and input the players’ characteristics into the BN. The new data from every week or season can be input into the system’s database by data scientists (either in-house or third party) to ensure the model is up-to-date. As technology continues to advance, the process of constructing and utilising BN is expected to become easier and more efficient. This increased ease of use may encourage medical staff to invest their efforts in learning and adopting BNs to support their decision-making processes.
When planning for RTS for an athlete, the BN can provide a personalised risk analysis from broad epidemiological data, as exemplified in section “Illustrative application of the BN”. While the results of the BN for estimating time to RTS align with existing clinical systems and therefore some practitioners may question the need for a computational decision support tool, it is important to see the potential benefits of integrating BN into sports medical practice in future. Conventional clinical guidelines often apply universally to all individuals regardless of age, sex, sport, or level of play. Consequently, they are not intended to offer an individualised approach to RTS estimation. Given that athletes’ diverse needs and circumstances, coupled with the growing emphasis on personalised and precision medicine, a more tailored approach to estimating RTS becomes essential. In this regard, the incorporation of BN can serve as a valuable addition to sports medical practice to improve decision making.
In our example, we explained the use of BN with a RTS scenario and estimated the time to RTS based on a player’s basic characteristics. However, it is important to note that this study is a pilot for future studies, and the potential applications of BN extend beyond this specific use case. They can be leveraged for various purposes, such as forecasting expected performance or injury risk in a game by considering factors like personal characteristics, opponent playing style, and recent training and game performance. While in this study, the BN yielded similar results to existing clinical systems, its true benefits lie in its ease of use, efficiency, and the capacity to incorporate a wider range of data into the decision-making process. The BN’s ability to identify the strength of influence allows medical staff to address the variables that significantly impact outcomes, a task that cannot be adequately addressed by conventional clinical guidelines alone. By leveraging BN, medical staff may enhance their decision-making processes and provide more tailored and effective care to athletes.
Computational models are more likely to be implemented in applied sports settings if their accuracy, interpretability and functionality fit with the operation framework of a sports organisation[61]. In this case, BN provides an intuitive visualisation of the complex relationship of injuries, which medical staff may understand even with little or no experience in computer analytics. This may increase the model’s transparency and may improve the medical staff’s trust in the model[62]. In terms of practical applicability, medical staff can use the model to create scenarios that facilitate the evaluation of different management options or the development of rehabilitation protocols for players with various positions or injuries occurring at different times during the season. These scenarios enable the assessment of the combined effects of risk factors[26]. This may enable them to proactively manage injury risk, and in case of injury, make more accurate predictions regarding the days to RTS for individual athletes based on epidemiological evidence. The BN can be modified to suit the specific context as determined by the user and updated with new information when available. In summary, BNs seem promising for modelling the relationships of variables in a complex system and may be further explored to support clinical decision making.
The epidemiological data we used in this study is static; thus, we use a BN to represent the system as a time-aggregated model. However, most systems, including athletes, have been well recognised to change over time[4,63]. To capture the change over time and the feedback loops, a BN can be further extended into a dynamic Bayesian network (DBN)[64]. The DBN is an extension of the BNs and replicates the BN model at discrete points in time (time slices) and captures temporal relationships between the variables. DBNs can represent complex questions, such as how changes in rehabilitation training load affect the time to RTS in athletes with different demographic and anthropometric characteristics. Modifiable variables, such as rehabilitation training exposure, can be collected continuously over time as time-series data[65]. DBNs have been applied in injury prediction[66], medical diagnosis or prognosis [67–69] and are also a promising avenue for further investigation in subsequent injury risk modelling. Specifically, DBNs may provide medical practitioners with a tool to predict the probability of a subsequent injury to occur given time-series data, such as injury history, recovery time and players’ physiological state over time. Future work could investigate using Wu et al. (2019)’s approach to analyse scenario-based complex systems[70].
Limitations
The proposed BN model has limitations that should be considered when interpreting its results. Firstly, it is important to recognise that the model was constructed using data solely from the Bundesliga, and therefore, the generalisability of the results to other leagues or levels of play is limited. Second, the quality of the input variables directly impacts the model’s predictive accuracy. In the case of our BN, it was constructed using basic epidemiological data sourced from public databases. While this approach maximises the availability of injury events for the model, it lacks pertinent information related to prognosis, such as diagnosis (e.g., structural damage or functional disorder) and the precise location of the injury (e.g., involvement of central tendon in hamstring injury) [71]. An accurate diagnosis of an injury plays a vital role in determining prognosis[10,72]. For instance, the recovery time to RTS can vary significantly between a minor partial tear in the hamstring muscle (requiring 17 days to RTS) and a moderate partial tear (requiring a longer time of 36 days to RTS). Therefore, the lack of detailed diagnostic information can impact the model’s graphical structure and prediction accuracy.
Further studies may consider including more information into the model, such as quality and duration of rehabilitation training, RTS performance, reinjury incidence, the importance of upcoming competition, the remaining contract length, the club’s geographical regions and the players’ transfer value. Potentially, depending on the purpose of the model, we can integrate epidemiological data from other larger datasets into the model to enlarge the database to reduce the chance of overfitting and improve result transportability.
Comparing the machine-learned BN with the existing scientific understanding of injury and recovery would also be a valuable future avenue of research. This comparison can provide insights into the performance and predictive capabilities of a complex model that incorporates a wide range of variables in contrast to a simpler model, such as an epidemiological one. By doing so, researchers can evaluate the added value of BN as a decision-support tool.
Conclusion
This discrete BN provides a decision support tool to help medical staff, coaches and players manage injury. The BN has a high sensitivity, ranging from 0.73 to 1.00 in predicting severity and provides a graphical representation of the investigated interdependencies. Medical staff can use BN to understand the strength of influence of different variables on the outcome and analyse the outcome based on custom RTS scenarios. This information may help medical staff evaluate different injury scenarios and better respond to individual player’s rehabilitation and team planning. BNs seem promising for modelling the relationships of variables from multiple sources and can be further explored to support clinical decision making.
Supporting information
S1 Table. The demographics and the main playing positions of injured players.
https://doi.org/10.1371/journal.pone.0314184.s003
(PDF)
S3 Table. Strength of influence of all arcs in the model.
https://doi.org/10.1371/journal.pone.0314184.s005
(PDF)
Acknowledgments
Our Bayesian network was built using GeNIe 2.0 Modeler (BayesFusion 2019), available free of charge for academic research and teaching use from https://www.bayesfusion.com/. The authors would also like to thank Tobias Tröß and Abed Hadji for the data collection and injury database management.
References
- 1. Waldén M, Mountjoy M, McCall A, Serner A, Massey A, Tol JL, et al. Football-specific extension of the IOC consensus statement: methods for recording and reporting of epidemiological data on injury and illness in sport 2020. Br J Sports Med. 2023;57(21):1341–50. pmid:36609352
- 2. Ardern CL, Glasgow P, Schneiders A, Witvrouw E, Clarsen B, Cools A, et al. 2016 Consensus statement on return to sport from the First World Congress in Sports Physical Therapy, Bern. Br J Sports Med. 2016;50(14):853–64. pmid:27226389
- 3. Pol R, Balagué N, Ric A, Torrents C, Kiely J, Hristovski R. Training or synergizing? Complex systems principles change the understanding of sport processes. Sports Med Open. 2020;6(1):28. pmid:32661759
- 4. Yung KK, Ardern CL, Serpiello FR, Robertson S. Characteristics of complex systems in sports injury rehabilitation: examples and implications for practice. Sports Med Open. 2022;8(1):24. pmid:35192079
- 5. Balague N, Torrents C, Hristovski R, Davids K, Araújo D. Overview of complex systems in sport. J Syst Sci Complex. 2013;26(1):4–13.
- 6. Hallén A, Ekstrand J. Return to play following muscle injuries in professional footballers. J Sports Sci. 2014;32(13):1229–36. pmid:24784885
- 7. López-Valenciano A, Ruiz-Pérez I, Garcia-Gómez A, Vera-Garcia FJ, De Ste Croix M, Myer GD, et al. Epidemiology of injuries in professional football: a systematic review and meta-analysis. Br J Sports Med. 2020;54(12):711–8. pmid:31171515
- 8. Lu D, McCall A, Jones M, Kovalchik S, Steinweg J, Gelis L, et al. Injury epidemiology in Australian male professional soccer. J Sci Med Sport. 2020;23(6):574–9. pmid:32008909
- 9. Aus der Fünten K, Tröß T, Hadji A, Beaudouin F, Steendahl IB, Meyer T. Epidemiology of football injuries of the German Bundesliga: a media-based, prospective analysis over 7 consecutive seasons. Sports Med Open. 2023;9(1):20. pmid:36867257
- 10. Ekstrand J, Krutsch W, Spreco A, van Zoest W, Roberts C, Meyer T, et al. Time before return to play for the most common injuries in professional football: a 16-year follow-up of the UEFA Elite Club Injury Study. Br J Sports Med. 2020;54(7):421–6. pmid:31182429
- 11. Hecksteden A, Keller N, Zhang G, Meyer T, Hauser T. Why humble farmers may in fact grow bigger potatoes: a call for street-smart decision-making in sport. Sports Med Open. 2023;9(1):94. pmid:37837528
- 12. Shrier I. Strategic Assessment of Risk and Risk Tolerance (StARRT) framework for return-to-play decision-making. Br J Sports Med. 2015;49(20):1311–5. pmid:26036678
- 13. Miller GA. The magical number seven, plus or minus two: some limits on our capacity for processing information. Psychol Rev. 1956;63(2):81–97.
- 14. Blumenthal-Barby JS, Krieger H. Cognitive biases and heuristics in medical decision making: a critical review using a systematic search strategy. Med Decis Making. 2015;35(4):539–57. pmid:25145577
- 15. Yung KK, Ardern CL, Serpiello FR, Robertson S. A framework for clinicians to improve the decision-making process in return to sport. Sports Med Open. 2022;8(1):52. pmid:35416633
- 16. Tversky A, Kahneman D. Judgment under uncertainty: heuristics and biases. Science. 1974;185(4157):1124–31. pmid:17835457
- 17. Kawamoto K, Houlihan CA, Balas EA, Lobach DF. Improving clinical practice using clinical decision support systems: a systematic review of trials to identify features critical to success. BMJ. 2005;330(7494):765. pmid:15767266
- 18.
Han J. Data mining concepts and techniques. 3rd ed. Kamber M, Pei J, editors. Waltham, Mass.: Morgan Kaufmann Publishers; 2012.
- 19. Heckerman D, Geiger D, Chickering DM. Mach Learn. 1995;20(3):197–243.
- 20. Binney ZO, Mansournia MA. Methods matter: (mostly) avoid categorising continuous data - a practical guide. Br J Sports Med. 2024;58(5):241–3. pmid:38050063
- 21. Cartuyvels R, Spinks G, Moens M-F. Discrete and continuous representations and processing in deep learning: looking forward. AI Open. 2021;2:143–59.
- 22. Wu PP-Y, Mengersen K, McMahon K, Kendrick GA, Chartrand K, York PH, et al. Timing anthropogenic stressors to mitigate their impact on marine ecosystem resilience. Nat Commun. 2017;8(1):1263. pmid:29093493
- 23. Kim S-K, Frisby CL. Gaining from discretization of continuous data: the correspondence analysis biplot approach. Behav Res Methods. 2019;51(2):589–601. pmid:30406507
- 24.
Fenton N, Neil M. Risk assessment and decision analysis with Bayesian networks. Boca Raton: Chapman and Hall/CRC; 2018.
- 25. Eugene C. Bayesian networks without tears. AI Magazine. 1991;12(4).
- 26. Fuster-Parra P, Tauler P, Bennasar-Veny M, Ligęza A, López-González AA, Aguiló A. Bayesian network modeling: a case study of an epidemiologic system analysis of cardiovascular risk. Comput Methods Programs Biomed. 2016;126:128–42. pmid:26777431
- 27.
Howard RA. The foundations of decision analysis revisited. In: von Winterfeldt D, Miles Jr RF, Edwards W, editors. Advances in decision analysis: from foundations to applications. Cambridge: Cambridge University Press; 2007. p. 32–56.
- 28. Heckerman DE, Nathwani BN. Toward normative expert systems: Part II. Probability-based representations for efficient knowledge acquisition and inference. Methods Inf Med. 1992;31(2):106–16. pmid:1635462
- 29. Constantinou AC, Fenton N. Things to know about Bayesian networks: decisions under uncertainty, part 2. Significance. 2018;15(2):19–23.
- 30. Hecksteden A, Kellner R, Donath L. Dealing with small samples in football research. Sci Med Footb. 2022;6(3):389–97. pmid:35862155
- 31. Mengersen KL, Drovandi CC, Robert CP, Pyne DB, Gore CJ. Bayesian estimation of small effects in exercise and sports science. PLoS One. 2016;11(4):e0147311. pmid:27073897
- 32. Hecksteden A, Forster S, Egger F, Buder F, Kellner R, Meyer T. Dwarfs on the shoulders of giants: Bayesian analysis with informative priors in elite sports research and decision making. Front Sports Act Living. 2022;4:793603. pmid:35368412
- 33. Olmedilla A, Rubio VJ, Fuster-Parra P, Pujals C, García-Mas A. A Bayesian approach to sport injuries likelihood: Does player’s self-efficacy and environmental factors plays the main role? Front Psychol. 2018;9:1174. pmid:30034359
- 34.
Dimitrova L, Petkova KG, editors. Bayesian network - based causal analysis of injury risk in elite rhythmic gymnastics; 2014.
- 35. Lau CL, Mayfield HJ, Sinclair JE, Brown SJ, Waller M, Enjeti AK, et al. Risk-benefit analysis of the AstraZeneca COVID-19 vaccine in Australia using a Bayesian network modelling framework. medRxiv. 2021:2021.09.30.21264337.
- 36. El Khediri S, Thaljaoui A, Alfayez F. A novel decision-making process for COVID-19 fighting based on association rules and bayesian methods. Comput J. 2021;65(9):2360–76.
- 37. Johnson S, Mengersen K, de Waal A, Marnewick K, Cilliers D, Houser AM, et al. Modelling cheetah relocation success in southern Africa using an Iterative Bayesian Network Development Cycle. Ecol Model. 2010;221(4):641–51.
- 38. Wu PP, McMahon K, Rasheed MA, Kendrick GA, York PH, Chartrand K, et al. Managing seagrass resilience under cumulative dredging affecting light: Predicting risk using dynamic Bayesian networks. J Appl Ecol. 2017;55(3):1339–50.
- 39. Wu PP-Y, Pitchforth J, Mengersen K. A hybrid queue-based Bayesian network framework for passenger facilitation modelling. Transp Res Part C: Emerg Technol. 2014;46:247–60.
- 40. Beaudouin F, Demmerle D, Fuhr C, Tröß T, Meyer T. Head impact situations in professional football (Soccer). Sports Med Int Open. 2021;5(2):E37–44. pmid:33718592
- 41. Heus P, Damen JAAG, Pajouheshnia R, Scholten RJPM, Reitsma JB, Collins GS, et al. Uniformity in measuring adherence to reporting guidelines: the example of TRIPOD for assessing completeness of reporting of prediction model studies. BMJ Open. 2019;9(4):e025611. pmid:31023756
- 42. aus der Fünten K, Faude O, Lensch J, Meyer T. Injury characteristics in the German professional male soccer leagues after a shortened winter break. J Athl Train. 2014;49(6):786–93. pmid:25365132
- 43. Fuller CW, Ekstrand J, Junge A, Andersen TE, Bahr R, Dvorak J, et al. Consensus statement on injury definitions and data collection procedures in studies of football (soccer) injuries. Br J Sports Med. 2006;40(3):193–201. pmid:16505073
- 44.
Druzdzel MJ, editor. SMILE: Structural Modeling, Inference, and Learning Engine and GeNIe: a development environment for graphical decision-theoretic models. Aaai/Iaai; 1999.
- 45. Ekstrand J, Hägglund M, Waldén M. Injury incidence and injury patterns in professional football: the UEFA injury study. Br J Sports Med. 2011;45(7):553–8. pmid:19553225
- 46. Hägglund M, Waldén M, Ekstrand J. Risk factors for lower extremity muscle injury in professional soccer: the UEFA injury study. Am J Sports Med. 2013;41(2):327–35. pmid:23263293
- 47. Cooper GF, Herskovits E. A Bayesian method for the induction of probabilistic networks from data. Mach Learn. 1992;9(4):309–47.
- 48. Beuzen T, Marshall L, Splinter KD. A comparison of methods for discretizing continuous variables in Bayesian networks. Environ Model Softw. 2018;108:61–6.
- 49. Liu H, Hussain F, Tan CL, Dash M. Data Min Knowl Discov. 2002;6(4):393–423.
- 50. Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B: Stat Method. 1977;39(1):1–22.
- 51. Lauritzen SL. The EM algorithm for graphical association models with missing data. Comput Stat Data Anal. 1995;19(2):191–201.
- 52.
Hastie T, Tibshirani R, Friedman J. Model assessment and selection. The elements of statistical learning: data mining, inference, and prediction. New York, NY: Springer New York; 2009. p. 219–59.
- 53. Congalton RG. A review of assessing the accuracy of classifications of remotely sensed data. Remote Sens Environ. 1991;37(1):35–46.
- 54. Woods C, Hawkins R, Hulse M, Hodson A. The football association medical research programme: an audit of injuries in professional football-analysis of preseason injuries. Br J Sports Med. 2002;36(6):436–41; discussion 441. pmid:12453838
- 55. Kjærulff U, Van Der Gaag LC, editors. Making sensitivity analysis computationally efficient. Conference on Uncertainty in Artificial Intelligence. Stanford (CA); 2013.
- 56. Orchard J. Is there a relationship between ground and climatic conditions and injuries in football? Sports Med. 2002;32(7):419–32. pmid:12015804
- 57. Azubuike SO, Okojie OH. An epidemiological study of football (soccer) injuries in Benin City, Nigeria. Br J Sports Med. 2009;43(5):382–6. pmid:18927169
- 58. Thomson A, Whiteley R, Bleakley C. Higher shoe-surface interaction is associated with doubling of lower extremity injury risk in football codes: a systematic review and meta-analysis. Br J Sports Med. 2015;49(19):1245–52. pmid:26036677
- 59. Stanitski CL, McMaster JH, Ferguson RJ. Synthetic turf and grass: a comparative study. J Sports Med. 1974;2(1):22–6. pmid:4468319
- 60.
Koiter JR. Visualizing inference in Bayesian networks. Delft University of Technology; 2006.
- 61. Fernández J, editor. Decomposing the immeasurable sport: a deep learning expected possession value framework for soccer. MIT Slogan Sports Analytics Conference. Boston (MA); 2019.
- 62. Bullock GS, Hughes T, Arundale AH, Ward P, Collins GS, Kluzek S. Black box prediction methods in sports medicine deserve a red card for reckless practice: a change of tactics is needed to advance athlete care. Sports Med. 2022;52(8):1729–35. pmid:35175575
- 63. Bittencourt NFN, Meeuwisse WH, Mendonça LD, Nettel-Aguirre A, Ocarino JM, Fonseca ST. Complex systems approach for sports injuries: moving from risk factor identification to injury pattern recognition-narrative review and new concept. Br J Sports Med. 2016;50(21):1309–14. https://www.researchgate.net/publication/215721980_Dynamic_Bayesian_Networks_Representation_Inference_and_Learning pmid:27445362
- 64.
Murphy KP, Russell SJ, editors. Dynamic bayesian networks: representation, inference and learning; 2002.
- 65. Nielsen RØ, Malisoux L, Møller M, Theisen D, Parner ET. Shedding light on the etiology of sports injuries: a look behind the scenes of time-to-event analyses. J Orthop Sports Phys Ther. 2016;46(4):300–11. pmid:26954269
- 66. Peterson KD, Evans LC. Decision support system for mitigating athletic injuries. Int J Comput Sci Sport. 2019;18(1):45–63.
- 67. Sandri M, Berchialla P, Baldi I, Gregori D, De Blasi RA. Dynamic Bayesian Networks to predict sequences of organ failures in patients admitted to ICU. J Biomed Inform. 2014;48:106–13. pmid:24361388
- 68. van Gerven MAJ, Taal BG, Lucas PJF. Dynamic Bayesian networks as prognostic models for clinical patient management. J Biomed Inform. 2008;41(4):515–29. pmid:18337188
- 69. Galán SF, Aguado F, Díez FJ, Mira J. NasoNet, modeling the spread of nasopharyngeal cancer with networks of probabilistic events in discrete time. Artif Intell Med. 2002;25(3):247–64. pmid:12069762
- 70. Wu PP, Mengersen K, Caley MJ, McMahon K, Rasheed MA, Kendrick GA. Analysing the dynamics and relative influence of variables affecting ecosystem responses using functional PCA and boosted regression trees: a seagrass case study. Methods Ecol Evol. 2019;10(10):1723–33.
- 71. Mueller-Wohlfahrt H-W, Haensel L, Mithoefer K, Ekstrand J, English B, McNally S, et al. Terminology and classification of muscle injuries in sport: the Munich consensus statement. Br J Sports Med. 2013;47(6):342–50. pmid:23080315
- 72. Ekstrand J, Askling C, Magnusson H, Mithoefer K. Return to play after thigh muscle injury in elite football players: implementation and validation of the Munich muscle injury classification. Br J Sports Med. 2013;47(12):769–74. pmid:23645834