Figures
Abstract
Prior research has identified that school absences harm children’s academic achievement. However, this literature is focused on brief periods or single school years and does not consistently account for the dynamic nature of absences across multiple school years. This study examined dynamic trajectories of children’s authorised and unauthorised absences throughout their compulsory school career in England. It investigated the consequences of these absence trajectories for children’s achievement at the end of compulsory schooling. We analyse linked administrative data on children’s absences and achievement from the National Pupil Database and survey data from the Millennium Cohort Study for a representative sample of children born in 2000/2001 in England (N = 7218). We used k-means clustering for longitudinal data to identify joint authorised-unauthorised absence trajectories throughout compulsory schooling and a regression-with-residuals approach to examine the link between absence trajectories and achievement. We identified five distinct absence trajectories: (1) ‘Consistently Low Absences’, (2) ‘Consistently Moderate Authorised Absences’, (3) ‘Moderately Increasing Unauthorised Absences’, (4) ‘Strongly Increasing Unauthorised Absences’, and (5) ‘Strongly Increasing Authorised Absences’. We found substantial differences between trajectory groups in GCSE achievement, even when accounting for significant risk factors of school absences. Compared to ‘Consistently Low Absences’, ‘Strongly Increasing Unauthorised Absences’ reduced achievement by -1.23 to -1.48 standard deviations, while ‘Strongly Increasing Authorised Absences’ reduced achievement by -0.72 to -1.00 SD for our continuous outcomes. ‘Moderately Increasing Unauthorised Absences’ (-0.61 to -0.70 SD) and ‘Consistently Moderate Authorised Absences’ (-0.13 to -0.21 SD) also negatively affected achievement compared to ‘Consistently Low Absences’. Our research underscores the critical importance of examining entire trajectories of absenteeism and differentiating between types of absences to fully grasp their associations with academic outcomes and design targeted interventions accordingly.
Citation: Dräger J, Klein M, Sosu EM (2024) Trajectories of school absences across compulsory schooling and their impact on children’s academic achievement: An analysis based on linked longitudinal survey and school administrative data. PLoS ONE 19(8): e0306716. https://doi.org/10.1371/journal.pone.0306716
Editor: Leonard Moulin, National Institute of Demographic Studies: INED, FRANCE
Received: January 11, 2024; Accepted: June 17, 2024; Published: August 12, 2024
Copyright: © 2024 Dräger et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The Millennium Cohort Study (MCS) sweeps used in the paper are publicly available at the UK Data Service (https://ukdataservice.ac.uk/). "The Millennium Cohort Study: Linked Education Administrative Datasets (National Pupil Database), England: Secure Access" data (https://beta.ukdataservice.ac.uk/datacatalogue/studies/study?id=8481) used in the paper can be accessed via the UK Data Service's Secure Lab. It is the UK Data Service’s flagship secure environment providing researcher’s access to these sensitive and confidential data. Data accessed in this way cannot be downloaded. Once researchers and their projects are approved, they can analyse the data remotely from their organisational desktop, or by using their Safe Room. Access to the data for this project has been granted by the data providers and UK Data Service following an application process, including completion of project application form, ESRC research proposal, Secure Access User Agreement, ESRC Accredited Researcher Application, and Safe Researcher Training (SRT). For more information on how to apply to access con trolled data in the Secure Lab see: https://ukdataservice.ac.uk/find-data/access-conditions/secure-application-requirements/ or contact secure.applications@ukdataservice.ac.uk.
Funding: Our study is funded by the Nuffield Foundation (FR-000023241). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Many countries worldwide have reported substantial increases in school absenteeism since schools reopened after the pandemic [1–4]. In turn, there is ample evidence of the harmful consequences of school absences on children’s academic achievement [5–9], which later translates into lower educational attainment and poorer labour market outcomes [10, 11]. While this suggests that cumulative exposure to schooling over time determines a child’s achievement [12], the school attendance literature has not comprehensively addressed the dynamic and multi-dimensional nature of absences and their effects on educational outcomes. Our article addresses three shortcomings in the literature.
First, school absences are not static but change as children progress through their education. The trajectory of absence may matter for children’s achievement as it captures the extent, timing, and variability of exposure to school-based learning. However, the school absenteeism literature has not consistently examined the dynamic nature of school absences and their impact on achievement. Most studies measured absences in one year or averaged absences up to three school years [9, 13–15]. This restriction may mask important differences between students and likely underestimates the effect of absences on achievement [16]. The few studies examining absence trajectories and achievement [17–20] have primarily focused on primary and early secondary school stages. None of these studies has considered absence trajectories throughout a student’s school career.
Second, studies have increasingly emphasised the importance of analysing the reason for school absence [21–24]. Absences can be caused by authorised (e.g., sickness) or unauthorised (e.g., truancy) reasons, which can change throughout a child’s educational career and affect achievement differently. For instance, evidence from the UK context suggests that unauthorised absences are much more prevalent in later school stages [25]. Although all reasons for absences harm children’s achievement [24], unauthorised absences are more detrimental to school performance than authorised absences [21, 22]. However, previous studies on absence trajectories and achievement have focused on absences as a whole [17, 20] and thus have yet to determine the intersecting role of absence reason and temporal variation of absences when analysing achievement. Examining the joint trajectories of authorised and unauthorised absences on achievement is crucial, given that they likely interact and vary in frequency over time.
Lastly, most of the existing studies on absence trajectories are based on regionally restricted samples in the US context [16, 17, 19, 26, 27], and there is limited research on the dynamics of absence trajectories from other contexts. This is significant because educational policies differ across countries, and we do not know yet whether there are more similarities or differences in the trajectories and consequences of school absences across countries. In addition, if studies link absence trajectories to achievement, they are unable to account for relevant baseline confounders of this association or fail to adequately account for time-varying confounders such as student behaviour, which may confound absences at a later stage but are influenced by absences at an earlier stage.
Our study addresses these gaps in the literature by identifying the joint trajectories of authorised and unauthorised absences throughout the compulsory school career using linked administrative data on absences and standardised achievement tests from the National Pupil Database (NPD) and survey data from the Millennium Cohort Study (MCS) for England. We use k-means clustering for longitudinal data (KML) to identify these typical school absence trajectories [28, 29]. In addition, we use a regression-with-residuals approach (RWR) and a rich set of confounders to analyse the extent to which the identified trajectories impact achievement at the end of compulsory schooling [30]. This novel approach allows us to appropriately account for time-varying confounders, such as student behaviour, without introducing overcontrol and collider bias. We ask the following research questions:
- What joint authorised-unauthorised absence trajectories emerge across compulsory school careers?
- To what extent do these absence trajectories affect achievement?
Absence trajectories and achievement
The Faucet theory suggests that students improve their skills through frequent exposure to schooling, and they cease making educational gains when their exposure to school is cut off [31]. Consequently, students who receive fewer hours of instruction during the school year are disadvantaged in their learning, receive lower grades, perform worse in exams, and are more likely to drop out of school [9]. This argument aligns well with empirical evidence demonstrating a link between classroom instruction time and academic achievement [32, 33]. In addition, students frequently absent from school may feel less connected to their classmates and struggle to participate in classroom activities and interactions with teachers and peers, which is detrimental to their academic development [34].
While there is abundant evidence on the negative consequences of school absences on children’s school achievement [5, 7–9, 35], these studies did not consider that absences may be differently associated with achievement depending on their temporal sequencing and their reasons over time. The pattern of absences during primary and secondary school may vary over time and for different students across the school life span. Their detrimental effects may not manifest until after prolonged exposure [12]. While the literature on school absences focuses on measures of chronic absenteeism (typically 10% or more days) over a given school year, it does not account for the persistence of absences over multiple school years. For instance, absences from school in a single year may not be as disruptive to children’s learning as absences in multiple years. Therefore, snapshot measures may underestimate the cumulative effect of school absences on later achievement. This requires a holistic measurement of school absences via clustering of individual trajectories.
Studies examining whether students vary in their absence trajectories during specific school stages (e.g., kindergarten to elementary; middle; or high school) have commonly found between four and seven clusters of absence trajectories for US students [18–20, 26]. However, only one study investigated the relationship between absence trajectories in elementary school and achievement [20]. Using data from the US Early Childhood Longitudinal Study–Kindergarten Class of 1998–1999 (ECLS-K), they identified four latent absence trajectories between kindergarten and fifth grade: a low absence trajectory (46%), a decreasing absence trajectory (24%), an increasing absence trajectory (22%) and a high absence trajectory (8%). Students in the low absence group performed the best in math and reading, while those in the high absence group performed the worst. No statistically significant difference existed between students with increasing and decreasing absence trajectories.
Absences may have varying consequences for achievement depending on the reason for absence. Teachers may view unauthorised absences negatively, resulting in increased student-teacher conflict, decreased closeness, and increased teacher irritation and frustration towards students who miss school without authorisation [36, 37]. As a result, teachers may be less willing to support students in catching up on missed lessons if they have taken unauthorised absences from school. Unauthorised absences are also linked to problem behaviours such as alcohol and substance abuse [38, 39] or crime and delinquency [40]. These behaviours may influence students’ motivations to learn and can exacerbate the negative impact of absences on achievement [41]. On the other hand, if students are absent from school for authorised reasons, they may be more motivated to make up for lost time and engage with missed lesson content. Teachers and parents may also be more willing to assist them in catching up on lesson content.
Some studies examining the impact of various reasons for absence have found that unauthorised absences were more harmful to achievement than authorised absences [5, 21, 22, 42]. A study on secondary pupils in Scotland found that excused absences due to sickness and exceptional domestic circumstances (e.g., bereavement) were just as damaging to achievement as unauthorised absences [24]. This may suggest that students’ learning suffers if the excused reason prevents them from focusing on the missed lesson content. For example, authorised absences due to sickness may indicate health issues that negatively impact learning and achievement in the longer term [43].
Therefore, the temporal dynamics of authorised and unauthorised absences must be considered concurrently. However, previous studies have explored trajectories for overall absences [20, 26] or unauthorised absences such as truancy [19]. Only one study has examined the joint trajectory of authorised and unauthorised absences over a single high school year, assuming a single trajectory for all students [16]. They found an increase in unauthorised absences and a decrease in authorised absences across the school year. It remains to be seen whether multiple joint trajectories exist for distinct student groups across the school life span and the extent to which these trajectories are associated with achievement.
We advance this literature in several meaningful ways. First, we examine latent absence trajectories throughout students’ entire school careers and study their impact on achievement at the end of compulsory schooling. Second, beyond the modelling of overall absences, we investigate profiles of joint trajectories of authorised and unauthorised absences and their consequences. Third, in contrast to previous research, our linked school administrative and survey data allow us to control for a comprehensive set of risk factors of school absences [44] and achievement, including time-varying confounders such as early cognitive ability, student behaviour and attitude towards school.
Data and methods
Data
For the analysis, we used data from the Millennium Cohort Study (MCS), a large-scale longitudinal study of children born in 2000 or 2001 and living in the UK (England, Northern Ireland, Scotland, and Wales) [45]. In total, 19,244 families (12,387 in England) were recruited and first surveyed when children were nine months old. Follow-up assessments took place at age 3 (sweep 2), age 5 (sweep 3), age 7 (sweep 4), age 11 (sweep 5), age 14 (sweep 6), and age 17 (sweep 7). These data are linked with the National Pupil Database (NPD), a register dataset of all students in state schools in England [46]. We received ethical approval for the study from the University of Strathclyde ethics committee in written form.
All MCS participants residing in England during sweeps 3–5 (N = 9,047) were asked for consent to link their data to the NPD. Consent was granted from 8,489 participants, and 8,438 were successfully matched to the NPD database. We restricted our analysis sample to participants who agreed to data linkage and were linked in sweep 4 (N = 8,206), which enabled us to use MCS survey weights. Moreover, we excluded all students for whom we lacked information on absences for a full academic year or achievement measures. This resulted in N = 7,218 cases for the analysis (see diagram on sample restrictions in S1 File).
We combined MCS weights and inverse probability weights [47] to correct for the non-random inclusion of students in our sample. Specifically, we multiplied MCS weights for participation in England in sweep 4 with the inverse of the probability that participants gave consent to data linkage, have been successfully linked, and have complete absence and achievement data. We estimated the probability of being included in our analytic sample with a logistic regression using socio-demographic, child, and family characteristics as our predictors (see S2 File). By weighting the analysis this way, we created a pseudo-population with the same characteristics as the initial MCS sample of children living in England.
To address item non-response on control variables, we imputed missing values on covariates using multiple imputation based on Categorization and Regression Trees [CART, 48]. CART is a nonparametric recursive algorithm that creates groups with maximum intragroup and minimum intergroup homogeneity using binary splits. The advantage of using CART for multiple imputation is that the algorithm finds the best predictors of missing data from all potential covariates, including non-linear patterns and interactions. We created 20 imputed datasets and applied Rubin’s rules to obtain standard errors [49].
Variables
School absences.
School attendance policy in England is guided by the Education Act of 1996, which established the legal framework for school attendance. This legislation made parents or guardians legally responsible for ensuring their children’s regular attendance at school from age five until the end of the academic year in which they turn sixteen. During our observation period, the UK government set school attendance targets and implemented truancy-reduction policies. Local education authorities and schools were responsible for monitoring attendance and taking appropriate action when attendance fell below acceptable levels. This includes issuing warnings, fines, or taking legal action against parents who do not ensure their children’s regular attendance at school.
In the comprehensive school system of England, students attend primary school for six years (key stages 1 and 2) from ages 5/6 and compulsory secondary school for five years (key stages 3 and 4) from ages 11/12. Accordingly, the NPD contains information regarding the number of possible school days, the number of days missed due to authorised absences, and the number missed due to unauthorised absences in the autumn, spring, and first half of the summer term for each year of compulsory schooling, from year 1 (ages 5 to 6) to year 11 (ages 15 to 16). After the 2012/13 academic year, data on absences for the second half of the summer term were also collected [50]. To maintain consistency in measuring student absences over time and since we are interested in modelling absence trajectories across school years, we combined data from the fall, spring, and summer terms into annual absence data. Students could have attended about 1,700 school days in this period.
Authorised absences are those with permission from a teacher or other authorised school representatives, which is only granted if a satisfactory explanation for the absence, such as illness, has been provided. Unauthorised absences are absences for which the school has not granted permission. We calculated the percentage of days missed due to authorised or unauthorised absences because the number of possible days varies between years and between students within the same year. Average total, authorised, and unauthorised absences per year, correlations over time, and correlations between authorised and unauthorised absences are presented in the S3 File. Average authorised absences fall slightly from 4.9% in year one to 3.4% in year six, then rise to 4.5% in year eleven. On the other hand, unauthorised absences remain stable at around 0.5% in years 1–7 before rising to 2.0% in year 11. Over the school career, the association between authorised and unauthorised absences was small (r = 0.08 to 0.20), indicating that these two types of absences are relatively distinct.
Academic achievement.
The dependent variable is student achievement upon completion of compulsory schooling. Students sit for their GCSE (General Certificate of Secondary Education) examinations at the end of year 11 (key stage 4), which are consequential for future education and labour market outcomes [51]. GCSE qualifications are offered in distinct subject areas. English and mathematics are considered "core subjects" that are compulsory. Although the precise number differs among individuals, it is customary for students to undertake a minimum of five GCSEs. A student is allocated a point value ranging from 1 (indicating the lowest) to 9 (representing the highest grade). For our analysis, we consider four different outcomes.
- ‘Attainment 8’ score (Mean = 47.3, SD = 19.1, Range 0–90), combining scores from eight best-performing GCSE subjects, including English, math, and six additional exam results from GCSE-level qualifications. English and math are counted twice. Three GCSE subjects must come from qualifications that count towards the English Baccalaureate (EBacc), such as sciences, languages, or history;
- GCSE Math attainment (Mean = 10.1, SD = 4.2, Range 0–18);
- GCSE English attainment (Mean = 9.2, SD = 4.3, Range 0–18);
- Whether students passed five or more GCSEs, including Math and English. Students passed a GCSE exam if they attained a point score of more than or equal to four. 59% of pupils in our sample passed five or more GCSEs.
For the analysis, we standardised the ‘Attainment 8’ score and GCSE English and math scores to have a mean of zero and a standard deviation of one in the weighted sample.
Covariates.
The MCS enabled us to account for risk factors of school absenteeism identified in the literature [44], which also influence student achievement. We included multiple measures of students’ socio-economic background (parental education, parental occupational class, household income, housing tenure, neighbourhood deprivation) and child and family demographics (gender, date of birth, ethnicity, family structure, household size, number of children, region). Moreover, we adjusted for students’ and parents’ attitudes towards school and educational aspirations, child behaviour problems (externalizing and internalizing), parental involvement in school, child health, birth conditions (e.g., birthweight, smoking during pregnancy), disruptive events in the student’s life (e.g., residential mobility and school changes), whether the family pays tuition fees for the school, and which ability stream and set the child attends. Finally, we considered children’s cognitive ability and performance in standardised achievement tests at the end of the second and sixth school years. We measured cognitive ability, educational motivation and aspirations, behaviour problems, parental involvement, ability streams and set, and school changes at multiple times points. A description of the timing of the measurement of covariates is shown in the S4 File, the measurement of latent covariates, such as attitudes towards school, is described in the S5 File, and the distribution of all covariates is shown in the S6 File.
Methods
Identifying absence trajectories
We applied k-means for longitudinal data (KML) to identify clusters of students with similar joint trajectories on authorised and unauthorised absences from years 1 to 11, using the R package kml3d [28, 29]. The optimal number of clusters is unknown and cannot be deduced from current research. Existing research has identified between four and seven distinct absence trajectory clusters based solely on total or unauthorised absences [19, 20, 26]. We tested cluster solutions with two to eight clusters and opted for an optimal cluster solution based on fit indices, predictive validity, the number of observations in the smallest cluster, and interpretability. For more information on identifying the optimal cluster solution, see S7 File.
Estimating effects of absence trajectories on achievement
To model the effect of absence trajectories on achievement, we accounted for important risk factors of school absenteeism [44], including time-varying covariates, such as early cognitive ability, educational motivation, and behavioural issues. However, some of these time-varying covariates may be both consequences of earlier absences and confounders for the effect of later absences on achievement [52]. Consequently, controlling for these time-varying confounders may remove some of the effects of earlier absences and result in overcontrol and collider bias. This issue is referred to as confounder feedback bias [47].
To address this problem, we employed a regression-with-residuals approach [RWR, 30]. In the first step, we regress each time-varying covariate on baseline covariates, all earlier measures of absences and all earlier measures of time-varying covariates, and obtain the residuals of these regressions. In the second step, we regress achievement on absence trajectories, baseline confounders, and the residualised time-varying confounders. The residualised risk factors that could be both the consequence of earlier absences and the cause of later absences are indicated by an “R” in S3 Table in S3 File (cognitive abilities, attitudes towards school, child behaviour, parental involvement, and school characteristics), while baseline risk factors which are not affected by absences are indicated by a “B” in S3 Table in S3 File.
We used linear regression models for Attainment 8, English and math attainment, and a linear probability model for obtaining five or more GCSEs. Under the assumption that there is no unmeasured confounding, positivity, and correct model specification, the RWR estimates for absence trajectories on our achievement outcomes can be interpreted as average causal effects in the English pupil population covered by the MCS. Given that our covariate set contains longitudinal and high-quality measures of risk factors for school absences, we believe these assumptions are plausible.
Results
Clusters of absence trajectories
Based on fit indices, predictive validity, and the number of observations in the smallest cluster, we identified five joint authorised-unauthorised absence trajectories. First, the five clusters resulted in the highest Calinksi-Harabasz index [29]. Second, it significantly increases the variance in achievement explained compared to two or three cluster solutions and nearly explains as much variance as seven or eight cluster solutions. Finally, the smallest cluster in the five-cluster solution is nearly twice the sample size of the smallest cluster in the six-cluster solution. The S7 File explains in detail why we prefer the solution with five clusters over other solutions. Fig 1 depicts the mean trajectory of authorised and unauthorised absences from Years 1 to 11 for the different clusters. The S8 File provides information on the exact percentages of absences each year and averaged across the 11 years for each identified cluster.
Note. Linked MCS-NPD data, N = 7,218, weighted. CLA = Consistently Low Absence, CMAA = Consistently Moderate Authorised Absences, MIUA = Moderately Increasing Unauthorised Absences, SIAA = Strongly Increasing Authorised Absences, SIUA = Strongly Increasing Unauthorised Absences. Mean absences above 40% are truncated.
About two-thirds of students fall into a cluster characterised as a Consistently Low Absence (CLA) trajectory. On average, across 11 school years, students in this cluster had 2.6% authorised absences and 0.3% unauthorised absences (see S8 File).
The second largest cluster contains about 28% of students and is characterised by Consistently Moderate Authorised Absences (CMAA). In most years, authorised absences in this cluster ranged between 6% and 8%. Additionally, students had, on average, 1.0% unauthorised absences.
In addition to these large clusters, we found three smaller clusters of increasing absences. 3.5% of students belong to a cluster with Moderately Increasing Unauthorised Absences (MIUA). These students had comparatively high unauthorised absences in years 1 to 6, ranging between 2% and 3%, which increased to 17% in year 11. Moreover, students in this cluster consistently had 7% to 8% of authorised absences.
The penultimate cluster is characterised by Strongly Increasing Authorised Absences (SIAA), including 1.62% of the student sample. Authorised absences in this cluster increased from around 7% in primary school to more than 30% in year 11. Additionally, unauthorised absences increased in secondary school but remained below 6% even in the final years.
Students in the final cluster, Strongly Increasing Unauthorised Absences (SIUA), show moderately low unauthorised absences in primary school but have a much stronger increase in unauthorised absences in secondary school. The 0.82% of students in this cluster had more than 10% unauthorised absences in year 8 and more than 60% unauthorised absences in year 11.
Absence trajectories and achievement
After adjusting for school absenteeism risk factors, Fig 2 depicts the differences in achievement by absence trajectory compared to the Consistently Low Absence (CLA) trajectory. For our binary outcome of five or more GCSE passes, differences are presented as percentage points. In contrast, effect sizes for continuous outcomes (Attainment 8 score, English GSCE score, and Math GCSE score) are presented as standard deviations. Bivariate statistics are presented in the S9 File, while the complete regression tables are presented in the S10 File.
Note. Linked MCS-NPD data, N = 7,218, weighted. Reference category: Consistently Low Absence (CLA). CMAA = Consistently Moderate Authorised Absences, MIUA = Moderately Increasing Unauthorised Absences, SIAA = Strongly Increasing Authorised Absences, SIUA = Strongly Increasing Unauthorised Absences. Thick vertical lines indicate the 84%-Confidence Interval, and thin vertical lines the 95%-Confidence Interval.
Even after controlling for all risk factors for school absences, large differences in student achievement persist across absence trajectories. For all four achievement outcomes, students with moderate or increasing absence trajectories perform worse than those with the Consistently Low Absence trajectory. The negative impact on achievement increases from Consistently Moderate Authorised Absences, Moderately Increasing Unauthorised Absences, Strongly Increasing Authorised Absences to its most severe form, Strongly Increasing Unauthorised Absences.
A Strongly Increasing Unauthorised Absence (SIUA) trajectory decreases students’ probability of obtaining 5 GCSEs by 33.1 percentage points (95%-Confidence interval [CI]: 24.9–41.3). For students with a Strongly Increasing Authorised Absence (SIAA) trajectory, the likelihood is lowered to 28.6 percentage points (95%-CI: 20.8–36.4); for students with a Moderately Increasing Unauthorised Absence (MIUA), it is 27.1 percentage points (95%-CI: 22.4–31.8); and for students with a Consistently Moderate Authorised Absence (CMAA) trajectory it is 8.7 percentage points (95%-CI: 6.3–11.1). All effects are statistically significant at the 5% level. However, the effect of SIUA, SIAA, and MIUA on obtaining five or more GCSEs was similar (as indicated by the 84%-confidence intervals).
When examining the effects of trajectories on Attainment 8 outcomes, students with a Strongly Increasing Unauthorised Absence (SIUA) trajectory have an Attainment 8 score that is 1.48 standard deviations (95%-CI: 1.29–1.67) lower than students with a Consistently Low Absence Trajectory (CLA). A Strongly Increasing Authorised Absence (SIAA) trajectory leads to an Attainment 8 score of 1.00 standard deviations lower (95%-CI: 0.84–1.17). The lower attainment among students with a Moderately Increasing Unauthorised Absence (MIUA) amounts to 0.70 standard deviations (95%-CI: 0.60–0.81). In comparison, it is 0.21 standard deviations (95%-CI: 0.17–0.24) for students with a Consistently Moderate Authorised Absence (CMAA) trajectory. All effects are statistically significant at the 5% level. The effects of each trajectory on Attainment 8 scores were significantly different from one another (as all 84%-confidence intervals do not overlap). The findings for GCSE English and GCSE Math mostly mirror the Attainment 8 results, albeit the difference between SIAA and MIUA in English and Math scores is not statistically significant.
Discussion
The dynamic nature of absences throughout children’s schooling has not been consistently addressed in the prior literature on school absenteeism. Predominantly in the early stages of children’s education, numerous studies have examined absences within a single school year or over a brief period. Moreover, the distinction between authorised and unauthorised absence reasons has not been modelled dynamically in prior studies. As unauthorised absences become more prevalent throughout children’s schooling, absence trajectories cannot be evaluated in isolation from the type of absence. Our study contributes to the literature by identifying joint authorised and unauthorised absence trajectories throughout a child’s compulsory school career for a representative sample of children born in 2000/2001 in England and examining their consequences for academic achievement at the end of compulsory secondary schooling.
Using k-means longitudinal clustering, we found five distinct absence trajectories, each characterised by different levels and patterns of authorised and unauthorised absences. Two-thirds of students fell into a cluster characterised by Consistently Low Absences. More than a quarter of students fell into a cluster that exhibited Consistently Moderate Authorised Absences from years 1 to 11. Additionally, we found three smaller clusters with increasing levels of authorised and unauthorised absences: Moderately Increasing Unauthorised Absences, Strongly Increasing Unauthorised Absences, and Strongly Increasing Authorised Absences. In contrast to previous research [19, 20, 26], we did not identify a cluster of significantly declining absences over time. However, these previous studies did not investigate absence trajectories over primary and secondary school careers. In addition, unlike previous studies, we examined the joint trajectory of authorised and unauthorised absences.
Furthermore, our study investigated the consequences of these absence trajectories for achievement, accounting for various risk factors associated with school absenteeism. Our findings revealed significant differences in student achievement across absence trajectories. Compared to students with a Consistently Low Absence trajectory, students with a Consistently Moderate Authorised Absence trajectory face a significant achievement disadvantage of 0.13–0.21 standard deviations, and students in the three increasing absence trajectories face a huge achievement disadvantage of 0.61–1.48 standard deviations. However, there is also substantial heterogeneity in student achievement in the three increasing trajectories, with students in the Moderately Increasing Unauthorised Absence trajectory being least disadvantaged and students with a Strongly Increasing Unauthorised Absence trajectory being most disadvantaged.
Our findings show that both timing of absences and cumulative exposure matter. Consistent attendance over multiple years ensures students receive the necessary instruction and support to master new concepts and skills. In addition, it provides opportunities for students to develop long-lasting positive peer networks and relationships with teachers. Any transition to more frequent absences, whether unauthorised or authorised, derails these gains, and has long-term consequences for academic performance. The vast differences in achievement between pupils in different absence trajectories underscore the importance of modelling absences in a holistic and dynamic way. Relying on snapshot measures or limited periods underestimates the severity of the detrimental consequences of absences for achievement. The findings, therefore, emphasise the importance of examining entire trajectories of absenteeism and its associations with academic outcomes. They further stress the importance of distinguishing the reason for absence because trajectories of unauthorised vs. authorised absences have different negative effects on achievement.
The findings also question the predominant focus on “chronic” or “persistent” absenteeism, often defined as students missing more than 10% of classes in a certain period. Neither students in the Consistently Low Absence trajectory nor students in the Consistently Moderate Authorised Absence trajectory break the threshold of 10% absences per year. However, over the 11 years, their absences accumulate and substantially impact achievement. Considering that students could have attended about 1,700 schooldays, students in the Consistently Moderate Authorised Absence trajectory missed about 80 schooldays more than students in the Consistently Low Absence trajectory. These important differences are missed when focusing on only chronic absenteeism. Likewise, students in all three increasing absence trajectories have, on average, absences above 10% per year throughout secondary school but differ drastically in achievement.
Our study has significant limitations. First, this study describes the absence trajectories of students born in 2000 and 2001 from 2006 to 2017. The absence trajectories of more recent cohorts of students affected by school closures during the Covid-19 pandemic may differ and have a different impact on academic achievement. Second, because information on private schools is not included in the NPD data, our findings cannot be generalised to the entire population of pupils in England. Third, while we have accounted for many significant confounders of the relationship between absence trajectories and achievement, we cannot rule out the possibility of unobserved heterogeneity. As a result, we must proceed with caution when drawing causal conclusions from our findings. Finally, absence trajectories and their impact on achievement may differ by sociodemographic groups, such as gender. Future research should investigate this possibility, particularly if population-level data is available to ensure sufficient statistical power.
Despite these limitations, our research has significant educational policy and practice implications. Since schools reopened following the Covid-19 pandemic, there has been an alarming increase in school absenteeism rates in the UK and globally. For example, England’s absence rate rose from 4.3% in the autumn of 2018 to 7.5% in 2022 [3]. While our findings apply to a pre-pandemic cohort, identifying typical absence trajectories in this study lays the groundwork for understanding how specific absenteeism patterns may evolve in cohorts after the pandemic. Furthermore, it emphasises the likely extent of the negative consequences of moderate and escalating absence trajectories, whether authorised or unauthorised, on future achievement in a post-pandemic world if not addressed by policy and practice.
The current emphasis on minimum attendance thresholds and penalties for non-compliance in policy frameworks may not effectively address the diverse absence trajectories that our study has uncovered. Various absence trajectories from consistently low to strongly increasing absences suggest that a more nuanced and individualised approach to addressing absenteeism post-pandemic is necessary. Tailored interventions are required depending on specific patterns of absences. For instance, while moderate levels of absences might require supporting and encouraging attendance, patterns of increasing absences will require interventions that address root causes, such as underlying health conditions. Furthermore, implementing policy measures for the early detection of attendance issues is critical, allowing for timely intervention and support to prevent further disruptions to students’ education after the pandemic. Collaboration among schools, policymakers, and educators is important to proactively identify and help at-risk students, with the goal of preventing any decline in their academic progress.
Our findings also highlight the importance of developing the capacity of schools to analyse and identify school attendance patterns. This will enable them to implement targeted interventions to address the needs of those with specific patterns of absences. For instance, practitioners will benefit from considering the specific reasons behind different patterns of absenteeism and developing flexible and responsive policies that consider each student’s unique circumstances. Finally, the findings underscore the importance of a proactive and preventive approach to absenteeism rather than a reactive one. Educators, parents, and policymakers must collaborate to create support systems and resources to help students overcome challenges before absenteeism becomes a significant issue. This will involve identifying root causes of absenteeism, such as health issues, family circumstances, or academic challenges, and developing targeted interventions to address these underlying issues.
In conclusion, our research on school absenteeism underscores the need for a nuanced and dynamic approach to educational policy and practice. By identifying diverse absence trajectories throughout a child’s compulsory school career and revealing significant consequences for academic achievement, we emphasise the importance of examining patterns of attendance rather than the predominant focus on attendance thresholds and penalties. Our findings highlight the importance of early intervention and developing the capacity of schools to analyse and identify patterns of absences. It also stresses the need for a collaborative approach among educators, parents, and policymakers to address the varied reasons behind absenteeism and develop tailored interventions based on each student’s unique circumstances to provide a supportive and responsive educational environment for all learners.
Supporting information
S4 File. Timing of measurement of covariates.
https://doi.org/10.1371/journal.pone.0306716.s004
(DOCX)
S6 File. Distribution of achievement and covariates.
https://doi.org/10.1371/journal.pone.0306716.s006
(DOCX)
S8 File. Yearly mean absences in preferred cluster solution.
https://doi.org/10.1371/journal.pone.0306716.s008
(DOCX)
S9 File. Bivariate associations between absence trajectories and achievement.
https://doi.org/10.1371/journal.pone.0306716.s009
(DOCX)
Acknowledgments
We are grateful to the advisory group of our Nuffield Foundation project (FR-000023241) for their comments on an earlier version of this manuscript as well as to the anonymous reviewers. We are also grateful to the Centre for Longitudinal Studies (CLS), UCL Social Research Institute, for the use of these data and to the UK Data Service for making them available. However, neither CLS nor the UK Data Service bear any responsibility for the analysis or interpretation of these data.
References
- 1.
Australian Curriculum, Assessment and Reporting Authority. National Report on Schooling in Australia 2022 [Internet]. 2022. Available from: https://dataandreporting.blob.core.windows.net/anrdataportal/ANR-Documents/ANR2022/nationalreportonschoolinginaustralia_2022_Chapter4.pdf#page=3
- 2. Dee TS. Higher Chronic Absenteeism Threatens Academic Recovery from the COVID-19 Pandemic. Proc Natl Acad Sci USA. 2024 Jan;121(3):e2312249121. pmid:38194454
- 3.
Department for Education. Pupil absence in schools in England. Autumn term 2022/23 [Internet]. 2023. Available from: https://explore-education-statistics.service.gov.uk/find-statistics/pupil-absence-in-schools-in-england#dataBlock-c215103a-7e6c-4870-acf4-d5861f3bba3e-tables
- 4.
Tusla Education Support Service. School Attendance Data Primary and Post-Primary Schools And Student Absence Reports Primary and Post-Primary Schools. 2019–2022 [Internet]. 2023. Available from: https://www.tusla.ie/uploads/content/AAR_SAR_2019_22.pdf
- 5. Aucejo EM, Romano TF. Assessing the effect of school days and absences on test score performance. Econ Educ Rev. 2016 Dec;55:70–87.
- 6. Gottfried MA. Evaluating the Relationship Between Student Attendance and Achievement in Urban Elementary and Middle Schools: An Instrumental Variables Approach. Am Educ Res J. 2010 Jun;47(2):434–65.
- 7. Gottfried MA, Kirksey JJ. “When” Students Miss School: The Role of Timing of Absenteeism on Students’ Test Performance. Educ Res. 2017 Apr;46(3):119–30.
- 8. Kirksey JJ. Academic Harms of Missing High School and the Accuracy of Current Policy Thresholds: Analysis of Preregistered Administrative Data From a California School District. AERA Open. 2019 Jul;5(3):233285841986769.
- 9. Morrissey TW, Hutchison L, Winsler A. Family income, school attendance, and academic achievement in elementary school. Dev Psychol. 2014 Mar;50(3):741–53. pmid:23914750
- 10. Liu J, Lee M, Gershenson S. The short- and long-run impacts of secondary school absences. J Public Econ. 2021 Jul;199:104441.
- 11. Cattan S, Kamhöfer DA, Karlsson M, Nilsson T. The long-term effects of student absence: Evidence from Sweden. The Economic Journal. 2023 Feb;133(650):888–903.
- 12. Bronfenbrenner U, Morris P. The bioecological model of human development. Handb Child Psychol Vol 1 Theor Models Hum Dev. 2006;793–828.
- 13. Gottfried MA. Chronic Absenteeism and Its Effects on Students’ Academic and Socioemotional Outcomes. J Educ Stud Placed Risk JESPAR. 2014 Apr 3;19(2):53–75.
- 14. Ready DD. Socioeconomic Disadvantage, School Attendance, and Early Cognitive Development: The Differential Effects of School Exposure. Sociol Educ. 2010 Oct;83(4):271–86.
- 15. Smerillo NE, Reynolds AJ, Temple JA, Ou SR. Chronic absence, eighth-grade achievement, and high school attainment in the Chicago Longitudinal Study. J Sch Psychol. 2018 Apr;67:163–78. pmid:29571532
- 16. Liu J, Lee M. Beyond Chronic Absenteeism: The Dynamics and Disparities of Class Absences in Secondary School. 2022; Available from: https://docs.iza.org/dp15664.pdf
- 17. Anderson S, Romm K. Absenteeism across the Early Elementary Grades: The Role of Time, Gender, and Socioeconomic Status. Elem Sch J. 2020 Dec 1;121(2):179–96.
- 18. Chen CC, Culhane DP, Metraux S, Park JM, Venable JC. The Heterogeneity of Truancy among Urban Middle School Students: A Latent Class Growth Analysis. J Child Fam Stud. 2016;25(4):1066–75.
- 19. Schoeneberger JA. Longitudinal Attendance Patterns: Developing High School Dropouts. Clear House J Educ Strateg Issues Ideas. 2012 Jan;85(1):7–14.
- 20. Simon O, Nylund-Gibson K, Gottfried M, Mireles-Rios R. Elementary absenteeism over time: A latent class growth analysis predicting fifth and eighth grade outcomes. Learn Individ Differ. 2020 Feb;78:101822.
- 21. Gershenson S, Jacknowitz A, Brannegan A. Are Student Absences Worth the Worry in U.S. Primary Schools? Educ Finance Policy. 2017 Apr;12(2):137–65.
- 22. Gottfried MA. Excused Versus Unexcused: How Student Absences in Elementary School Affect Academic Achievement. Educ Eval Policy Anal. 2009 Dec;31(4):392–415.
- 23. Hancock KJ, Gottfried MA, Zubrick SR. Does the reason matter? How student-reported reasons for school absence contribute to differences in achievement outcomes among 14–15 year olds. Br Educ Res J. 2018 Feb;44(1):141–74.
- 24. Klein M, Sosu EM, Dare S. School Absenteeism and Academic Achievement: Does the Reason for Absence Matter? AERA Open. 2022 Jan;8:233285842110711.
- 25.
Department for Education. A profile of pupil absence in England. 2011. (Research Report). Report No.: DFE-RR171.
- 26. Benner AD, Wang Y. Shifting attendance trajectories from middle to high school: Influences of school transitions and changing school contexts. Dev Psychol. 2014 Apr;50(4):1288–301. pmid:24364827
- 27. London R, Sanchez M, Castrechini S. The Dynamics of Chronic Absence and Student Achievement. Educ Policy Anal Arch. 2016;24(112).
- 28. Genolini C, Pingault JB, Driss T, Côté S, Tremblay RE, Vitaro F, et al. KmL3D: A non-parametric algorithm for clustering joint trajectories. Comput Methods Programs Biomed. 2013 Jan;109(1):104–11. pmid:23127283
- 29. Genolini C, Alacoque X, Sentenac M, Arnaud C. kml and kml3: R Packages to Cluster Longitudinal Data. J Stat Softw [Internet]. 2015 [cited 2022 Dec 9];65(4). Available from: http://www.jstatsoft.org/v65/i04/
- 30. Wodtke GT, Almirall D. Estimating Moderated Causal Effects with Time-varying Treatments and Time-varying Moderators: Structural Nested Mean Models and Regression with Residuals. Sociol Methodol. 2017 Aug;47(1):212–45. pmid:29391654
- 31. Entwisle DR, Alexander KL, Olson LS. Keeping the Faucet Flowing: Summer Learning and the Home Environment. Am Educ. 2001;25:10–5.
- 32. Fitzpatrick MD, Grissmer D, Hastedt S. What a difference a day makes: Estimating daily learning gains during kindergarten and first grade using a natural experiment. Econ Educ Rev. 2011;30(2):269–79.
- 33. Marcotte DE, Hemelt SW. Unscheduled School Closings and Student Performance. Educ Finance Policy. 2008;3(3):316–38.
- 34. Korpershoek H, Canrinus ET, Fokkens-Bruinsma M, de Boer H. The relationships between school belonging and students’ motivational, social-emotional, behavioural, and academic outcomes in secondary education: a meta-analytic review. Res Pap Educ. 2020;35(6):641–80.
- 35. Gottfried MA. The Detrimental Effects of Missing School: Evidence from Urban Siblings. Am J Educ. 2011 Feb;117(2):147–82.
- 36. Roorda DL, Koomen HM. Student–teacher relationships and students’ externalizing and internalizing behaviors: A cross-lagged study in secondary education. Child Dev. 2021;92(1):174–88. pmid:32700777
- 37. Wilson V, Malcolm H, Edward S, Davidson J. ‘Bunking off’: The impact of truancy on pupils and teachers. Br Educ Res J. 2008;34(1):1–17.
- 38. Eaton DK, Brener N, Kann LK. Associations of Health Risk Behaviors With School Absenteeism. Does Having Permission for the Absence Make a Difference? J Sch Health. 2008 Apr;78(4):223–9. pmid:18336682
- 39. Hallfors D, Vevea JL, Iritani B, Cho H, Khatapoush S, Saxe L. Truancy, Grade Point Average, and Sexual Activity: A Meta-Analysis of Risk Indicators for Youth Substance Use. J Sch Health. 2002 May;72(5):205–11. pmid:12109176
- 40. Rocque M, Jennings WG, Piquero AR, Ozkan T, Farrington DP. The Importance of School Attendance: Findings From the Cambridge Study in Delinquent Development on the Life-Course Effects of Truancy. Crime Delinquency. 2017;63(5):592–612.
- 41. Jeynes WH. The relationship between the consumption of various drugs by adolescents and their academic achievement. Am J Drug Alcohol Abuse. 2002 Jan;28(1):15–35. pmid:11853131
- 42.
Hancock KJ, Shepherd CCJ, Lawrence D, Zubrick SR. Student attendance and educational outcomes: Every day counts. Canberra; 2013. (Report for the Department of Education, Employment and Workplace Relations).
- 43. Pijl EK, Vanneste YTM, De Rijk AE, Feron FJM, Mathijssen J. The prevalence of sickness absence among primary school pupils–reason to be worried? BMC Public Health. 2021 Dec;21(1):170. pmid:33472603
- 44. Gubbels J, van der Put CE, Assink M. Risk Factors for School Absenteeism and Dropout: A Meta-Analytic Review. J Youth Adolesc. 2019 Sep;48(9):1637–67. pmid:31312979
- 45. Joshi H, Fitzsimons E. The Millennium Cohort Study: the making of a multi-purpose resource for social science and policy. Longitud Life Course Stud [Internet]. 2016 Oct 26 [cited 2022 Dec 13];7(4). Available from: http://www.llcsjournal.org/index.php/llcs/article/view/410
- 46. Jay MA, Mc Grath-Lone L, Gilbert R. Data Resource: the National Pupil Database (NPD). Int J Popul Data Sci [Internet]. 2019 Mar 20 [cited 2023 Jan 31];4(1). Available from: https://ijpds.org/article/view/1101 pmid:32935030
- 47.
Hernan M, Robins J. Causal Inference: What if. Boca Raton: Chapman & Hall/CRC.; 2020.
- 48. Burgette LF, Reiter JP. Multiple Imputation for Missing Data via Sequential Regression Trees. Am J Epidemiol. 2010 Nov 1;172(9):1070–6. pmid:20841346
- 49.
Rubin DB. Multiple Imputation for Nonresponse in Surveys. New York: Wiley; 1987.
- 50.
Department for Education. A guide to absence statistics. 2019. Available from: https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/468924/Guide_to_absence_statistics_15102015.pdf
- 51.
Hodge L, Little A, Weldon Matthew. GCSE attainment and lifetime earnings. Department for Education; 2021 p. 69.
- 52. Panayiotou M, Finning K, Hennessey A, Ford T, Humphrey N. Longitudinal pathways between emotional difficulties and school absenteeism in middle childhood: Evidence from developmental cascades. Dev Psychopathol. 2021 Dec 27;1–12. pmid:34955109