Graduate grade inflation at a U.S. research-intensive university: A 22-year longitudinal analysis

Vivien Lee; Nathan R. Kuncel; Paul R. Sackett

doi:10.1371/journal.pone.0341315

Abstract

The phenomenon of grade inflation has been studied extensively at high school and undergraduate levels, yet little is known about its occurrence in graduate education. This study bridges this gap by examining graduate grade inflation using data from one U.S. research intensive university, covering two decades of admissions across 75 master’s programs (N = 24,815) and 78 doctoral programs (N = 15,701). Relying on both linear and ordinal multilevel models, we investigated the presence of grade inflation and potential variations by degree level and individual academic programs at the program level. Our findings provide preliminary evidence for the presence of graduate grade inflation and suggest that the magnitudes differ across individual academic programs. There is also evidence showing that the trend of grade inflation significantly differed across master’s and doctoral programs. This apparent inflation undermines the signaling value of grades for employers and admission decisions in the labor market and academic selection, as well as for research, feedback, and learning purposes. Future research should replicate our findings using multi-institutional samples and examine drivers of graduate grade inflation to more accurately estimate its magnitude.

Citation: Lee V, Kuncel NR, Sackett PR (2026) Graduate grade inflation at a U.S. research-intensive university: A 22-year longitudinal analysis. PLoS One 21(3): e0341315. https://doi.org/10.1371/journal.pone.0341315

Editor: Manuel Salas-Velasco, University of Granada: Universidad de Granada, SPAIN

Received: April 17, 2025; Accepted: January 6, 2026; Published: March 25, 2026

Copyright: © 2026 Lee et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Data and code used to conduct all analyses are available via https://osf.io/3ywsz/overview?view_only=97c39642aab34e9da35cd261010f002f.

Funding: This research was supported by a grant from the Institute of Mental Chronometry to the University of Minnesota, PRS and NRK principal investigators. The funding agency had no role in study design, data collection, analysis, preparation of the manuscript, or decision to publish.

Competing interests: The authors have declared that no competing interests exist.

Introduction

In recent decades, the phenomenon of grade inflation has received widespread attention across the world [e.g., 1–3]. Grade inflation can be defined as increases in course grades that are not attributable to concurrent increases in the quality of students’ work or student ability [4,5], and typically manifests as an increase in GPAs or a shift in grade distribution, where top grades become more common, leading to a compression at the upper end of the grading scale. For instance, in the U.S., researchers have found that universities show an increasing trend of average GPA across time and that the “A” is now the most common grade among college students in the U.S. [3,6]. Similarly, Bachan [1] reported that, even after controlling for student ability, there is still evidence for grade inflation at undergraduate level in the UK such that the proportion of students gaining first class and upper second-class honor degrees increased over time.

Importantly, although researchers have sometimes equated the observed grade increases with grade inflation, observed grade increases are only a prerequisite for grade inflation. Observed grade increases are a necessary yet insufficient condition for true grade inflation as observed grade improvements could be attributed to explanations other than changing student characteristics that may not correspond to a concurrent increase in students’ quality of work [e.g., 5,7]. Existing research on grade inflation in undergraduate education has identified three major categories of factors that could lead to observed grade increases: institutional/department, instructor, and student characteristics. For example, institutions and/or departments may implement grading policies to attract students in less popular fields or deter students in more popular fields in order to stabilize enrollment [8]. The increasingly commercialized nature of higher education in recent years may also impact instructor characteristics such as leniency bias when it comes to assessing students. Berezvai et al. [9] found that assigning students better grades was associated with more positive student evaluations of teaching in two different universities in Central Europe. More recently, using a sample of Korean university students, Park and Cho [10] discovered that when students received grades lower than expected, they might retaliate by providing lower evaluation ratings. Given that student evaluations of teaching can have substantial impact on faculty promotion and tenure decisions [5,11], instructors may choose to be lenient and assign better grades out of concerns for their career, contributing to the observed grade increases. Indeed, Matos-Díaz [12] showed that grade incentives can be powerful in raising students’ expected grades and increasing students’ willingness to take another course with the same instructor, which are likely to be linked to higher student evaluations of teaching, in a sample of Puerto Rican students. Simultaneously, the cost-cutting trend of hiring more part-time adjunct faculty, who often face even more pressure to keep students happy and satisfied, may also exacerbate such bias [13]. Finally, changes in student characteristics such as ability, demographics, course-taking pattern, and effort also play a significant role in explaining observed grade increases. However, these factors are typically not considered determinants of grade inflation as they likely correspond to genuine changes in the quality of student work [5]. Grades are not inflated if they reflect an improvement in learning and the quality of assignments. Thus, controlling for explanations like changing student qualities and factors other than changing student characteristics will be crucial in detecting the presence of grade inflation and accurately estimating its magnitude as well as distinguishing it from legitimate grade improvement.

Indeed, a number of studies have documented grade inflation/observed grade increases over time and examined the factors contributing to this trend across educational levels and countries. For instance, using a nationally representative sample of Portuguese secondary school students from 2010–2019, Silva et al. [14] revealed evidence of rising grades, particularly among private schools compared to public schools. Similar findings have also been observed among high school students in the United States [15]. To better understand drivers underlying grade inflation in secondary education, Arrafii [16] investigated grading practices in Indonesia and identified both academic and non-academic influences on teachers’ grading decisions, including students’ effort, participation, and extracurricular involvement. Comparable trends have also been observed at the undergraduate level [e.g., 1,3,6], where grade inflation appears across both introductory and advanced courses [17]. Concerned with the phenomenon of grade inflation, Ullah et al. [18] conducted in-depth interviews with faculty members in Pakistan and found that institutional pressures, such as maintaining enrollment, accommodating underprepared students, and improving teaching evaluations, were key contributors to grade inflation. Collectively, these findings suggest that grade inflation is a pervasive global phenomenon, which has prompted educators and researchers to come up with ways to combat grade inflation [19,20].

Despite extensive research at high school and undergraduate levels [3,6,17,21], grade inflation remains a relatively underexplored phenomenon in graduate education. A report published by the UNESCO in 2024 [22] found that global enrollment in tertiary education in continents such as North America, Europe, Latin America, Africa, and Asia has more than doubled from 2000–2022. In the United States, the number of master’s and doctoral degrees awarded increased by 16 percent and 20 percent, respectively, from years 2011–2012 to 2021–2022 [23]. This similar pattern is observed in Europe where there is a steady increase in the proportion of people attaining tertiary education, which include master’s and doctoral graduates, in recent years [24]. Given that the number of graduate degrees awarded is on the rise across the world, it becomes increasingly crucial to assess the magnitude of grade inflation at a graduate level in order to better gauge the utility of school grades to schools, employers, and students over time. Graduate grade inflation negatively affects our ability to evaluate admission assessments, conduct research on graduate school learning, and make use of graduate school grades for administrative purposes like awards and fellowships. To this end, our study has two primary goals: First, using US-based data from 75 master’s programs (N = 24,815) and 78 doctoral programs (N = 15,701) within a single institution, we explored the presence of graduate grade inflation at the individual program level across 22 years while controlling for student prior ability. Second, in line with existing literature on undergraduate grade inflation, we investigated potential differences in the magnitudes of graduate grade inflation by exploring differences by program type and degree level. It is our hope that this study could fill a critical gap in the literature and stimulate more research on grade inflation at a graduate level. Understanding whether grade inflation exists at a graduate level is a first step toward addressing other questions asked at secondary and undergraduate levels.

Grade inflation: why should we care?

Signaling theory [25] suggests that because underlying characteristics are difficult to observe directly, individuals rely on signals to guide important decisions such as hiring and school admission. In the educational context, academic achievement (i.e., GPA) serves as one such signal. Beyond reflecting persistence and mastery across a variety of subjects, grades provide an observable indicator of student quality to schools and employers and serve as feedback to students about their own learning progress. However, grade increases not tied to genuine improvements in student performance create situations where high-performing and mediocre students receive indistinguishable grades (e.g., 4.0), rendering school grades useless as an information source and making it difficult to differentiate between them. In this sense, grade inflation erodes the external screening function of grades for prospective employers and admission officers [26], while also weakening their internal function on providing feedback on student progress.

Signaling power to schools and employers: student quality

Understanding the phenomenon of grade inflation is crucial for helping schools and employers gain a better understanding of the power of school grades in signaling student quality for school admissions or when they enter the workforce [27]. At school, school grades are frequently used for making decisions regarding opportunities like merit-based scholarships and fellowships alongside other criteria. Similarly, graduate school grades in a master’s program are often considered for admissions into a doctoral program. Students performing well academically will have a higher chance of getting rewards. The inherent reduction in variability in school grades that comes with grade inflation may make it more difficult to differentiate students. Over time, this may render school grades less useful for making such decisions.

In the workplace, despite doubts surrounding the value of school grades in helping organizations hire talent [28], receiving excellent school grades is undoubtedly a symbol of students’ continuous efforts and persistence in performing well in their courses, which is often tied to conscientiousness, a personality trait that individuals bring to their job that could significantly impact job-related outcomes like counterproductive work behavior and on-the-job performance [29,30]. Indeed, recent meta-analytic evidence also confirmed the predictive validity of academic performance—at high school, undergraduate as well as graduate levels—for predicting job performance [31]. Thus, graduate grade inflation may compromise the criterion-related validity of school grades for predicting future on-the-job outcomes, making it increasingly difficult to distinguish between good and exceptional performers both in educational and occupational settings over time. Grade inflation at individual institutions may also undermine a university’s reputation in the eyes of prospective students and employers.

Signaling power to students: learning and degree progress

Grade inflation must reduce the effectiveness of school grades as feedback to students as it runs the risk of miscommunicating performance standards [20]. Grades often serve as a feedback tool when individualized feedback from instructors is not feasible [11,32]. They have the potential to influence student performance, motivation as well as learning strategies. Koenka et al. [33] conducted a meta-analysis on the impact of grades in K-12 education; they found that students who received grades performed better than students who did not receive grades but at the same time displayed lower academic motivation, providing support for the notion that school grades play a role in influencing student performance and motivation.

Primary studies also confirmed the impact of school grades on performance and motivation. For instance, Main and Ost [34] found that course grades on a prior exam predicted student performance on a subsequent exam. Similarly, Gray and Bunte [35] found that low grades are tied to increased subsequent course performance. These findings are not surprising as students may change their self-regulatory strategies and reallocate their resources toward learning based on the grades (i.e., feedback) they receive [36].

Grades could also prompt students to seek additional information about their performance and improve their work or learning. There is some evidence suggesting that feedback seeking mediates the relationship between conscientiousness and final grades [37]. Accordingly, if grade inflation reduces the meaningfulness of school grades and creates a scenario where everything yields an A, its utility as feedback to students will also decline. After all, there is little reason to seek additional feedback or alter performance or learning strategies if they make little to no difference to their grade. Critically, when instructor-assigned grades provide limited signal to students, alternative evaluation methods such as structured peer assessment where students provide and receive feedback, may be considered. In a group of graduate students, Landry et al. [38] found that peer assessments can yield grading comparable to instructors and that feedback improved assignment quality. Meta-analytic evidence supports the notion that peer assessments are generally effective in improving academic performance [39]. Furthermore, narrative feedback focused on describing how students currently do and how they can improve might also be used in lieu of traditional numeric grade to better support student learning and performance [40].

Other reasons for why school grades matter

In addition to the utility of school grades in predicting important outcomes and providing feedback on student learning, school grades are often used as a measure of learning and academic performance—constructs of interest to many. Because grade inflation corresponds to an increasingly restricted range of school grades and that the correlation between two variables is a function of their variability [41], the phenomenon of grade inflation might also paint a misleading picture of the criterion-related validity of other predictors used in graduate school admission such as the GRE scores. In other words, since grades are often used to measure learning as an outcome, to the extent that school grades are range restricted, the observed criterion-related validity of their predictors will be attenuated and distorted.

Hypothesis and research question development

Detecting the presence of graduate grade Inflation.

Grade inflation at a graduate level is arguably more complex than at an undergraduate level because of more extreme restriction of talent and different attitudes towards grades among graduate instructors. On the one hand, grade inflation may not be present in graduate programs because the admissions process tends to be more selective; competitive programs often have stricter standards than undergraduate admission and underqualified applicants may self-select out of applying. This implies a greater degree of range restriction on ability and academic performance in graduate education, leaving less room for average grades to grow over time. On the other hand, grade inflation could still occur as instructors may be more inclined to award higher grades to graduate students than to undergraduates.

At an undergraduate level, school grades play an instrumental role in signaling student quality to graduate school admission offices as well as recruiters for entry level positions, which is not always the case at the graduate level. Knowing that differentiating students may not carry as much weight as they do in undergraduate education, instructors may be more less likely to adopt a criterion-referenced or self-referenced grading orientation, where students are evaluated against an absolute grading standard or against their past self, than norm-referenced grading orientation, where students are evaluated relative to other students in a course [11,42]. This means that a greater number of A’s may be awarded to graduate students compared to undergraduate students. As mentioned, because observed grade increases could be attributed to reasons other than increases in student quality and may not mean grade inflation, we controlled for admitted students’ baseline ability as indexed by GRE scores as a potential explanation for observed grade increases. If grade increases are still observed after controlling for prior ability, then observed grade increases is unlikely to be due to changes in student ability, providing evidence for grade inflation.

Research question 1. Is there evidence for graduate grade inflation, as shown by consistent increasing trend of school grades over time even after controlling for program level baseline ability?

Potential differences in the magnitudes of graduate grade inflation

Academic Programs across Disciplines. The magnitude of grade inflation may differ by the type of program due to potential disciplinary differences in rigor and grading philosophies endorsed by faculty. The idea that there are interdisciplinary differences in the magnitude of grade inflation has been extensively studied and detected at an undergraduate level (e.g., 11). For instance, using a sample from University of Michigan, Achen and Courant [8] explored potential differences in the grades by department and found that, on average, instructors of courses in fields such as math and chemistry tended to record lower grades than those in English. One explanation might be objectivity in scoring where science and math courses tend to have assignments and quizzes with right or wrong answers that are graded objectively, whereas humanities courses have more subjective evaluation. Similarly, Rojstaczer and Healy [3] examined the magnitude of grade inflation using a sample from 119 institutions and found that schools with a science and engineering focus, where students tend to take more STEM courses, are less likely to award high grades (i.e., A’s and B’s) compared to other schools that are equally selective.

However, as discussed, grading in graduate programs often emphasizes research skills, critical thinking, and scientific contributions, which may result in more subjective, interpretation-based grading practices even within STEM disciplines. Furthermore, arguments about controlling class sizes and maintaining enrollment that are often used to explain stricter grading in STEM fields at the undergraduate level may be less relevant in graduate programs, where smaller and more specialized classes are the norm. Therefore, it remains unclear whether similar interdisciplinary grading differences exist at a graduate level.

Research question 2. Does the magnitude of graduate grade inflation differ across academic programs?

Master’s versus Doctoral Programs. Aside from disciplinary differences, the magnitude of grade inflation may differ by degree level. The phenomenon of graduate grade inflation is inherently more complex than undergraduate grade inflation because there is more than one degree level within graduate programs, including master’s and doctoral, each with its own selection standards, admission objectives, course content, and grading policies. These distinctions can contribute to varying magnitudes of grade inflation by degree level. Grade inflation may be more pronounced in master’s than doctoral programs for two primary reasons: First, because doctoral programs are generally more selective than master’s programs [43], one would reasonably expect students’ academic performance to be more range restricted to begin with [41], thereby leaving less room for growth over time. Second, the leniency bias in grading might be less common in doctoral programs because funding and financial assistance are more readily available at a doctoral level [44], which could alleviate instructors’ pressure to please students or attract enrollment that is common in self-funded master’s programs.

Alternatively, one could argue that grade inflation may be stronger in doctoral than master’s programs due to differences in curriculum design and perceived importance of school grades. Whereas master’s coursework is more likely to be lecture-based where class sizes tend to be larger, doctoral courses tend to be discussion-based with smaller class sizes and a focus on developing critical thinking and research skills so that students can contribute to the body of scientific literature in their respective fields [45]. These differences in course size and instructional methods may manifest as differences in school grades. For example, Ekstrom et al. [11] found that class size is inversely correlated with students’ grades and that faculty tend to use different instructional methods for introductory compared to advanced courses. Furthermore, the perceived value of school grades may also differ for master’s and doctoral graduates. Grades often matter less to employers and students at doctoral level as students are generally valued more for their domain-specific knowledge and specific accomplishments like publications than their school grades, and doctoral students face less pressure when it comes to education advancement. Hence, instructors may feel less pressure to use normative grading as its implications for future decision making are minimal. Taken together, it is unclear whether grade inflation would be stronger at a master’s or doctoral level.

Research question 3. Is the magnitude of grade inflation significantly different for master’s versus doctoral programs?

The current project

The current study aims at developing a deeper understanding of the phenomenon of grade inflation at a graduate level. Using a large dataset on 75 master’s programs (N = 24,815) and 78 doctoral programs (N = 15,701) covering over two decades of graduate admissions at a single university, we leverage multilevel modelling (MLM) and investigate the presence of grade inflation in graduate education at the level of the individual academic program as well as the extent to which observed grade increases at the academic program level is attributable to actual grade inflation. Our study has substantial contributions. First, we fill a critical gap in the literature by investigating the presence of grade inflation in graduate education. Second, by exploring graduate grade inflation, our findings provide important information to educators and employers on the meaning and utility of school grades at a graduate level.

Method

The study was approved by the institutional review board at the University of Minnesota (IRB#STUDY00024904) with a waiver of informed consent.

Sample

A sample of N = 44,678 graduate students from at one large US Midwestern University admitted across 24 years from years 1999–2022 was included in the initial dataset. The data, which were anonymized in advance, were shared with the research team by the graduate school in August 2023. Data were accessed for the current project on October 23, 2023. Students were enrolled in 105 master’s programs and 90 doctoral programs. All students had already completed their final exams, if at a master’s level, and preliminary exams, if at a doctoral level. Due to the nature of the analysis, students admitted from 2021–2022 were excluded from the analyses as the outcome criterion was unavailable for most students. Additionally, we further coded and removed programs where data were too sparse due to missing data for many time points. This resulted in a final sample of N = 24,815 students from 75 master’s programs and N = 15,701 from 78 doctoral programs.

Measures

Classification of instructional program (CIP) codes. Our grouping variable was academic programs, which were recoded to be consistent with CIP, a taxonomy of academic titles developed by the U.S. Department of Education [46]. Examples of CIP programs include research and experimental psychology, educational psychology, history, physics, and geography. The full list of CIP programs available in the current study is included in Tables S1a and S1b in S1 File in the supporting materials.

Graduate Cumulative Grade Point Average (GPA). Individual graduate school performance was operationalized as their cumulative GPA after they have completed all required coursework for their program. GPA information was extracted from university records.

Graduate Record Examination (GRE). To control for students’ ability upon entering graduate school as a potential reason for observed grade increases, students’ GRE total scores (i.e., sum of verbal and quantitative component scores) were used as a level 1, student-level covariate. Because the scale of GRE changed in year 2011, old scores (out of 800 per component) before 2011 were transformed into the new scale (out of 170 per component) based on a concordance chart by Educational Testing Service, which can be found as part of our supplementary materials. Because missing GRE scores are likely policy-driven, due to GRE being optional in some academic programs and policy changes during the Covid-19 pandemic, and not missing at random or missing completely at random, a missingness indicator was created to distinguish cases with and without missingness [47]. Missing values for the GRE were imputed with the value “0”, which corresponds to the grand mean after centering to aid interpretation.

Degree Level. This variable was coded based on the specific program students were enrolled in. 0 refers to master’s programs and 1 refers to doctoral programs.

Sex. Students’ sex was dummy coded as 0 for female students and 1 for male students.

Ethnicity. Ethnicity was coded as one of the following six groups: White, Black, Hispanic, Asian, Other, and Non-specified. White was used as the reference group.

Analytic approach

To study the presence and magnitudes of graduate grade inflation across time, a series of eight linear mixed effects models where students (level 1) were nested within academic programs (level 2) was fitted. Time and GRE, including both GRE total scores and GRE missing indicators, were added as student-level covariates at level 1; degree level was input as a program-level covariate at level 2. Covariates were added in one at a time, allowing us to directly test our research questions. Additionally, although we considered a unified three-level model (time – program – degree level), two-level models were used because a unified model would contain only two level-3 units (i.e., master’s and doctoral) and that past simulation research has shown small sample size for higher level groupings may lead to inaccurate estimation of variance components as well as inferences [48]. Similarly, although other longitudinal approaches (e.g., fixed effects panel models) are viable, we employed multilevel modeling (i.e., linear mixed effects models) for both interpretability and alignment with our research questions [49]. Fixed effects models handle clustering by including higher-level grouping variables (in this case, academic programs) as dummy variables. With more than 150 master’s and doctoral programs in our dataset, this approach would yield over 100 dummy variables which could be cumbersome, whereas mixed effects models provide a more parsimonious and interpretable modeling framework by allowing the modeling of academic program as a random effect. Second, and more importantly, mixed effects models enable a direct examination of level 2 random effects, which is central to our research goals. RQ2 investigates the cross-level interaction between degree level (a level 2 variable) and time, and RQ3 evaluates random slopes for time across programs.

The assumptions of linearity, normality, homoskedasticity, and independence of observations were checked. Our baseline model was a random intercepts model with no predictor. Then, to estimate the average grade increase across time, time was entered as a predictor. Afterwards, a random intercepts-random slopes model, which assume differential growth rates in grades across academic programs, was fitted and compared to the random intercepts model on model fit. To explore potential non-linear effect of time, we fitted natural splines of time with varying degrees of freedom (df). A natural spline breaks the time predictor into segments and fits smooth cubic curves to each segment, joining them at knots. This flexibility allows us to test whether the effect of time departs from a simple linear trend. Following recommendations from Harrell [50], we started with a df of 3 up to 6 until there was no significant improvement in model fit as shown by likelihood ratio tests. Once the functional form of the relationship between time and GPA was determined, we controlled for individual-level GRE scores as a proxy of baseline ability to obtain more accurate estimates of grade inflation over time. A GRE missingness indicator was also included. Finally, degree level and the interaction between degree level and time were included to understand whether the magnitude of grade inflation differs for master’s versus doctoral programs. For all models, residual variance () and R² were reported to illustrate the amount of variance explained. Whereas Marginal R² denotes the proportion of variance in the outcome that is explained by fixed effects only, conditional R² denote the proportion of variance that is explained by both fixed effects and random effects.

To supplement our analyses, we conducted separate analyses for each degree level as a sensitivity check. In addition, because GPA values are skewed and clustered near the top of the scale, standard methods may give misleading results. Accordingly, we also fitted Bayesian multilevel ordinal models to confirm that the observed upward trend is real and not an artifact of model choice, given that individual grades are not continuous but grouped into ordered categories even though cumulative GPA form a continuous variable [51,52]. A Bayesian approach was selected over a frequentist approach because it is better suited for estimating complex models with many parameters, in this case natural spline terms and random effects, while producing more stable estimates. Specifically, GPA values were grouped into six approximately equal-sized categories exclusive of the upper bound (i.e., [-Inf,3.50], [3.50, 3.70], [3.70, 3.80], [3.80, 3.90], [3.90,4.00], [4.00, + Inf]). The assumption of proportion odds (PO) for time, which asserts that the effect of time is consistent across all GPA categories, was explicitly tested.

Results

Student-level descriptive statistics for GPA and GRE by admission year and degree level are summarized in Table 1. The mean GPAs for both master’s and doctoral programs exhibit an increasing trend from 1999 to 2020 (d_masters = 0.56; d_doctoral = 0.36). For doctoral programs, there is a gradual but noticeable reduction in standard deviations of mean GPA over time. For GRE, there is also an increasing but weaker trend over time (d_masters = 0.33; d_doctoral = 0.53).

Download:

Table 1. Descriptive Statistics for GPA and GRE total scores by Admission Year at Student Level.

https://doi.org/10.1371/journal.pone.0341315.t001

To test our assumptions, for normality, residuals at both level 1 and level 2 were inspected using QQ plots, showing evidence for normality. For homoskedasticity of variance, we examined fitted values versus residuals plots and found evidence violation of homoskedasticity. This is unsurprisingly as our dependent variable, GPA, is bounded between 0 and 4. Therefore, we utilized robust standard errors produced from sandwich estimator for our fixed effect estimates to ensure accurate inference. Finally, dependence of observations was confirmed by computing the intraclass correlation coefficient (ICC) based on the baseline model. Specifically, we estimated an ICC of 0.21, which indicates that 21% of variance in student GPA can be explained by the academic program to which they belong, providing strong justification for our modeling approach.

Overall, across the seven fitted models, marginal values ranged from .00 to .08 while conditional values ranged from .21 to .34, indicating that both fixed and random effects accounted for a substantial proportion of the variance. Residual variance decreased steadily with the addition of predictors, suggesting improved model fit.

First, to answer our research question of whether there were significant grade increases over time (i.e., RQ1), a random intercepts-fixed slope model with only time as a predictor was fitted in Model 2. As shown in Table 2, time significantly predicted GPA at p < .001, suggesting the phenomenon of grade increases over time. Specifically, one year increase in time corresponds to 0.0049-point increase in GPA (95% CI = [0.0040, 0.0058]). It is worth noting that this increase is occurring with grades that are, on average, already very high compressing an already small variance.

Download:

Table 2. Results from Linear Mixed Effects Models.

https://doi.org/10.1371/journal.pone.0341315.t002

Second, to test our RQ2 which concerns whether the extent of grade increases significantly differs across academic programs, we compared the model fit of the random intercepts-fixed slope model to that of the random intercepts-random slopes model using likelihood ratio tests (i.e., Model 3). As shown in Table 2, results from likelihood ratio test revealed that the random intercepts-random slopes models demonstrate a significantly better fit ([2] = 197.57, p < .001), providing evidence that the rate of grade changes differs across programs.

To confirm the linearity of the effect of time, we fitted the random intercepts-random slopes model with natural splines with degrees of freedom ranging from 3 to 6. A series of model comparisons using likelihood ratio tests revealed that the model with 4 degrees of freedom emerged as the best fitting model (i.e., Model 4), which was used as the basis for later models. Four degrees of freedom mean that there are up to four segments and three inflection points for the relationship between time and GPA.

Third, to account for student demographic characteristics as a potential explanation for observed grade increases, individual-level sex and ethnicity were entered as level 1 covariates in Model 5. Both sex and ethnicity significantly predicted GPA such that students who were male and students who identified as Black or Hispanic showed a smaller increase in GPA.

Fourth, to rule out increased students’ baseline ability as an alternative explanation for observed grade increases, individual-level GRE total scores and the GRE missingness indicator were added as level 1 covariates in Model 6. To make intercept estimates more interpretable, GRE was grand mean centered by subtracting the grand mean of GRE scores across all programs and all time points. Unsurprisingly, as shown in Table 2, GRE scores significantly predicted GPA such that a one-point increase in GRE above the grand mean predicted 0.0050-point increase in GPA. GRE missingness indicator was not statistically significant, indicating that cumulative GPA did not differ significantly for those with and without GRE scores. Regarding the effect of time, results continued to show that time significantly and nonlinearly predicted GPA even after controlling for GRE scores, indicating potential grade inflation.

Finally, to explore whether the rate of grade inflation differs for master’s versus doctoral programs (i.e., RQ3), we input degree level as a level 2 covariate as well as the interactions between degree level and time in two steps. The first model has only the main effect of degree level (i.e., Model 7) and the second model has both the main effect of degree level and the interactive effects between degree level and time (i.e., Model 8). Results are summarized in Table 2. In Step 1, we found a significant effect for degree level at p < .001 such that students enrolled in doctoral programs have on average 0.06 points higher in cumulative GPA compared to students enrolled in master’s programs. In Step 2, we compared two nested models using likelihood ratio test. Results showed that the model with the interactive effects had significantly better fit ( [4] = 20.55, p < .001), providing support for the idea that the magnitude of grade inflation varies by degree level. Importantly, this is not in conflict with the nonsignificant coefficients for individual interaction terms reported. It is because natural splines decompose time into multiple basis functions, each term captures only a portion of the overall trajectory. As a result, the significance of the interaction must be evaluated jointly across all spline terms, rather than from any single coefficient. The overall finding suggests that the trend of grade inflation is non-linear and that this trend is statistically different for master’s and doctoral programs. To better illustrate the differences by degree level, Fig 1 shows the non-linear trend of predicted GPA by degree level conditional on student characteristics. For master’s programs, GPA shows a steady upward trend in the earlier years, followed by a period of (relative) stability, and a marked increase in recent years. For doctoral programs, GPA remained relatively flat initially, then rises modestly before displaying a shaper increase in the most recent admission years. These patterns suggest that grade inflation has accelerated in both programs, though it appears to unfold differently across degree levels.

Download:

Fig 1. Predicted GPA by year and by degree level. Predicted values are conditional on model covariates and reflect the reference groups used in the analyses (female and White students), with GRE held at its mean (GRE = 0) and the missing GRE indicator set to 0.

https://doi.org/10.1371/journal.pone.0341315.g001

Follow-up analyses using linear mixed effects models separating master’s and doctoral programs confirm the observed non-linear effects of time.

Results from a series of Bayesian multilevel ordinal models, including one overall model and two degree-specific models, also indicated the presence of grade inflation, suggesting that the upward trend in GPA over time was observed even with an alternative model choice. There was no evidence for the violation of the proportional-odds assumption in the overall and master’s level models, whereas the doctoral level model showed mild evidence of a potential violation, suggesting that the magnitude of grade inflation may not be uniform across GPA categories. Because time was modeled using natural splines, its effect cannot be summarized by a single regression coefficient. Thus, we summarized the effect of time by comparing the model’s predicted log-odds of being in a higher GPA category at two time points (1999 and 2020) and exponentiating this difference to obtain an odds ratio. The odds ratio represents the change in the cumulative odds of being in a higher GPA category between these two admission years while holding other covariates constant. Consistent with the linear models, results from the overall Bayesian multilevel ordinal model indicated higher GPA categories over time; specifically, the cumulative odds of being in a higher GPA category were 43% greater for students admitted in 2020 compared to those admitted in 1999 (OR = 1.43, 95% Crl = [1.28, 1.60]). This general upward trend was similarly observed in the degree-specific models. Moreover, the ordinal models also provided evidence that the magnitude of grade inflation likely differed across degree levels. Detailed results from the Bayesian multilevel ordinal models are provided in Tables S2-S4 in S1 File in the supporting materials.

Discussion

Our study demonstrated that grade increases are observed over time at a graduate level for both master’s and doctoral programs even after controlling for student sex, ethnicity, and prior ability, providing preliminary evidence for graduate grade inflation. Our findings showed that graduate grade inflation is nonlinear, rather than linear, with a sharp increase among students admitted years between 2017 and 2020. This is likely due to grading and instructional changes during the Covid-19 pandemic. As cumulative GPA was typically recorded at the end of year 2 among master’s students and year 3 among doctoral students, grades of students admitted around this period will be partially impacted and are likely to be inflated, as noted in some institutions (e.g., [53]). Importantly, there is still a notable grade increase between years 1999 and 2017 (d_masters = 0.10; d_doctoral = 0.22).

In terms of degree level comparison, we found that the cumulative magnitude of grade inflation is slightly higher for master’s programs than doctoral programs between 1999 and 2020, presumably due to potential ceiling effects among doctoral programs, which have higher starting mean academic performance in our sample. Indeed, individual intercepts and slopes tend to be negatively correlated such that programs with higher starting intercepts (i.e., mean GPA in year 1999) tend to show lesser growth over time (r = −.48). To gain further insights into the differences between master’s and doctoral programs, we explored grade distributions across time at both degree levels descriptively. As shown in Fig 2, both degree levels saw an overall increase in the proportion of students obtaining a 4.0, the maximum grade possible, in recent years. However, there is also a noticeable increase in the proportion of students obtaining 3.75–4.00 and visible decrease in the ranges of 3.25–3.50 and 3.50–3.75 for master’s level program, a pattern that is not seen for doctoral level programs, revealing distinct changes in grade distribution between master’s and doctoral programs.

Download:

Fig 2. Proportions of students belonging to different grade buckets by admission year at (a) master’s level and (b) doctoral level.

All buckets are exclusive of upper bound.

https://doi.org/10.1371/journal.pone.0341315.g002

In addition, as shown in Figs 3 and 4, the trajectory of grade inflation varied substantially across academic programs. While nearly all programs exhibited increases between 2017 and 2020, patterns from 1999 to 2016 were more heterogeneous. In fact, several programs, such as aerospace engineering, industrial engineering, multidisciplinary studies, and media studies, displayed clear downward trends at both master’ and doctoral levels. These declines may reflect program-specific grading policies or philosophies, though our data do not allow us to direct examine or control for these mechanisms. This highlights the importance of considering program-level factors when examining grade inflation in future research. Moreover, contrary to the expectation that non-STEM programs might be more prone to inflation due to greater grading subjectivity, we did not observe systematic differences between STEM and non-STEM fields at either degree level.

Download:

Fig 3. Program-specific GPA trajectories at master’s level.

Predicted values are conditional on model covariates and reflect the reference groups used in the analyses (female and White students), with GRE held at its mean (GRE = 0) and the missing GRE indicator set to 0.

https://doi.org/10.1371/journal.pone.0341315.g003

Download:

Fig 4. Program-specific GPA trajectories at doctoral level.

Predicted values are conditional on model covariates and reflect the reference groups used in the analyses (female and White students), with GRE held at its mean (GRE = 0) and the missing GRE indicator set to 0.

https://doi.org/10.1371/journal.pone.0341315.g004

Study contributions

The current study has substantial contributions to both theory and practice. First, consistent with prior research on grade inflation at undergraduate level, our findings indicate that grade inflation likely occurs in graduate education, thereby bridging a crucial gap in the achievement and educational testing literature. Second, using MLM, our study also contributes to the literature by exploring the phenomenon of graduate grade inflation at a program level and producing program-level estimates of grade inflation when existing literature has tended to explore grade inflation either at an institutional level or student level. Third, to our knowledge, this is the first study to investigate the potential non-linear trends in grade inflation (while controlling for student ability), suggesting that the phenomenon of grade inflation may be more complex and nuanced than previously understood.

Additionally, this study provides practical and invaluable information to both employers and educators to aid decision making for selection in organizational and educational contexts. The existence of graduate grade inflation as evidenced by this upward trend of GPA, and increasingly compressed grade spread suggests that the ability of grades to signal differences in student academic achievement is impaired and that the meaning of grades is changing over time even at the graduate level [54,55]. Admission offices and employers might be concerned about the devaluation of graduate degree qualifications. Importantly, an increasingly compressed range of grades may compromise school grades’ criterion-related validity (and hence their utility) for predicting important outcomes such as future academic performance as well as job performance, thereby increasing the risk of admitting or hiring undesirable candidates, which can be costly to organizations [e.g., 43]. Finally, to the extent that grades are intended to give student accurate feedback about their performance, the results suggest that, increasingly, grading may be failing at that task. The average GPA was already an “A” average 25 years ago and it steadily compressed what was already a narrow range of GPAs. Educators might consider the utility of school grades as a feedback tool for students to reflect upon their progress in their program. To combat the issue of graduate grade inflation, educators might consider implementing standardized grading rubrics or offering coaching sessions to help faculty develop standardized assessment practices. As well, because instructors may be incentivized to give out higher school grades in exchange for more positive student evaluations, institutions should also rethink the use of student evaluations in tenure and promotion decisions.

Limitations and future directions

However, the current project is not without limitations. First, the dataset utilized in the current project came from one institution, a large US Midwestern state university. Although the dataset is large and spans across more than twenty years, one may ask whether findings are idiosyncratic to the institution in question. We believe that our findings will very likely generalize to other institutions and other geographical regions for three reasons. First, the current study used a sample from a large university, where there were considerable changes in faculty and university leadership over the twenty-two-year timeframe, so it is unlikely that the observed grade increases are due to a single policy or a single group of faculty unique to the institution under study. Second, like most research-intensive universities, graduate students come from all over the world to study in a large number of disciplines and are likely to have varying characteristics, suggesting that this university’s candidate pool is broad, and does not reflect some set of local candidate pool features that make this an idiosyncratic sample. Third, grading is not centrally managed; faculty are largely free to determine their own grading standards, which should lead to differences in grading rather than a consistent trend of grade inflation. Together, these factors make it unlikely that graduate-grade inflation is merely a localized issue specific to the institution under study and instead suggest that it aligns with broader trends observed at the high school and undergraduate levels. In any case, we acknowledge that exploring this phenomenon with data from a single institution is limited and that the magnitude of grade inflation observed may not generalize fully to other institutions.

Future research should attempt to replicate the findings across institutions and geographical regions to confirm the presence and magnitude of graduate grade inflation at other institutions and test institution-level effects. If possible, replicating our findings using multi-institutional datasets will be optimal. If graduate grade inflation is consistently observed, students, educators, and prospective employers should re-evaluate the value of school grades in conveying meaningful information about student learning progress and overall student quality. New learning techniques like peer assessments, which is linked to learning, may be necessary for helping students reflect on their learning progress [56,57]. Instructors may also need to rely on formative assessments to gauge students’ level of learning. Likewise, in the workplace, employers may place less emphasis on school grades when distinguishing applicants and consider alternative measures of skills and competencies.

Second, we acknowledge that using GRE scores as a proxy for baseline student ability is imperfect. The GRE has undergone reforms, including changes in test format and scoring, and many universities—including the one examined in this study—have gradually adopted test-optional policies in recent years. Nevertheless, standardized test scores such as the GRE have long been regarded as a relatively direct and objective measures of cognitive ability in many fields such as Economics and Industrial/Organizational Psychology. This approach is also consistent with prior grade inflation research: for example, Kostal et al. [5] controlled for student-level SAT in their analysis, and Rojstaczer and Healy [6] used SAT scores as a proxy for institutional selectivity. Taken together, these examples demonstrate that the use of standardized test scores as proxies for student ability is well-established in the literature, thereby justifying our approach. At the same time, we encourage future researchers to triangulate multiple indicators of student ability whenever possible when studying grade inflation.

Third, while the current study provides evidence of graduate grade inflation, it was not possible to tease out factors leading to the observed grade increases. Only baseline ability, sex, and ethnicity were included and no other individual-level variables (e.g., undergraduate GPA) were included due to limited information retained in student records. Other additional sources of variance, including institutional/departmental, instructor, and student characteristics such as curricular changes and shifts in grading philosophy might also be potential alternative explanations for the observed grade increases (e.g., 39). However, there were no major policy reforms during this time with respect to grading. Given that the effect was observed in a wide number of programs, which are effectively isolated from each other with respect to grading philosophy, changes in grading policy is unlikely to fully explain the observed trend. To the extent that other indicators of student quality such as student effort and course taking patterns accounted for the observed increases over time, the magnitude of the observed grade inflation would be reduced. However, rather than providing a universal estimate of grade inflation, our intent is to provide preliminary evidence for grade inflation at a graduate level. We encourage future research to consider alternative explanations for the observed grade increases such as socioeconomic status, social support, effort, and motivation to learn to obtain a more precise estimate of the magnitude of graduate grade inflation. Additionally, to deepen our understanding of grade inflation in graduate education, future research should also explore the role of course design (e.g., curriculum content and class size), faculty characteristics (e.g., biases, composition, and grading philosophy), which prior studies have identified as potential mechanisms contributing to genuine grade inflation (e.g., 5). These factors can be examined quantitatively–by incorporating the above variables as controls—and qualitatively—through focus groups or in-depth interviews to understand the drivers of grade inflation from faculty and students’ perspectives.

Fourth, the current treatment of degree level may be over simplistic as there might be substantial differences among academic programs at the same degree level. For instance, thesis-based and practice-oriented master’s programs may have substantially different grading policies as differences in course content may necessitate different learning evaluation methods and students and course instructors may have different motivations. Future research should account for program type (i.e., research versus practice) as a predictor and examine if grade inflation is more readily observed in practice-oriented master’s programs. Similarly, the current study only examined grade inflation in research-oriented doctoral programs, professional doctoral programs such as medicine and dentistry were not included in our current analyses due to data sparsity across years. Future research could use multi-institutional samples with larger sample sizes to detect the presence and extent of grade inflation in these disciplines, which will be essential for developing an understanding of the utility of school grades in graduate education more broadly.

Conclusion

To summarize, the current study addresses a critical gap in the literature by investigating the presence of graduate grade inflation at the program level. In parallel with prior research on undergraduate education, our results suggest that graduate grade inflation exists and that its extent varies across different academic programs. Grade inflation appears to be more pronounced at the master’s level than at the doctoral level. Our findings offer valuable insights to both employers and educators by demonstrating that the signaling power of graduate school grades in reflecting student quality may not remain constant over time.

Supporting information

S1 File.

Table S1a. List of CIP master’s programs included in the current study. Table S1B. List of CIP doctoral programs included in the current study. Table S2a. Results from linear mixed-effects models for master’s programs. Table S2b. Results from linear mixed-effects Models for doctoral programs. Table S3a. Results from Bayesian multilevel ordinal models for master’s programs. Table S3b. Results from Bayesian multilevel ordinal models for doctoral programs. Table S4. Results from Bayesian multilevel ordinal models for both degree levels.

https://doi.org/10.1371/journal.pone.0341315.s001

(ZIP)

References

1. Bachan R. Grade inflation in UK higher education. Studies in Higher Education. 2015;42(8):1580–600.
- View Article
- Google Scholar
2. Jephcote C, Medland E, Lygo-Baker S. Grade inflation versus grade improvement: are our students getting more intelligent?. Assessment & Evaluation in Higher Education. 2021;46(4):547–71.
- View Article
- Google Scholar
3. Rojstaczer S, Healy C. Where a is ordinary: The evolution of American college and university grading, 1940-2009. Teachers College Record. 2012;114(7):1–23.
- View Article
- Google Scholar
4. Bejar II, Blew EO. Grade Inflation and the Validity of the Scholastic Aptitude Test. Am Educational Res J. 1981;18(2):143–56.
- View Article
- Google Scholar
5. Kostal JW, Kuncel NR, Sackett PR. Grade inflation marches on: grade increases from the 1990s to 2000s. Educational Measurement. 2015;35(1):11–20.
- View Article
- Google Scholar
6. Rojstaczer S, Healy C. Grading in American colleges and universities. Teachers College Record: The voice of scholarship in education. 2010.
- View Article
- Google Scholar
7. Juola AE. Grade inflation in higher education. 1976.
8. Achen AC, Courant PN. What Are Grades Made Of?. J Economic Perspectives. 2009;23(3):77–92.
- View Article
- Google Scholar
9. Berezvai Z, Lukáts GD, Molontay R. Can professors buy better evaluation with lenient grading? The effect of grade inflation on student evaluation of teaching. Assessment Evaluation in Higher Education. 2020;46(5):793–808.
- View Article
- Google Scholar
10. Park B, Cho J. How does grade inflation affect student evaluation of teaching?. Assessment & Evaluation in Higher Education. 2023;48(5):723–35.
- View Article
- Google Scholar
11. Ekstrom RB, Villegas AM, Bruschi BA, Cline FA. College grades: an exploratory study of policies and practices. ETS Research Report Series. 1994;1994(1).
- View Article
- Google Scholar
12. Matos-Díaz H. Student evaluation of teaching, formulation of grade expectations, and instructor choice: explorations with random-effects ordered probability models. Eastern Econ J. 2012;38(3):296–309.
- View Article
- Google Scholar
13. Kezim B, Pariseau SE, Quinn F. Is grade inflation related to faculty status?. Journal of Education for Business. 2005;80(6):358–64.
- View Article
- Google Scholar
14. Silva PL, DesJardins S, Biscaia R, Sá C, Teixeira P. Public and Private School Grade Inflations Patterns in Secondary Education. SSRN Journal. 2023.
- View Article
- Google Scholar
15. Gershenson S. Grade Inflation in High Schools (2005–2016). The Thomas B. Fordham Institute; 2018 Sept.
16. Arrafii M, Arsyad A. Grades and grade inflation: exploring teachers’ grading practices in Indonesian EFL secondary school classrooms. Pedagogy, Culture & Society. 2020;28(3):477–99.
- View Article
- Google Scholar
17. Nieswiadomy M, Kim M. Grade inflation at 34 public universities in Texas (2012–2019). Appl Econ. 2024;57(43):6820–32.
- View Article
- Google Scholar
18. Ullah H, Qureshi AW, Ali R. Grade inflation in Pakistan higher education: teachers’ perspective. J Further and Higher Education. 2024;48(7):700–13.
- View Article
- Google Scholar
19. Butcher KF, McEwan PJ, Weerapana A. The Effects of an Anti-Grade Inflation Policy at Wellesley College. J Economic Perspectives. 2014;28(3):189–204.
- View Article
- Google Scholar
20. Chowdhury F. Grade inflation: causes, consequences and cure. JEL. 2018;7(6):86.
- View Article
- Google Scholar
21. Sanchez EI, Moore R. Grade inflation continues to grow in the past decade. ACT. 2022.
- View Article
- Google Scholar
22. UNESCO. Higher education: Figures at a glance. 2024. https://unesdoc.unesco.org/ark:/48223/pf0000389069
23. National Center for Education and Statistics. Graduate Degree Fields. Condition of Education. https://nces.ed.gov/programs/coe/indicator/ctb. 2024. Accessed 2024 October 22.
24. Eurostat. Educational attainment statistics. https://ec.europa.eu/eurostat/statistics-explained/index.php?title=Educational_attainment_statistics#Development_of_educational_attainment_levels_over_the_last_ten_years. Accessed 2024 October 22.
25. Spence M. Job Market Signaling. The Quarterly J Econ. 1973;87(3):355.
- View Article
- Google Scholar
26. Stiglitz JE. The theory of “screening,” education, and the distribution of income. The American Economic Review. 1975;65(3):283–300.
- View Article
- Google Scholar
27. Pattison E, Grodsky E, Muller C. Is the Sky Falling? Grade Inflation and the Signaling Power of Grades. Educ Res. 2013;42(5):259–65. pmid:25288826
- View Article
- PubMed/NCBI
- Google Scholar
28. Davenport TH, Harris J, Shapiro J. Competing on talent analytics. Harvard Business Review. 2010.
- View Article
- Google Scholar
29. Barrick MR, Mount MK. The big five personality dimensions and job performance: A meta‐analysis. Personnel Psychology. 1991;44(1):1–26.
- View Article
- Google Scholar
30. Wilmot MP, Ones DS. A century of research on conscientiousness at work. Proc Natl Acad Sci U S A. 2019;116(46):23004–10. pmid:31666330
- View Article
- PubMed/NCBI
- Google Scholar
31. Van Iddekinge CH, Arnold JD, Krivacek SJ, Frieder RE, Roth PL. Making the grade? A meta-analysis of academic performance as a predictor of work performance and turnover. J Appl Psychol. 2024;109(12):1972–93. pmid:39190439
- View Article
- PubMed/NCBI
- Google Scholar
32. Lambating J, Allen JD. How the multiple functions of grades influence their validity snd value as measures of academic achievement. New Orleans, LA, 2002.
- View Article
- Google Scholar
33. Koenka AC, Linnenbrink-Garcia L, Moshontz H, Atkinson KM, Sanchez CE, Cooper H. A meta-analysis on the impact of grades and comments on academic motivation and achievement: a case for written feedback. Educational Psychology. 2019;41(7):922–47.
- View Article
- Google Scholar
34. Main JB, Ost B. The impact of letter grades on student effort, course selection, and major choice: a regression-discontinuity analysis. J Economic Education. 2014;45(1):1–10.
- View Article
- Google Scholar
35. Gray T, Bunte J. The Effect of Grades on Student Performance: Evidence from a Quasi-Experiment. College Teaching. 2021;70(1):15–28.
- View Article
- Google Scholar
36. Thibodeaux J, Deutsch A, Kitsantas A, Winsler A. First-Year College Students’ Time Use: Relations With Self-Regulation and GPA. J Advanced Academics. 2017;28(1):5–27.
- View Article
- Google Scholar
37. Wang Y, Cullen KL, Yao X, Li Y. Personality, freshmen proactive social behavior, and college transition: Predictors beyond academic strategies. Learning and Individual Differences. 2013;23:205–12.
- View Article
- Google Scholar
38. Landry A, Jacobs S, Newton G. Effective use of peer assessment in a graduate level writing assignment: A case study. IJHE. 2014;4(1):p38.
- View Article
- Google Scholar
39. Double KS, McGrane JA, Hopfenbeck TN. The impact of peer assessment on academic performance: A meta-analysis of control group studies. Educ Psychol Rev. 2020;32(2):481–509.
- View Article
- Google Scholar
40. Kelleher M, Kinnear B, Sall DR, Weber DE, DeCoursey B, Nelson J. Warnings in early narrative assessment that might predict performance in residency: signal from an internal medicine residency program. Perspect Med Educ. 2021;10(6):334–40.
- View Article
- Google Scholar
41. Sackett PR, Yang H. Correction for range restriction: an expanded typology. J Appl Psychol. 2000;85(1):112–8. pmid:10740961
- View Article
- PubMed/NCBI
- Google Scholar
42. Geisinger KF, Rabinowitz W. Individual Differences Among C. J Instructional Psychol. 1980;7(1):20–7.
- View Article
- Google Scholar
43. Zhou E. Graduate Enrollment and Degrees: 2011 to 2021. Washington, DC: Council of Graduate Schools. 2022.
- View Article
- Google Scholar
44. Johnson V. Master’s or PhD? My advice - follow the funding. https://www.profellow.com/tips/masters-or-phd-my-advice-follow-the-funding/. 2017. Accessed 2024 October 23.
45. Duke University. Should I pursue a master’s or a Ph.D.?. https://gradschool.duke.edu/admissions/how-choose-right-graduate-school/should-i-pursue-masters-or-phd/. 2024. Accessed 2024 October 23.
46. National Center for Education and Statistics. CIP: The Classification of Instructional Programs. 2020. https://nces.ed.gov/ipeds/cipcode/Default.aspx?y=56
47. Dziak JJ, Henry KL. Two-part predictors in regression models. Multivariate Behav Res. 2017;52(5):551–61. pmid:28622056
- View Article
- PubMed/NCBI
- Google Scholar
48. Lee E, Hong S. Adequate sample sizes for a three-level growth model. Front Psychol. 2021;12:685496.
- View Article
- Google Scholar
49. McNeish D, Kelley K. Fixed effects models versus mixed effects models for clustered data: Reviewing the approaches, disentangling the differences, and making recommendations. Psychol Methods. 2019;24(1):20–35. pmid:29863377
- View Article
- PubMed/NCBI
- Google Scholar
50. Harrell FE. Regression Modeling Strategies. Springer International Publishing. 2015.
- View Article
- Google Scholar
51. Bauer DJ, Sterba SK. Fitting multilevel models with ordinal outcomes: performance of alternative specifications and methods of estimation. Psychol Methods. 2011;16(4):373–90. pmid:22040372
- View Article
- PubMed/NCBI
- Google Scholar
52. Cao J, Wang X, Damiano MD, Zhou C, Zhu J. A Bayesian multilevel ordinal regression model for fish maturity data: Difference in maturity ogives of skipjack tuna (Katsuwonus pelamis) between schools in the western and central Pacific Ocean. Front Marine Sci. 2021;8:736462.
- View Article
- Google Scholar
53. Sumida N. UC Berkeley grade inflation: these majors seeing biggest GPA jumps. San Francisco Chronicle. 2024.
- View Article
- Google Scholar
54. Singleton R, Smith ER. Does grade inflation decrease the reliability of grades?. J Educational Measurement. 1978;15(1):37–41.
- View Article
- Google Scholar
55. Taylor HC, Russell JT. The relationship of validity coefficients to the practical effectiveness of tests in selection: discussion and tables. J Applied Psychol. 1939;23(5):565–78.
- View Article
- Google Scholar
56. Li H, Xiong Y, Hunter CV, Guo X, Tywoniw R. Does peer assessment promote student learning? A meta-analysis. Assessment & Evaluation in Higher Education. 2019;45(2):193–211.
- View Article
- Google Scholar
57. Yeritsyan A, Mjelde JW, Litzenberg KK. Grade inflation or grade increase. J Agric Appl Econ. 2022;54(2).
- View Article
- Google Scholar

[ref1] 1. Bachan R. Grade inflation in UK higher education. Studies in Higher Education. 2015;42(8):1580–600.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Jephcote C, Medland E, Lygo-Baker S. Grade inflation versus grade improvement: are our students getting more intelligent?. Assessment & Evaluation in Higher Education. 2021;46(4):547–71.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Rojstaczer S, Healy C. Where a is ordinary: The evolution of American college and university grading, 1940-2009. Teachers College Record. 2012;114(7):1–23.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Bejar II, Blew EO. Grade Inflation and the Validity of the Scholastic Aptitude Test. Am Educational Res J. 1981;18(2):143–56.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref5] 5. Kostal JW, Kuncel NR, Sackett PR. Grade inflation marches on: grade increases from the 1990s to 2000s. Educational Measurement. 2015;35(1):11–20.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref6] 6. Rojstaczer S, Healy C. Grading in American colleges and universities. Teachers College Record: The voice of scholarship in education. 2010.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. Juola AE. Grade inflation in higher education. 1976.

[ref8] 8. Achen AC, Courant PN. What Are Grades Made Of?. J Economic Perspectives. 2009;23(3):77–92.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref9] 9. Berezvai Z, Lukáts GD, Molontay R. Can professors buy better evaluation with lenient grading? The effect of grade inflation on student evaluation of teaching. Assessment Evaluation in Higher Education. 2020;46(5):793–808.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref10] 10. Park B, Cho J. How does grade inflation affect student evaluation of teaching?. Assessment & Evaluation in Higher Education. 2023;48(5):723–35.
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref11] 11. Ekstrom RB, Villegas AM, Bruschi BA, Cline FA. College grades: an exploratory study of policies and practices. ETS Research Report Series. 1994;1994(1).
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref12] 12. Matos-Díaz H. Student evaluation of teaching, formulation of grade expectations, and instructor choice: explorations with random-effects ordered probability models. Eastern Econ J. 2012;38(3):296–309.
View Article
Google Scholar

[33] View Article

[34] Google Scholar

[ref13] 13. Kezim B, Pariseau SE, Quinn F. Is grade inflation related to faculty status?. Journal of Education for Business. 2005;80(6):358–64.
View Article
Google Scholar

[36] View Article

[37] Google Scholar

[ref14] 14. Silva PL, DesJardins S, Biscaia R, Sá C, Teixeira P. Public and Private School Grade Inflations Patterns in Secondary Education. SSRN Journal. 2023.
View Article
Google Scholar

[39] View Article

[40] Google Scholar

[ref15] 15. Gershenson S. Grade Inflation in High Schools (2005–2016). The Thomas B. Fordham Institute; 2018 Sept.

[ref16] 16. Arrafii M, Arsyad A. Grades and grade inflation: exploring teachers’ grading practices in Indonesian EFL secondary school classrooms. Pedagogy, Culture & Society. 2020;28(3):477–99.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref17] 17. Nieswiadomy M, Kim M. Grade inflation at 34 public universities in Texas (2012–2019). Appl Econ. 2024;57(43):6820–32.
View Article
Google Scholar

[46] View Article

[47] Google Scholar

[ref18] 18. Ullah H, Qureshi AW, Ali R. Grade inflation in Pakistan higher education: teachers’ perspective. J Further and Higher Education. 2024;48(7):700–13.
View Article
Google Scholar

[49] View Article

[50] Google Scholar

[ref19] 19. Butcher KF, McEwan PJ, Weerapana A. The Effects of an Anti-Grade Inflation Policy at Wellesley College. J Economic Perspectives. 2014;28(3):189–204.
View Article
Google Scholar

[52] View Article

[53] Google Scholar

[ref20] 20. Chowdhury F. Grade inflation: causes, consequences and cure. JEL. 2018;7(6):86.
View Article
Google Scholar

[55] View Article

[56] Google Scholar

[ref21] 21. Sanchez EI, Moore R. Grade inflation continues to grow in the past decade. ACT. 2022.
View Article
Google Scholar

[58] View Article

[59] Google Scholar

[ref22] 22. UNESCO. Higher education: Figures at a glance. 2024. https://unesdoc.unesco.org/ark:/48223/pf0000389069

[ref23] 23. National Center for Education and Statistics. Graduate Degree Fields. Condition of Education. https://nces.ed.gov/programs/coe/indicator/ctb. 2024. Accessed 2024 October 22.

[ref24] 24. Eurostat. Educational attainment statistics. https://ec.europa.eu/eurostat/statistics-explained/index.php?title=Educational_attainment_statistics#Development_of_educational_attainment_levels_over_the_last_ten_years. Accessed 2024 October 22.

[ref25] 25. Spence M. Job Market Signaling. The Quarterly J Econ. 1973;87(3):355.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref26] 26. Stiglitz JE. The theory of “screening,” education, and the distribution of income. The American Economic Review. 1975;65(3):283–300.
View Article
Google Scholar

[67] View Article

[68] Google Scholar

[ref27] 27. Pattison E, Grodsky E, Muller C. Is the Sky Falling? Grade Inflation and the Signaling Power of Grades. Educ Res. 2013;42(5):259–65. pmid:25288826
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref28] 28. Davenport TH, Harris J, Shapiro J. Competing on talent analytics. Harvard Business Review. 2010.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref29] 29. Barrick MR, Mount MK. The big five personality dimensions and job performance: A meta‐analysis. Personnel Psychology. 1991;44(1):1–26.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref30] 30. Wilmot MP, Ones DS. A century of research on conscientiousness at work. Proc Natl Acad Sci U S A. 2019;116(46):23004–10. pmid:31666330
View Article
PubMed/NCBI
Google Scholar

[80] View Article

[81] PubMed/NCBI

[82] Google Scholar

[ref31] 31. Van Iddekinge CH, Arnold JD, Krivacek SJ, Frieder RE, Roth PL. Making the grade? A meta-analysis of academic performance as a predictor of work performance and turnover. J Appl Psychol. 2024;109(12):1972–93. pmid:39190439
View Article
PubMed/NCBI
Google Scholar

[84] View Article

[85] PubMed/NCBI

[86] Google Scholar

[ref32] 32. Lambating J, Allen JD. How the multiple functions of grades influence their validity snd value as measures of academic achievement. New Orleans, LA, 2002.
View Article
Google Scholar

[88] View Article

[89] Google Scholar

[ref33] 33. Koenka AC, Linnenbrink-Garcia L, Moshontz H, Atkinson KM, Sanchez CE, Cooper H. A meta-analysis on the impact of grades and comments on academic motivation and achievement: a case for written feedback. Educational Psychology. 2019;41(7):922–47.
View Article
Google Scholar

[91] View Article

[92] Google Scholar

[ref34] 34. Main JB, Ost B. The impact of letter grades on student effort, course selection, and major choice: a regression-discontinuity analysis. J Economic Education. 2014;45(1):1–10.
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref35] 35. Gray T, Bunte J. The Effect of Grades on Student Performance: Evidence from a Quasi-Experiment. College Teaching. 2021;70(1):15–28.
View Article
Google Scholar

[97] View Article

[98] Google Scholar

[ref36] 36. Thibodeaux J, Deutsch A, Kitsantas A, Winsler A. First-Year College Students’ Time Use: Relations With Self-Regulation and GPA. J Advanced Academics. 2017;28(1):5–27.
View Article
Google Scholar

[100] View Article

[101] Google Scholar

[ref37] 37. Wang Y, Cullen KL, Yao X, Li Y. Personality, freshmen proactive social behavior, and college transition: Predictors beyond academic strategies. Learning and Individual Differences. 2013;23:205–12.
View Article
Google Scholar

[103] View Article

[104] Google Scholar

[ref38] 38. Landry A, Jacobs S, Newton G. Effective use of peer assessment in a graduate level writing assignment: A case study. IJHE. 2014;4(1):p38.
View Article
Google Scholar

[106] View Article

[107] Google Scholar

[ref39] 39. Double KS, McGrane JA, Hopfenbeck TN. The impact of peer assessment on academic performance: A meta-analysis of control group studies. Educ Psychol Rev. 2020;32(2):481–509.
View Article
Google Scholar

[109] View Article

[110] Google Scholar

[ref40] 40. Kelleher M, Kinnear B, Sall DR, Weber DE, DeCoursey B, Nelson J. Warnings in early narrative assessment that might predict performance in residency: signal from an internal medicine residency program. Perspect Med Educ. 2021;10(6):334–40.
View Article
Google Scholar

[112] View Article

[113] Google Scholar

[ref41] 41. Sackett PR, Yang H. Correction for range restriction: an expanded typology. J Appl Psychol. 2000;85(1):112–8. pmid:10740961
View Article
PubMed/NCBI
Google Scholar

[115] View Article

[116] PubMed/NCBI

[117] Google Scholar

[ref42] 42. Geisinger KF, Rabinowitz W. Individual Differences Among C. J Instructional Psychol. 1980;7(1):20–7.
View Article
Google Scholar

[119] View Article

[120] Google Scholar

[ref43] 43. Zhou E. Graduate Enrollment and Degrees: 2011 to 2021. Washington, DC: Council of Graduate Schools. 2022.
View Article
Google Scholar

[122] View Article

[123] Google Scholar

[ref44] 44. Johnson V. Master’s or PhD? My advice - follow the funding. https://www.profellow.com/tips/masters-or-phd-my-advice-follow-the-funding/. 2017. Accessed 2024 October 23.

[ref45] 45. Duke University. Should I pursue a master’s or a Ph.D.?. https://gradschool.duke.edu/admissions/how-choose-right-graduate-school/should-i-pursue-masters-or-phd/. 2024. Accessed 2024 October 23.

[ref46] 46. National Center for Education and Statistics. CIP: The Classification of Instructional Programs. 2020. https://nces.ed.gov/ipeds/cipcode/Default.aspx?y=56

[ref47] 47. Dziak JJ, Henry KL. Two-part predictors in regression models. Multivariate Behav Res. 2017;52(5):551–61. pmid:28622056
View Article
PubMed/NCBI
Google Scholar

[128] View Article

[129] PubMed/NCBI

[130] Google Scholar

[ref48] 48. Lee E, Hong S. Adequate sample sizes for a three-level growth model. Front Psychol. 2021;12:685496.
View Article
Google Scholar

[132] View Article

[133] Google Scholar

[ref49] 49. McNeish D, Kelley K. Fixed effects models versus mixed effects models for clustered data: Reviewing the approaches, disentangling the differences, and making recommendations. Psychol Methods. 2019;24(1):20–35. pmid:29863377
View Article
PubMed/NCBI
Google Scholar

[135] View Article

[136] PubMed/NCBI

[137] Google Scholar

[ref50] 50. Harrell FE. Regression Modeling Strategies. Springer International Publishing. 2015.
View Article
Google Scholar

[139] View Article

[140] Google Scholar

[ref51] 51. Bauer DJ, Sterba SK. Fitting multilevel models with ordinal outcomes: performance of alternative specifications and methods of estimation. Psychol Methods. 2011;16(4):373–90. pmid:22040372
View Article
PubMed/NCBI
Google Scholar

[142] View Article

[143] PubMed/NCBI

[144] Google Scholar

[ref52] 52. Cao J, Wang X, Damiano MD, Zhou C, Zhu J. A Bayesian multilevel ordinal regression model for fish maturity data: Difference in maturity ogives of skipjack tuna (Katsuwonus pelamis) between schools in the western and central Pacific Ocean. Front Marine Sci. 2021;8:736462.
View Article
Google Scholar

[146] View Article

[147] Google Scholar

[ref53] 53. Sumida N. UC Berkeley grade inflation: these majors seeing biggest GPA jumps. San Francisco Chronicle. 2024.
View Article
Google Scholar

[149] View Article

[150] Google Scholar

[ref54] 54. Singleton R, Smith ER. Does grade inflation decrease the reliability of grades?. J Educational Measurement. 1978;15(1):37–41.
View Article
Google Scholar

[152] View Article

[153] Google Scholar

[ref55] 55. Taylor HC, Russell JT. The relationship of validity coefficients to the practical effectiveness of tests in selection: discussion and tables. J Applied Psychol. 1939;23(5):565–78.
View Article
Google Scholar

[155] View Article

[156] Google Scholar

[ref56] 56. Li H, Xiong Y, Hunter CV, Guo X, Tywoniw R. Does peer assessment promote student learning? A meta-analysis. Assessment & Evaluation in Higher Education. 2019;45(2):193–211.
View Article
Google Scholar

[158] View Article

[159] Google Scholar

[ref57] 57. Yeritsyan A, Mjelde JW, Litzenberg KK. Grade inflation or grade increase. J Agric Appl Econ. 2022;54(2).
View Article
Google Scholar

[161] View Article

[162] Google Scholar

Figures

Abstract

Introduction

Grade inflation: why should we care?

Signaling power to schools and employers: student quality

Signaling power to students: learning and degree progress

Other reasons for why school grades matter

Hypothesis and research question development

Detecting the presence of graduate grade Inflation.

Potential differences in the magnitudes of graduate grade inflation

The current project

Method

Sample

Measures

Analytic approach

Results

Discussion

Study contributions

Limitations and future directions

Conclusion

Supporting information

S1 File.

References