Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Dear admission committee…: Which moves in application essays predict student master grades?

  • Timon de Boer ,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Project administration, Validation, Visualization, Writing – original draft

    t.h.j.deboer@uu.nl

    Affiliations Copernicus Institute of Sustainable Development, Utrecht University, Utrecht, Utrecht State, The Netherlands, Dialogic Innovation and Interaction, Utrecht, Utrecht State, The Netherlands

  • Frank Van Rijnsoever,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Project administration, Supervision, Validation, Writing – review & editing

    Affiliation Copernicus Institute of Sustainable Development, Utrecht University, Utrecht, Utrecht State, The Netherlands

  • Hans de Bresser

    Roles Conceptualization, Formal analysis, Supervision, Writing – review & editing

    Affiliation Earth Sciences Department, Utrecht University, Utrecht, Utrecht State, The Netherlands

Abstract

Application essays are a commonly used admission instrument for students entering higher education. The quality of the essay is usually scored, but this score is often subjective and has poor interrater reliability due to the unstructured format of the essays. This results in mixed findings on the validity of application essays as an admission instrument. We propose a more objective method of using application essays, using Latent Dirichlet Allocation (LDA), a text mining method, to distinguish seven moves occurring in application essays written by students who apply to a master degree program. We use the probability that these moves occur in the essay to predict study success in the master. Thereby we answer the following research question: What is the effect of discussing different moves in students’ application essays on the student grades in a master program? From the seven different moves (functional unit of text) we distinguished, five of which have a significant effect on student grades. The moves we labeled as ‘master specific’ and ‘interest to learn’ have a positive effect on student grades, and the moves we labeled as ‘research skills’, ‘societal impact’ and ‘city and university’ have a negative effect. Our interpretation of this finding is that topics related to intrinsic motivation and specific knowledge, as opposed to generic knowledge, are positively related with study success. We thereby demonstrate that application essays can be a valid predictor of study success. This contributes to justifying their use as admission instruments.

1. Introduction

Since, the 1990s, the number of academic studies on the predictive validity of admission criteria for university students has steadily increased [1, 2]. Among these, studies on admission criteria that test the students’ cognitive skills, such as grades or standardized tests, are dominant [1, 35]. However, academics and practitioners have noted that assessing only the students’ cognitive skills yields an incomplete representation of their abilities, and increases the likelihood of incorrect admission decisions [4]. For this reason, studies into non-cognitive admission criteria have become increasingly prevalent [6, 7]. Examples of such non-cognitive criteria are teamwork, communication, empathy and motivation [8].

Of these con-cognitive criteria, motivation is among the most prominent. A student’s motivation is well known to affect the study success and study behavior [9, 10]. A commonly used admission instrument to test this criterion is the students’ application essay, also known as personal statement or motivation letter [11]. Universities often require students to write an application essay during the admission process. In the essay students explain different reasons why they should be admitted to the study program. These reasons are considered in the admission decision. For students, application essays provide an opportunity to state why they are suitable for a program. This can be done by writing about their non-cognitive skills, stressing their motivation for entering the program, or highlighting qualities that are not immediately discernable from their prior education and grades, such as personal values, extracurricular activities or work experience [1214]. Application essays also allow students to provide context to aspects of their application. For example, why they experienced a delay in completing their Bachelor program. Moreover, application essays are well suited for students to reflect on themselves. The ability to self-reflect is often associated with successful students [15, 16].

To admissions offices, application essays are a relatively efficient means to gain insight in why a student wants to study a particular program, to understand their context, and to better assess cognitive and non-cognitive skills. In addition, application essays give admissions offices some insights in the writing skills of the student. Finally, if used properly, letters can give insights in the socio-economic background of the student, which can help to keep the admissions sufficiently diverse [17].

However, some scholars doubt if application essays are a useful admission instrument [18, 19] because the empirical evidence on the use and the predictive validity of application essays for study success is scarce and with ambiguous results [17, 19]. Application essays may also favor students from an advantaged socio-economic background because of their particular writing skills and stronger access to useful advice [20]. Furthermore, the format of an application essay is often relatively unstructured, meaning there is not provided template, leading to sizable differences between them. This unstructured format and the sheer volume of text make it difficult to systematically assess the effect of application essays on study success for admissions offices, and thereby to evaluate the usefulness of application essays as an admission instrument. This criticism indicates a need to further the study predictive validity of the commonly used admission instrument.

Recently, scholars shifted their attention to studying the contents of application essays. This is an important shift, as earlier studies on this topic often measure the quality of application essays with a rating of one or more assessors [19, 21, 22] and sometimes find positive effects on student grades [23, 24]. However, the interrater reliability of such ratings is often poor [21, 22, 25]. There are a number of studies studying the contents of application essays, identifying topics discussed in them [2628], but do not relate these to grades. Furthermore, these paper rely on more qualitative methods, namely discourse and genre analysis. Using a quantitative approach, Pennebaker et al. [11] study the language students use in application essays using linguistic analysis. They find that students that use categorical language, about complex concepts and objects, acquire higher grades. Lower grades are acquired by students who write more about personal narratives. Alvero et al. [17] also use a linguistic analysis to show that the writing style and contents of the application essay are strongly related to household income and SAT scores. Hoblos and Kakkonen et al. [29, 30] use a LDA to replace assessor rating with an automatic grading process for the quality of the application essay, and predict student success accurately with this method. The paper of Brown [31], which combines qualitative and quantitative methods, reports that students dedicating more space in their application essay to their research experience later become more successful in their study program.

However, to our knowledge, there is no research so far that directly links the topics occurring in an application essay to subsequent academic success in a master program. For example, we do not know if discussing expectations and ambitions for the master’s program is more likely to lead to study success than discussing professional development after the master’s program, while both are likely topics in an application essay. Examining the content of application essays, and relating these to study success can give admission officers an heuristic to make better use of this admission instrument.

To fill this knowledge gap, we analyze the application essays from four cohorts of in total 820 graduate students from a large university in the middle of the Netherlands. We do so by using Latent Dirichlet Allocation (LDA). This text mining application seeks patterns within the texts by looking at the co-occurrence of words. This enables us to identify topics that are discussed in the letters, and to assign probabilities to topics in letters. We then link these topic probabilities to grades scored in the master as indicator for study success. This allows us to formulate an answer to the following research question:

What is the effect of discussing different moves in students’ application essays on the student grades in a master program?

This paper contributes to student admission literature by filling knowledge gaps on one of the most commonly used, but most uncommonly studied, admission criteria: application essays. Our LDA approach ends up yielding seven moves, which are in our case, units of text with the purpose to gain admission to a master program [26]. of which five are significantly related to student grades. We report that two moves related to intrinsic motivation, which we labeled ‘master specific’ and ‘interest to learn’, have a positive effect on student grades. Three moves we labeled as ‘research skills’, ‘societal impact’ and ‘city and university’ have a negative effect on student grades. This confirms existing theory that intrinsic motivation is an important driver of study success. Our findings have practical implications for universities, and influences their use of application essays as a admission criterion. First, we find support that application essays can indeed be used as a valid admission instrument. Second, we report a number of topics that influence student grades, which can thus be used as evaluation criteria during the admission process. Third, we argue that intrinsic motivation and discussing specific knowledge, as opposed to generic knowledge, are important overarching admission criteria, which should be considered in the admission decision. Application essays can be a useful instrument to evaluate both.

2. Theoretical background

This chapter provides theoretical context for the variables used in this study. We first define our dependent variable: student success. We then provide theoretical background for our independent variables: the themes students write about in their essay. We draw from existing studies on this topic, to gain insight in the themes students discuss in their essays. This way, we can later observe if the topic we distill from essays using text mining are similar to the topics already discussed in academic literature. We finally discuss why we included the control variables we used in this study.

2.1. Defining student success

We define student success as ‘academic achievement, engagement in educationally purposeful activities, satisfaction, acquisition of desired knowledge, skills, and competencies, persistence, and attainment of educational objectives [32, p. 7]’. From this definition it follows that it needs to be assessed if these educational objectives are achieved. Most educational objectives are commonly achieved and assessed by completing a series of educational activities, such as courses and a thesis, which culminate in a degree. Therefore, degree attainment is a common way to measure student success. After all, students go to university to acquire knowledge and skills in a coherent program, and need to complete the entire program successfully to fully master the contents of the program. However, simply attaining a degree by finishing courses does not fully capture how proficient students’ are at acquiring knowledge. Many universities and students’ future employers value proficiency as high measure of success. Just because two students both graduate from the same program, does not mean they’ve reached a similar level of mastery. Grades are a more suitable method of demonstrating mastery and proficiency of desired learning objectives [33]. They are the most commonly used indicator for student success [1], as they are the primary method of indicating how well the student performed in tests and coursework. Grades also reflect students’ self-regulatory competencies [34]. Grades are awarded to students throughout the program, which means they provide a more refined insight into the students development over time. In this paper, we will consider attained grades during the master as indicator for student success.

2.2. Themes from application essays

Students write admission essays to demonstrate the fit between themselves and their study program of choice in order to gain admission [26]. Prior studies showed that students can cover several themes in their essay, such as their motivation and reasoning for applying to the program and university of their choice, why they are qualified to follow the program, their personality and relevant life experience, and future career goals [26, 28]. Our paper starts out abductively. We discuss four possible themes in the next paragraphs, but we won’t use these four themes to code and label the topics we find using our text mining methodology. Moreover, we won’t use them to formulate hypotheses. Nonetheless, this chapter can help us to gain a better grasp on what themes application essays contain, and some minor expectations about how these relate to study success. We will observe how these four themes relate to the topics we find using our text mining approach.

2.2.1. Motivation for applying.

First, students can be intrinsically motivated to study the program. Intrinsic motivation comes from humans’ natural inclination towards exploration, mastery and curiosity [35]. Deci and Ryan state that intrinsic motivation arises when tasks are perceived as challenging, leave room for your own interpretation and carried out in relation to people the individual is sympathetic to. These requirements are present in the case of applying to a graduate program, which is why we expect students to discuss their intrinsic motivation in their application essay. Several studies have demonstrated that learning form intrinsic motivation leads to better study results [3640], so we would expect students who a priori demonstrate their intrinsic motivation to have higher study success.

Motivation can also come from extrinsic sources. Extrinsic motivation is described as doing something because it leads to a separable outcome [35]. In other words, an activity is undertaken in order to achieve a goal which is not inherently tied to the activity [41]. For example: a child does the dishes in order to receive an allowance, not because doing dishes is an inherently rewarding and stimulating activity. One extrinsic reason for students to apply to a master’s program is to develop social connections and networks [38]. Because of this, the university lifestyle is attractive for students [42]. During their studies, students seek to engage in meaningful discussions with peers. Through this, many students look for and find a sense of belonging in the academic community [42]. Students are also known to include reasons for choosing a specific university within a specific country, often using phrases like ‘this institute is among the best in the world’ [43]. We expect students to cover these sources of extrinsic motivation in their essay, and we expect these topics to have a positive relationship with study success.

2.2.2. Qualifications for studying.

Qualifications for studying involves describing the students’ relevant academic and professional preparedness for the study program [26]. This topic is often considered the most important aspect of the application essay by both students and selectors [26, 27]. Applicants need to convince the reader that they possess the right qualifications for the study program they apply for, and to establish an identity as a competent academic [26, 44]. They will do this by listing their academic achievements so far, such as attained degrees, courses and grades or by highlighting specific projects and research experiences related to their field [26, 28]. Students’ qualifications are self-reported, which means this topic contains a risk of self-aggrandizement [44]. Despite this, Brown [31] reports that applicants that dedicate more space in their letter to describing their research experience and interests become more successful.

2.2.3. Future goals.

Future goals refer to the applicant’s intended career goals and desired personal development after graduation. This topic differs motivation for applying (section 2.2.1.) in the sense that students describe here what their future career goals are instead of arguing that studying will help them realize their career goals. Students will sometimes write about a specific profession they wish to fulfill in the future (for example: Doctor) and what the universities’ post-graduation employment prospects are [27, 43, 45]. Attaining a master’s degree is known to positively influences students’ career trajectory and their financial prospects [39] and students therefore expect an academic degree to speed up career development and lead to faster promotions [46]. It is likely that such reasons are discussed in students’ application essays.

In addition, students can also cover other future goals, such as societal contributions they wish to make (for example: combat climate change), or personal growth they want to achieve (for example: becoming a team worker) [27, 28]. Writing about future career plans, societal contributions, and goals for personal growth, shows that students are able to reflect upon themselves and their learning goals. Students with this ability are known to be more successful, which is why we could expect students who write more about this topic to have higher study success [16].

2.2.4. Personality and life experience.

With this topic, students explicitly describe their unique personality and life experience to distinguish themselves from the large pool of applicants [26]. Students can also write about their personal history, both related or unrelated to the study program [28]. For instance, students with a relative or close friend who followed the study program or has a career in the field will often mention this in their essay [28]. Another topic which can be covered is the students’ experience and affinity with teamwork [26]. Students also often discuss their extracurricular activities, (voluntary) work experience and community involvement that offers insight in their abilities and skills related to their field of study [26, 28, 31]. Little is known about how students’ descriptions of their personality relate to study success. However, empirical work shows some personality traits, especially conscientiousness, are positively related to study success [4749]. However, personal narratives and experiences which students describe in their application essays do not link strongly to their conscientiousness [26, 28].

2.3. Control variables

We also include six control variables that can, according to the existing literature, have an effect on our dependent and independent variables.

First, we controlled for the length of the application essay because literature shows that longer application essays tend to be more successful [26, 50]. Second, we controlled for the average bachelor grade of the student. We control for bachelor grades because they have a consistently positive effect on student success [5, 18, 51, 52]. This is because, independent of the content of the bachelor’s program, grades reflect the self-regulatory competencies that are needed for a student to successfully navigate a bachelor’s program [34]. Grades also reflect the students’ overall cognitive ability and intelligence [5]. Because grades reflect students cognitive and self-reflective ability, grades are also likely to influence the quality of the application essay. Our third and fourth control variables are the demographic variables age and sex. Age and sex have a well-reported influence on student success, with multiple studies finding that older students and female students perform better [5356].

Fifth, we controlled for the students’ international background. There is a lot of evidence that international students are at more risk of dropping out compared to their non-international peers [57]. This increased drop-out risk can be attributed to language differences and cultural barriers international students might have to overcome, [57]. Financial pressures and risks are also a common reason for study problems for international students [57]. These pressures can be caused by a precarious personal socio-economic position, coupled with high tuition rates compared to the students country of origin. Another factor is the sometimes unreliable financial support by the student’s country of origin [57]. All of these problems do not have the same intensity for every international student, as students from the Global South are more at risk than their Northern counterparts. Because of this, we included variables about the students’ international background as a control variable.

Finally, we controlled for the students choice of graduate program. Although the graduate programs in our sample are all from the same faculty, there are clear differences between them. It is possible that these different programs attract different types of students who require different types of skills. The topics from these students’ application essays might therefore also differ which makes this a suitable control variable.

3. Methods

To answer our research question, we identified topics in the application essays of master students using a Latent Dirichlet Allocation. We then related the probabilities that these topics occur in a letter to the grades achieved in the master. This paper therefore has an abductive research design.

3.1. Data description and collection

We collected data about application essays, acquired grades and other information from students at the Faculty of Geosciences at Utrecht University in the Netherlands. This multidisciplinary faculty offers research, impact and education “with regard to our Earth system and sustainability, from the local to the global scale” [58]). The faculty offers master programs via three education institutes that respectively focus on earth sciences, sustainable development and human geography. The master programs vary in their emphasis on the natural or social sciences, but all fit under the thematic umbrella of the faculty. To keep the master programs comparable, we look at the eleven two-year master programs of the faculty that are offered in English. These were at the time: [1] Earth Structure and Dynamics, [2] Earth Surface and Water, [3] Earth, Life and Climate; [4] Energy Science, [5] Geographical Information Systems, [6] Innovation Sciences, [7] Marine Sciences, [8] Sustainable Business and Innovation, [9] Sustainable Development, [10] Urban and Economic Geography, and [11] Water Science and Management. Even though every master’s program in the sample is unique in its content, their overall structure is similar. All programs fit in most of their coursework in the first year, whereas the second year consists mostly of internships and a thesis. This further contributes to the comparability of the programs.

The data is from four cohorts of master students, which started studying in 2014, 2015, 2016 and 2017. Data collection was approved by the faculty’s ethics committee, who also waived the need for obtaining consent from students. After data cleaning, the sample consists of 2701 students from all over the world of which 2051 graduated and 650 dropped out. To be admitted, students require a degree in a university bachelor program or a tailor-made premaster program that confirms that they have acquired the knowledge, insight and skills at the university Bachelor’s level with regards to the Master’s program. Students provide the admission office with a list of attained grades, their resume and a personal application essay. There is no minimum required grade, but students with an average grade of a seven or higher are usually admitted. For students with a lower average grade, the decision often comes down to a careful examination of the application essay. For the application essays, no specific requirements (including word count) were formulated. The only suggestion provided to the applicants was to ‘not repeat information from the students resume’. All admissions are made on a case-by-case basis. Despite our efforts, we were not able to fully complete the data for each student. This was mostly due to the fact that it was difficult to obtain an reliable average bachelor grade for international students. The number of students that entered the program and for which we’re able to reconstruct a complete admission file was 1307. The number of students that finished their master and for which all data was available is 820. These students were divided over the eleven separate master programs.

3.2. Measurement

3.2.1. Dependent variable: Student success.

We measured study success by looking at the students’ attained grades during the program. Grades are obtained throughout the entire educational program and can therefore be measured in various ways, such as the grade for the first or second semester, the entire first year, the last year, the finishing thesis, or most commonly, a student’s average grade over the entire program (For example, see [5961]. The choice of when to measure student grades matters because studies show that admission instruments such as bachelor’s program grades often predict grades in the first year of a program very well, but lose validity in the later years of a program [62, 63]. Students progress at different speeds throughout their program and often need to adjust to a new master’s program and institute [6466]. Therefore, it is important to measure grades at various points in the study program. As an indicator for grades, we used the average grade of the student for the entire master’s program and their average grades of the first year of the master’s program.

3.2.2. Independent variables: From essays to moves.

As independent variables, this study uses moves occurring within students’ application essays. A move is a functional unit of text, which has an identifiable purpose [26]. In our case, the purpose being to gain admission to a master program. To identify them, we applied Latent Dirichlet Allocation (LDA), which is a generative probabilistic model of a corpus. A corpus consists of a number of documents, which in turn consists of a number of terms. The LDA treats every application essay as a bag of terms where every term has its’ own probability of appearing in any of the topics. The LDA can therefore be used to distill topics from a corpus and identify the concepts that run through it [67]. LDA has been used earlier in the context of higher education. For example, Ogihare & Ren [68] used LDA to distill linguistic features from application essays to predict student retention. There are also several scholars that used LDA to automatically grade the quality of application essays [29, 30, 69].

To identify topics, we first prepared the application essays for analysis. Every term was stemmed and lemmatized with the porter algorithm [70, 71], and then transformed into lower case. Furthermore, all numbers and stop words were removed with the using SMART stopword list from the “tm” R-package. Second, we determined the optimal number of topics following the methods of Arun [72], Griffiths [73] and Cao [74] using the R package ‘ldatuning’. This is an important step as an LDA model with an insufficient number of topics is too coarse to identify accurate classifiers. On the other hand, an excessive number of topics could result in a model that is too complex, making interpretation difficult [75].

With this package, we can set a range for a number of topics, and plot the topic density for this range. The topic density indicates the specificity of terms to certain topics. The higher the topic density, the more specific terms are to that topic. Higher topic densities are preferable, because this makes the topics more easy to distinguish from each other. Fig 1 displays the results of this plot. In the bottom graph of Fig 1 a higher number on the y-axis represent a higher topic density. In the top graph the topic density is inverted, so a higher density leads to a lower number on the y-axis. We chose 30 as the final number of topics as this ensures a high topic density while maintaining a manageable number of topics for labeling. We rendered LDAs with several numbers of topics to check the face validity of the quality of the topics. We did so because we wanted to ensure that every individual master program had its’ own topic. Seeing as there are eleven master programs in the dataset, we required more topics than usually seen in topic modeling papers [7678]. The topic density could be marginally improved by creating more topics, but the higher the number of topics, the smaller the gain in density, so there would be relatively little advantage to doing this. In fact, there are occasions where an increase in the number of topics decreases the topic density, especially when the number of topics already exceeds 30. We then used a Gibbs sampling algorithm to run the LDA [79].

For every topic, the most common terms within the topic were extracted and used to label what the topic was about. Every co-author labeled the topics individually, after which the labels were compared. In the rare case that the co-authors disagreed about the label of a topic, a second round of labeling was done to increase the interrater reliability. Topics with highly similar or identical labels were then merged into moves. Merging the topics into moves was done separately by the first and second author. All three authors then checked the final labeling of the moves, after which definitive labels were chosen. We ended up distinguishing seven separate moves from the thirty topics. We discuss these moves, and link them to the themes found in chapter 2 in section 4.1. We schematically show this in Fig 2 below.

thumbnail
Fig 2. Schematic overview of topics, moves and themes and how moves are used as independent variable.

https://doi.org/10.1371/journal.pone.0304394.g002

For each application essay, the topic probability of every topic and move was calculated. The topic probability is a number between 0 and 1. If the topic probability is 1, it means that all words from the letter belong to that topic. If the topic probability is 0, it means that no words from the letter are from that topic. The probability of a move was calculated by adding up the probabilities of its’ underlying topics. These move probabilities were used as predictors of student success during the analysis.

3.2.3. Control variables.

The average bachelor grades of students who completed a Dutch bachelor’s program were extracted from the faculties’ data storage application. For the 843 international students the application data were stored in PDF files that listed the full academic transcript of each student. A research assistant manually examined every file and extracted the average grades in the bachelor’s program. Information from the Dutch organization Nuffic, which provides information on educational systems across the world, was used to transform international grades into a Dutch equivalent between 1 and 10. The length of the application essay was measured by counting the number of words.

We categorized the students’ international using the United Nations geoscheme [80], which classifies countries into regional and subregional groups. The advantage of this classification is that cultural aspect of entire regions are controlled for, without having to rely on very small sample sizes for some countries in our dataset. Unfortunately, for some of these regional and subregional groups the sample size was still very small (for several groups, N was < 5). We therefore manually added together these groups into larger regional or continental groups. In the regressions, we used the category ‘Europe’ as reference category. The age and sex of each student were known to the faculty as part of the admission process. We refer to a students’ biological sex as this is what they filled in their application form. Students have the option to not fill is this in (as one student did) in case they do want to for any reason, like their biological sex is not binary, or their biological sex does not align with their gender identity. Sex was measured as a binary variable, with male students coded as 1 and female as 2. The age of the students was measured by their age in years at the starting day of the master’s program. The choice of master program is extracted from the students’ application file. Tables 1 and 2 present the descriptive statistics and correlation matrix of our variables.

thumbnail
Table 1. Descriptive statistics of variables used in this study.

https://doi.org/10.1371/journal.pone.0304394.t001

thumbnail
Table 2. Correlation matrix of variables used in this study.

https://doi.org/10.1371/journal.pone.0304394.t002

3.3. Analysis

In the first step of the analyses, we estimated seven beta regression models with the goal of testing the influence our control variables have on the move probabilities. Beta regressions are commonly used when the dependent variable can take values between 0 and 1 [81], as is also the case here. We ran seven beta regressions, with our seven moves from application essays as the dependent variable and all control variables as independent variables (for clarity’s sake, we will continue to refer to these as control variables in text). To do this, we first scaled the move probabilities to a number between 0 and 1.

Next, we estimated our main models to predict student grades in their master program. For all models we estimated an ordinary least squares (OLS) multiple linear regression, as the dependent variables tested in these models are continuous. We ran four models, two for each of our dependent variables. The first model for each dependent variable only includes the control variables (models 1 and 3) while models 2 and 4 add the independent variable as main effect.

4. Results

4.1. Moves derived from topic modeling

The seven moves are shown on the right side of Fig 3. On the left side of the same figure are the four themes we distilled from literature (section 2.2). We will now discuss the contents of the seven moves we distilled from our topic model, and how we linked these to the themes from existing literature. The linkages between them can be seen in Fig 3. All the moves we derived from our topic model are connected to the themes from literature. Some moves are connected to multiple themes we derived from our literature review, but none are connected to no theme from the literature.

thumbnail
Fig 3. Connections between themes derived from literature and moves derived from topic modeling.

https://doi.org/10.1371/journal.pone.0304394.g003

Table 3 presents the results from our beta regressions. Each column presents the estimates of a control variable predicting the probability that a specific move is discussed. Based on the pseudo-R2, we conclude that the model fit varies quite a bit across the seven models. The pseudo-R2 is highest (0.514) for model 1, which measures ‘master specific’ indicating that the control variables explain the variance of the dependent variables very well. For models 2 and 4 the pseudo-R2 is also higher than 0.2, indicating a good model fit. For models 3, 5, 6 and 7 the pseudo-R2 is below that threshold, implying that the independent variables included do not explain much of the move. We discuss each move, and it’s predicting variables. We refrain from discussing explanations for the relationships we find, because such explanations would mostly be speculative.

thumbnail
Table 3. Beta regressions predicting move probabilities with control variables.

https://doi.org/10.1371/journal.pone.0304394.t003

  • The first move we distilled is called ‘master specific’. In this move, we found words that describe the contents of the master program. As this move contains the topic probabilities for every individual master program, we multiplied this value with the master program dummy variable. In that way, only the topic probability for the ‘master specific’ topic of the student’ master program of choice is used in the regression models. We connected this move to the ‘motivation for applying’ theme and the ‘qualifications’ theme because it captures both. After all, students wrote about master specific term to motivate how passionate they are for content of the master, but also to highlight relevant knowledge and experience they already possess on this. The first control (i.e. independent) variable that has a significant effect on this move is the word count of the application essay. This effect is negative, which means that students that have a longer essay write less about ‘master specific’ content. Second, students from Asia also have a negative significant effect which means they write less about this move compared to students from Europe.
  • The second move is called ‘research skills’. This move consists of terms that describe the students research and methodological skills. Some of the most common terms in this move are ‘research’, ‘design’ and ‘method’. We connected this move to the theme ‘qualifications’, because it described a skill that qualifies students for entering the master. The control variables word count of the application essay and sex have a negative significant effect on this move, meaning that students with a longer essays and female students write less about their research skills. Students from Asia and North America as well as students with a higher bachelor grade also write significantly more about their research skills.
  • The third move is called ‘prior education’. With this move, students described which education they have completed prior to applying. This can be about the student’s bachelor program, but also about educational institutes the student attended before that. Some of the most common terms in this move are ‘school, ‘bachelor and ‘studi’. We connected this move to the theme ‘qualifications’, because relevant prior education is an important qualification students highlight when they want to enter a master. There are three control variables that significantly negatively affect the probability of this move occurring: word count of the application essay, and having an international background in Asia or North America’. This means that students from these two continents, and students with a longer essay write less about their prior education.
  • The fourth move is called ‘societal impact’. This move consists of terms that describe the students desire to create societal impact during and after their master program. They wrote that they want to contribute to solving societal challenges and cover specific societal issues they want to contribute to, such as nature conservation or aiding developing countries. Some of the most common terms in this move are ‘develop’, ‘world’, ‘problem’ and ‘live’. We connected this move to the theme ‘future goals, because the societal impact students want to make during and after their master lies in the future. Control variables word count and age have a significant positive effect on this move occurring, showing that longer essays and essays written by older students contain more terms about ‘societal impact’. Furthermore, every international background has a more significant positive relationship compared with our reference category, Europe, except "Africa. This shows that students from Africa and Europe write less about Societal Impact than all other students.
  • The fifth move is called ‘interest to learn’. This is a move that described the students motivation for applying and studying. This mostly concerned their proclivity towards acquiring knowledge and learning but also to the attainment of degrees. Some of the most common terms in this move are ‘degr’, ‘work’, ‘learning’ and ‘knowledge’. We connected this move to the theme ‘motivation for applying’, because the desire for students to gain knowledge and learn is a strong motivation for studying. This move also connects to ‘future goals’ because the desired acquisition of knowledge is often a prerequisite for fulfilling the students’ goals in later life. We find significant negative effects for control variables average bachelor grade and ‘international background: South America’. This shows that students with a higher bachelor grade and students from South America write less about their interest in further learning.
  • The sixth move is called ‘city and university’. In this move, students wrote about the city the university lies in and the specific university in this city students’ applied to. They commonly write about how both city and the university are well-established and good places for studying. Students used this move to motivate their choice for this specific city and university to study. Some of the most common terms in this move are ‘univ, ‘name of university city’ and ‘studi’. We connected this move to the theme ‘motivation for applying’ because students use it to explain to motivation for studying in this specific city and at this particular university. Table 3 shows that the word count of the application essay has a significant negative effect, meaning that longer essays contain less terms about the city and university the student applied to. There are several other control variables with a significant effect such as sex, age and having international backgrounds in Africa, Asia, North America and South America’. Students from these backgrounds, female students and older students wrote more about this move.
  • The seventh move is called ‘extracurricular’. Students used it to highlight their extracurricular activities. The list of activities is quite broad, and includes student associations and boards, voluntary organizations and sports teams. Some of the most common terms in this move are ‘activ’, ‘associ’ and ‘team’. This move relates to the theme ‘personality and life experience’, because it captured personal experiences from the students life that take place outside a classroom. None of control variables included in our model influence the probability of this move occurring.

4.2. Predicting study success

Table 4 displays the results for our OLS models that predict study success. For all models the adjusted R2 was higher than 0.2, indicating good model fit. Moreover, the moves add a significant amount of explained variance to the models. An inspection of the model residuals revealed that these were normally distributed. We also made a scatterplot of the residual and predicted values of the all four models to evaluate them for heteroscedasticity. Based on these plots, there was no reason to assume that heteroscedasticity occurred. We used variance inflation factors (VIF) to check for multicollinearity. All VIFs were below 2.1, indicating no problematic levels of multicollinearity.

Model 1 and 2 have the average master grade as dependent variable and models 3 and 4 have first year master grade as dependent variable. Model 1 and 3 only include the control variables, whereas models 2 and 4 also add the independent variables. When looking at model 2 and model 4 we see several significant estimates.

First, the move ‘master specific’ has a significant positive effect on the average grade and first year grade. This means that students that either describe mastering these contents during their bachelor in specific terms, or students that specifically state their motivation to study this content during the master acquire higher grades compared to students that write less about this. The first interpretation of this finding reinforces the notion that students with existing knowledge about the contents of their program perform better [66, 82, 83]. The second interpretation reinforces the notion that intrinsic motivation is a strong driver of study success [35] as discussed in our theoretical chapter.

The move ‘research skills’ has a significant negative effect on the average grade and first year grade. This negative effect is possibly explained the fact that research skills described tend to be fairly generic, which are needed for any master. This is in line with literature that shows that specific skills are more often associated with study success [84]. Then again, these generic research skills are useful for the chosen master, which makes the negative effect somewhat surprising. Another possible explanation could be that these students overestimate their own research skills, as can sometimes be the case with self-reported skills [3, 85].

The move ‘prior education’ has no significant effect on both average master grade and first year master grade. The move ‘prior education’ relates to the students qualifications, just like ‘Master Specific’. However, ‘prior education’ is less specific about the contents and specific master program of choice, which is a likely reason that this move does not have a positive effect on study success. Further, prior education is also captured by the selection on bachelor grade, and the fact that students from only sufficiently related bachelor programs are admitted.

The move ‘societal Impact’ has a significant negative effect on the average grade and first year grade. Students using this move want to use knowledge gained in their master to make societal impact, which can be classified as a source of extrinsic motivation. For them, learning knowledge seems to be a means to an end, a separable outcome [35, 41] but not a goal in itself. An explanation for the negative effect has to do with our measure for study success. Grades are very good are measuring the acquisition and mastering of cognitive skills. However, making societal impact also requires some non-cognitive skills. The assessment methods of these skills are currently underdeveloped [86], which explain why there are not accounted for in the grading of the master program.

Then, the move ‘interest to Learn’ also has a significant positive effect on the average grade and first year grade. This effect can also be explained using motivation theory. After all, ‘interest to learn’ relates to the students’ intrinsic motivation to study the program. They consider learning contents of the study an inherently satisfying activity, without having a separable goal in mind.

The move ‘city and university’ has a significant negative effect on the average grade and first year grade. The motivation to study in a particular city is an extrinsic form of motivation because students apply to enjoy the city’s’ university lifestyle and social networks, rather than having a strong intrinsic reason for applying to their specific program. Furthermore, enthusiasm about the city and university is not a description of motivation for the specific master of choice. After all, there many possible masters to apply to in the city of Utrecht and it’s university.

The move ‘extracurricular’ has no significant effect on the average grade and first year grade. We can explain this in three ways. First, ‘extracurricular’ is also not closely related to the specific master program, which is in line with our notion that intrinsic motivation for the specific is a determinant of study success. Second, it also does not describe the students (extrinsic) motivation for following a master in their near future, because extracurricular activities describe the students’ past. Third, the skills needed to thrive in extracurricular activities might very well not be needed to thrive in an academic master program.

Looking at the control variables (Table 4, lower part), we see that the word count of the essay has a positive effect on both average and first year master grade, which corresponds with the findings of Ding (2007). The average bachelor grade also has a positive significant effect on both dependent variables. This means that students with a higher bachelor grade graduate faster, and with higher master grades. This is in accordance with existing literature on the predictive validity of grades [51, 52]. Additionally, female sex has a strongly positive effect on both average and year 1 master grade. Again, this is well in accordance with existing literature [87], likely because female students are more conscientious [88, 89]. Age of the student has no significant effect on average master grade or first year master grade. Finally, our results on the international background demonstrate that students from Africa, Asia, Central America and South America attain lower grades than students from Europe in all four models. This result corresponds with existing literature, which shows that international students, especially those from different continents can suffer from cultural bias, negatively impact their study success [90]. We also report that students from North America have significantly lower grades than European students, but only in models 1 and 3. This means this significant effect disappears when we add the independent variables to the model.

5. Conclusion and discussion

The aim of this paper was to answer the following research question: What is the effect of discussing different topics in students’ application essays on the student’s success in a master program? We used a Latent Dirichlet Allocation (LDA) to distill topics from students applying to a master. We distilled 30 topics from the corpus, which consists of 2701 data entries. These 30 topics were aggregated into 7 moves. We first predict the probability of each move occurring in the essay using control variables, and then we use the move probability as independent variables in our regression models. We found that five moves have a significant effect on student success when measured by grades. Students that write more about their interest to learn, and about the specific master of their choice attain higher grades. Students that write more often about their research skills, desire to make societal impact and the city and university where they intend to study attain worse grades. Writing about prior education and extracurricular activities has no significant effect on attained grades.

5.1. Theoretical implications

This paper provides valuable new knowledge to the student admission literature. First, we inductively study the topics and moves that are discussed in students’ application essays using LDA. There are have been other studies to list themes that occur in students’ application essays [26, 28], but these have been qualitative in nature. The moves we find using are text mining approach are similar to the themes discussed in existing work on applications essays (see Fig 3), and therefore provide quantitative validation of existing literature. Furthermore, existing papers on the themes occurring in application essays list four or five themes. We find seven distinct moves, which makes our list of moves more detailed. We therefore suggest future scholars who study the contents of application essays to use our seven moves.

Furthermore, we study relationships between these moves and study success. To the best of our knowledge, we are the first to do so. Five of our distilled seven moves have a significant effect on grades attained in the master. We find that students who write about the specific contents of their master of choice and their interest to learn attain higher grades. This confirms the notion that students who demonstrate a strong intrinsic motivation to learn the contents of the program in their application essay are more successful. We report that students who write about their research skills, a more generic skill that is useful in many different masters, attain lower grades. We also report that students who write about their desire to make societal impact and the city and institute they want to study attain lower grades, since this shows they are more extrinsically motivated. For them, making societal impact or immersing themselves in the academic life is the desired outcome of following a master program rather than obtaining high grades. They have less intrinsic motivation to learn master specific knowledge and therefore attain lower grades than students who write a lot about ‘Master Specific’ and ‘Interest to Learn’. This confirms existing theory on motivation which posits that intrinsic motivation is most effective driver of success. Second, our results lead to the conclusion that students writing about specific content and knowledge related to their master perform better than students writing about generic knowledge and skills. This finding fits well in existing literature. which shows that students with existing specific knowledge perform better [66, 91, 92].

5.2. Limitations

This study has three main limitations. First, the labeling of the initial thirty topics and the aggregation of these topics in moves was done manually by the three co-authors. While we used triangulation during the process, some subjectivity always remains. We encourage other authors to do replication studies, using a similar methodology, to further validate the moves occurring in application essays. The fact that our moves match well with the themes found in prior qualitative studies gives further confidence that our labeling process was robust.

Second, we did not have the application essay for every student that studied at the faculty during the period for which data collection was done. We therefore examined the composition of the groups of students with and without application essays and found no differences. This means that no type of student was over- or underrepresented in either group. Therefore, we are assured that this limitation had little effect on the outcomes of the study. Further, like most studies on student admission, we encountered admission bias. However, our data for rejected students shows that there is hardly any correlation between the probability to use any of the relevant moves in the application essay, and the average bachelor grade (Pearsons r = -0.06, n = 348), and the suitability of the bachelor program (Pearsons r = -0.08, n = 375). These correlations hardly differ from those of admitted students (average bachelor grade: Pearson’s r = -0.04, n = 1582 and suitability Pearsons r = -0.02, n = 2755). In a univariate binary logistic regression, we did find a positive significant relationship between the probability of using relevant moves and being admitted or not (odds ratio = 48.99). This result is theoretically expected as the use of the relevant moves is expected to contribute to study success. Based on these results, we see little indication of admission bias.

Finally, we collected data at a faculty which spans a wide spectrum of disciplines from the social and natural sciences. In total, we studies eleven distinct master programs. However, these programs are organized by this single faculty from a single University. This brings to mind the question of generalizability. However, given the multidisciplinary nature of said faculty we argue that our results are likely generalizable to other disciplines, faculties and institutes. Furthermore, we controlled for these disciplines with education program dummies, and found they do not significantly influence our main findings. However, further research is needed to confirm this suggestion.

5.3. Practical implications

We end this paper with some advice for admission committees on how to use application essays. First, we encourage committees not to dismiss application essays as admission instrument, as they are an relatively efficient tool, and because we find significant effects of several moves on student success. We argue that these moves should be carefully monitored while making the admission decision. Our most important recommendation concerns our finding that the moves we labeled ‘master specific’ and ‘interest to learn’ are positively related to study success. Based on this, we conclude that having an intrinsic motivation for the specific contents of the program is an important predictor of success. Second, the fact that ‘master specific’ is positively related to study success shows that discussing specific knowledge related to the program the student is applying to, as opposed to generic knowledge, is a predictor of study success. We suggest that application essays be used to evaluate both intrinsic motivation and specific knowledge. We also recommend to keep the essay in a relatively free format, as requiring the students to include specific moves will lead to a loss of discriminatory value of essays as admission instrument.

Furthermore, our study also shows that other moves that occur in application essays have no (‘extracurricular’ and ‘prior education’, or a negative (‘research skills, ‘societal impact’ and ‘city and university)’ significant effect on study success. We recommend that these moves should not be given much weight during the admission decision. Finally, the negative impact of societal impact on grades could be explained by the lack of well-developed assessment methods to grade this aspect of the program. Given the societal debate for universities to become societally relevant, we urge universities to develop more assessment methods for societal impact.

Supporting information

References

  1. 1. de Boer T, Van Rijnsoever F. In search of valid non-cognitive student selection criteria. Assess Eval High Educ. 2021 Aug;1–18.
  2. 2. Vulperhorst J, Lutz C, de Kleijn R, van Tartwijk J. Disentangling the predictive validity of high school grades for academic success in university. Assess Eval High Educ [Internet]. 2018 Apr 3 [cited 2021 Apr 28];43(3):399–414. Available from: https://www.tandfonline.com/doi/full/10.1080/02602938.2017.1353586
  3. 3. Kuncel NR, Credé M, Thomas LL. The Validity of Self-Reported Grade Point Averages, Class Ranks, and Test Scores: A Meta-Analysis and Review of the Literature [Internet]. Vol. 75, Source: Review of Educational Research. 2005 [cited 2018 Sep 13]. Available from: https://www.jstor.org/stable/pdf/3516080.pdf?refreqid=excelsior%3Ae3b600fc750bae83a1c4a89972b01feb
  4. 4. Niessen ASM, Meijer RR. On the Use of Broadened Admission Criteria in Higher Education. Perspect Psychol Sci [Internet]. 2017 May 25;12(3):436–48. Available from: http://journals.sagepub.com/doi/10.1177/1745691616683050 pmid:28544866
  5. 5. Kuncel NR, Credé M, Thomas LL. A meta-analysis of the predictive validity of the Graduate Management Admission Test (GMAT) and Undergraduate Grade Point Average (UGPA) for graduate student academic performance. Acad Manag Learn Educ. 2007 Nov 30;6(1):51–68.
  6. 6. Pau A, Jeevaratnam K, Chen YS, Fall AA, Khoo C, Nadarajah VD. The Multiple Mini-Interview (MMI) for student selection in health professions training–A systematic review. Med Teach [Internet]. 2013 Dec 20 [cited 2021 Jan 12];35(12):1027–41. Available from: http://www.tandfonline.com/doi/full/10.3109/0142159X.2013.829912 pmid:24050709
  7. 7. Thomas LL, Kuncel NR, Credé M. Noncognitive Variables in College Admissions The Case of the Non-Cognitive Questionnaire. Educ Psychol Meas [Internet]. 2007 [cited 2020 Apr 2];67:635–57. Available from: http://epm.sagepub.comhttp//online.sagepub.com
  8. 8. Wood PS, Smith WL, Altmaier EM, Tarico VS, Franken EA. A prospective study of cognitive and noncognitive selection criteria as predictors of resident performance. Invest Radiol [Internet]. 1990 Jul [cited 2019 Feb 12];25(7):855–9. Available from: http://www.ncbi.nlm.nih.gov/pubmed/2391201 pmid:2391201
  9. 9. Kusurkar RA, Ten Cate TJ, Van Asperen M, Croiset G. Motivation as an independent and a dependent variable in medical education: A review of the literature. https://doi.org/103109/0142159X2011558539 [Internet]. 2011 May [cited 2022 Jul 27];33(5). Available from: https://www.tandfonline.com/doi/abs/10.3109/0142159X.2011.558539
  10. 10. Kusurkar RA, Ten Cate TJ, Vos CMP, Westers P, Croiset G. How motivation affects academic performance: A structural equation modelling analysis. Adv Heal Sci Educ [Internet]. 2013 Mar 1 [cited 2023 Oct 5];18(1):57–69. Available from: https://link.springer.com/article/10.1007/s10459-012-9354-3 pmid:22354335
  11. 11. Pennebaker JW, Chung CK, Frazee J, Lavergne GM, Beaver DI. When Small Words Foretell Academic Success: The Case of College Admissions Essays. Gong Q, editor. PLoS One [Internet]. 2014 Dec 31 [cited 2018 Sep 13];9(12):e115844. Available from: pmid:25551217
  12. 12. Elam CL, Weaver AD, Whittler ET, Stratton TD, Asher LM, Scott KL, et al. Discerning applicants’ interests in rural medicine: a textual analysis of admission essays. 2015 [cited 2021 Jan 26]; Available from: https://www.tandfonline.com/action/journalInformation?journalCode=zmeo20
  13. 13. Woo R, Krawczyk Oman JA, Byrn L, Wakim NM, Dyne PL, Cheaito MA, et al. Writing the Curriculum Vitae and Personal Statement. J Emerg Med. 2019 Sep 1;57(3):411–4. pmid:31229304
  14. 14. Ostapenko L, Schonhardt-Bailey C, Sublette JW, Smink DS, Osman NY. Textual Analysis of General Surgery Residency Personal Statements: Topics and Gender Differences. J Surg Educ. 2018 May 1;75(3):573–81. pmid:29102557
  15. 15. Williams PE, Hellman CM. DIFFERENCES IN SELF-REGULATION FOR ONLINE LEARNING BETWEEN FIRST-AND SECOND-GENERATION COLLEGE STUDENTS. Res High Educ. 2004;45(1).
  16. 16. Stephens MB, Reamy B V., Anderson D, Olsen C, Hemmer PA, Durning SJ, et al. Writing, Self-Reflection, and Medical School Performance: The Human Context of Health Care. Mil Med [Internet]. 2012 Sep 1 [cited 2022 Aug 5];177(suppl_9):26–30. Available from: https://academic.oup.com/milmed/article/177/suppl_9/26/4345532 pmid:23029857
  17. 17. Alvero AJ, Giebel S, Gebre-Medhin B, Antonio AL, Stevens ML, Domingue BW. Essay content and style are strongly related to household income and SAT scores: Evidence from 60,000 undergraduate applications. Sci Adv. 2021;7(42):eabi9031. pmid:34644119
  18. 18. Kurysheva A, Koning N, Fox CM, van Rijen HVM, Dilaver G. Once the best student always the best student? Predicting graduate study success using undergraduate academic indicators: Evidence from research masters’ programs in the Netherlands. Int J Sel Assess [Internet]. 2022 Jul 27 [cited 2022 Aug 1]; Available from: https://onlinelibrary.wiley.com/doi/
  19. 19. Dong T, Kay A, Artino AR, Gilliland WR, Waechter DM, Cruess D, et al. Application Essays and Future Performance in Medical School: Are They Related? Teach Learn Med [Internet]. 2013 [cited 2021 Mar 9];25(1):55–8. Available from: https://www.tandfonline.com/action/journalInformation?journalCode=htlm20 pmid:23330895
  20. 20. Jones S. “Ensure That You Stand Out from the Crowd”: A Corpus-Based Analysis of Personal Statements according to Applicants’ School Type. Comp Educ Rev [Internet]. 2013 Aug [cited 2018 Sep 13];57(3):397–423. Available from: https://www.jstor.org/stable/pdf/10.1086/670666.pdf?refreqid=excelsior%3A3a9053bcbfe9bc6b5258bea26c233de8
  21. 21. Johnson RL, Penny J, Gordon B. Score Resolution and the Interrater Reliability of Holistic Scores in Rating Essays. Writ Commun [Internet]. 2001 Apr 17;18(2):229–49. Available from: http://journals.sagepub.com/doi/10.1177/0741088301018002003
  22. 22. Salvatori P. Reliability and Validity of Admissions Tools Used to Select Students for the Health Professions. Adv Heal Sci Educ. 2001;6:159–75. pmid:11435766
  23. 23. Kirchner GL, Stone RG, Holm MB. Use of Admission Criteria to Predict Performance of Students in an Entry-Level Master’s Program on Fieldwork Placements and in Academic Courses. Occup Ther Heal Care [Internet]. 2001 Jan 3 [cited 2018 Sep 6];13(1):1–10. Available from: http://www.tandfonline.com/action/journalInformation?journalCode=iohc20
  24. 24. Berchulc CM, Wade GA, Seidner KK. Predicting Academic Achievement of Undergraduate Occupational Therapy Students: Preliminary Results. Occup Ther J Res [Internet]. 1987 Jul 24 [cited 2022 Sep 2];7(4):245–8. Available from: https://journals.sagepub.com/doi/abs/10.1177/153944928700700406?journalCode=otja
  25. 25. White BAA, Sadoski M, Thomas S, Shabahang M. Is the evaluation of the personal statement a reliable component of the general surgery residency application? J Surg Educ. 2012 May 1;69(3):340–3. pmid:22483135
  26. 26. Ding H. Genre analysis of personal statements: Analysis of moves in application essays to medical and dental schools. English Specif Purp. 2007 Jan 1;26(3):368–92.
  27. 27. Chiu YLT. ‘Singing your tune’: Genre structure and writer identity in personal statements for doctoral applications. J English Acad Purp. 2016 Mar 1;21:48–59.
  28. 28. Wouters A, Bakker AH, Van Wijk IJ, Croiset G, Kusurkar RA. A qualitative analysis of statements on motivation of applicants for medical school. 2014 [cited 2022 Sep 6]; Available from: http://www.biomedcentral.com/1472-6920/14/200 pmid:25248374
  29. 29. Hoblos J. Experimenting with latent semantic analysis and latent dirichlet allocation on automated essay grading. 2020 7th Int Conf Soc Netw Anal Manag Secur SNAMS 2020. 2020 Dec 14;
  30. 30. Kakkonen T, Myller N, Sutinen E, Timonen J. Comparison of dimension reduction methods for automated essay grading. Educ Technol Soc. 2008;11(3):275–88.
  31. 31. Brown RM. Self-Composed. Writ Commun [Internet]. 2004 Jul 17 [cited 2022 Sep 7];21(3):242–60. Available from: http://journals.sagepub.com/doi/10.1177/0741088304264338
  32. 32. Kuh GD, Kinzie J, Buckley JA, Bridges BK, Hayek JC. Piecing Together the Student Success Puzzle: Research, Propositions, and…—George D. Kuh, Jillian Kinzie, Jennifer A. Buckley, Brian K. Bridges, John C. Hayek—Google Books [Internet]. 2007 [cited 2020 Jul 2]. Available from: https://books.google.nl/books?hl=en&lr=&id=E2Y15q5bpCoC&oi=fnd&pg=PR7&dq=student+success+defined&ots=ZxINljxrKN&sig=rvxsWCBH_MWjvBh4p4-oRMMQuwc#v=onepage&q=studentsuccessdefined&f=false
  33. 33. Kuh GD, Kinzie J, Schuh JH, Whitt EJ. Fostering Student Success in Hard Times. Chang Mag High Learn [Internet]. 2011 Jun 30 [cited 2020 Jul 2];43(4):13–9. Available from: https://www.tandfonline.com/doi/abs/10.1080/00091383.2011.585311
  34. 34. Galla BM, Shulman EP, Plummer BD, Gardner M, Hutt SJ, Goyer JP, et al. Why High School Grades Are Better Predictors of On-Time College Graduation Than Are Admissions Test Scores: The Roles of Self-Regulation and Cognitive Ability. Am Educ Res J [Internet]. 2019 Dec 3;56(6):2077–115. Available from: http://journals.sagepub.com/doi/10.3102/0002831219843292
  35. 35. Ryan RM, Deci EL. Intrinsic and Extrinsic Motivations: Classic Definitions and New Directions. Contemp Educ Psychol. 2000;25(1):54–67. pmid:10620381
  36. 36. Krapp A. Basic needs and the development of interest and intrinsic motivational orientations. Learn Instr. 2005 Oct 1;15(5):381–95.
  37. 37. Krapp A. Structural and dynamic aspects of interest development: theoretical considerations from an ontogenetic perspective. Learn Instr. 2002 Aug 1;12(4):383–409.
  38. 38. Mikkonen J, Ruohoniemi M, Lindblom-Ylänne S. The role of individual interest and future goals during the first years of university studies. Stud High Educ [Internet]. 2013 Feb [cited 2022 Aug 31];38(1):71–86. Available from: https://www.tandfonline.com/action/journalInformation?journalCode=cshe20
  39. 39. Janke S. Prospective effects of motivation for enrollment on well-being and motivation at university. Stud High Educ [Internet]. 2020 Dec 1 [cited 2022 Aug 31];45(12):2413–25. Available from: https://www.tandfonline.com/action/journalInformation?journalCode=cshe20
  40. 40. Hayamizu T. Between intrinsic and extrinsic motivation: Examination of reasons for academic study based on the theory of internalization. Jpn Psychol Res. 1997;39(2):98–108.
  41. 41. Legault L. Intrinsic and Extrinsic Motivation. In: Encyclopedia of Personality and Individual Differences [Internet]. Cham: Springer International Publishing; 2016 [cited 2023 Jan 20]. p. 1–4. Available from: https://www.researchgate.net/publication/311692691_Intrinsic_and_Extrinsic_Motivation
  42. 42. Kember D, Ho A, Hong C. Studies in Higher Education Initial motivational orientation of students enrolling in undergraduate degrees. 2010 [cited 2022 Aug 31]; Available from: https://www.tandfonline.com/action/journalInformation?journalCode=cshe20
  43. 43. Hemsley-Brown J. ‘The best education in the world’: reality, repetition or cliché? International students’ reasons for choosing an English university. Stud High Educ [Internet]. 2012 Dec [cited 2022 Sep 6];37(8):1005–22. Available from: https://www.tandfonline.com/doi/abs/10.1080/03075079.2011.562286
  44. 44. Hyland K, Guinda CS. Stance and voice in written academic genres. Stance Voice Writ Acad Genres. 2012 Sep 24;1–263.
  45. 45. Wouters A, Croiset G, Galindo-Garre F, Kusurkar RA. Motivation of medical students: Selection by motivation or motivation by selection Assessment and evaluation of admissions, knowledge, skills and attitudes. BMC Med Educ [Internet]. 2016 Jan 29 [cited 2022 Jul 27];16(1):1–9. Available from: https://bmcmededuc.biomedcentral.com/articles/10.1186/s12909-016-0560-1
  46. 46. Wellington J, Sikes P. ‘A doctorate in a tight compartment’: why do students choose a professional doctorate and what impact does it have on their personal and professional lives? https://doi-org.proxy.library.uu.nl/101080/03075070601004358 [Internet]. 2007 Dec 1 [cited 2022 Aug 31];31(6):723–34. Available from: https://www-tandfonline-com.proxy.library.uu.nl/doi/abs/10.1080/03075070601004358
  47. 47. Shen H, Comrey AL. Predicting medical students’ academic performances by their cognitive abilities and personality characteristics. Acad Med [Internet]. 1997 [cited 2022 Oct 18];72(9):781–6. Available from: /record/1997-42919-001 pmid:9311320
  48. 48. Conard MA. Aptitude is not enough: How personality and behavior predict academic performance. J Res Pers. 2006 Jun;40(3):339–46.
  49. 49. Furnham A, Chamorro-Premuzic T, McDougall F. Personality, cognitive ability, and beliefs about intelligence as predictors of academic performance. Learn Individ Differ. 2003;14(1):47–64.
  50. 50. Lemaire B, Dessus P. A SYSTEM TO ASSESS THE SEMANTIC CONTENT OF STUDENT ESSAYS. Vol. 24, J. EDUCATIONAL COMPUTING RESEARCH. 2001.
  51. 51. Kuncel NR, Credé M, Thomas LL, Klieger DM, Seiler SN, Woo SE. A meta-analysis of the validity of the Pharmacy College Admission Test (PCAT) and Grade Predictors of Pharmacy Student Performance. Am J Pharm Educ. 2005;69(3):339–47.
  52. 52. Westrick PA, Le H, Robbins SB, Radunzel JMR, Schmidt FL. College Performance and Retention: A Meta-Analysis of the Predictive Validities of ACT ® Scores, High School Grades, and SES. Educ Assess [Internet]. 2015 Jan 2 [cited 2020 Jun 9];20(1):23–45. Available from: https://www.tandfonline.com/action/journalInformation?journalCode=heda20
  53. 53. Ofori R. Age and ‘type’ of domain specific entry qualifications as predictors of student nurses’ performance in biological, social and behavioural sciences in nursing assessments. Nurse Educ Today [Internet]. 2000 May [cited 2020 Jul 15];20(4):298–310. Available from: https://reader.elsevier.com/reader/sd/pii/S0260691799903969?token=0A76AB7734DE52F2C695C2552F8A92909741FE0B0EE0A62A8A64764C5E0508049F64F1CE1BD45ED02BAF9C418D4DDF75 pmid:10827101
  54. 54. Guney Y. Exogenous and Endogenous Factors Influencing Students’ Performance in Undergraduate Accounting Modules. Account Educ [Internet]. 2009 Feb [cited 2020 Jul 15];18(1):51–73. Available from: https://www.tandfonline.com/action/journalInformation?journalCode=raed20
  55. 55. Adam J, Bore M, Childs R, Dunn J, Mckendree J, Munro D, et al. Predictors of professional behaviour and academic outcomes in a UK medical school: A longitudinal cohort study. Med Teach [Internet]. 2015 Sep 2 [cited 2020 Jul 15];37(9):868–80. Available from: pmid:25665628
  56. 56. Cuddy MM, Swanson DB, Clauser BE. A Multilevel Analysis of Examinee Gender and USMLE Step 1 Performance. Acad Med [Internet]. 2008 Oct [cited 2020 Jul 15];83(Supplement):S58–62. Available from: http://journals.lww.com/00001888-200810001-00015 pmid:18820503
  57. 57. Smith C. International Student Success. Strateg Enroll Manag Q [Internet]. 2016 Jul 1 [cited 2022 Oct 28];4(2):61–73. Available from: https://onlinelibrary.wiley.com/doi/full/10.1002/sem3.20084
  58. 58. Faculty of Geosciences. Utrecht University. 2023. p. https://www.uu.nl/en/organisation/faculty-of-geosc.
  59. 59. Ferrer FP, Ng Lungsod Ng Maynila P. Determinants of Performance in Thesis: Evidence from Selected Filipino Graduate Students [Internet]. Vol. 2, International Journal of Education and Research. 2014 [cited 2020 Aug 25]. Available from: www.ijern.com
  60. 60. Houglum JE, Aparasu RR, Delfinis TM. Predictors of Academic Success and Failure in a Pharmacy Professional Program. Am J Pharm Educ. 2005;69(3).
  61. 61. Berry CM, Sackett PR. Individual Differences in Course Choice Result in Underestimation of the Validity of College Admissions Systems. Psychol Sci. 2009;20(7):822–30. pmid:19493326
  62. 62. Wilkinson D, Zhang J, Byrne GJ, Luke H, Ozolins IZ, Parker MH, et al. Medical school selection criteria and the prediction of academic performance. Med J Aust [Internet]. 2008 Mar 17 [cited 2020 May 28];188(6):349–54. Available from: https://onlinelibrary.wiley.com/doi/abs/ pmid:18341459
  63. 63. Wilkinson D, Zhang J, Parker M. Predictive validity of the Undergraduate Medicine and Health Sciences Admission Test for medical students’ academic performance. Med J Aust [Internet]. 2011 Apr 4 [cited 2020 May 28];194(7):341–4. Available from: https://onlinelibrary.wiley.com/doi/abs/10.5694/j.1326-5377.2011.tb03002.x pmid:21470083
  64. 64. Myles J, Cheng L. The social and cultural life of non-native English speaking international graduate students at a Canadian university. J English Acad Purp [Internet]. 2003 Jan [cited 2020 Jul 14];2(3):247–63. Available from: www.elsevier.com/locate/jeap
  65. 65. Park SY. Returning students’ perspectives on adjusting to medical graduate school in Korea: an interview study 27(1):37–44 2015. Korean J Med Educ. 2015;27(1):37–44.
  66. 66. Arzuman H, Ja’afar R, Fakri NMRM The influence of pre-admission tracks on students’ academic performance in a medical programme: Universiti Sains Malaysia. Vol. 25, Education for Health: Change in Learning and Practice. 2012. p. 124–7.
  67. 67. Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation. J Mach Learn Res. 2003;3(Jan):993–1022.
  68. 68. Ogihara M, Ren G. Student retention pattern prediction employing linguistic features extracted from admission application essays. Proc - 16th IEEE Int Conf Mach Learn Appl ICMLA 2017. 2017;2017-December:532–9.
  69. 69. Munnecom L, Pacheco MC de L. Exploration of an Automated Motivation Letter Scoring System to Emulate Human Judgement. 2020 [cited 2022 Aug 18]; Available from: http://urn.kb.se/resolve?urn=urn:nbn:se:du-34563
  70. 70. Porter MF. An algorithm for suffix stripping. Program. 1980 Mar 1;14(3):130–7.
  71. 71. Arianti ND, Irfan M, Syaripudin U, Mariana D, Rosmawarni N, Maylawati DS adillah. Porter Stemmer and Cosine Similarity for Automated Essay Assessment. 5th Int Conf Comput Eng Des ICCED 2019. 2019 Apr 1;
  72. 72. Arun R, Suresh V, Madhavan CEV, Murty MN. On finding the natural number of topics with Latent Dirichlet Allocation: Some observations. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) [Internet]. Springer, Berlin, Heidelberg; 2010 [cited 2020 Jul 7]. p. 391–402. Available from: https://link.springer.com/chapter/10.1007/978-3-642-13657-3_43
  73. 73. Blei DM, Griffiths TL, Jordan MI, Tenenbaum JB. Hierarchical Topic Models and the Nested Chinese Restaurant Process.
  74. 74. Cao J, Xia T, Li J, Zhang Y, Tang S. A density-based method for adaptive LDA model selection. Neurocomputing. 2009 Mar 1;72(7–9):1775–81.
  75. 75. Zhao W, Chen JJ, Perkins R, Liu Z, Ge W, Ding Y, et al. A heuristic approach to determine an appropriate number of topics in topic modeling. BMC Bioinformatics [Internet]. 2015 Sep 25 [cited 2022 Aug 24];16(13):1–10. Available from: https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-16-S13-S8 pmid:26424364
  76. 76. Leendertse J, Rijnsoever FJ, Eveleens CP. The sustainable start‐up paradox: Predicting the business and climate performance of start‐ups. Bus Strateg Environ. 2020;(October):1–18.
  77. 77. Arroyabe MF, Schumann M, Arranz CFA. Mapping the entrepreneurial university literature: a text mining approach. Stud High Educ [Internet]. 2022 May 4 [cited 2022 Aug 1];47(5):955–63. Available from: https://www.tandfonline.com/action/journalInformation?journalCode=cshe20
  78. 78. Perez-Encinas A, Rodriguez-Pomeda J, de Wit H. Factors influencing student mobility: a comparative European study. https://doi.org/101080/0307507920201725873 [Internet]. 2020 [cited 2023 Jan 6];46(12):2528–41. Available from: https://www.tandfonline.com/doi/abs/10.1080/03075079.2020.1725873
  79. 79. Blei DM. Probabilistic topic models. Commun ACM [Internet]. 2012 Apr 1 [cited 2022 Aug 22];55(4):77–84. Available from: https://dl.acm.org/doi/10.1145/2133806.2133826
  80. 80. United Nations. Geoscheme Methodology [Internet]. https://unstats.un.org/unsd/methodology/m49/. 2022. Available from: https://unstats.un.org/unsd/methodology/m49/
  81. 81. Cribari-Neto F, Zeileis A. Beta regression in R. J Stat Softw. 2010;34:1–24.
  82. 82. Dochy FJRCPrior knowledge and learning, International Encyclopedia of Education (Second edition, pp.4698–4702). Oxford New York: Pergamon Press. 1994. 4698–4702 p.
  83. 83. Hailikari T, Nevgi A, Lindblom-Ylänne S. EXPLORING ALTERNATIVE WAYS OF ASSESSING PRIOR KNOWLEDGE, ITS COMPONENTS AND THEIR RELATION TO STUDENT ACHIEVEMENT: A MATHEMATICS BASED CASE STUDY. Stud Educ Eval. 2007 Sep 1;33(3–4):320–37.
  84. 84. Niessen ASM, Meijer RR, Tendeiro JN. Predicting Performance in Higher Education Using Proximal Predictors. Smalheiser NR, editor. PLoS One [Internet]. 2016 Apr 13 [cited 2018 Sep 5];11(4):e0153663. Available from: http://hdl. pmid:27073859
  85. 85. Lauder W, Holland K, Roxburgh M, Topping K, Watson R, Johnson M, et al. Measuring competence, self-reported competence and self-efficacy in pre-registration students. Nurs Stand [Internet]. 2008 Jan 1 [cited 2023 Feb 24];22(20):35–43. Available from: https://europepmc.org/article/med/18284020 pmid:18284020
  86. 86. Redman A, Wiek A, Barth M. Current practice of assessing students’ sustainability competencies: A review of tools. Sustain Sci. 2021;16:117–35.
  87. 87. Adam J, Bore M, Childs R, Dunn J, Mckendree J, Munro D, et al. Predictors of professional behaviour and academic outcomes in a UK medical school: A longitudinal cohort study. Med Teach. 2015 Sep;37(9):868–80. pmid:25665628
  88. 88. Wiersma UJ, Kappe R. Selecting for extroversion but rewarding for conscientiousness. Eur J Work Organ Psychol [Internet]. 2017 [cited 2020 Jun 2];26(2):314–23. Available from: https://www.tandfonline.com/action/journalInformation?journalCode=pewo20
  89. 89. Richardson M, Abraham C. Conscientiousness and achievement motivation predict performance. https://doi.org/101002/per732 [Internet]. 2009 Nov 1 [cited 2023 Jan 20];23(7):589–605. Available from: https://journals.sagepub.com/doi/abs/10.1002/per.732
  90. 90. Coddington DMM, Karsten K. Eliminating cultural bias in nursing examinations. Teach Learn Nurs. 2014 Jul 1;9(3):134–8.
  91. 91. Hailikari T, Katajavuori N, Lindblom-Ylanne S. The relevance of prior knowledge in learning and instructional design. Am J Pharm Educ. 2008;72(5). pmid:19214267
  92. 92. Dochy F, de Rijdt C, Dyck W. Cognitive Prerequisites and Learning. Act Learn High Educ [Internet]. 2002 Nov 20 [cited 2021 Dec 7];3(3):265–84. Available from: http://journals.sagepub.com/doi/10.1177/1469787402003003006