Parental rights or parental wrongs: Parents’ metacognitive knowledge of the factors that influence their school choice decisions

Trent N. Cash; Daniel M. Oppenheimer

doi:10.1371/journal.pone.0301768

Abstract

School choice initiatives–which empower parents to choose which schools their children attend–are built on the assumptions that parents know what features of a school are most important to their family and that they are capable of focusing on the most important features when they make their decisions. However, decades of psychological research suggest that decision makers lack metacognitive knowledge of the factors that influence their decisions. We sought to reconcile this discrepancy between the policy assumptions and the psychological research. To do so, we asked participants to complete Choice-Based Conjoint surveys in which they made series of choices between different hypothetical schools. We then asked participants to self-report the weight they placed on each attribute when making their choices. Across four studies, we found that participants did not know how much weight they had placed on various school attributes. Average correlations between stated and revealed weights ranged from r = .34–.54. Stated weights predicted different choices than revealed weights in 16.41–20.63% of decisions. These metacognitive limitations persisted regardless of whether the participants were parents or non-parents (Study 1a/1b), the nature of the attributes that participants used to evaluate alternatives (Study 2), and whether or not decision makers had access to school ratings that could be used as metacognitive aids (Study 3). In line with prior psychological research–and in contract to policy assumptions–these findings demonstrate that decision makers do not have particularly strong metacognitive knowledge of the factors that influence their school choice decisions. As a result, parents making school choice decisions are likely to seek out and use the wrong information, thus leading to suboptimal school choices. Future research should replicate these results in more ecologically valid samples and test new approaches to school choice that account for these metacognitive limitations.

Citation: Cash TN, Oppenheimer DM (2024) Parental rights or parental wrongs: Parents’ metacognitive knowledge of the factors that influence their school choice decisions. PLoS ONE 19(4): e0301768. https://doi.org/10.1371/journal.pone.0301768

Editor: Roghieh Nooripour, Alzahra University, IRAN (ISLAMIC REPUBLIC OF)

Received: January 23, 2024; Accepted: March 21, 2024; Published: April 18, 2024

Copyright: © 2024 Cash, Oppenheimer. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Data, materials, and code for all studies reported in this manuscript are available on OSF at: https://osf.io/krxec/?view_only=8b829144f7e54518957d3334521c0774.

Funding: This work was supported by a Graduate Student Small Grant from Carnegie Mellon University’s Center for Behavioral and Decision Research (CBDR). No specific grant number was provided. The grant was awarded to TNC, under the supervision of DMO. CBDR website: https://www.cmu.edu/cbdr/. This work was also supported by an Academic Research Grant from Sawtooth Software, Inc. that provided the authors with free access to their Lighthouse Studio CBC software. No specific grant number was provided. The grant was awarded to TNC, under the supervision of DMO. Sawtooth Software, Inc. website: https://sawtoothsoftware.com/ The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. There was no additional external funding received for this study.

Competing interests: The authors have declared that no competing interests exist.

Introduction

School choice in America

A recent Gallup poll found that only about a third of Americans (36%) were satisfied with the quality of the American K-12 educational system [1]. This is, unfortunately, hardly an anomaly: since 1999, satisfaction levels have ranged from a low of 36% (2000; 2023) to a high of 53% (2004). Given these persistently low satisfaction levels, politicians [2] and advocacy organizations [3] have proposed a variety of educational reforms, collectively known as school choice, in which parents evaluate a variety of schools and choose the one that best fits their family’s needs, rather than sending their child to their neighborhood public school [4–6]. One reason for which school choice advocates argue that these policies will lead to better outcomes is that they believe parents have the most accurate knowledge of what their children need in a school [7].

School choice efficacy

To date, empirical scholarship regarding the efficacy of school choice has been mixed [8–12]. Recent reviews and meta-analyses of the extant literature have suggested that, on average, school choice policies have small, positive impacts on educational attainment [13], academic achievement [5], and disciplinary issues [10]. However, these positive effects have been found to be moderated by demographic characteristics [5] and highly variable based on the quality of available alternative schools [14]. Despite the heterogeneity of results, the literature clearly suggests that, although school choice policies may have some benefits, they have not been the panacea that proponents suggest they should be.

One theory to explain the limited efficacy of school choice is that its success is dependent on families having sufficient resources to make good decisions. This is perhaps best exemplified by studies demonstrating that, compared to high-SES families, low-SES families making school choice decisions limit their choice set to a smaller number of schools due to practical constraints, such as commute time [15, 16]. Many empirical accounts have also highlighted the fact that the complex nature of the school choice decision environment places a large burden on parents’ limited cognitive resources by providing too many options [7, 17], too much information [18], and information that is uninformative or difficult to understand [19, 20].

Metacognitive knowledge of attribute weights

An additional explanation that has received little empirical attention is the hypothesis that parents may lack sufficient knowledge about what they value in a school and thus struggle to choose a school that matches their priorities [2, 3, 6]. Because school choice is a multi-attribute decision, parents’ priorities are expressed through an attribute weighting process in which they determine the relative weight they should place on each attribute on which schools can be compared [21]. For example, parents must ask themselves which is more important: the quality of the math curriculum, the graduation rate, or the commute time to get to the school. A decision maker’s ability to generate and apply these weights in a way that truly matches their priorities reflects their ability to make decisions based on the factors that matter most to them [22].

To successfully weight attributes in a way that matches their true preferences, decision makers must monitor their beliefs and values [23, 24] and control the application of these beliefs and values during the decision process [25, 26]. As such, attribute weighting can be considered a metacognitive process [27–30]. For this reason, we will refer to a decision maker’s awareness of the influence that various factors have (and ought to have) on their school choice decisions as their metacognitive knowledge of attribute weights.

Benchmarking metacognitive knowledge of attribute weights

The extant literature has shown that decision makers often lack the metacognitive capacities to accurately and consistently self-report the reasons for which they make decisions [31]. Studies seeking to quantify this inconsistency have demonstrated that, on average, correlations between attribute weights elicited via different modalities (e.g., self-reported vs. revealed) typically fall within in the range of r = .40 - .70 [32–38]. The range is wide because existing studies have implemented different preference elicitation methods, evaluated different domains, and induced different psychological goal states [32–38]. While these studies were designed to test (in)consistencies across different preference elicitation methods–not necessarily metacognitive knowledge itself–this range of typical correlations provides a useful benchmark against which to which to compare participants’ metacognitive knowledge of attribute weights, particularly because our paradigm compares self-reported weights to revealed weights.

Metacognitive knowledge of attribute weights in school choice

If the limited metacognitive ability demonstrated by decision makers in other domains holds true in school choice decisions, it would have implications for the success of school choice policies. If parents lack sufficient metacognitive knowledge of attribute weights, they may choose schools that do not actually align with their priorities. Similarly, parents who lack metacognitive knowledge of their educational priorities may be unable to accurately respond to polls or surveys about how they want local educational leaders to improve their schools [39–41]. For these reasons, it is imperative that we empirically investigate the degree to which parents have accurate metacognitive knowledge of their educational priorities.

Despite the evidence regarding decision makers’ lack of metacognitive knowledge in general [31, 36, 37], the high value that many individuals and communities place on education may make school choice a domain in which decision makers have a uniquely high level of metacognitive knowledge. School choice priorities are also highly influenced by cultural norms [42, 43] that may provide decision makers with easily accessible scripts about what ‘people like them’ value [44, 45], thus granting an avenue for decision makers to be metacognitively aware of the factors that influence their decisions. As such, metacognitive knowledge may be greater in school choice than other domains that have typically been studied.

The current study

To date, however, empirical studies of metacognitive knowledge in the school choice domain are lacking. To rectify this, we conducted four experiments in which participants were tasked with making school choice decisions and then reporting on how heavily they weighted various attributes when making their decisions. In Studies 1a and 1b, we showed that participants were unable to accurately report the weight they placed on various attributes, regardless of whether they were a general online sample (1a) or a sample of parents of high school-aged children (1b). In Study 2, we replicated this finding using a different set of attributes, some of which were non-academic in nature. Finally, In Study 3, we further replicated this result while giving participants access to meta-cognitive aids in the form of aggregate school ratings that graded each alternative school on an A-F Scale. Across studies, our results suggest that metacognitive limitations should be considered as a potential roadblock to the success of school choice policies and should be accounted for in policy design.

All participants affirmed their written informed consent to participate at the beginning of each study and all procedures were conducted in compliance with and approved by the Carnegie Mellon University Institutional Review Board (IRB). Data, materials, and code for all studies reported in this manuscript are available at: https://osf.io/krxec/?view_only=8b829144f7e54518957d3334521c0774

Study 1a

Study 1a overview

Study 1a aimed to evaluate the accuracy of participants’ metacognitive knowledge of the weight they placed on various attributes when making school choice decisions. Participants completed a Choice Based Conjoint (CBC) survey in which they made a series of hypothetical choices among high schools for their children’s education based on a given set of attributes. Participants then reported the weight that they had placed on each attribute and how important each attribute was to them. Alignment between these measures was then used to evaluate participants’ metacognitive knowledge.

Study 1a methods

Participants.

An analysis of simulated respondents suggested that our Choice-Based Conjoint analysis would require approximately 200 responses to reduce the standard error of our utility estimates to less than 0.05 if they were estimated using logistic regression. This is the standard method of sample size estimation for CBC [46] and represents an upper-bound of the necessary sample size when using more precise utility estimation methods, such as Hierarchical Bayes, which we used. To ensure that our sample would be sufficiently large after removing participants who failed a bot check, we recruited 210 participants from MTurk via CloudResearch [47]. In this bot check, participants were asked to explain why a simple, pun-based cartoon was funny. Participants were marked as bots if they provided non-sense response or responses that were totally unrelated to the cartoon. Data was collected February 11^th - 16^th, 2021. Nine participants were excluded for failing the bot check, leaving 201 participants. Due to a programming error, ten of these participants were unable to complete two portions of the study (details below), leaving a total of 191 participants (86 females, 161 White participants, 78 parents, M_age = 38.6 years) who completed the entire study. Full demographic characteristics are reported in the S1 Table.

CBC survey.

Participants were asked to imagine that they were parents picking among high schools for their children to attend. They then completed a Choice Based Conjoint (CBC) survey in which they were tasked with picking between sets of three high schools with different scores on seven attributes. CBC is an established and well-validated tool from the marketing literature that researchers have used for decades to estimate the degree to which participants care about various attributes by having them repeatedly choose between alternatives that systematically vary across values of attributes [48–52]. CBC has been implemented in a wide variety of policy-relevant domains including medicine [53], transportation [54], nutrition [55], electoral politics [56], and many others. Discrete choice tasks like CBC have been used in the literature to assess school choice preferences [40], but have not been used to assess metacognitive knowledge of these preferences. The CBC survey was designed using Sawtooth Software Inc.’s Lighthouse Studio, V 9.15.4 [57].

Attribute selection.

To ensure that our study included attributes that are normally accessible to parents, we first began by gathering all of the attributes used by three school rating websites: U.S. News & World Report [58], Niche [59], and GreatSchools [60]. We then reduced the set to only consider attributes that were given a weight of at least 10% by one of the school rating websites. We then selected attributes that we believed captured different aspects of school performance and would be easy for participants to understand and compare. In line with the literature on what attributes parents value in schools, we primarily focused on measures of academic performance and diversity [40].

The seven attributes that we ultimately decided to use were: 1) Graduation Rate; 2) Percent of Students Who Pass State Tests (henceforth, State Test Pass Rate); 3) The Gap in State Test Pass-Rates Between (Economically and Racially) Advantaged and Disadvantaged Students (Disadvantaged Student Gap); 4) Average ACT score; 5) Average Rating Given to the School by Parents (Average Parent Rating); 6) Percent of Seniors Who Take at Least One AP course (AP Enrollment); and 7) Percent of Students Who Are a Racial/Ethnic Minority (Percent Minority Students). Some of the other attributes that were used by the school rating websites that we chose not to use in our CBC included: Growth ratings, Number of AP courses offered, AP exam performance, and Extracurricular participation rates [58–60]. Most CBC studies include between 3–8 attributes, so having 7 attributes put us squarely in the standard range [61].

There were five possible levels for each attribute, which were selected to reflect the range of real schools’ performances on these metrics. In line with CBC conventions, we chose to have only five levels per attribute to reduce decision complexity for participants [61]. To ensure that participants understood the attributes, participants were provided with: 1) A brief overview of what the attribute measured; 2) an explanation of the five possible levels of the attribute; and 3) data regarding real world average scores on the attribute, which were based on publicly available state or national data. For complete attribute descriptions, see S1 Appendix.

CBC survey procedure.

During the CBC task, participants made 14 choices between sets of three hypothetical schools. This is typical of CBC studies, most of which ask participants to make 15 or fewer choices [61]. The attribute levels for the hypothetical schools were randomly generated for each participant. To avoid school options that seemed unrealistic, we prohibited the following attribute levels from co-occurring: 1) The lowest (highest) level of the State Test Pass Rate attribute (20% and 80%, respectively) could not be paired with the highest (lowest) level of the Average ACT Score attribute (27 and 15, respectively); and 2) The highest level of the AP Enrollment attribute (60%) could not be paired with the lowest level of the State Test Pass Rate attribute (20%) or the lowest level of the Average ACT Score attribute (15). The order of the seven attributes was held constant across participants. A sample task is depicted in Fig 1.

Download:

Fig 1. Sample CBC task from Study 1a.

https://doi.org/10.1371/journal.pone.0301768.g001

Self-reported weights procedure.

After completing the CBC survey, participants were shown a list of the seven attributes and asked to rate how important each attribute was to them on a scale of 1 (“Not at All Important”) to 9 (“Extremely Important”). We refer to these ratings to as Attribute Importance Ratings (AIRs). Participants were then shown the list of attributes once more and asked to identify what percentage of their decisions had depended on the schools’ scores on each given attribute. We refer to these percentages as Stated Attribute Weights (SAWs). Mean SAWs and AIRs for each attribute are reported in the S2 Table. Finally, participants completed a brief demographic survey. Due to a programming error, the AIR and SAW items did not appear for 10 participants, leaving only 191 participants who completed all parts of the study.

Study 1a analysis & results

Analytical decisions.

The following decisions were made regarding how to conduct analyses for all studies presented in this manuscript. Outliers were not excluded. Participants with missing data were excluded in a listwise manner. A significance threshold of α = .05 was applied for all analyses. Analyses were conducted using Lighthouse Studio, V 9.15.4 [57] and R Statistical Software, V 4.3.2 [62]. Materials, data and analysis code for are available at: https://osf.io/krxec/?view_only=8b829144f7e54518957d3334521c0774

Estimating utilities.

We first converted the CBC choice data into Revealed Attribute Weights (RAWs). To estimate these RAWs, we used Hierarchical Bayes Estimation (HB) to estimate utilities that captured the relative value that each participant placed on each level of each attribute (i.e., part-worth utilities) [49, 61, 63]. We began the HB estimation process with conservative priors of 0 for all part-worth utility parameters and updated the parameters over 20,000 iterations (10,000 to reach convergence, 10,000 retained and averaged for point estimates) using a Markov Chain Monte Carlo. Utility estimates for each iteration were generated using a Metropolis Hastings Algorithm (for a technical overview, see [64]). For ease of replicability, all settings were left at the default provided by Lighthouse Studio, V 9.15.4 [57].

We chose to use HB as our estimation method because it is considered the gold standard for estimating utilities from CBC data and because it allows for the generation of utility estimates at the individual-level, rather than the sample level [49, 61]. Additionally, HB uses sample-level utility estimates to inform individual-level estimates, thus generating more accurate utility estimates from relatively little data than older methods of assessing CBC data, like logistic regression [48–52, 61]. Our CBC survey adhered to standard conventions and parameters by having a relatively small number of attributes, choices, and levels [61]. As such, there is no reason to believe that HB would be an invalid or unreliable way to estimate utilities in the context of our study. We are further convinced of the reliability and validity of our HB estimation procedure because we achieve such similar results across the studies presented here.

Estimating RAWs from utilities.

We then converted the individual utilities into RAWs, which are estimated as percentages and can be interpreted as the weight that each participant placed on each attribute. In adherence to standard conventions [61, 63], RAWs for each attribute for each participant were calculated according to the following equation, where U_j is a vector containing the utility values for the five levels of attribute j, U_i is a vector containing the utility values for the five levels of attribute i, and the set from which attribute i is pulled includes all seven attributes from the CBC, including j:

The best and worst levels of each attribute were determined separately for each participant, thus allowing us to capture participants’ heterogeneous preferences. RAWs were calculated for all 201 participants, but all analyses were conducted based on the reduced sample of 191 participants. Mean RAWs for each attribute are reported in the S2 Table. Both the HB estimations and the RAW calculations were conducted using Sawtooth Software Inc.’s Lighthouse Studio, V 9.15.4 [57].

Estimating perfect metacognitive knowledge.

We next sought to establish what would constitute perfect metacognitive knowledge of attribute weights. To estimate this upper bound, we conducted a simulation in which simulated respondents (n = 191) completed the same CBC survey as the human participants. Each simulated respondent was yoked to one human participant’s set of SAWs. For each choice task, the value of each alternative was calculated by multiplying the assigned SAW for each attribute by the level of that attribute (scored as 1–5) and summing these values across the 7 attributes. The simulated respondent then selected the alternative with the highest sum value. We then used HB to calculate the RAWs for the simulated respondents.

The correlations between simulated respondents’ RAWs and assigned SAWs were then calculated for each attribute. These correlations ranged from r = .64 - .90 (see Fig 2) and the mean of the seven correlation coefficients was r = .79. Since simulated respondents’ assigned SAWs were yoked to human participants’ SAWs, these correlations approximated the correlations that participants would be expected to produce between RAWs and SAWs if they had perfect metacognitive knowledge of their attribute weights. These simulations reflect a fairer standard of comparison than perfection (r = 1.00), as some of participants’ miscalibration (i.e., the gap between r = .79 and r = 1.00) arises from the noise that is inherent to comparisons of weights generated via different measurement modalities. By running simulations instead of assuming perfection, we can determine if participants’ metacognitive knowledge is sub-optimal, even after removing the degree of miscalibration that is due to measurement error.

Download:

Fig 2. Study 1a RAW-AIR, RAW-SAW, and simulated RAW-SAW correlations, by attribute.

Error bars reflect 95% confidence intervals.

https://doi.org/10.1371/journal.pone.0301768.g002

RAW-SAW correlations.

To evaluate participants’ metacognitive knowledge of their attribute weights, we first evaluated the correlations between participants’ Revealed Attribute Weights (RAWs) and Stated Attribute Weights (SAWs) for each attribute. Correlations for each attribute ranged from r = .17 - .68 (see Fig 2), and the average of these seven correlation coefficients was only r = .52. This moderate correlation was typical of the degree of miscalibration found across preference elicitation modalities in the literature (r = .40 - .70) [37], suggesting that decision makers’ metacognitive knowledge of attribute weights is similarly limited in school choice as it is in other domains.

In order to test whether or not the correlations were significantly different from one another [65], we conducted a series of Fisher’s r to z transformations in which we compared the correlations between RAWs and assigned SAWs for the simulated respondents to RAW-SAW correlations achieved by the human participants for each of the seven attributes (See Fig 2). The human participants achieved significantly lower correlations than the simulated respondents for six of the seven attributes (ns = 191, zs = -7.95 –-3.24, ps ≤ .001). For State Test Pass Rate, the difference in correlations was marginally significant (ns = 191, z = -1.95, p = .05). The average of the human participants’ seven correlation coefficients (r = .52, n = 191) was significantly lower than the average of the simulated respondents’ seven correlation coefficients (r = .79, n = 191, z = -4.93, p < .001). These comparisons suggest that participants’ metacognitive knowledge of their own attribute weights was limited, even accounting for the noise that is inherent to comparisons across measurement modalities.

RAW-SAW different choice predictions.

As a final test of the alignment between participants’ RAWs and SAWs, we estimated how often participants would have made different choices if they had based their decisions on their SAWs as opposed to their RAWs. To do so, we separately analyzed each choice task completed by each participant (n = 2,674 tasks) and estimated the utility that the participant would have assigned to each of the three schools by multiplying their SAW or RAW for each attribute by the level of the attribute (scored as 1–5) and summing across the seven attributes. We then assumed that the participant would select the alternative with the highest utility and calculated the percentage of tasks for which weighting by SAWs led to different choices than weighting by RAWs. We found that weighting by SAWs led to different choices than weighting by RAWs in 18.36% (491/2,674; κ = .72) of choice tasks.

RAW-AIR correlations.

We also explored the correlations between participants’ RAWs and their attribute importance ratings (AIRs). Correlation coefficients for each attribute ranged from r = .18 - .60 (see Fig 2), and the average of the seven correlation coefficients was only r = .40. This low-to-moderate correlation falls at the bottom end of the benchmark range (r = .40 - .70) [37] for correlations between attribute weights measured via different elicitation methods, thus providing further evidence that decision makers lack metacognitive knowledge of the attribute weights they use when making school choice decisions, just as they do in other domains. A two-tailed fisher’s r to z transformation indicated that the mean correlation between RAWs and AIRs was not significantly different than the mean correlation between RAWs and SAWs (n = 191, z = -1.42, p = .16).

Demographic differences in metacognitive knowledge.

We next explored possible demographic differences in metacognitive calibration. To do so, we first calculated the absolute difference between each participant’s RAW and SAW for each attribute, and then took the average of these values across attributes to generate a measure of each participant’s metacognitive knowledge of the attribute weights they had used. We then regressed this Average RAW-SAW Difference variable (descriptive statistics reported in the S2 Table) on six demographic factors: gender (male vs. female), parental status (parents vs. non-parents), educational attainment (Bachelor’s vs. no Bachelor’s), urban status (urban vs. suburban vs. rural), age, and income. Only the coefficient for income was significant (B = -0.38, SE = 0.15, p = .01). The full regression is reported in Table 1.

Download:

Table 1. Study 1a demographic factors predicting average RAW-SAW differences.

https://doi.org/10.1371/journal.pone.0301768.t001

Study 1a discussion

In Study 1a, participants were unable to accurately self-report the weight that they had placed on various attributes when making school choice decisions. The degree of miscalibration was similar to results reported in the literature for other domains. This pattern persisted regardless of self-report format, and metacognitive knowledge was not associated with demographic characteristics other than income. In conjunction, these results suggest that decision makers may lack the metacognitive sophistication to make school choice decisions that match their true preferences. Indeed, we found that over 18% of choices would have been different if participants had applied the weighting functions that they claimed to have preferred, rather than the weighting functions they revealed through their choices.

Notably, however, the participants in Study 1a were a combination of parents and non-parents, which could mask the possibility that parents may have unique metacognitive knowledge of their school choice attribute weights because they are likely to have more experience with the education system. While Study 1a did find that parents and non-parents had statistically identical Average RAW-SAW Differences, the sample did not include enough parents to conduct the full set of analyses on parents alone. To rectify this limitation and fully address the possibility that parents could have different levels of metacognitive knowledge than non-parents, Study 1b replicated Study 1a with a participant sample consisting entirely of parents of high school-aged children.

Study 1b

Study 1b overview

Study 1b was conducted as a perfect replication of Study 1a, except that all participants were parents whose oldest children were high school-aged (14–19 years old) at the time of study completion. The purpose of Study 1b was to test whether parents of high school-aged children have greater metacognitive knowledge of attribute weights in school choice decisions than the general population. Participants completed the exact same tasks as in Study 1a.