Risk Factors for Children's Receptive Vocabulary Development from Four to Eight Years in the Longitudinal Study of Australian Children

Receptive vocabulary develops rapidly in early childhood and builds the foundation for language acquisition and literacy. Variation in receptive vocabulary ability is associated with variation in children's school achievement, and low receptive vocabulary ability is a risk factor for under-achievement at school. In this study, bivariate and multivariate growth curve modelling was used to estimate trajectories of receptive vocabulary development in relation to a wide range of candidate child, maternal and family level influences on receptive vocabulary development from 4–8 years. The study sample comprised 4332 children from the first nationally representative Longitudinal Study of Australian Children (LSAC). Predictors were modeled as risk variables with the lowest level of risk as the reference category. In the multivariate model, risks for receptive vocabulary delay at 4 years, in order of magnitude, were: Maternal Non- English Speaking Background (NESB), low school readiness, child not read to at home, four or more siblings, low family income, low birthweight, low maternal education, maternal mental health distress, low maternal parenting consistency, and high child temperament reactivity. None of these risks were associated with a lower rate of growth from 4–8 years. Instead, maternal NESB, low school readiness and maternal mental health distress were associated with a higher rate of growth, although not sufficient to close the receptive vocabulary gap for children with and without these risks at 8 years. Socio-economic area disadvantage, was not a risk for low receptive vocabulary ability at 4 years but was the only risk associated with a lower rate of growth in receptive vocabulary ability. At 8 years, the gap between children with and without socio-economic area disadvantage was equivalent to eight months of receptive vocabulary growth. These results are consistent with other studies that have shown that social gradients in children's developmental outcomes increase over time.


Introduction
Our uniquely human capacity for language is one of the most important developmental accomplishments of childhood. Language enables literacy, education, and employment [1,2,3,4,5,6]and is one of the major pathways that supports human capability formation [7]. School is regarded as the single best investment in human capability formation in individuals and populations [8], however, the foundations for success at school are built early. Increasingly, early childhood (birth to 4 years) is the focus of global and national public policy frameworks for human capability expansion [9,10,11].
It is well known that language acquisition is not robust for all children and that disparities in language acquisition emerge early and foretell persistent low levels of language abilities. Longitudinal studies have shown a consistent pattern of early emergence of disparities in language acquisition that persist over time and influence disparities in later educational achievement [2,4,6,12,13]. Results from the first Australian national census of child development conducted in 2009 illustrated the extent of disparities in language abilities in children in the first year of formal school at 5 years. The results of the 2009 Australian Early Development Index (AEDI) showed that 23% of children were vulnerable in language and cognitive skills and that 25% were vulnerable in communication skills [14,15].
Receptive language ability is a dimension of language that emerges in the first year of life and tracks strongly to children's literacy abilities [16,17]. Our receptive vocabulary comprises words we know. Children show reliably that they understand the meaning of words they hear from around 8 months, well before they say their first words around their first birthday [18,19,20]. From onset in infancy, receptive vocabulary develops rapidly in the preschool and school years, from around 200 words in the second year [21], to 20,000 words at 8 years [22]. These crosssectional studies of receptive vocabulary abilities have shown striking individual differences in receptive vocabulary size at different ages.
Population studies that have identified children in the low range of language ability at an early age and followed them up at a later age have revealed a mixed picture of persistent low language ability for some children and catch-up for others [23,24]. These studies point to differences in the rate of language growth between ages, and prompt the question, what factors influence language growth between ages?
Guided by bioecological [25]and transactional [26] models of child development, studies that have used direct observation of parent-child interaction have necessarily investigated the influence of a restricted set of candidate predictors on receptive vocabulary development. This is because data collection, using direct observation, is so resource intensive [27]. Large-scale prospective population level cohort studies permit investigation of a comprehensive range of child, maternal and family level influences on receptive vocabulary development. For resource reasons, proxy, rather than direct measures of home environmental support for language learning are used in large-scale studies such as LSAC [28].

Child, Maternal and Family Influences on Children's Language Acquisition in LSAC
LSAC is an ongoing life course epidemiological study designed to answer questions about how a child's individual characteristics interact with parental, family, community and school characteristics to shape development. The overarching aim of LSAC is to provide a contemporary evidence base for policy, prevention and intervention initiatives in Australia. Data collection for the study is led by a consortium of expert Australian researchers [29]. Guided by a bioecological model of child development [25], data are collected on child, parental, family, community and school characteristics that influence children's development at different ages (i.e., a developmental pathways approach [26,30]. The measurement framework is comparable to indicator frameworks used internationally [31]. Indicator frameworks group variables that influence child development into key domains. For example, time, income, human capital, psychological capital and social capital [32]. This has produced a comprehensive (and expanding) set of independent variables that researchers can select from, to model in relation to specific developmental outcomes, which in this study, is language development [29].
The focus of this study is on factors that influence variation in receptive vocabulary abilities in the preschool and school years in an unselected population level sample. Population level research in language acquisition is rare, meaning there were few precedents for selecting candidate predictors of variation in language acquisition for this study. We selected child, maternal and family variables for our models based on two criteria: (1) Evidence of an independent association with English language abilities in an unselected population level sample of preschool and school age children; or (2) conceptual relevance to language abilities, in the absence of empirical evidence.
This study permitted the rare opportunity to bring together all of these predictors in the same models and to multivariately adjust for the independent influence of these predictors on receptive vocabulary growth from 4-8 years, in an unselected population level sample. Child characteristics. The child characteristics in our models were gender, birthweight, ethnicity, temperament, school readiness and ear infections. Male gender, low birthweight [23], and minority race [28] have been identified as independent risk factors for low receptive language abilities. Temperament was included in our models because of the plausible relationship between a child's temperament characteristics, parent-child interaction and word learning. Low persistence is a candidate risk factor for low receptive vocabulary ability, based on evidence that low persistence was more prevalent in four-year-old children with low language abilities, relative to children with typical language abilities [33]. We included school readiness in our models because disparities in foundational literacy and numeracy skills have been shown to emerge as early as 22 months and to persist through the school years and into adulthood, especially for children with low socioeconomic status [1,13]. Ear infections were included in the models, although surprisingly, this risk factor has not previously been examined in an unselected population level sample of preschool or school age children. What is known, is that in the LSAC sample, ear infections at four years were associated with a four-fold risk for hearing impairment at eight years [34]; and children with hearing impairment had lower levels of receptive vocabulary ability compared to children with normal hearing [35].
Maternal characteristics. We modeled maternal characteristics from three key resource domains for child development; human capital, psychological capital and time [36]. Our measures of human capital were age at the birth of the child, education and parenting. Our measures of psychological capital were mental health distress, problematic alcohol use and smoking. Our measure of time was hours of paid employment.
The most extensively studied variable is maternal education, with low levels of maternal education consistently associated with low receptive language ability [23,37]. Parenting was included in our model as proximal measures [25] of human capital. High levels of positive parent-child interactions are associated with high levels of receptive vocabulary ability [38]. Maternal age was included in the model as an additional measure of human capital. Evidence from the only study, to examine the associations between maternal age, education and low receptive language ability, showed that low levels of education were associated with low receptive vocabulary ability and that young maternal age was not an independent risk for low receptive vocabulary ability. The same study included maternal mental health distress as a measure of psychological capital and reported no association with low receptive language ability [23]. We included mental health distress in our model as well as two health risk behaviors, problematic alcohol use and smoking, that are associated with mental health distress [39]. Full-time maternal employment has been associated with lower language abilities in preschool age children although the effect size was negligible [40] Family characteristics. The family characteristics in our models were: Non-English Speaking Background (NESB), family structure, sibship size, income, health care card, financial hardship, socioeconomic disadvantage, reading to the study child, playgroup and child care. Four measures of socioeconomic status (SES) were included in our model: Income, health care card, financial hardship and socioeconomic disadvantage. NESB, family structure, sibship size, SES and reading to the study child are indicators of family resources for language development. Playgroup attendance and hours in child care are measures of the child's wider ecology, that is contexts other than the family. NESB and socioeconomic disadvantage are independent risk factors for low receptive language abilities [23,38]. Single-parent family structure has also been associated with lower receptive vocabulary ability, relative to children in nuclear families [38]. Being an only child is associated with higher verbal abilities compared to children lower in the birth order [41] and large sibship size is associated with lower verbal and nonverbal abilities and lower academic achievement [42]. Receptive vocabulary abilities vary with SES, with lower levels of SES associated with lower receptive vocabulary abilities. This pattern is found when individual [23] and composite [28] SES measures are used. Book reading is a proximal measure of the home learning environment and is associated with variation in language abilities and foundational literacy and numeracy abilities [43] [38]. Playgroup attendance was included based on prior evidence of an association between attending playgroup and better learning (including verbal) outcomes for children growing up in disadvantaged families [44]. Hours in child care was included in our model because a high proportion of children (28%) in our sample spent more than 4 hours a day in non-parental care, during the working week. It was of interest to determine whether the amount of hours in non-parental care was an independent influence on children's receptive language abilities, over and above the other child, maternal and family characteristics in our models. Prior evidence has suggested that hours spent in non-parental care was not associated with variation in children's language abilities [45]. The present study investigated receptive vocabulary growth from 4-8 years in the nationally representative Longitudinal Study of Australian Children (LSAC). The specific aims were to (1) identify child, maternal and family risks for low receptive vocabulary ability at 4 years and (2) identify child, maternal and family risks for growth in receptive vocabulary ability from 4-8 years. Of particular interest, was whether or not the risks for a low intercept at 4 years were associated with a lower rate of growth from 4-8 years.

Ethics Statement
The Longitudinal Study of Australian Children (LSAC) is conducted in a partnership between the Department of Families, Housing, Community Services and Indigenous Affairs (FaHCSIA), the Australian Institute of Family Studies (AIFS) and the Australian Bureau of Statistics (ABS). The study has ethics approval from the Australian Institute of Family Studies Ethics Committee. The Ethics Committee is registered with the Australian Health Ethics Committee, a subcommittee of the National Health and Medical Research Council (NHMRC). As the study children were all minors at the time these data were collected, written informed consent was obtained from the caregiver on behalf of each of the study children. The signed consent forms are retained by the field agency (ABS).

Access and Use of LSAC Data
LSAC data are publicly available. Researchers can apply to the Commonwealth of Australia Department of Families, Housing, Community Services and Indigenous Affairs (FaHCSIA) for permission to access and use Longitudinal Study of Australian Children (LSAC) data (FaHCSIA website. Available: http://www. fahcsia.gov.au/our-responsibilities/families-and-children/programsservices/growing-up-in-australia-the-longitudinal-study-of-australianchildren-lsac. Accessed 2013 Aug 2).

Study Design
The Longitudinal Study of Australian Children (LSAC) is a national longitudinal study that commenced in 2004. The study uses a cross-sequential design of biennial face-to-face visits with the family and study child. In this study we used the first three waves of the child cohort which comprised 4,983 children at wave 1 (age 4-5 years), 4,464 children at wave 2 (age 6-7 years) and 4,331 children at wave 3 (age 8-9 years).
The LSAC sampling frame was extracted from the Medicare Australia enrolment database, which was validated to ensure coverage of Australian children within the target age-range. The initial study sample was aimed at being representative of Australian children within the selected age cohort, proportional to the regional distribution of children in the Australian population. An initial sample size of 5,000 was chosen as to ensure there would still be a sufficient sample for detailed analysis after attrition over the number of years of the longitudinal study.
The study entailed a two-stage clustered design, first selecting postcodes then children within postcodes. Stratification was used to ensure proportional geographic representation for states/ territories and capital city statistical division/rest of state areas. Cluster sampling was utilised because it provides a cost effective way to conduct face-to-face interviews, as well as an opportunity to collect and analyse community-level effects. Postcodes were selected with probability proportional to size selection where possible, and with equal probability for small population postcodes. Children were selected from 311 postcodes [46,47].
Analyses show that the initial sample was broadly representative of the general Australian population when compared with 2001 Census data, but slightly under-representative of families who were single-parent, non-English speaking and living in rental properties. Attrition somewhat increased these biases. For example, the overall attrition rate between Waves 1 and 3 was 13%, but children with mothers classified as Non-English speaking background decreased from 15.7% at Wave 1 to 13.8% at Wave 3, an attrition rate of 23%. The proportion of mothers who had a year 11 or less education as at Wave 1 decreased from 39.2% at Wave 1 to 36.5% at Wave 3, an attrition rate of 19% [48]. Such attrition is typical in longitudinal studies. The growth curve modelling used in this study utilises maximum likelihood estimation; this technique makes full use of available data, which minimises the effects of item-non response within the study although does not fully adjust for missing data.

Measures
A bioecological model of child development [25,49] guided the selection of measures for LSAC. The conceptual model posited multiple domains of proximal and distal influences on child development. Among these domains are characteristics related to the child, the mother, and the family home environment. Many of the measures are benchmarked against Australian census collections while still others are referenced to large scale Australian and international child development studies. For ease of summary the measures used in the growth modeling for this study are classified into the above domains and with respect to their function (i.e., response, time and predictors).
Response variable. The response variable was the children's performance on the Adapted Peabody Picture Vocabulary Test-III (PPVT-III). The children were assessed by the interviewer with a number of tests, including the Adapted PPVT-III, a test of receptive vocabulary designed for the LSAC study [50]. The Adapted PPVT-III is a shortened version of the PPVT-III [51]. The Adapted PPVT-III was administered directly to each child during the home interview. For each word presented, the child was shown a card containing four pictures and was asked to point to the picture corresponding to the word (e.g., ''Show me wrapping''). Scaled scores for the Adapted PPVT-III were used in all analyses. While the full details of the development of this shortened version are available elsewhere, briefly the Wave 1 calibration entailed an independently drawn sample of 215 children aged from 41 to 66 months (mean = 54.7 months) who were given the full PPVT-III with test administrators following standard procedures. After testing, a one-parameter (Rasch) item response model was fitted to the data, which consisted of correct and incorrect responses. The person separation reliability was 0.88. After determining the 'best' 40 items for use in a shortened version, the remaining items were then fit again to a oneparameter item response model; the person separation reliability was 0.78. In this way a short version of the PPVT-III was developed suitable for administration to the child in a household survey. The Pearson product-moment correlation between the full PPVT-III and the Adapted PPVT-III was 0.93 for all children [50]. This procedure was carried out at each Wave with appropriately drawn calibration samples. Thus the Adapted PPVT scores used in this report reflect a continuous range of ability. Table 1 contains the median ages and age ranges in months for the study children at each Wave along with the mean Adapted PPVT-III Rasch scaled score with SDs and associated ranges. For economy of expression, the Adapted PPVT-III is referred to as the PPVT.
Time. The child's age in months was used as the measure of time. There were approximately 24 months between each wave of the LSAC, with the age of children within each wave varying around a median age (see Table 1). This distribution of ages allowed a detailed month-by-month analysis of growth in receptive vocabulary ability over time.
Child, maternal and family candidate predictors. A total of 28 candidate predictor variables were selected for models. These were grouped into child, maternal, family and home environment characteristics (see Table 2). Child characteristics. The child characteristics in our models were: Gender, ethnicity, birthweight, ear infections, school readiness and temperament. There were equal proportions of girls and boys in the sample. A small proportion of children (n = 187; 3.8%) were of Aboriginal and/or Torres Strait Islander decent and were coded to distinguish them from those who were not. Primary carers were asked to report their child's birthweight which was subsequently coded into those children who were born with low birthweight (,2500 grams; 6.5%) and those who weighed more than this (. = 2500 grams). Mothers were asked if their child had a range of ongoing health problems. Because of the known association between hearing loss and language emergence a single item indicator of ongoing ear infections at Wave 1 was included. In addition to the Adapted PPVT-III, each study child was directly assessed at Wave 1 using the Who Am I? [52]. This is a measure of school readiness and comprises 11 items in which children write their names, copy shapes and write words and numbers. It has been extensively calibrated for use in the LSAC and has well demonstrated item characteristics, high internal reliability (0.89), and excellent distributional properties [52]. In this report, study children have been grouped into quintiles of performance based on the total Who Am I? score with high quintiles representing higher levels of performance.
Child temperament was measured at wave 1 of the LSAC by administering the Short Temperament Scale for Children (STSC; [53]) to the person designated as Parent 1. The STSC measures three dimensions of temperament: persistence, reactivity and sociability. Persistence refers to a child's ability to stay focussed on tasks, reactivity refers to refers to irritability, negative mood and high-intensity negative reactions, and sociability refers to a child's tendency to approach novel situations and people or conversely to withdraw and be wary. Each temperament dimension was assessed through parent report using four items, rating the frequency of the behaviours on a 6-point Likert scale of occurrence from ''almost never'' to ''almost always''. Where data were missing for any of the items making up a dimension of temperament respondents were coded as missing for that variable. Four composites were constructed based on the respective items and each was then divided into quintiles with higher quintiles representing the positive aspects of each dimension.
Maternal characteristics. The maternal characteristics in our models were: Age at the birth of the child, problematic alcohol use, smoking, mental health distress, education, hours of paid employment and parenting. The biological mother's age at the birth of the child was grouped into categories representing teen birth (,age 20 years), 20-39 years and 40 or more years at birth with the vast majority of mothers (93.9%) of study children in the age range 20-39 years.
Information on current tobacco and alcohol at Wave 1 was gathered from the mothers. We defined problematic alcohol use where women reported their daily alcohol consumption to exceed 2 standard drinks and/or where they reported frequent binge drinking of 5 or more alcoholic drinks at least 2-3 times per month with 12.1% being so classified. Study children's mothers were asked about tobacco use and also categorised as either current smokers (23.1%) or not current smokers.
In this study, we used the Kessler-6 (K-6) to measure maternal non-specific psychological distress. Women with scores of 8 or more were classified as having symptomatic psychological distress. This threshold is consistent with other studies [54,55] using the K-6. The scale has robust characteristics as an indicator of mental health with recent Australian findings [39] that 53% of Australian adults with a score of 8-12 on the K6 had an ICD-10 anxiety or depressive disorder as diagnosed by the Composite International Diagnostic Interview. Sixteen percent of mothers reported symptomatic psychological distress.
In Australia, at the time of this study, 10 years of education was compulsorily mandated. Maternal education in years was grouped into three levels according to those who had completed 11 years (39.2%), 12 years (32.3%), and those who had completed more than 12 years (i.e. University education) (28.5%).
Mothers were variously employed at the time when the children were first measured. We used total hours of paid maternal employment to distinguish mothers who were not in paid employment (0 hours), from those in part time paid employment (1-37 hours; Mean = 17.8) and in full time paid employment (. = 38 hours; Mean = 44.9). Equal proportions of women (43%) were either not in paid employment or working part time with the remainder (13%) working full time.
The parenting characteristics of both parents were measured in a self-complete form, using four measures of parenting warmth, hostility, consistency and inductive reasoning developed for the LSAC [54]. We use the mother's responses in this report. Responses to each item were on a 5-point Likert scale, ranging from ''almost never'' to ''always/almost always''. Items for each measure were summed to create a composite score with higher levels representing more positive parenting characteristics. Item and scale properties for the LSAC parenting measures have been extensively documented [54]. Ordinal scale reliabilities [55] were 0.72 for maternal hostility, 0.82 for consistency and .93 and .94 for warmth and inductive reasoning respectively.
Family characteristics. The characteristics of the family home environment in our models were: NESB, family structure,  sibship size, income, health care card, financial hardship, socioeconomic disadvantage, reading to the study child, playgroup and child care. As the focus of this study is explicitly on English language development and because language development is known to vary where more than one language is spoken in the home, we used the mother's NESB as a general indicator for language other than English spoken in the household at Wave 1. About 16% of mothers were predominately non-English speaking at the time of the interview. With respect to family composition, two variables were selected as candidate predictors of vocabulary development: Family structure (sole parent vs. other) and number of siblings (0, 1, 2, 3, 4+). About 13% of the study children were living in single mother families and the majority had one sibling (48.4%) or were singletons (11.4%) at the time of the Wave 1 interview. The study design did not permit the establishment of birth order.
Families were asked to report their total weekly family income from all sources. Responses were partitioned into relatively equal quintiles: those families earning under $600, $600-$999, $1000-$1499, $1500-$1999, and $2000 or more per week. In Australia, there is universal health coverage and where income falls below a defined threshold and/or certain hardship criteria are met families also qualify for a health care card. About 22.0% of LSAC families had a health care card and this is used as an indicator of financial need in the LSAC families. Additionally, an indicator of family hardship was also derived where families reported, due to shortage of money over the last 12 months that: they had not been able to pay gas, electricity or telephone bills on time; they had not been able to pay the mortgage or rent on time; adults or children had gone without meals; they family had been unable to heat or cool their home; they had pawned or sold something; or sought assistance from a welfare or community organisation. About one third of families reported at least one of these occurrences in the previous 12 months to the Wave 1 interview.
An area measure of socioeconomic disadvantage was also estimated for each participating family. The family home was coded with Socio-Economic Indicators for Area (SEIFA) disadvantage, indexed in quintiles-higher quintiles represent greater levels of disadvantage. The neighbourhood SEIFA disadvantage index summarizes information from the Australian Census of Population and Housing as this relates to economic and social disadvantage in small areas, such as low income, low educational attainment and high unemployment [56]. This data was linked at either the Statistical Local Area (SLA) level or, where this was not available, the child's postcode. Several indicators of the child's learning environment were gathered. The frequency with which the primary caregiver read to the study child was assessed via face-to-face interview. A total of 182 (3.6%) of parents reported not reading to the child at all, 970 (19.5%) reported reading 1 or 2 days a week, 1478 (29.7%) reported reading to the child 3-5 days a week, and 2350 (47.2%) reported reading to the child daily. Mothers were asked if their child had attended a playgroup in the period 12 months prior to the Wave 1 interview with about one third indicating this to be the case. Finally, hours a week in care/early education were coded by asking the parent ''how many hours a week on average does the child go to school, kindergarten, pre-school, and/or day-care?'' A total of 298 (6.3%) attended 8 or less hours a week, 3114 (65.5%) attended 9-20 hours a week, 950 (20.0%) attended 21-30 hours a week, and 389 (8.2%) attended 31+ hours a week.

Data Analysis
The data were analysed using growth curve models [57]providing a clearer view of the trajectories of vocabulary growth. These models estimate intercept and slope effects of receptive vocabulary development in relation to the candidate predictors. Predictors may be examined in both bivariate and multivariate models. In multivariate models the predictor set is simultaneously adjusted permitting an estimate of the relative (e.g. unique) contribution of each of the predictors when the remaining predictors are at their reference levels. A desirable feature of growth curve analysis is its use of unevenly spaced measurements and its use of all available data in the presence of some level of missing data [53].
Our growth curve modelling utilises a two-level nested structure. Level 1 is the within-person model while level 2 is the between-person model. The within-person component allows each child to have a unique receptive vocabulary growth trajectory and the between-person component of the models represents variation in receptive vocabulary growth parameters among children with similar characteristics. The Proc Mixed procedure in SAS 9.2 was used to fit these models. As we are primarily interested in the fixedeffects portions of these models, we have used Maximum Likelihood as the estimation method.
Our analyses proceeded in three steps. First, an unconditional growth model was estimated to gain an overall model of receptive vocabulary development across the sample. The only predictor variable in this unconditional growth model was time (i.e., age). We made the decision to input time to the model parameterized as the child's age minus 50 months. Thus each child's age was modelled as his or her age at each wave minus 50 (i.e. age at wave 1-50, age at wave 2-50, age at wave 3-50). This set the intercept (i.e. the start of the growth trajectory) such that the youngest child in the study had an intercept of zero months at Wave 1.
Receptive vocabulary development was then modelled as a linear function of children's initial receptive vocabulary (intercept), growth per month in receptive vocabulary (linear slope), and random error. Following Singer and Willett [57], the unconditional growth model is represented by the following equation: That is, PPVT ij = (c 00 + f 0i ) + (c 10 + f 1i ) (AGE ij 2 50) + e ii Or alternatively, PPVT ij = c 00 + c 10 (AGE ij 2 50) + f 0i + f 1i ) (AGE ij 2 50) + e ii In this model PPVT ij represents child i's PPVT score at time j, p oj represents child i's PPVT score at an initial intercept (time zero), p 1i represents a linear function of PPVT score at child's age in months, AGE ij represents child i's age at each time j and e ii represents unique error associated with that child at time j. c 00 and c 10 represent systematic between-person differences in intercept and slope. f 0i and f 1i represent the residuals unexplained by the between-person parameters. These residuals represent the amount of variation left over after accounting for the model's predictors, in this case, the child's age.
In this model we presume that a linear trajectory adequately describes growth over time in this study. This assumption was supported by an extensive visual screening of initial growth plots. Figure 1 depicts the empirical growth plots for a random sample of 50 children. There were no major departures from linearity, and with a maximum of three time points per child, a linear model was determined to be adequate to describe each child's trajectory.
Second, we developed a series of conditional models that included child, maternal and family predictors. Initially, each predictor was considered individually. Each sub-model fitted into the above model as follows: In these models the terms c 00 , c 01 , c 10 , and c 11 , represent fixed effects. These are the systematic between-individual differences in intercepts and slopes according to the predictors in the model. c 01 represented the difference between the reference group and the group of interest at intercept and c 11 represented the difference between the reference group and the group of interest in slope. As before, f 0i and f 1i represented the residuals that were not explained by the predictors.
Our fixed effects were typically parameterized such that the lowest risk (or most advantaged) group represented our reference group. For each variable, the percentage of the sample in each of the reference categories, along with the percentage of the sample in each of the other categories are shown in Table 2. For example, in examining birthweight we used children of 'normal' birthweight as our reference group. For school readiness, we used children in the quintile with the highest Who Am I? scores as our reference group. Thus, we estimated the bivariate impact on the child's PPVT score at intercept (i.e. at the start of the growth trajectory) and slope (i.e., monthly rate of growth in PPVT score) for each candidate predictor. To establish the effect size for the various predictors we standardised the regression coefficients from the growth curve models into standard deviation units (Cohen's d) cohen [58].
Following Singer and Willett [57] we calculated a pseudo-R 2 measure of model fit by squaring the correlation between observed and predicted values for PPVT, as an indicator of model fit.
Third and finally, we tested an overall multivariate model by selecting candidate predictors from these bivariate estimates on the basis of their Cohen's d. Cohen provides some general guidance on what constitutes a negligible (d,0.2), small (d = 0.2), medium (d = 0.5) and large (d = 0.8) effect sizes. We were able to calculate effect sizes across a range of values of change in the PPVT score and chose d = 0.3 as the threshold for including a predictor in the multivariate model. We chose this threshold because an effect size of this magnitude corresponds to about 6 months of development with respect to change in the PPVT score. Our rationale was that a six-month lag in receptive vocabulary ability at 42 months constitutes a developmentally meaningful gap. It was of interest to determine the extent to which children with risk factors associated with a 6 month lag in receptive vocabulary abilities at 4 years were able to close the gap at 8 years. We acknowledge that there is no precedent for setting a threshold of d = 0.3 as a developmentally meaningful effect size in relation to language ability. However, we were guided by a meta-analysis of meta-analyses of child and family influences on children's educational achievement that identified d = 0.31 as the average effect size for child influences and d = 0.40 for family influences [59]. Our rationale was not to reduce the predictor set a priori, but to select predictors for multivariate modelling associated with a developmentally meaningful lag in receptive vocabulary ability at the intercept. Using the threshold of d = 0.3, a subset of 16 predictors from the 28 were selected for entry into multivariate modelling, allowing estimation of their independent effects on the PPVT score intercept and slope. We examined model diagnostics and noted that after accounting for the predictors in the multivariate model, random slope (i.e. individual rate of growth in PPVT scores) no longer accounted for any substantive variation in PPVT score, to the point that there was not enough slope variance to model. As a result, random slope was removed from the final model. So the final model included a random intercept term, and co-efficients for different rates of growth for each level of each predictor variable.

Results
Growth modelling permits several views of vocabulary change over time. Table 3 contains the estimates of the unconditional growth model and the bivariate growth model along with the effects for each predictor on the initial PPVT score (the intercept) and the effect of the predictor on PPVT growth (the slope). These effects include the predictor's effect on the PPVT score at intercept, its effect size -Cohen's d (d i ), and the number of months this effect represents relative to the reference group for the given predictor variable. Additionally we provide estimates of each predictor's effect on growth in the PPVT score over time (i.e., slope), the effect size for the slope calculated as Cohen's d (d s ), and then the projected change in PPVT from 50 months onward to 57, 82 and 105 months (the median ages at each wave of the study). The final columns in this table provide the estimated difference in PPVT points and associated difference in developmental age in months, along with model fit indices.

The Unconditional Growth Model
The unconditional growth model showed that at a starting age benchmarked to 50 months, the initial PPVT score was 62.88, with a growth rate from the period 50 to 119 months of 0.29 PPVT points per month (p = ,0.0001). This model has an overall fit (pseudo R 2 ) of 0.53. This showed that more than half of the variance in receptive vocabulary growth was explained by time (i.e., age).

Conditional Growth Models
Bivariate growth modelling. We tested bivariate associations with PPVT intercept and slope for each of the 8 child, 10 maternal, and 10 family variables in a series of conditional growth models (see Table 3).
For the child variables, most bivariate effects with PPVT were in the small to moderate range. The group of children performing in the lowest quintile on the Who am I? test of school readiness, exhibited an effect size bordering on large (d i = 0.73). These children with low school readiness had PPVT performances that started 4.5 points, or 14.5 months lower relative to those children in the highest Who am I? quintile. However, their onward vocabulary growth differential was among the highest observed (0.037; d s = 0.30) allowing these children to halve this initial gap to 2.5 PPVT points, representing a subsequent decrement of about 8 months in vocabulary growth at 105 months relative to the group in the highest Who am I? quintile. Within the quintile categories of the Who am I? there were differential effects on both the PPVT intercept and the slope. Progressively poorer Who am I? performances were associated with progressively poorer PPVT initial vocabulary performance. However, those children in the lowest two quintile groups on the Who am I? were the ones that exhibited more rapid growth in onward vocabulary over time relative to the children in the highest quintile group.
Among the child variables the candidate predictors for persistent and reactive temperament and Aboriginal status showed the next highest effects on vocabulary (d i = 0.42, 0.41. 0.40 respectively). The PPVT performance of children with low persistence started about 2.6 PPVT points or 8.8 months lower than those children with high persistence. The difference in the rate of change in vocabulary for those children with low persistence was essentially negligible and they were about 6.3 months behind in their PPVT performance 105 months later relative to children with high persistence. A similar pattern was evident in those children with high levels of temperamental reactivity. These children started about 2.5 PPVT points or 8.5 months lower than those children with low reactivity. These children evinced slightly more positive vocabulary change over time relative to those children with low reactivity, but none-theless remained about 4.7 months behind at 105 months. We would note here that the third measure of temperament -sociabilityshowed weaker associations with initial vocabulary performance (d i = 0.22). Relative to children with high sociability, those with low sociability started about 1.4 PPVT points or 4.8 months lower. Their rate of vocabulary change was not different from children with high sociability (d s = 0.03) and they were about 1.2 PPVT points, or 4.1 months behind in their vocabulary performance at 105 months.
Aboriginal children started about 2.5 PPVT points, or 8.5 months below non-Aboriginal children (d i = 0.40) and displayed a negligible difference in growth rate (d s = 0.05) over the period to Smaller effects on the vocabulary intercept were observed for low birthweight status (d i = 0.34). These children started about 2.1 PPVT points lower, or 7 months behind children with normal birthweight. This initial difference in PPVT vocabulary performance was approximately halved over time by a slightly more rapid change in vocabulary over time (d s = 0.14) in the low birthweight group and their performance was about 1.2 PPVT points, or 3.9 months behind at 105 months relative to normal birthweight children.
Finally, we found negligible effects on the PPVT intercept and slope for both gender and ear infections.
There were 10 maternal predictor variables assessed at this stage. In relative terms, the largest effect size on the vocabulary intercept was for maternal parenting consistency (d i = 0.55). That is, a medium effect size. Children who experienced a low level of parenting consistency started 3.5 PPVT points, or about 11.4 months behind in their vocabulary performance relative to those children who experienced high levels of parenting consistency. The rate of change of their PPVT over time was the largest among the quintiles of parenting consistency (d s = 0.22), and by 105 months they had almost halved the initial PPVT gap to about 2 points, or 6.5 months lower than those children who experienced high levels of parenting consistency. As levels of parenting consistency declined, so too did the PPVT intercept.
The effect of maternal inductive reasoning at intercept was less marked (d i = 0.20), with children who experienced a low level of maternal inductive reasoning starting 1.3 PPVT points, or about 4.2 months behind in their vocabulary performance relative to those children who experienced high levels of maternal inductive reasoning. The growth differential was negligible (0.006, d s = 0.05), with the initial PPVT gap persisting over time.
Maternal education showed predominant intercept effects. Relative to mothers with post-school education, the PPVT intercept of those children whose mothers had Year 11 or lower levels of education was 3 points or 10.3 months lower (d i = 0.48). The rate of change in the PPVT performance was negligible (d i = 0.04) and by 105 months their PPVT performance was 2.7 points or 9.4 months lower than children whose mothers had postschool education. The overall effect of changing levels of maternal education was confined to the PPVT initial performance (i.e., the intercept) rather than on producing onward change in PPVT over time (i.e. the slope).
Poor maternal mental health was associated with lower initial PPVT performance (d i = 0.42). Where mothers reported symptomatic mental health distress their children's initial PPVT performance was 2.6 points, or about 8.4 months lower than the performance of children whose mothers were not classified as symptomatically distressed. Over time the children whose mothers were mentally distressed showed more rapid change in their PPVT score relative to those whose mothers were not distressed (d s = 0.24) and at 105 months their PPVT score was about 1 point, or 3.1 months lower.
The effect of maternal employment on PPVT performance was small and observed in children whose mothers were not working relative to children whose mothers were working part-time. Children whose mothers were not working started about 2.1 PPVT points, or about 7 months behind in their vocabulary performance relative to those children whose mothers were working part-time. This initial difference was almost halved (d s = 0.15) and their PPVT score at 105 months was about 1 point, or 3.5 months lower than those children whose mother was working part time. The PPVT performances of children whose mothers were working full-time were indistinguishable from those children whose mothers were working part-time.
With respect to maternal age at birth, the only effect that was notable was a small effect (d i = 0.32) for the children whose mothers were teenagers at the time of the birth of the child. Relative to the children of older mothers, the initial PPVT performance of children whose mothers were teens was about 2 points, or 6.7 months lower. The rate of change in their PPVT performance was negligible (d s = 0.05) and at 105 months they remained 1.6 points, or 5.6 months behind the children of mothers aged 20-39 years. The PPVT performances of children of mothers aged 20-39 years was not different from children with mothers aged over 40 years at birth.
The effect of maternal smoking on PPVT performance was small (d i = 0.25), with the children of smokers starting 1.6 PPVT points or 5.3 months behind at intercept The growth differential was negligible (d s = 0.04), with the initial PPVT gap persisting over time.
Finally, the effects on child PPVT performances of maternal alcohol consumption, were very small and inconsistent. So too were the effects of low maternal warmth and high hostility.
Among the family variables measured, reading to the study child showed the single largest effect on the PPVT intercept. Relative to children who were read to daily, diminishing PPVT performances were associated with lower levels of parental reading to the child. So, relative to those children who were read to daily, the PPVT scores of children who were not read to at all was 6.1 points, or 18.6 months, lower (d i = 0.98). Importantly, the rate of change in the PPVT performance of children where were not read to was among the largest observed (d s = 0.38) allowing them to almost halve the gap between those who received daily reading. By 105 months their PPVT performance was 3.5 points, or 10.6 months lower than those children who were read to daily.
A large effect size was also observed on the initial PPVT performance of NESB children (d i = 0.74). The PPVT scores of NESB children were 4.6 points, or 16.3 months lower than the performance of children from English speaking households. The results for NESB children also showed the largest effect size observed for change in PPVT performance over time (d s = 0.49). Thus the 16.3 month gap between their PPVT performance relative to English speaking children was reduced to 1.3 PPVT points, or 4.5 months at 105 months.
Large effects were also observed on initial PPVT performance for low family income and increasing family size. Relative to families earning $2000 per week, the initial PPVT performance of children in families earning $600 per week was 4.3 points, or 13.9 months lower (d i = 0.69) and by 105 months the rate of change in their PPVT score (d s = 0.24) had reduced this gap to about 2.7 PPVT points, or 8.5 months.
This same pattern of effects was paralleled with respect to family size. Relative to only children, the initial PPVT performance of those children who had four or more siblings was 3.9 points, or 12.7 months lower (d i = 0.63). We observed linear effects on the PPVT intercept with decreasing PPVT performance as the number of siblings increased. These effects appeared confined to the intercept with the PPVT performances remaining in relative position over the period to 105 months. There were no differential slope effects.
Small effects were observed for financial distress as measured by the need for an Australian Health Care Card, area disadvantage (SEIFA), financial hardship and single mother status.
The PPVT initial performance of children in families needing a Health Care Card was 2.2 points or 7.5 months lower than the performance of children in families that did not need a Health Care Card (d i = 0.36). At 105 months these children were 1.7 points or 5.8 months lower in their PPVT performance with no differential rate of change (d s = 0.08).
The initial PPVT performance of children living in the areas of high social and economic disadvantage (i.e. lowest SEIFA quintile) was about 2 points or 6.8 months lower than those children living in the highest SEIFA quintile. Moreover, area disadvantage had a negative effect on PPVT differential, with children in the most disadvantaged areas falling further behind (d s = 0.11) such that by 105 months these children had a PPVT score 2.7 points or 9.5 months lower. Financial hardship and sole mother status were near the threshold for small effects on initial PPVT-III scores (both d i = 0.29). Both showed virtually identical effects on initial PPVT scores: Children in families with financial hardship had initial PPVT scores about 1.8 points or 6 months lower than in families without hardship. There was a negligible rate of change in PPVT performance by 105 months and these children's scores were 1.1 points and 3.7 months lower. The results for children of sole mother families were the same.
Finally the effects of playgroup attendance and hours of week in care on initial PPVT scores and on the rate of PPVT growth were negligible.
Multivariate growth modelling. Table 4 contains the 16 predictor variables with initial effects d. = 0.30 selected for multivariate modelling. We tested independent associations with PPVT intercept and slope for each of the 5 child, 5 maternal and 6 family variables. This model has an overall fit (pseudo R 2 ) of 0.6082. Thus the predictors accounted for around an additional 7% of the variance beyond the child's age, compared with the unconditional growth model (pseudo R 2 of 0.53). While the age of the child was, overwhelmingly, the most important predictor of receptive vocabulary growth, the effects of other predictors with an effect size of d. = 0.3 equated to a 6 month difference in receptive vocabulary at the intercept.
Of the sixteen variables in the multivariate model, NESB and Who am I? showed medium effects on initial adjusted PPVT performance (d i = 0.68 and d i = 058 respectively) and reading to the child and number of siblings showed small effect sizes on initial adjusted PPVT performance (d i = 0.45 and d i = 0.35 respectively).
In the multivariate model, NESB showed the largest effect on initial PPVT performance with PPVT scores 4.2 points or about 16 months behind families who were of English speaking status. In the multivariate model, this initial gap in PPVT performance had closed to 1.0 point or 3.9 months at 105 months.
Poor performances on the Who am I? showed effects on the initial PPVT performance, evident also in the performances of children at 105 months. Relative to children in the highest quintile of the Who am I?, those in the lowest quintile had initial PPVT scores 3.6 points or about 15 months lower. The rate of change in the PPVT score for those children in the lowest quintile of Who am I? narrowed the initial gap in performance to 1.5 PPVT points or 6.1 months.
The effects of daily reading to the child remained evident but reduced in the multivariate model. These effects were confined to the initial PPVT performance (i.e. the intercept) and resulted in children maintaining relative position with respect to PPVT performance at 105 months. When compared with those children who were read to daily, the initial PPVT performance of those children who were not read to at all was 2.8 points or 12.1 months lower. The rate of change in PPVT was essentially negligible for all levels of reading from the initial measure to the measure at 105 months. So, by 105 months the group of children who were reported as not being read to at the Wave 1 interview were 1.6 PPVT points, or 6.7 months lower than those who had been read to daily.
Sibling effects, while small, were apparent -particularly for those children with 2, 3 or 4 siblings, these effects being confined to the initial performance (i.e., intercept). So for example, relative to those children with no siblings, the initial PPVT performance for those with 4 siblings was 2.2 points or about 10 months lower. There were no apparent differential effects in the rate of PPVT change over time and these children were 1.7 points or 7.8 months lower at 105 months.
Of the remaining eight predictors, income, low birthweight and maternal education showed the next largest effects (d i = 0.26, 0.22 0.21 respectively). These variables demonstrated effects on PPVT initial performance (i.e. intercept). There were no differential rates of change in PPVT performance over time associated with income and mothers education. Relative to children in high income families, the initial PPVT performance of those in low income families was 1.6 points or 7.1 months lower and at 105 months this gap had diminished to 0.7 adjusted PPVT points, or 3.2 months. Low birthweight children started 1.4 PPVT points or 6.4 months lower, relative to normal birthweight children. The growth rate differential was negligible (d s = 0.07) and by 105 months the gap remained 0.9 PPVT points or 4.2 months. For children in families with mothers who had 11 or less years of school, their initial PPVT score was 1.3 points or about 6 months lower relative to children from families where mother had some level of post-school education. No differential rates of change in PPVT scores was observed over time, and this group persisted with relatively lower PPVT performance with PPVT scores of 1.3 points or about 6 months at 105 months.
After accounting for the other variables in the multivariate model, there was no independent effect of area disadvantage on initial PPVT, with the initial PPVT of children living in the areas of high social and economic disadvantage (i.e. lowest SEIFA quintile) about 0.05 points or 0.2 months lower than those children living in the highest SEIFA quintile. However, while the effect on the intercept was negligible, the negative receptive vocabulary growth differential associated with these areas (D s = 0.21) persisted, so by 105 months children in the areas with the greatest social and economic disadvantage were 1.5 PPVT points or 8.1 months behind.
Finally, effects for Aboriginal status, temperament variables, maternal age, maternal mental health, maternal work hours, parenting consistency, and Health Care Card status showed extremely small and inconsistent effects well below criterion threshold (i.e., d = .0.30).

Discussion
The aims of this study were to investigate trajectories of receptive vocabulary growth from 4-8 years and to identify child, maternal and family predictors of variation in the intercept and slope of these trajectories.
The results showed substantial variation in receptive vocabulary ability at 4, 6 and 8 years as well as in the rate of growth between 4-8 years. These results mirror studies of the onset and growth of receptive and expressive vocabulary development in the first three years of life [60,61]. This suggests that receptive vocabulary acquisition continues to be characterised by variability in the preschool and school years and is not just a pattern observed in very young children at the onset of language acquisition. This pattern of variability rather than convergence over time is not surprising given that receptive vocabulary growth does not have a developmental endpoint, unlike the acquisition of speech sounds or grammar.

Bivariate and Multivariate Growth Model Effects for Receptive Vocabulary
The child, maternal and family variables that were included in both the bivariate and multivariate models were those with a bivariate effect size d. = 0.30 (see Table 5). The child characteristics in both models were ethnicity, birthweight, school readiness and temperament. The maternal characteristics in both models were age, mental health distress, education, work hours, and parenting consistency. The family characteristics in both models were NESB, sibship size, income, health care card, socio-economic area disadvantage and book reading. Once multivariately adjusted, the child, maternal and family risk factors for low receptive vocabulary ability at four years were low birthweight, low school readiness, teenage motherhood, mental health distress, low maternal education, low maternal consistency, NESB, one or more siblings, low income, health care card, socio-economic area disadvantage and book reading. Except for socio-economic disadvantage, these risk factors were associated with a positive growth differential between 4 -8 years. The positive growth differential associated with low birthweight, low school readiness, teenage motherhood, mental health distress, low maternal education, low maternal consistency, NESB, one or more siblings, low income, health care card, and book reading, meant that children with these risk factors narrowed the gap in receptive vocabulary ability at the intercept, to varying degrees (see Table 5).
The adjusted effects are important to understand with respect to multiple risk. For most predictors, adjusted effect sizes are negligible to small. The total amount of variance that the predictors account for is an additional 7% after adjusting for advancing age, which accounts for another 52% of the variance in vocabulary development. This 7% is a surprisingly small percentage of increase in variance accounted for over and above that accounted for by age. This is consistent with other population level studies that have reported that child, maternal and family characteristics explain little of the variation observed in young children's language acquisition [62]. These adjusted estimates provide an indication of the independent contribution that the predictors make so it is important to remember that risks occur multiply and that the adjusted effect sizes are in this sense additive.
Based on effect sizes, this study failed to reveal strong explanatory factors for variable growth in children's receptive vocabulary abilities, apart from the child's age. That is, the effect sizes for risks associated with low receptive vocabulary ability were mostly medium to small. This is entirely consistent with the results of other studies of children's language acquisition that have shown that prediction of variability in language acquisition is modest at best [62,63,64]. However, when children's receptive vocabulary scores were equated to months, the impact of adjusted risk factors with medium to small effect sizes equated to noteworthy developmental differences in months of receptive vocabulary growth. For example, the risk for low receptive vocabulary associated with not reading to the study child had a small effect size at intercept (d = 20.45). However, this equated to a 12-month gap in receptive vocabulary development for children who were not read to at all, relative to children who were read to daily.
Of particular interest, was the extent to which risks associated with a low intercept at 4 years were also associated with a lower rate of growth from 4-8 years. From an initial set of 28 candidate predictors, 10 risks were associated with intercept effects. These risks in order of magnitude were: Maternal NESB, low school readiness, child not read to at home, four or more siblings, low family income, low birthweight, low maternal education, maternal mental health distress, low maternal parenting consistency and high child temperament reactivity. Notably, none of the risks that influenced the intercept at 4 years were associated with a lower rate of vocabulary growth from 4-8 years. Instead, maternal NESB, low school readiness and maternal mental health distress were associated with a higher rate of growth, although not sufficient to close the receptive vocabulary gap for children with and without these risks at 8 years. Male gender had negligible effects on receptive vocabulary ability at intercept and slope. The results of meta-analyses concur with the results of the present study, that gender differences in language ability are negligible [59]. Convergent evidence for negligible gender differences in language ability is that the prevalence of language impairment in the general population is 8% for boys and 6% for girls and not significantly different [65]. The only risk factor associated with a slower rate of growth was socioeconomic area disadvantage. At 8 years, the gap between children with and without area disadvantage was equivalent to eight months of receptive vocabulary growth. Thus the gap between children with and without area disadvantage widened in the early years of school. The higher rate of growth for children with maternal NESB and maternal mental health distress narrowed the receptive vocabulary gap at 8 years to less than 4 months. The higher rate of growth for children with low school readiness between 4-8 years more than halved the gap of 15 months at 4 years compared to a six-month gap at 8 years. The remaining risks associated with lower receptive vocabulary ability at 4 years; low birthweight, low maternal education, low family income, not read to at home, four or more siblings, low maternal parenting consistency and high child temperament reactivity were not associated with higher or lower receptive vocabulary growth from 4-8 years. For children with these risk factors, the receptive vocabulary gaps apparent at 4 years were still evident at 8 years. These results showed that receptive vocabulary disparities evident before the start of formal school persisted over time, although the extent of the disparities at 8 years varied across risk types. The largest gap in receptive vocabulary abilities at 8 years was between children with and without socio-economic area disadvantage. The concerning pattern of change over time for these children was slower growth and increasing disparity. This is consistent with the findings from a study of 10,366 children in the U.S. Children of the National Longitudinal Survey of Youth 1979 (CNLSY). The study used growth curve models to investigate social class and race differences in receptive vocabulary growth trajectories from 3 -13 years. The highest rate of receptive vocabulary growth was in the preschool years and social class and race disparities in receptive vocabulary abilities that were evident in the preschool years did not close (or widen) in the school years. Their findings pointed towards the preschool years as the critical time for intervention because this was the period of highest growth and also the period when disparities emerged [28]. Even with pre-school education between 3-5 years, the SES-gap in children's language abilities has been shown to persist. While pre-school education did not close the gap, the SES-gap widened for low-SES children who did not participate in pre-school education [37].

Implications for Prevention and Intervention
With important exceptions, these being low school readiness and socio-economic area disadvantage, none of the remaining predictors would suggest that a preventive strategy or intervention to address vocabulary development (or language development more widely) should focus on a single risk exposure. Instead, the pattern of weak adjusted effect sizes would suggest a broad-based or universal ''scaffold'' of developmental opportunity and interventions able to address multiple risks would have greater merit. With respect to intensity of intervention, like the predictors, these opportunities and interventions would need to operate close to the child, be available fairly regularly and over an extended period of time. In this regard, we suspect consistency of regular but modest participation in these developmental opportunities is more critical than short, high intensity interventions. The issue to then consider is the extent to which such developmental strategies are adjusted and made universally proportionate (or not) to address risks associated with socioeconomic area disadvantage and low school readiness.

Future Research
The results of this study showed a mixed picture of risks for low receptive vocabulary ability between 4-8 years. The widest receptive vocabulary gaps between risk exposed and non-risk exposed children were evident at 4 years, and with the exception of socio-economic area disadvantage, the gaps narrowed through the early years of school. As the children in this study progress through the school grades, it will be important for future research will determine: (1) Risk factors for persistently low receptive vocabulary ability from 4 -8 years; (2) the extent to which low receptive vocabulary ability projects to low literacy ability and (3) and the configuration of risks for low literacy ability.