Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The INTERGROWTH-21st Project Neurodevelopment Package: A Novel Method for the Multi-Dimensional Assessment of Neurodevelopment in Pre-School Age Children

  • Michelle Fernandes ,

    Affiliation Nuffield Department of Obstetrics & Gynaecology, and Oxford Maternal & Perinatal Health Institute, Green Templeton College, University of Oxford, Oxford, United Kingdom

  • Alan Stein,

    Affiliation Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, United Kingdom

  • Charles R. Newton,

    Affiliations Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, United Kingdom, KEMRI/Wellcome Trust Research Programme, Kilifi, Kenya

  • Leila Cheikh-Ismail,

    Affiliation Nuffield Department of Obstetrics & Gynaecology, and Oxford Maternal & Perinatal Health Institute, Green Templeton College, University of Oxford, Oxford, United Kingdom

  • Michael Kihara,

    Affiliations Department of Psychology, United States International University, Nairobi, Kenya, KEMRI/Wellcome Trust Research Programme, Kilifi, Kenya

  • Katharina Wulff,

    Affiliation Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, United Kingdom

  • Enrique de León Quintana,

    Affiliation Centro de Tecnología e Innovación, Mexico City, Mexico

  • Luis Aranzeta,

    Affiliation Centro de Tecnología e Innovación, Mexico City, Mexico

  • Aureli Soria-Frisch,

    Affiliation Neuroscience Business Unit, Starlab Barcelona, SL, Barcelona, Spain

  • Javier Acedo,

    Affiliation Neuroscience Business Unit, Starlab Barcelona, SL, Barcelona, Spain

  • David Ibanez,

    Affiliation Neuroscience Business Unit, Starlab Barcelona, SL, Barcelona, Spain

  • Amina Abubakar,

    Affiliations Department of Psychology, Lancaster University, Lancaster, United Kingdom, KEMRI/Wellcome Trust Research Programme, Kilifi, Kenya

  • Francesca Giuliani,

    Affiliation Dipartimento di Scienze Pediatriche e dell' Adolescenza, SCDU Neonatologia, Università di Torino, Turin, Italy

  • Tamsin Lewis,

    Affiliation Nuffield Department of Obstetrics & Gynaecology, and Oxford Maternal & Perinatal Health Institute, Green Templeton College, University of Oxford, Oxford, United Kingdom

  • Stephen Kennedy,

    Affiliation Nuffield Department of Obstetrics & Gynaecology, and Oxford Maternal & Perinatal Health Institute, Green Templeton College, University of Oxford, Oxford, United Kingdom

  • Jose Villar,

    Affiliation Nuffield Department of Obstetrics & Gynaecology, and Oxford Maternal & Perinatal Health Institute, Green Templeton College, University of Oxford, Oxford, United Kingdom

  •  [ ... ],
  • for the International Fetal and Newborn Growth Consortium for the 21st Century (INTERGROWTH-21st)

    Membership of the International Fetal and Newborn Growth Consortium for the 21st Century (INTERGROWTH-21st) is provided in the Acknowledgments.

  • [ view all ]
  • [ view less ]

The INTERGROWTH-21st Project Neurodevelopment Package: A Novel Method for the Multi-Dimensional Assessment of Neurodevelopment in Pre-School Age Children

  • Michelle Fernandes, 
  • Alan Stein, 
  • Charles R. Newton, 
  • Leila Cheikh-Ismail, 
  • Michael Kihara, 
  • Katharina Wulff, 
  • Enrique de León Quintana, 
  • Luis Aranzeta, 
  • Aureli Soria-Frisch, 
  • Javier Acedo



The International Fetal and Newborn Growth Consortium for the 21st Century (INTERGROWTH-21st) Project is a population-based, longitudinal study describing early growth and development in an optimally healthy cohort of 4607 mothers and newborns. At 24 months, children are assessed for neurodevelopmental outcomes with the INTERGROWTH-21st Neurodevelopment Package. This paper describes neurodevelopment tools for preschoolers and the systematic approach leading to the development of the Package.


An advisory panel shortlisted project-specific criteria (such as multi-dimensional assessments and suitability for international populations) to be fulfilled by a neurodevelopment instrument. A literature review of well-established tools for preschoolers revealed 47 candidates, none of which fulfilled all the project's criteria. A multi-dimensional assessment was, therefore, compiled using a package-based approach by: (i) categorizing desired outcomes into domains, (ii) devising domain-specific criteria for tool selection, and (iii) selecting the most appropriate measure for each domain.


The Package measures vision (Cardiff tests); cortical auditory processing (auditory evoked potentials to a novelty oddball paradigm); and cognition, language skills, behavior, motor skills and attention (the INTERGROWTH-21st Neurodevelopment Assessment) in 35–45 minutes. Sleep-wake patterns (actigraphy) are also assessed. Tablet-based applications with integrated quality checks and automated, wireless electroencephalography make the Package easy to administer in the field by non-specialist staff. The Package is in use in Brazil, India, Italy, Kenya and the United Kingdom.


The INTERGROWTH-21st Neurodevelopment Package is a multi-dimensional instrument measuring early child development (ECD). Its developmental approach may be useful to those involved in large-scale ECD research and surveillance efforts.


Approximately one in ten children have impairments in neurodevelopment, which manifest as disturbances in cognition, auditory processing, vision, behavior, language, attention, motor skills and sleep [1]. Research into the epidemiology of childhood neurodisability has revealed three consistent global trends. First, the prevalence of childhood neurodisability varies widely across geographical locations with higher prevalences reported from low- and middle-income countries (LMICs; 5.3 to 24.3 per 1,000 children) than high-income countries (HICs; 2.0 to 4.5 per 1,000 children) [2]. Second, the reported prevalence of mild levels of neurodisability is consistently higher than that of severe neurodisability in both LMICs and HICs [2]. Third, the prevalence of childhood neurodisability in LMICs is much more variable than in HICs. Among HICs, rates of severe childhood neurodisability range between 2 and 5 per 1,000 children; in LMICs they range from 5.9 per 1,000 in Dhaka, Bangladesh to 24.3 per 1,000 in Karachi, Pakistan [2] (Fig. 1).

Figure 1. Distribution of the prevalence of severe neurodevelopmental impairment per 1,000 children in select countries.

Distribution of prevalence of severe neurodevelopmental impairment per 1,000 children in select countries. Taken from Durkin, The Epidemiology of Developmental Disabilities in Low-Income Countries, Mental Retardation and Developmental Disabilities Research Reviews p. 207 (2002) [2].

A number of genetic, biological and environmental risk factors, operating in the pre- or postnatal period, or both, have been proposed [3]. These are summarized in Table 1. More often than not, these factors do not act in isolation but interact with each other resulting in a multi-factorial etiological matrix in which some, but not all, risk factors may be modifiable. Irrespective of its precise etiological determinants, childhood neurodisability is one of the most important precursors of psychopathology, poor social functioning and educational disadvantage in later life [3], [4]. Attention and behavior problems in early childhood have been associated with higher rates of oppositional/defiant behavior, conduct problems, personality disorder, depression, antisocial lifestyles and suicide attempts in adolescence and adulthood [4][7]. Longitudinal studies have also found children with neurodisability to be less likely to be living independently, in paid employment and have cohabitating relationships as adults compared to controls [8]. These effects are most pronounced in those born extremely preterm, those with very low birth weights (<1,000 g) and those with severe neurodevelopmental impairments manifesting early in life [9], [10].

Table 1. Summary of major risk factors for early childhood neurodisability.

Despite the long-term significance of childhood neurodisability and benefit of early diagnosis and treatment in young children, there is limited scientific and clinical agreement on two issues central to the topic: (i) international standards of optimal early child development (ECD) and (ii) standardized, internationally applicable tools to measure these outcomes in young children [11], [12]. Although a sizable body of literature on ECD and childhood neurodisability exists, there is limited comparability among the many studies because of the heterogeneity in methods employed in assessments, the wide ranges of ages assessed, and the focus on specific domains of neurodevelopment rather than global outcomes [11], [12]. Two important reasons for this limited comparability among studies are the lack of: (i) a standardized, robust methodology that is easy and rapid to administer in the field, and that uses a combination of psychological, clinical and neurophysiological tests to measure multiple dimensions of neurodevelopment, and (ii) large, international, prospective cohorts on whom information about potential risk and protective factors for ECD (such as fetal growth, home environment, maternal health and early nutrition) is available.

In the context of ECD research, it is important to highlight the many conceptual and methodological challenges in investigating the epidemiology of neurodevelopmental disturbances in young children in a large, international cohort (Fig. 2). First, little is known about childhood neurodisability in LMICs despite the high prevalence of risk factors. In addition, the children most at risk in these settings are unlikely to have been assessed and, therefore, may not be represented in prevalence estimates [2]. Second, although severe disorders may be recognized during infancy, it is difficult to reliably diagnose impairments in speech, cognition or behavior before 3 to 4 years of age [2]. Nevertheless, there is increasing evidence that the earlier the identification and treatment of childhood neurodisability, the better the opportunity for developmental change [2]. Third, the wide normal variation in neurodevelopment among children, simultaneous delays in multiple areas of development, and the logistic implications of carrying out long-term surveillance makes the selection and implementation of an assessment complicated [13]. Fourth, assessments that yield information of clinical prognostic importance are often lengthy, require specialist training and contain culture-specific items making them difficult to implement in the field, especially in international settings [2], [13]. Fifth, studies employing neuroimaging and neurophysiological measurements (e.g. electroencephalography, magnetoencephalography, functional magnetic resonance imaging and eye tracking), although free from cultural biases and of sufficient sensitivity to measure subtle differences in neurodevelopment, are often limited to one domain, are resource intensive and are extremely difficult to implement in the field and in very young children.

Figure 2. Conceptual and methodological challenges in the assessment of early child development in the context of the INTERGROWTH-21st Project.

Conceptual and methodological challenges in the multi-dimensional assessment of early childhood neurodevelopment across low-, middle- and high-income settings (black boxes) and the unique opportunity provided by the INTERGROWTH-21st Project to explore some of these issues (red boxes).

Background: The INTERGROWTH-21st Project

INTERGROWTH-21st (The International Fetal and Newborn Growth Consortium for the 21st Century) is a multicenter, multiethnic, population-based project, conducted between 2008 and 2013, in eight study sites: the cities of Pelotas (Brazil), Turin (Italy), Muscat (Oman), Oxford (UK) and Seattle (USA); Shunyi County, Beijing (China); the central area of Nagpur (India), and the Parklands suburb of Nairobi (Kenya) [14]. Its primary aim was to study growth, health, nutrition and neurodevelopment from <14+0 weeks of gestation to 2 years of age, using the same conceptual framework as the WHO Multicentre Growth Reference Study [15], so as to produce prescriptive, international growth standards and a new phenotypic classification of the intrauterine growth restriction and preterm birth syndromes.

The women whose babies contributed to the construction of the international standards describing optimal fetal and newborn growth were selected on the basis of two factors: they were living in environments where the exposure to risk factors known to affect fetal growth was minimal, and they themselves were healthy, non-obese and similarly free of any factors or conditions affecting fetal growth. The study methodology has been published elsewhere in detail [14]. In brief, clinical, nutrition and anthropometric outcomes were measured in mothers and children during pregnancy and at birth, during the first year of life, and at 24 months after birth. The inclusion and exclusion criteria, study protocol, ultrasound manual and other documents are available at The recruitment of a large, optimally healthy, international, population-based sample; the use of robust, standardized procedures in all assessments (ultrasound, clinical and anthropometric) including a system of random evaluation and repetition to achieve quality control, and the use of a centralized data management system make the INTERGROWTH-21st birth cohort distinct in its methodology and scope.

In an extension to the 24 month follow-up of the INTERGROWTH-21st Project, an assessment of early child neurodevelopment was included based on the rationale that the Project provided a unique opportunity to profile neurodevelopment outcomes in an optimally healthy sample of children whose nutritional and health needs had been fully met during the first 1,000 days of life (Fig. 2). It is envisioned that the data from this study will add to the current understanding of neurodevelopment trajectories during early childhood by comparing different neurodevelopment functions across a range of contexts and settings as well as relating these functions to intrauterine growth. The Project also provided an opportunity to develop and test a novel, mixed methodology, global approach to the assessment of early child development using objective, robust techniques, which are quick and easy to administer in both high- and low-resource settings by non-professional staff.

Towards this end, a multi-dimensional, mixed methodology, screening package to assess neurodevelopment in international populations of preschoolers was developed. The Package was designed to be easy to administer in the field, in 35 to 45 minutes, by non-specialist research staff. In keeping with the methodology and scope of the INTERGROWTH-21st Project, the Package was designed to measure neurodevelopment in children in an ‘optimally healthy' cohort in accordance with the methodological concepts of standardization, integrated quality assessments and centralized data management. The Package is in use in study sites in Brazil, Kenya, India, Italy and the UK where more than 700 children have been assessed to date. The objectives of this paper are to: (1) outline the methodology leading to the development of the Package and the challenges involved in this process and (2) describe the characteristics of the INTERGROWTH-21st Neurodevelopment Package.

Materials and Methods

Study sites and participants

Five INTERGROWTH-21st Project sites are implementing the INTERGROWTH-21st Neurodevelopment Package. These are the Universidade Federal de Pelotas, Pelotas, Brazil; Aga Khan University Hospital, Nairobi, Kenya; Ketkar Hospital, Nagpur, India; Universita' di Torino, Turin, Italy; and the John Radcliffe Hospital, Oxford, UK. A total of 4607 optimally healthy mothers were recruited from these sites [16], [17], their characteristics are presented in Table 2 [17].

Table 2. Inclusion criteria to be met by mothers at booking for recruitment in the INTERGROWTH-21st Project.


Participants in the INTERGROWTH-21st Project were informed about the study procedures, and voluntarily consented to participation. Written consent was obtained from all adult participants. Parents/guardians provided written informed consent on behalf of their children enrolled in the Project. Participants (or their parents/guardians in the case of children) provided written consent for their clinical records to be used in the Project. The INTERGROWTH-21st Project was approved by the Universidade Federal de Pelotas, Faculdade de Medicina comitê de ética em pesquisa (Ref: OF.051/09), Beijing Obstetric and Gynaecology Hospital, Capital Medical University; the Indian Council of Medical Research, the Indian Ministry of Health and Family Welfare and the Institutional Ethics Committee, Ketkar Hospital, Nagpur (Ref: 5/7/314/2008-RHN); Servizio Sanitario Nazionale – Regione Piemonte, Aziende Ospedaliere OIRM/S.Anna, Oridine Mauriziano di Torino, Comitato Etico Interaziendale (Ref: G9947/CEI/C.27.2); the Aga Khan University Health Research Ethics Committee (Ref: AKU-09-106), the Oman Research and Clinical Studies Committee (Ref: MH/DGHA/DNCD/R&C/47/09), the Oxfordshire Research Ethics Committee ‘C’, United Kingdom (Ref: 08/H0606/139) and the University of Washington, Human Subjects Division, Human Subjects Review Committee (Ref: 36341).

Conceptual issues guiding the INTERGROWTH-21st Project Neurodevelopment Assessment

Figure 2 illustrates the conceptual framework behind the rationale to include an assessment of neurodevelopment in the INTERGROWTH-21st Project. Prior to the selection of a neurodevelopment measure for the Project, an Infant Development Group consisting of international experts in pediatric medicine, pediatric neurology, child development, child psychology, neuroscience, electroencephalography, vision, obstetric medicine and epidemiology was constituted. The Group's remit was to review the cohort's characteristics and the requirements of the Project so as to develop a set of criteria for a neurodevelopment measure to meet the Project's needs (see Table 3). As a preliminary step, the Group identified a number of conceptual, methodological and logistical challenges to the selection, design or development of a neurodevelopment measure in a large, multi-site study of young children. These are listed in Table 4.

Table 3. Criteria to be met by a neurodevelopment measure for inclusion in the INTERGROWTH-21st Project.

Table 4. Challenges to the selection, design and development of a neurodevelopment assessment for use in young children and in multiple international settings in the INTERGROWTH-21st Project.

Selection of a Neurodevelopment Tool for the INTERGROWTH-21st Project

The Infant Development Group selected a measure of neurodevelopment at 24 months using two sequential approaches, and the results of approach 1 informed the design and results of approach 2. The methods and results of each approach are described in the sections that follow.

Approach 1: Review of existing psychometric tools for assessing neurodevelopment at 24 months.

The Infant Development Group compiled a list of the most well-established psychometric tools to measure neurodevelopment outcomes, including (but not limited to) cognition, language skills, motor skills and behavior, at 24 months in an international sample. This age was selected as it was found to be the earliest at which: (i) neurodevelopment is not confounded by transient neurological syndromes of prematurity and (ii) conventionally used developmental instruments, such as the Bayley Scales of Infant Development, have been found to possess an acceptable level of medium and long term predictive validity [18]. The list was compiled in two phases.

Phase 1 involved a search of online databases (PubMed, PsycINFO and Embase) using search terms for cognition, language, motor skills and behavior. These are listed in Table S1 in File S1.

During Phase 2, the Infant Development Group contacted a number of child development experts and authors of child psychometric measures from Bangladesh, Brazil, India, Italy, Kenya, the Netherlands, UK and USA. The experts were asked to list the available measures they were familiar with to assess cognition, language, motor skills and behavior at 24 months.

Phases 1 and 2 (Figure S1 in File S1) resulted in a list of 47 tools (Table S2 in File S1). Their purpose, the age ranges at which they may be used, the domains assessed and the languages the tools are currently available in are summarized in Table S2 in File S1. The Infant Development Group critically analyzed these tools against the criteria listed in Table 3. Thirteen fulfilled some of the Project's criteria. Information on the characteristics, purpose, and strengths and weaknesses (in to the context of the INTERGROWTH-21st Project) of each tool is presented in Table S3 in File S1.

The 13 tools were critically analyzed against the Project's criteria by a scientific advisory panel of independent international experts in the fields of neurodevelopment, pediatric neurology, child psychiatry, child psychology, vision, neuroscience, ophthalmology, sleep research, perinatal medicine and epidemiology at a meeting organized by the Infant Development Group.

Results of Approach 1: The findings of Phases 1 and 2, as reviewed by the scientific advisory panel, resulted in four key conclusions. First, the panel acknowledged that no one tool fulfilled all the Project's criteria. Second, the panel found 5 of the 13 tools to individually fulfill most of the Project's criteria and recommended the use of these tools in their most sensitive and reliable domains of neurodevelopment (Table 5). These were the Bayley Scales of Infant Development – Third Edition (BSID) [19], [20], the Rapid Neurodevelopment Assessment (RNDA) [21], the Malawi Development Assessment Tool (MDAT) [22], the Griffiths Mental Development Scale [23], [24] and the Child Behaviour Checklist (CBCL) [25], [26]. Third, the panel recommended that while it is almost impossible for any psychometric instrument measuring ECD to be free from cultural biases, it is possible to undertake a decentering approach by selecting items that are suitable for middle- and upper-class families in high-, middle- and low-income contexts [27]. In this approach, no single cultural group provides the initial items sets. Instead the test development process sets out to select items that are equally familiar to the different target groups and excludes any materials or content that seems unfamiliar to any of the target groups. The rationale for this approach was that in the current global situation, children from most middle- and upper-class families across settings are exposed to a set of similar resources (such as cups, spoons, dolls, crayons, shoes, stacking blocks) and therefore it is possible to pool this common set of exposures to select a set of items that most middle- and upper-class children across high-, middle- and low-income settings can perform. Finally, the panel suggested a package-based approach combining the use of psychometric tools with neurophysiological measures. The panel recommended that the methodology, scope and sensitivity of such a package must be tailored to the needs of the INTERGROWTH-21st Project.

Table 5. Neurodevelopment tools selected following review of candidate tools by scientific advisory panel.

Approach 2: A multi-dimensional package for the assessment of neurodevelopment at 24 months.

Based on the recommendations of the scientific advisory panel, the Infant Development Group undertook a structured, package-based approach to construct a neurodevelopment assessment protocol customized to the needs of the Project and the characteristics of the study population. The design of the Package was also guided by its potential for use as a population-based screening instrument for ECD in the context of LMICs and HICs. The process leading to the development of the Package was undertaken in five stages: (1) the neurodevelopment outcomes we wished to assess were grouped into similar domains; (2) a set of domain-specific criteria to be fulfilled by each neurodevelopment measure were developed; (3) the candidate measures that fulfilled the criteria for each domain were listed, based on the results of the systematic review, and discussions with experts and the scientific advisory panel; (4) the list of domain-specific, candidate measures was critically analyzed against the criteria for each domain and the most appropriate measure for each domain was selected by the scientific advisory panel and the Infant Development Group; and (5) the various measures were incorporated into a singular assessment package with domain-specific protocols.

Results of Approach 2: 1) Categorization of desired neurodevelopment outcomes into domains

Ten neurodevelopment outcomes were listed as priorities for the Project: cognition, language, gross motor skills, fine motor skills, behavior, attention, emotional reactivity, overall visual function, cortical auditory processing, and sleep. The rationale for the selection of these outcomes was each outcome has been previously found to have long-term predictive ability [7], [8], [28][30] and well-established psychometric, neurophysiological and/or clinical tests are available to measure them. The 10 outcomes were grouped into four domains based upon the balance between the technical, conceptual and logistical feasibility of assessing each domain on an international sample in accordance with the criteria and requirements of the INTERGROWTH-21st Project. The domains are:

  1. Vision: Visual acuity and contrast sensitivity
  2. Neuropsychological function: Cognition, language, gross motor skills, fine motor skills, behavior, attention, emotional reactivity
  3. Auditory function
  4. Sleep

Language was included in domain 2 (neuropsychological function) rather than domain 3 (auditory function) in keeping with the contextual framework of the candidate measures. Furthermore, the expert panel felt it would be logistically more feasible to administer items on receptive and expressive language together with the cognitive, motor and behavior items and that this practice would contribute to a greater fluidity and continuity of the assessment for both the child and the assessor.

2) Domain-specific test selection criteria

A set of criteria to be fulfilled by a measure (psychometric, neurophysiological, clinical or a combination of these) for each domain was compiled by the Infant Development Group and approved by the scientific advisory panel. The test selection criteria for each domain are conceptually similar to the overall Project criteria depicted in Table 3. These are listed in detail in Tables S5–S8 in File S1.

3) Candidate measures for each domain

3.1) Vision

Conventional measures of vision in adults, such as the Snellen chart, require levels of comprehension, concentration and verbal skills that a young child may not possess. Therefore, in addition to the criteria listed in Table S4 in File S1, attention was paid to the candidate measures below being suitable for young children and free from biases introduced due to potential cognitive and language difficulties. These candidate measures, grouped as tests of visual acuity and contrast sensitivity, are listed below:

  1. Measures of Visual Acuity

Tests of visual acuity measure the child's ability to discern objects at a given distance according to a fixed standard. These include:

    1. Tests using subjective/behavioral methods based on the forced choice preferential looking (FPL) method, conceived by David Teller [31]. In this method, the observer presents the child with a display, half of which is plain while the other half contains a pattern. The child tends to look at the pattern if he/she can resolve it. This technique becomes a “forced choice” when the observer has to decide, based on the child's head and eye movements, where the stimulus is located [32]. The threshold is defined as when the child makes a “clear” look. These methods consist of measures of resolution acuity (Teller acuity test [31], Keeler acuity test [33], Cardiff test [34], and grating and operant preferential looking techniques [35]) and measures of recognition acuity (Kay pictures, LEA symbols, Cambridge crowding cards, Landolt C-s and crowded Landolt C-s) [36], [37].
    2. Experiments using objective methods involving the measurement of pattern steady-state visual evoked potentials (VEP) and variations of the same (for example, sweep VEP) using electroencephalographic techniques [36].
  1. Measures of Contrast Sensitivity

Tests of contrast sensitivity assess the minimum contrast necessary for a subject to detect sine wave gratings of different spatial frequencies [35]. The measurement of contrast sensitivity is thought to reflect the integrity of the entire visual pathway, and is considered the “single most complete measure of human spatial vision” [37]. The measures include:

    1. Tests using subjective/behavioral methods (Cardiff tests, Alley-running 2AFT staircase, Vistech charts and Pelli-Robinson Charts) [37].
    2. Tests using objective methods, namely those measuring VEPs [36].

3.2) Neuropsychological function:

The findings of approach 1 were applied to the selection of candidate tools for the assessment of cognition, language, gross motor skills, fine motor skills, behavior, attention and emotional reactivity. Approach 1 has been described in detail above and the results of this process are enumerated in Tables S3 and S4 in File S1, and Table 5. Of the 47 measures initially identified, 13 were found to fulfill some of the Project's criteria, the top 5 of which were selected as candidate measures for use in the INTERGROWTH-21st Project. These were the BSID [19], [20], the RNDA [21], the MDAT [22], the Griffiths scale [23], [24] and the CBCL [25] and were selected on the basis of their functionality to sensitively assess cognition, language, gross motor skills, fine motor skills and behavior at 24 months of age.

3.3) Auditory function:

There were five candidate tests for the assessment of auditory function as per the domain-specific criteria listed in Table S6 in File S1. These were audiometry, acoustic reflex threshold measurement, otoacoustic emission measurement, communication inventories and the measurement of evoked response potentials (ERPs) to auditory stimuli [38].

3.4) Sleep:

The candidate tools for sleep were classified by recording technique, as follows:

  1. Caregiver reports: These include sleep diaries and questionnaires. In the former, the caregiver maintains a diary recording of the child's activities, sleep and wake times and napping [39]. Sleep questionnaires assess the quality and frequency of the child's sleep [39], [40].
  2. Neurophysiological techniques: These include polysomnography (PSG) and actigraphy. PSG is the comprehensive measurement of the various biophysical changes that occur during sleep and involves the simultaneous recording of brain and heart signals, eye movements and muscle activity from multiple channels placed on the participant's head, face, limbs and chest [41]. The test is conventionally conducted in a sleep laboratory. Actigraphs are watch-like devices placed on the wrist or ankle to record movement [42]. The data are analyzed for periods of activity and inactivity to estimate sleep patterns and circadian rhythms [43].

4) Selection of test for each domain

4.1) Vision

The candidate tools for visual acuity and contrast sensitivity were assessed according to the specific criteria listed in4 in File S1. FPL techniques, particularly Cardiff Tests, Teller cards and Keeler cards, met the Project's criteria better than objective (neurophysiological) techniques (Table 6). A further analysis of the Cardiff, Teller and Keeler tests, presented in Table 7, showed the Cardiff tests to be the most appropriate for the purposes of the INTERGROWTH-21st Project because (1) they measure both visual acuity and contrast sensitivity and (2) the images are more attention-grabbing and engaging for toddlers than the black and white pictures of the Teller/Keeler cards [33], [34], [44]. The Cardiff acuity and contrast sensitivity tests [34], [45] were, therefore, selected as the vision assessment of choice for the neurodevelopment package of the INTERGROWTH-21st Project.

Table 6. Analysis of candidate vision tests against selection criteria.

Table 7. Analysis of candidate FPL techniques to assess vision in young children.

4.2) Neuropsychological function

The analysis of the 5 candidate measures; namely the BSID [19], the RNDA [21], the MDAT [22], the Griffiths Scale [23], [24] and the CBCL [25]; against the domain specific criteria for the INTERGROWTH-21st Project (Table S5 in File S1) is presented in Table 8. As none of the measures were found to meet all the Project's criteria, the scientific advisory panel and the Infant Development Group concluded that the Project should use a combination of the candidate tools, by integrating the strengths of some measures to complement the weaknesses of others. A 53-item tool, the INTERGROWTH-21st Neurodevelopment Assessment (INTER-NDA; Table 9), was designed by including 30 cognitive, language and motor items; 6 behavioral items and 17 items measuring attention and emotional reactivity. These items were selected by international experts in child development who had extensive experience with the candidate measures and with the development of psychometric tools. The rationale for the selection of each individual item was based on the experts' agreement on (i) its suitability for the 22–26 month age group; (ii) its appropriateness for use in international populations using the decentering approach described earlier; (iii) its ability to be administered reliably and (iv) its ability to be adapted across cultural contexts. The administration and scoring of the items was adapted for the purpose of the Project and care was taken to ensure conceptual equivalence was maintained between the items of the INTER-NDA and those of the 5 candidate measures of neuropsychological function. A 5-point Likert scale was selected to objectively record the child's performance on each item with a view to characteristic levels of achievement for each item across a spectrum (rather than a binary pass/fail outcome measure). As raw scores for the candidate measures were not freely available it was not possible to select items following a statistical analysis of item weights. The assessment of behavior encompasses the assessor's observation of child's overall behavior during the assessment, and does not include reports on specific constructs of behavior such as temperament and empathy. The operation manual and training materials of the INTER-NDA are freely available at

Table 8. Analysis of candidate cognitive, language, motor, behavior and attention tests against selection criteria.

Table 9. Constituent items and scoring sheet of the INTERGROWTH-21st Neurodevelopment Assessment.

4.3) Auditory function

The candidate tools for auditory function were analyzed according to the domain specific criteria (Table 10, Table S6 in File S1). Audiometry and the measurement of auditory ERPs were the only tests found to: (i) be sensitive enough to detect subtle differences in auditory function in a cohort of healthy children and (ii) assess the entire auditory pathway [46]. Audiometry was, however, found to be difficult to perform reliably in children less than three years of age [46], [47]. Although the measurement of acoustic reflex thresholds and otoacoustic emissions detect the most common causes of hearing impairment in children they solely assess middle ear function, and not the entire auditory pathway [48]. In addition, these tests require a probe to be placed in the child's ear for up to 3 minutes, which would pose considerable difficulties with compliance in the field [48].

Table 10. Analysis of candidate auditory function tests against selection criteria.

The measurement of auditory ERPs using the novelty oddball ERP task [49] was the method of choice for the Project. This technique has been validated in young children and tests their ability to discriminate novel stimuli embedded in a train of frequent stimuli [49]. It involves the presentation of frequent, infrequent and novel auditory stimuli, and the measurement of the amplitude and latencies of distinct EEG waveforms to each stimulus. The waveforms of interest include the P1 (a positive peak around 100 ms after stimulus onset), the N2 (a negative peak around 200 ms) and the P3a (a positive peak around 250–350 ms).

However, measuring auditory ERPs involves recording EEG signals from the child's scalp. This is complicated because it requires specialist training, sophisticated equipment, long set-up times and possibly sedation of the child [46]. The Infant Development Group had specific concerns about the co-operation of children, the consent of mothers and the ease of implementation of this technique in the field, especially in low resource settings, with conventional EEG apparatus and methods. To meet these challenges, the Group designed a novel method of measuring auditory ERPs in children employing a ‘strap and click’ design using a wireless EEG cap and dry electrodes [50].

4.4) Sleep

The candidate tools to measure sleep in children were analyzed according to domain specific criteria (Table S7 in File S1, Table 11). Actigraphy was found to be the only method of assessment that was: (i) of sufficient sensitivity to measure subtle differences in sleep in a healthy cohort of children, (ii) objective, (iii) minimally disruptive to the daily routine of the mother and child, and (iv) able to record sleep patterns and daytime physical activity in a natural, home based setting [42], [43]. However, leading actigraphy and sleep experts on the scientific advisory panel felt that actigraphy alone may not adequately capture the child's sleeping pattern as the information collected is limited by the number of days that the child wears the actiwatch. It was therefore decided that the use of a well-established maternally reported sleep questionnaire, such as the Brief Infant Sleep Questionnaire [51], in conjunction with actigraphy may be a more reliable method of assessing sleep and daytime activity patterns in children than either alone.

Table 11. Analysis of candidate sleep measures against selection criteria.

Results and Discussion

The INTERGROWTH-21st Neurodevelopment Package

The INTERGROWTH-21st Neurodevelopment Package describes a multi-dimensional, mixed-methodology approach to assessing neurodevelopment in preschoolers. The Package is the result of the combined efforts of the Infant Development Group and the scientific advisory panel to develop a holistic, scalable instrument to measure ECD that is easy to implement across different contexts and settings for population-based research and screening purposes. The Package, in its entirety, meets all the Project's requirements listed in Table 3, while each individual component of the Package fulfills domain specific criteria listed in Tables 6, 8, 10 and 11.

Characteristics of the Package

The package consists of four constituent assessments (Fig. 3). These include a test of vision; a measure of cognition, language, motor skills, behavior, attention and emotional reactivity; a test of cortical auditory processing and a measure of sleep patterns and daytime physical activity. The package takes between 35 and 45 minutes to administer and consists of a combination of psychological, neurophysiological and clinical tests, and caregiver reports.

Key Features

The important features of the Package are that it:

  1. Is a multi-dimensional assessment of neurodevelopment in preschoolers using a mixed methodology approach
  2. Was developed for, and is currently in use in, middle- and upper-class children across low-, middle- and high- income settings
  3. Can be delivered in the field by non-specialist staff with minimal training
  4. Yields objective, rather than subjective, information about a child's performance
  5. Incorporates the assessment of entire vision and auditory pathways rather than individual anatomical/physiological components of these pathways.

Implementation of the INTERGROWTH-21st Neurodevelopment Package


A number of components of the Package were automated to facilitate rapid administration, real-time data upload and integrated quality checks in the data collection system. A tablet-based data collection application, the NeuroApp, was designed for this purpose. The NeuroApp is an intuitive user interface with integrated operation manuals. It consists of: (a) a biography page, (b) a neurodevelopment home page providing access to the tests for the four constituent domains, and (c) vision, INTER-NDA, cortical auditory processing, and sleep pages on which the child's performance on each individual test are recorded. Integrated quality checks ensure that no more than one option is selected for each item and any items with missing data are flagged before the data upload stage.

To ensure that non-specialist research staff can carry out the cortical auditory processing experiment, a ‘strap-and-click’ design was developed. This method involved hardware and software customization of the EEG system (Enobio, Neuroelectrics, Barcelona) to automate the presentation of the auditory stimuli and the recording, termination and upload of the corresponding electrophysiological data following a simple initial set-up.

The Package's tri-axial actigraph of choice, marketed as the Motionwatch 8 [52], is reliable under harsh climate conditions, waterproof, easy to set up and use, and has a relatively long battery life. It does not require calibration and data are easy to download onto a computer using the Motionware software. The data are uploaded onto a dedicated sleep section on the Project's centralized database using a standard file transfer protocol.

In order to ensure that non-specialist research staff involved in the study are capable of conducting psychological, neurophysiological and clinical tests to the same degree of proficiency as specialist research staff a comparison of the data quality obtained between the two will be carried out following the conclusion of data collection. Although there is considerable doubt among the scientific community about the validity of neurodevelopment assessments being carried out by non-specialist research staff with minimal training, the INTERGROWTH-21st Project offers the opportunity to test the hypothesis that with targeted training, user-friendly apparatus and a high degree of automation (including integrated quality checks), non-specialist field workers may be able to achieve levels of proficiency at power with specialist research staff. We believe this would be an important step ahead in the field where the lack of trained personnel is often the rate-limiting step in the scalability of ECD screening packages, especially in LMICs.

Database organization.

Neurodevelopment data from all study sites are stored securely in a dedicated neurodevelopment section on the Project's centralized, cloud-based database. Neuropsychological and vision data (in.txt format) are uploaded instantaneously from the NeuroApp following the completion of an assessment. EEG data (, info and.easy formats) are uploaded automatically from the EEG software. Actigraphy data (in.awd format) are uploaded through a standard file transfer protocol. The database for the Project was designed, and is maintained, by MedSciNet, (Stockholm, Sweden) and managed by local and coordinating unit data managers with varying levels of access and administrative rights. Information on the organization of the database, data access and security is available elsewhere [53].


The Infant Development Group devised the training strategy for the Package by taking into consideration the requirements of each site. The training session, which lasts 3½ days, is conducted at each site by the coordinator of the Infant Development Group. Day 1 consists of 6 sessions where trainees are introduced to, and trained in, the various theoretical and practical aspects of Package using a combination of direct instruction, small group interaction, role-play and problem-based learning. Day 2 consists of 9 sessions – 6 refresher sessions, a demonstration and 2 live sessions. Day 3 consists of 3–4 live sessions and a video based standardization session to assess inter-rater and test-re test reliability. The half-day consists of a further live session, written test and discussion about the standardization session. At the end of every live assessment, the trainees are encouraged to discuss the assessment with the trainer and give feedback. Specific attention is paid to problem areas and site-specific cultural issues at this point. The rationale for conducting on-site, rather than centralized, training sessions is to enable the trainer to help the teams with site-specific issues and to test the apparatus. It also provides an opportunity for the trainer to liaise with the IT and data experts at the sites and to troubleshoot initial problems.

Cultural customization.

At each training session, the site team together with the coordinator of the Infant Development Group compiles a list of item-based culture-specific issues for the site and discusses solutions to address these problems. The issues largely concern the amendment of phrases during the administration of the INTER-NDA to more culturally-appropriate conceptual equivalents (for example, in item 30 on combining a word and gesture, it was found that certain phrases such as ‘show me Gota’ in Kenya, and ‘do Namaste’ in India, worked better than English versions such as ‘blow a kiss’ or ‘wave bye’). For maternally reported items on attention and emotional reactivity, local language versions (Brazilian Portuguese, Hindi, Italian, Kiswahili and Marathi) from validated CBCL translations were made available to the site staff.

Reliability Analysis and Comparison with Other Neurodevelopment Scales

The inter-rater and test-re test reliability of the INTER-NDA was determined across 21 assessors in Brazil, India, Italy, Kenya and the UK. This was carried out using video recordings of INTER-NDA assessments on four children performed by an expert assessor (MF). All assessors were instructed to simultaneously score each child according to a standardized reliability protocol. Cohen's kappa coefficients of reliability were calculated. These were found to be substantial for both inter-rater reliability (k = 0.70, 95% CI: 0.47–0.88) and test-re test reliability (k = 0.79, 95%CI: 0.48–0.96).

A study to evaluate the agreement between the INTER-NDA and the BSID-III is currently underway at our Oxford site. The results of this analysis will be published separately.

Progress to date

The INTERGROWTH-21st Project teams in Brazil, Kenya, India, Italy and the UK were trained between November 2012 and September 2013. Implementation of the Package commenced in the UK in February 2013, followed by Kenya, India and Italy in April 2013 and Brazil in November 2013. To date, 829 children in these sites have been assessed.

The Package takes between 35 and 45 minutes to administer in the field. The duration of each component of the Package is as follows: vision, 5 minutes; INTER-NDA, 15–20 minutes; cortical auditory processing, 12 minutes, and sleep, 3–5 minutes. The mean administration times in each site, rounded to the nearest 5 minutes, are as follows, – Brazil: 40 minutes, India: 45 minutes, Italy: 45 minutes, Kenya: 40 minutes and the UK: 35 minutes.

The Package was found to be easy to administer in all sites. Technical difficulties including electricity failures during ERP recordings and Internet connectivity problems occurred to a small extent in all sites. The HIC sites witnessed greater difficulty in persuading children to keep the wireless EEG cap on for the entire duration of 10 minutes as compared to the LMIC sites. The return of the actiwatch to the study center by post proved to be difficult in the Indian, Italian and Kenyan sites and alternative methods of delivery, such as courier services, were therefore organized.


The INTERGROWTH-21st Neurodevelopment Package describes, to the best of our knowledge, the first multi-dimensional instrument for early child development, for both research and screening purposes, that is designed to be carried out by non-specialist assessors and that has been developed for use in children from middle- and upper class families across low-, middle- and high-income settings. The Package assesses visual acuity and contrast sensitivity (with the Cardiff tests); cognition, language, motor skills, behavior, attention and emotional reactivity (the INTER-NDA); cortical auditory processing (measuring auditory ERPs to a novelty oddball paradigm) and sleep-wake patterns and daytime physical actigraphy (actigraphy and maternal reports). The Package is in use in the INTERGROWTH-21st study sites in Asia, Africa, Europe and South America where more than 700 children have been assessed. The operation manuals and protocols are freely available at This paper describes the conceptual and methodological challenges encountered during the selection of multi-dimensional ECD measures and outlines the methodological process of systematic evaluation and tool selection undertaken by the INTERGROWTH-21st Project towards the development of the Package. The paper also presents an overview of, to our knowledge, the most well-established psychometric measures of ECD for preschoolers. We believe that both the description of the systematic approach leading to the development of the Package, and the findings of the overview of psychometric tools to assess ECD in preschoolers, may be of use to researchers, policy makers and organizations involved in ECD research and surveillance at local, national and global levels when selecting and/or designing multi-dimensional, context-dependent, population-based instruments for measuring ECD.

The Package has a number of strengths. First, is its ability to assess multiple dimensions of neurodevelopment in preschoolers without the need for specialist staff or infrastructure. Second, the package was designed to reduce cultural and language biases in its administration as much as possible, thus making it suitable for use among children from middle- and upper class families across low-, middle- and high-income settings. This was done by selecting a pool of items using a decentering approach which, to the best of the Group's knowledge, involved resources that were familiar to children from middle- and upper-class families across settings and were not affected by cultural and language biases. The Group also adopted a scoring system based on objective observations of the child's performance rather than subjective judgments and undertook a process of cultural customization during on-site training sessions. Third, non-specialist field workers can easily deliver the Package in 35–45 minutes, and integrated quality checks in tablet-based application ensure that data can be collected reliably across diverse settings and contexts. An added feature of the Package is that its constituent assessments measure the function of entire vision and auditory pathways and not merely specific components.

There are a number of methodological and conceptual limitations to consider. First, the selection of tools for the Package was based on an overview of existing well-established instruments to assess ECD in preschoolers. This process consisted of a systematic review of the literature and discussions with experts, and not on a meta-analysis of existing tools, which might have produced a more objective assessment. The results of this process resulted in a list of, to our knowledge, the most well-established psychometric tools to assess children at 2 years of age but we acknowledge that this list is not exhaustive. In addition, the results of the systematic review were not evaluated against the PRISMA or STROBE guidelines [54], [55] because our primary concern was not the quality of the individual studies but the suitability of the instrument for the Project. Second, it may appear that the INTER-NDA is based on a variety of items from a number of different tools. However, all items are conceptually equivalent to those in the candidate measures and were carefully selected by international experts for the purpose of the study and the tool's future potential as a population-based screening instrument of ECD. Third, the INTER-NDA is not yet available in all the languages of the participating study sites, although translations are being prepared in accordance with the WHO Mental Health Initiative translation guidelines [56]. The Package was designed specifically for the 22 to 26 month age group and should not be used, therefore, in other age groups. Fourth, the tool was designed for use among children from middle- and upper class families, and its usefulness needs to be assessed in children from poorer backgrounds. Lastly, the Package does not assess the nature and degree of the concerns of caregivers about their children in detail.


To the best of our knowledge, the Package represents the first attempt at a concise, yet comprehensive, multi-dimensional instrument to measure early child development by non-specialist field workers across low- middle- and high-income settings. The comprehensiveness of the Package, its mixed methodology approach and ease of use make it a good candidate for scalability in the context of population-based research, as well as for screening purposes. This paper describes the current landscape of well-established psychometric tools to measure ECD in preschoolers and outlines the methodological approach of systematic evaluation and tool selection, which led to the development of the Package. We believe this approach may be of use to persons and organizations involved in large-scale efforts in ECD research and surveillance when selecting and/or developing measures of ECD.

Supporting Information

File S1.

Combined supporting information file containing all supporting figures and tables. Figure S1, Process and results of the systematic literature review of existing psychometric tools for assessment of neurodevelopment at 24 months. Table S1, List of search terms to identify the most well-established psychometric tools in a search of online databases (Approach 1, Phase 1). Table S2, Summary of well-established neuropsychological measures to assess neurodevelopment in children at 24 months. Table S3, Shortlisted neurodevelopmental assessment tools for use in children at 24 months. Table S4, Selection criteria for vision tests. Table S5, Selection criteria for neuropsychological tests (cognition, language, motor skills, behavior, attention and emotional reactivity). Table S6, Selection criteria for auditory function tests. Table S7, Selection criteria for sleep tests.



The authors are grateful to Kenny McCormick, Georg Hollander, Peter Sullivan, Rebeccah Slater, Stuart Peirson, Morten Kringelbach, Russell Foster, Michelle de Haan, Carol Hogue, Naila Khan, Elizabeth Murray, Aaron Martinez, Stephen Dunne and Guillem Mitjà for their intellectual and technical contribution to the Package. The authors wish to thank all the mothers and children who kindly took part in the INTERGROWTH-21st Project, as well as the Health Authorities in Pelotas, Brazil; Nagpur, India; Turin, Italy; Nairobi, Kenya and Oxford, UK, who facilitated this component of the project by allowing these study sites to participate as collaborating centers. This project was supported by a generous grant from the Bill & Melinda Gates Foundation to the University of Oxford, for which we are very grateful.

Members of the International Fetal and Newborn Growth Consortium for the 21st Century (INTERGROWTH-21st)

Scientific Advisory Committee

M Katz (Chair from January 2011), MK Bhan, C Garza, S Zaidi, A Langer, PM Rothwell (from February 2011), Sir D Weatherall (Chair until December 2010).

Steering Committee

ZA Bhutta (Chair), J Villar (Principal Investigator), S Kennedy (Project Director), DG Altman, FC Barros, E Bertino, F Burton, M Carvalho, L Cheikh Ismail, WC Chumlea, MG Gravett, YA Jaffer, A Lambert, P Lumbiganon, JA Noble, RY Pang, AT Papageorghiou, M Purwar, J Rivera, C Victora.

Executive Committee

J Villar (Chair), DG Altman, ZA Bhutta, L Cheikh Ismail, S Kennedy, A Lambert, JA Noble, AT Papageorghiou.

Study Co-ordinating Unit

J Villar (Head), S Kennedy, L Cheikh Ismail, A Lambert, AT Papageorghiou, M Shorten, L Hoch (until May 2011), HE Knight (until August 2011), EO Ohuma (from September 2010), C Cosgrove (from July 2011), I Blakey (from March 2011).

Data Analysis Group

DG Altman (Head), EO Ohuma (from September 2010), J Villar.

Data Management Group

DG Altman (Head), F Roseman, N Kunnawar, SH Gu, JH Wang, MH Wu, M Domingues, P Gilli, L Juodvirsiene, L Hoch (until May 2011), N Musee (until June 2011), H Al-Jabri (until October 2010), S Waller (until June 2011), C Cosgrove (until April 2013), EO Ohuma (from September 2010), D Yellappan (from November 2010), A Carter (from July 2011), D Muninzwa (from October 2011), D Reade (from June 2012), R Miller (from June 2012).

Ultrasound Group

AT Papageorghiou (Head), L Salomon (Senior external advisor), A Leston, A Mitidieri, F Al-Aamri, W Paulsene, J Sande, WKS Al-Zadjali, C Batiuk, S Bornemeier, M Carvalho, M Dighe, P Gaglioti, N Jacinta, S Jaiswal, JA Noble, K Oas, M Oberto, E Olearo, MG Owende, J Shah, S Sohoni, T Todros, M Venkataraman, S Vinayak, L Wang, D Wilson, QQ Wu, S Zaidi, Y Zhang, P Chamberlain (until September 2012), D Danelon (until July 2010), I Sarris (until June 2010), J Dhami (until July 2011), C Ioannou (until February 2012), CL Knight (from October 2010), R Napolitano (from July 2011), C Pace (from January 2011), V Mkrtychyan (from June 2012).

Anthropometry Group

L Cheikh Ismail (Head), WC Chumlea (Senior external advisor), F Al-Habsi, ZA Bhutta, A Carter, M Alija, JM Jimenez-Bustos, J Kizidio, F Puglia, N Kunnawar, H Liu, S Lloyd, D Mota, R Ochieng, C Rossi, M Sanchez Luna, YJ Shen, HE Knight (until August 2011), DA Rocco (from June 2012), IO Frederick (from June 2012).

Neonatal Group

ZA Bhutta (Head), E Albernaz, M Batra, BA Bhat, E Bertino, P Di Nicola, F Giuliani, I Rovelli, K McCormick, R Ochieng, RY Pang, V Paul, V Rajan, A Wilkinson, A Varalda (from September 2012).

Environmental Health Group

B Eskenazi (Head), LA Corra, H Dolk, J Golding, A Matijasevich, T de Wet, JJ Zhang, A Bradman, D Finkton, O Burnham, F Farhi.

Infant Development Group

A Stein (Head), M Fernandes (Co-ordinator), C Newton (Senior advisor), M Kihara, A Abubakar, K Wulff, M de-Haan, A Soria-Frisch, J Acedo, D Ibanez, C Hogue, E de-Leon, L Aranzeta, T Lewis, A Varalda, I Rovelli, N Sudhir, D Muninzwa, J Kizidio, E Albernaz, F Giuliani, S Savini, E Murray.

Participating countries and local investigators

Brazil: FC Barros (Principal Investigator), M Domingues, S Fonseca, A Leston, A Mitidieri, D Mota, IK Sclowitz, MF da Silveira.

China: RY Pang (Principal Investigator), YP He, Y Pan, YJ Shen, MH Wu, QQ Wu, JH Wang, Y Yuan, Y Zhang.

India: M Purwar (Principal Investigator), A Choudhary, S Choudhary, S Deshmukh, D Dongaonkar, M Ketkar, V Khedikar, N Kunnawar, C Mahorkar, I Mulik, K Saboo, C Shembekar, A Singh, V Taori, K Tayade, A Somani.

Italy: E Bertino (Principal Investigator), P Di Nicola, M Frigerio, G Gilli, P Gilli, M Giolito, F Giuliani, M Oberto, L Occhi, C Rossi, I Rovelli, F Signorile, T Todros.

Kenya: M Carvalho (Principal Investigator), J Kizidio, R Ochieng, J Shah, W Stones, S Vinayak, N Musee (until June 2011), C Kisiang'ani (until July 2011), D Muninzwa (from August 2011).

Oman: YA Jaffer (Principal Investigator), J Al-Abri, J Al-Abduwani, FM Al-Habsi, H Al-Lawatiya, B Al-Rashidiya, WKS Al-Zadjali, FR Juangco, M Venkataraman, H Al-Jabri (until October 2010), D Yellappan (from November 2010).

UK: S Kennedy (Principal Investigator), L Cheikh Ismail, AT Papageorghiou, F Roseman, A Lambert, S Lloyd, R Napolitano (from July 2011), C Ioannou (until February 2012), I Sarris (until June 2010).

USA: MG Gravett (Principal Investigator), C Batiuk, M Batra, S Bornemeier, M Dighe, K Oas, W Paulsene, D Wilson, IO Frederick, HF Andersen, SE Abbott, AA Carter, H Algren, DA Rocco, TK Sorensen, D Enquobahrie, S Waller (until June 2011).

Author Contributions

Conceived and designed the experiments: MF AS CN LCI MK KW EL LA ASF DI JA AA FG SK JV. Performed the experiments: MF AS CN SK JV LCI TL AA. Analyzed the data: MF AS CN SK JV AA MK KW DI ASF JA. Contributed reagents/materials/analysis tools: MF CN AS AA LA EL MK KW ASF DI JA. Wrote the paper: MF CN AS SK JV TL AA KW ASF MK.


  1. 1. First LR, Palfrey JS (1994) The Infant or young child with developmental delay. The New England Journal of Medicine 330:478–483.
  2. 2. Durkin M (2002) The epidemiology of developmental disabilities in low-income countries. Mental Retardation and Developmental Disabilities Research Reviews 8:206–211.
  3. 3. Olness K (2003) Effects on Brain Development Leading to Cognitive Impairment: A Worldwide Epidemic. Journal of Developmental & Behavioral Pediatrics 24:120–130.
  4. 4. Rutter M, Kim-Cohen J, Maughan B (2006) Continuities and discontinuities in psychopathology between childhood and adult life. Journal of Child Psychology and Psychiatry 47:276–295.
  5. 5. Mannuzza S, Klein RG (2000) Long-term prognosis in attention-deficit/hyperactivity disorder. Child and Adolescent Psychiatric Clinics of North America 9:711–726.
  6. 6. Caspi A, Harrington H, Milne B, Amell JW, Theodore RF, et al. (2003) Children's Behavioral Styles at Age 3 Are Linked to Their Adult Personality Traits at Age 26. Journal of Personality 71:495–514.
  7. 7. Caspi A, Moffitt TE, Newman DL, Silva PA (1996) Behavioral observations at age 3 years predict adult psychiatric disorders: Longitudinal evidence from a birth cohort. Archives of General Psychiatry 53:1033–1039.
  8. 8. Moffitt TE, Arseneault L, Belsky D, Dickson N, Hancox RJ, et al. (2011) A gradient of childhood self-control predicts health, wealth, and public safety. Proceedings of the National Academy of Sciences 108:2693–2698.
  9. 9. Hack M, Flannery DJ, Schluchter M, Cartar L, Borawski E, et al. (2002) Outcomes in Young Adulthood for Very-Low-Birth-Weight Infants. New England Journal of Medicine 346:149–157.
  10. 10. Doyle LW, Anderson PJ (2010) Adult Outcome of Extremely Preterm Infants. Pediatrics 126:342–351.
  11. 11. Sutton PS, Darmstadt GL (2013) Preterm Birth and Neurodevelopment: A Review of Outcomes and Recommendations for Early Identification and Cost-effective Interventions. Journal of Tropical Pediatrics.
  12. 12. Behrman RE, Butler AS (2007) Neurodevelopmental, Health, and Family Outcomes for Infants Born Preterm. Preterm Birth: Causes, Consequences, and Prevention. Washington (DC): National Academies Press (US) and Institute of Medicine (US) Committee on Understanding Premature Birth and Assuring Healthy Outcomes pp.346–397.
  13. 13. Fernald LCH, Kariger P, Engle P, Raikes A (2009) Examining Early Child Development in Low-Income Countries: A Toolkit for the Assessment of Children in the First Five Years of Life. Washington DC 20433.
  14. 14. Villar J, Altman DG, Purwar M, Noble JA, Knight HE, et al. (2013) The objectives, design and implementation of the INTERGROWTH-21st Project. BJOG 120 Suppl 29–26.
  15. 15. Garza C, de Onis M (2004) Rationale for developing a new international growth reference. Food Nutr Bull 25:S5–14.
  16. 16. Villar J, Altman DG, Purwar M, Noble JA, Knight HE, et al. (2013) The objectives, design and implementation of the INTERGROWTH-21st Project. British Journal of Obstetrics and Gynaecology 120:9–26.
  17. 17. The International Fetal and Newborn Growth Consortium for the 21st Century (INTERGROWTH-21st) Study Group. International standards for newborn weight, length and head circumference according to gestational age and sex (submitted) 2014 Lancet.
  18. 18. Marlow N, Abbott J, Field D, Johnson S, Huertas A, et al. (2008) Report of a BAPM/RCPCH Working Group: Classification of health status at 2 years as a perinatal outcome. London, UK: British Association of Perinatal Medicine. 23 p.
  19. 19. Bayley N (2006) Bayley Scales of Infant Development Manual. Antonio, Texas: The Psychological Corporation.
  20. 20. Bayley N, Schaefer ES (1964) Correlations of Maternal and Child Behaviors with the Development of Mental Abilities: Data from the Berkeley Growth Study. Monographs of the Society for Research in Child Development 29:1–80.
  21. 21. Khan NZ, Muslima H, Begum D, Shilpi AB, Akhter S, et al. (2010) Validation of Rapid Neurodevelopmental Assessment Instrument for Under-Two-Year-Old Children in Bangladesh. Pediatrics 125:e755–e762.
  22. 22. Gladstone MJ, Lancaster GA, Umar E, Nyirenda M, Kayira E, et al. (2010) The Malawi Developmental Assessment Tool (MDAT): The Creation, Validation, and Reliability of a Tool to Assess Child Development in Rural African Settings. PLoS Medicine.
  23. 23. Griffiths R (1954) The abilities of babies: a study in mental measurement. New York, NY, US: McGraw-Hill.
  24. 24. Griffiths R (The 1996 Revision) Manual: The Griffiths Mental Development Scales from Birth to 2 years. UK: Association for Research in Infant and Child Development.
  25. 25. Koot H, Van Den Oord ECG, Verhulst F, Boomsma D (1997) Behavioral and Emotional Problems in Young Preschoolers: Cross-Cultural Testing of the Validity of the Child Behavior Checklist/2–3. Journal of Abnormal Child Psychology 25:183–196.
  26. 26. Ostrander R, Weinfurt KP, Yarnold PR, August GJ (1998) Diagnosing attention deficit disorders with the Behavioral Assessment System for Children and the Child Behavior Checklist: Test and construct validity analyses using optimal discriminant classification trees. Journal of Consulting and Clinical Psychology 66:660–672.
  27. 27. Eremenco SL, Cella D, Arnold BJ (2005) A Comprehensive Method for the Translation and Cross-Cultural Validation of Health Status Questionnaires. Evaluation & the Health Professionals 28:212–232.
  28. 28. Whitney MP, Thoman EB (1993) Early Sleep Patterns of Premature Infants Are Differentially Related to Later Developmental Disabilities. Journal of Developmental & Behavioral Pediatrics 14:71–80.
  29. 29. Benasich AA, Tallal P (2002) Infant discrimination of rapid auditory cues predicts later language impairment. Behavioural Brain Research 136:31–49.
  30. 30. Henry B, Caspi A, Moffitt TE, Silva PA (1996) Temperamental and familial predictors of violent and nonviolent criminal convictions: Age 3 to age 18. Developmental Psychology 32:614–623.
  31. 31. Teller DY, McDonald MA, Preston K, Sebris SL, Dobson V (1986) Assessment of Visual Acuity in Infants and Children; The Acuity Card Procedure. Developmental Medicine & Child Neurology 28:779–789.
  32. 32. Teller DY (1979) The forced-choice preferential looking procedure: A psychophysical technique for use with human infants. Infant Behavior and Development 2:135–153.
  33. 33. Neu B, Sireteanu R (1997) Monocular acuity in preschool children: Assessment with the Teller and Keeler acuity cards in comparison to the C-test. Strabismus 5:185–202.
  34. 34. Adoh TO, Woodhouse JM, Oduwaiye KA (1992) The Cardiff Test: A New Visual Acuity Test for Toddlers and Children with Intellectual Impairment. A Preliminary Report. Optometry & Vision Science 69:427–432.
  35. 35. Adams RJ, Mercer ME, Courage ML (1992) A new technique to measure contrast sensitivity in human infants. Optometry and vision science: official publication of the American Academy of Optometry 69:440–446.
  36. 36. Verweyen P (2004) Measuring Vision in Children. Community Eye Health 17:27–29.
  37. 37. Adams RJ, Courage ML (2002) Using a single test to measure human contrast sensitivity from early childhood to maturity. Vision Research 42:1205–1210.
  38. 38. Cunningham M, Cox EO, Practice Co, Medicine A, Otolaryngology So, et al. (2003) Hearing Assessment in Infants and Children: Recommendations Beyond Neonatal Screening. Pediatrics 111:436–440.
  39. 39. Lewandowski AS, Toliver-Sokol M, Palermo TM (2011) Evidence-Based Review of Subjective Pediatric Sleep Measures. Journal of Pediatric Psychology 36:780–793.
  40. 40. Spruyt K, Gozal D (2011) Pediatric sleep questionnaires as diagnostic or epidemiological tools: A review of currently available instruments. Sleep Medicine Reviews 15:19–32.
  41. 41. Jon C (2009) Polysomnography in Children. In: Pereira KD, Mitchell RB, editors. Pediatric Otolaryngology for the Clinician: Humana Press. pp.35–47.
  42. 42. So K, Michael Adamson T, Horne RSC (2007) The use of actigraphy for assessment of the development of sleep/wake patterns in infants during the first 12 months of life. Journal of Sleep Research 16:181–187.
  43. 43. Sadeh A, Lavie P, Scher A, Tirosh E, Epstein R (1991) Actigraphic Home-Monitoring Sleep-Disturbed and Control Infants and Young Children: A New Method for Pediatric Assessment of Sleep-Wake Patterns. Pediatrics 87:494–499.
  44. 44. Adoh TO, Woodhouse JM (1994) The Cardiff acuity test used for measuring visual acuity development in toddlers. Vision Research 34:555–560.
  45. 45. Fogarty Associates; The Cardiff Contrast Test.
  46. 46. Marcus RE (1951) Hearing and speech problems in children: Observations and use of electroencephalography. AMA Archives of Otolaryngology 53:134–146.
  47. 47. Suzuki T, Ogiba Y (1961) Conditioned orientation reflex audiometry: A new technique for pure-tone audiometry in young children under 3 years of age. Archives of Otolaryngology 74:192–198.
  48. 48. Northern JL, Downes MP (2002) Hearing in children. Pennsylvania, USA: Lippincott Williams & Wilkins.
  49. 49. Kihara M, Hogan AM, Newton CR, Garrashi HH, Neville BR, et al. (2010) Auditory and visual novelty processing in normally-developing Kenyan children. Clinical Neurophysiology 121:564–576.
  50. 50. Starlab & Neuroelectrics, Barcelona, Spain. Accessed 2014 Oct 31.
  51. 51. Sadeh A (2004) A Brief Screening Questionnaire for Infant Sleep Problems: Validation and Findings for an Internet Sample. Pediatrics 113:e570–e577.
  52. 52. CamNtech Ltd and CamNtech Inc, Cambridge, UK. Accessed 2014 Oct 31.
  53. 53. Ohuma EO, Hoch L, Cosgrove C, Knight HE, Cheikh Ismail L, et al. (2013) Managing data for the international, multicentre INTERGROWTH-21st Project. BJOG: An International Journal of Obstetrics & Gynaecology 120:64–70.
  54. 54. von Elm E, Altman D, Egger M, Pocock S, Gøtzsche P, et al. (2007) The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) Statement: Guidelines for reporting observational studies. Preventive Medicine 45:247–251.
  55. 55. Moher D, Liberati A, Tetzlaff J, Altman DG (2009) Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. Annals of Internal Medicine 151:264–269.
  56. 56. Picton TW, Bentin S, Berg P, Donchin E, Hillyard SA, et al. (2000) Guidelines for using human event-related potentials to study cognition: Recording standards and publication criteria. Psychophysiology 37:127–152.