Comparison of Fecal Microbiota in Children with Autism Spectrum Disorders and Neurotypical Siblings in the Simons Simplex Collection

In order to assess potential associations between autism spectrum disorder (ASD) phenotype, functional GI disorders and fecal microbiota, we recruited simplex families, which had only a single ASD proband and neurotypical (NT) siblings, through the Simons Simplex Community at the Interactive Autism Network (SSC@IAN). Fecal samples and metadata related to functional GI disorders and diet were collected from ASD probands and NT siblings of ASD probands (age 7–14). Functional gastrointestinal disorders (FGID) were assessed using the parent-completed ROME III questionnaire for pediatric FGIDs, and problem behaviors were assessed using the Child Behavior Check List (CBCL). Targeted quantitative polymerase chain reaction (qPCR) assays were conducted on selected taxa implicated in ASD, including Sutterella spp., Bacteroidetes spp. and Prevotella spp. Illumina sequencing of the V1V2 and the V1V3 regions of the bacterial 16S rRNA genes from fecal DNA was performed to an average depth of 208,000 and 107,000 high-quality reads respectively. Twenty-five of 59 ASD children and 13 of 44 NT siblings met ROME III criteria for at least one FGID. Functional constipation was more prevalent in ASD (17 of 59) compared to NT siblings (6 of 44, P = 0.035). The mean CBCL scores in NT siblings with FGID, ASD children with FGID and ASD without FGID were comparably higher (58–62 vs. 44, P < 0.0001) when compared to NT children without FGID. There was no significant difference in macronutrient intake between ASD and NT siblings. There was no significant difference in ASD severity scores between ASD children with and without FGID. No significant difference in diversity or overall microbial composition was detected between ASD children with NT siblings. Exploratory analysis of the 16S rRNA sequencing data, however, identified several low abundance taxa binned at the genus level that were associated with ASD and/or first order ASD*FGID interactions (FDR <0.1).


Introduction
Autism spectrum disorders (ASD) are neurodevelopmental disorders characterized primarily by alterations in reciprocal social interactions and communication as well as repetitive behavior. The reported prevalence of ASD has increased and affects 1 in 68 children in the USA [1]. This may be due in part to heightened awareness for these disorders [2,3]. The diagnosis of ASD is based on specially trained physicians and psychologists administering autism-specific behavioral evaluations. Twin studies provide support for both genetic and environmental contributions to ASD pathogenesis [2,3].
Alterations in the composition of resident gut microbial communities have been reported for an increasing number of human disorders, including ASD and functional GI disorders (FGID) . The term "functional" in FGIDs refers to the absence of structural abnormalities in the gut that can be identified by blood, radiographic or endoscopic tests [25,26]. Consequently, FGIDs are diagnosed based on assessment of GI symptoms, and the exclusion of other GI diseases (e.g. celiac disease, inflammatory bowel diseases). FGIDs are very common in the general population and comprise a major portion of gastroenterology and primary care practices. Abnormalities in the neural regulation of gut function and sensation are implicated in the pathogenesis of FGIDs [20,25,26].
Functional GI symptoms have been frequently reported in ASD children as a non-neurological co-morbidity [27][28][29][30][31]. However a major barrier to integrating the ASD studies on GI symptoms with those conducted in the general population has been that the instruments used to assess functional GI symptoms in ASD differed from the ROME III process, which is the current international standard by which functional GI disorders (FGID) are diagnosed and classified [24].
In 2010, the Simons Foundation Autism Research Initiative (SFARI) began assembling the Simons Simplex Collection (SSC) as a resource for ASD research [32]. The SSC includes over 2000 simplex families, each with a single ASD proband but with unaffected parents and siblings. The simplex families were recruited through 12 university affiliated research clinics located in Canada and the United States. Upon recruitment to the SSC, the ASD probands (age [4][5][6][7][8][9][10][11][12][13][14][15][16][17][18] were evaluated with an extensive battery of diagnostic measures including the Autism Diagnostic Observation Schedule (ADOS) using the same diagnostic criteria across centers. Medical data, including GI symptoms, were collected on the ASD probands, but not on the NT siblings, using an extensive SSC Medical History form. Families were excluded if the proband did not meet criteria for ASD, had medically significant perinatal incidents (e.g. perinatal asphyxia), low mental age, or had primary relatives on the autism spectrum. The SSC thus represents a "unique, well-described sample of able children and adolescents with relatively severe ASD, as indicated by ADI-R and ADOS Calibrated Severity Scores (ADOS-CSS)" [32]. In order to assess the associations between ASD phenotype, FGIDs and fecal microbiota, we recruited a subset of the SSC families with ASD probands and/or NT siblings (age [7][8][9][10][11][12][13][14], to participate in a study collecting fecal samples and metadata related to functional GI disorders and diet.

Patient recruitment
Families were recruited via a registry called the Simons Simplex Community through the Interactive Autism Network (SSC@IAN), which is composed of simplex families originally recruited to the Simons Simplex Collection that were willing to be contacted through the Interactive Autism Network for additional studies. The SSC@IAN provided us with contact information for 245 potentially eligible simplex families that expressed their willingness to be contacted about this study . Of the 245 potentially eligible SSC families with 339 children, 107 families  with 145 children consented to participate with parental written consents and written child  assents for all participating children, and 66 families with 103 children (66 ASD probands and  37 NT siblings) completed the study. Exclusion criteria for ASD probands and NT siblings in this study were age <7 or age >14 years, a past or current history of GI diseases or other serious medical problems, and use of antibiotics within one month of collection of the stool sample. The procedures of the present study were reviewed and approved by the Institutional Review Board of Stony Brook University (IRB 313466) Families who consented to participate in the study were asked to complete the Pediatric ROME III Version (QPGS-RIII), the GI symptoms form in the Simon Simplex Collection Medical History (v 4.0) [31], the Child Behavior Checklist (CBCL/6-18) [33][34][35][36][37], a one-week food diary for the ASD proband and/or the NT sibling prior to collecting a fecal sample, and provide current information on the child's height and weight. The families were asked to submit the stool samples after the subjects were off probiotics and antibiotics for at least one month.
Pediatric ROME III process of diagnosing FGID and the GI symptoms section of the SSC medical history form Functional GI disorders (FGID) were assessed using the ROME III process [25]. Information on GI symptoms in both ASD and NT siblings was also collected using the GI symptoms section of the SSC Medical History form [32]. The GI symptoms section of the SSC Medical History form included the seven symptom categories: constipation, diarrhea, abnormal stool smell, bloating, excessive gas, severe abdominal pain, and vomiting. The parents were asked to rank whether these categories were problematic as "yes", "not sure" and "no". Many of these symptom categories in the SSC form overlap with the following symptoms included in the calculation of the GI severity index score [27]: constipation, diarrhea, stool smell, flatulence and abdominal pain, but did not include "unexplained daytime irritability", "nighttime awakening" or "abdominal tenderness during exam".

Child Behavior Checklist (CBCL)
The CBCL/6-18 is a parent report form to screen for emotional, behavioral, and social problems in children ages 6-18 [33]. This instrument as well as the earlier version of this instrument has been previously used by several research groups studying school-aged children with ASD [34][35][36][37]. The data from ASD and NT siblings was entered and analyzed using the AGES 6-18 ASEBA-PC MODULE (ASEBA) to yield norm-referenced T-scores (M = 50, SD = 10). The CBCL contains eight syndrome scales: Anxious/Depressed, Withdrawn/Depressed, Somatic Complaints, Rule Breaking Behavior, Aggressive Behavior. The Internalizing Domain score is a measure of emotional problems and contains three syndrome scales: Anxious/Depressed, Withdrawn/Depressed, and Somatic Complaints. The Externalizing Domain score is a measure of behavioral problems and contains the Rule Breaking Behavior and Aggressive Behavior syndrome scales. total protein g/day, total carbohydrates g/day, total fat g/day, total dietary fiber/day, total sugar g /day.

Fecal sample collection and DNA extraction
Fecal samples were collected in a stool "hat" placed in the toilet and following evacuation 2 ml of stool were immediately transferred, using the spoon built into the lid into a Para-Pak Clean vial (Meridian Bioscience, Cincinnati, OH) that contained 10 ml of RNAlater for metagenomics studies and a Para-Pak Clean vial with no preservative. The fecal samples were shipped in cold packs overnight to our laboratory. Fecal DNA was extracted from stool samples immediately upon arrival using ZR Fecal DNA MiniPre (Zymo Research Corporation, Irvine, CA) and following manufacturer's protocol. Fecal DNA and stool samples were stored in -80°C.

16S rRNA amplicon library construction and Illumina V1V2 and V1V3 sequencing analysis
Fecal bacterial profiles were determined using broad-range amplification of the~300 base pair (bp) V1V2 variable region using primers 27FYM (AGAGTTTGATYMTGGCTCAG) and 338R (TGCTGCCTCCCGTAGGAGT) and the~500 bp V1V3 region using primers 27FYM (AGAGTTTGATYMTGGCTCAG) and 534R (ATTACCGCGGCKGCTGG) [43,44]. Sequencing was performed on the Illumina MiSeq platform using 600-cycle version 3 reagent kits. Demultiplexed, paired end sequence data were deposited in the NCBI Short Read Archive under BioProject Accession Number: PRJNA282013 (www.ncbi.nlm.nih.gov/bioproject/ PRJNA2282203). The sorted paired reads were assembled using phrap [45] and paired reads that did not assemble were discarded. Assembled sequence ends were trimmed over a moving window of 5 nucleotides until average quality met or exceeded 20. Trimmed contigs with more than 1 ambiguity or shorter than 200 nucleotides were discarded. Potential chimeras identified with Uchime (usearch6.0.203_i86linux32) [46] using the Schloss [47] Silva reference sequences were removed from subsequent analyses. Assembled sequences were aligned and classified with SINA (1.2.11) using the 418,497 bacterial sequences in Silva 115NR99 as reference configured to yield the Silva taxonomy [48,49]. Operational taxonomic units (OTUs) were produced by clustering sequences with identical taxonomic assignments. OTU counts were normalized between samples by dividing sequence counts by the total number of sequences generated per sample to calculate the relative abundance. Phylum-level OTU tables were generated by collapsing lower level OTUs into higher-level categories. OTUs with a maximal relative abundance <0.0001 and with a prevalence <0.01 were culled. The remaining OTU relative abundances were then transformed using the square root function.

Statistical analysis
Comparisons of gender, FGIDs, and SSC GI symptoms between ASD and NT siblings were conducted using the unconditional Fisher exact test with one tail [50]. The threshold significance was set at P < 0.05.
The ASD and NT siblings were further subdivided with respect to whether they also carried a diagnosis of an FGID based on the ROME III process. Study characteristics including body mass index (BMI), CBCL scores, dietary scores and the qPCR ΔCt scores were compared between the following four groups: 1) ASD with FGID, 2) ASD without FGID, 3) NT siblings with FGID and 4) NT siblings without FGID. Outcome variables including CBCL, dietary macronutrients, qPCR ΔCt were also modeled using mixed effect models with ASD, FGID and ASD Ã FGID as the three fixed effects, and family as random effects. Each continuous variable was fitted to a linear model while each categorical variable was fitted to a logistic model. Comparison of continuous variables between family matched ASD and NT siblings was conducted using two-tailed Student's T test or the Wilcoxon signed rank test with the threshold of significance set as P < 0.05.
Beta diversity was compared between ASD and NT siblings using the adonis function in the R vegan package as previously described [52,53] at the phyla, family and genus level. This function uses a non-parametric multivariate analysis of variance test (PERMANOVA) [54], and was applied using Bray-Curtis, Jaccard and Morisita-Horn indices as distance measurements [55]. Family (i.e. whether ASD or NT siblings belonged to the same family) was selected as a strata variable to restrict permutations as previously described [56]. Overall microbial composition binned as phyla, was also compared between family matched ASD and NT siblings using the permutation Hotelling T test [57].
Paired two class comparisons of the transformed (square root) relative abundances of individual OTUs (phyla and genera level) between ASD probands and NT siblings from the same family were conducted using the Wilcoxon signed rank test with the threshold set as FDR < 0.05.
Because over-dispersion is often observed in microbiome sequence count data [58], a negative binomial regression model was also used to assess the effect of the ASD phenotype, FGID phenotype and first order interactions between ASD and FGID (ASD Ã FGID) on each individual OTU, in both paired and unpaired ASD and NT samples as follows: where Y ijk denotes the raw sequence count of OTU k for child j in family i. In this model we treat family, which may reflect genetic and possibly dietary factors, as a zero mean random coefficient. Coefficients b 0 , β 1 , β 2 and β 3 are fixed constants representing grand mean, the effect of ASD, FGID and the interaction of ASD and FGID respectively. The log total sequence count (Y ij. = ∑ k Y ijk ) for each individual is considered as an offset. Because the distribution of the raw sequence count is often skewed by a large proportion of zero counts, a zero-inflated version of the negative binomial model was also fitted to each individual OTU: The final model was selected based on the Akaike Information Criterion (AIC). The p-values for each OTU were adjusted for multiple comparisons to calculate a false discovery rate (FDR) using the Benjamini-Hochberg procedure [59]. The threshold of significance for ASD, FGID or ASD Ã FGID first order interactions was set at FDR < 0.1. The logistic and linear mixed models were built with R package glmmADMB [60] and lme4-R [61] respectively.
Comparative analysis of the parallel square root transformed relative abundance [62] measurements of Bacteroidetes phylum, the Sutterella genus and the Prevotella genus by MiSeq V1V2, MiSeq V1V3 and targeted qPCR was conducted using latent structural equation modeling [63]. Because of the large proportion of zero counts for Sutterella and Provetella genera, samples with zero counts were excluded from the analysis. To avoid the complication caused by correlated samples, SEM models were fitted with grouping variable ASD and the constraint that loadings are equal for different groups. The corresponding model for each group is: The corresponding model for each group is: Here ξ ij is the underlying true bacteria relative abundance in the square root scale, while δ 1 , δ 2 and δ 3 are the corresponding measurement errors. In order to render the model identifiable, we chose qPCR as the reference level with a 1 = 0 and b 1 = 1. The model was fitted with the R package lavaan [64].

Demographics and functional GI disorders (FGID) in ASD and NT siblings
A total of 59 ASD children and 44 NT siblings from 66 families completed the study ( Table 1). Matched ASD and NT sibling pairs were from 37 families. In some instances NT siblings of ASD probands were recruited, but not the ASD proband s because their age exceeded 14 years. The ASD children were predominantly male. The gender distribution for the NT siblings was roughly equal. The mean age in both ASD children and NT siblings was 10 years, and they were predominantly of Caucasian descent. There are three major groups of FGIDs (see Table 1) [24]: H1) Vomiting and aerophagia, H2). Abdominal-pain related FGIDs, and H3) Constipation and fecal incontinence. Group H1 is further subdivided into three FGIDs: H1a) Adolescent rumination syndrome; H1b) Cyclic vomiting syndrome; H1c) Aerophagia. Group H2 is further subdivided into four FGIDS: H2a) Functional dyspepsia, H2b) Irritable bowel syndrome (IBS), H2c) abdominal migraine, H2d) Childhood functional abdominal pain with a subset termed H2d1. Childhood functional abdominal pain syndrome. Subjects classified within the H2d1 subset have additional somatic symptoms such as headache, limb pain or difficulty sleeping.
Twenty-five of 59 (42%) ASD children and 13 of 44 (30%) NT siblings were diagnosed with at least one FGID. Fifteen ASD children and 13 NT siblings in 37 families met ROME III criteria for at least one FGID, with concordance for FGIDs in 6 (18%) ASD/NT sibling pairs. Some ASD and NT siblings were diagnosed with more than one FGID. ASD children exhibited a higher prevalence of functional constipation in compared to NT siblings (P = 0.035) when both matched and unmatched individuals were included in the analysis. We detected no significant difference in the prevalence of children with at least one FGID in family matched ASD/ NT sibling pairs. We also detected no significant difference in the prevalence of specific FGIDs such as functional constipation in the family matched ASD/NT sibling pairs.
When the SSC registry was established, medical history forms including one on GI symptoms were completed by the parents on the ASD children but not the NT siblings. The same SSC GI symptoms form was then completed (~2-3 years later) by the parents for the ASD and NT children for the current study. As shown in Table 2, there was no significant change in the prevalence of the GI symptoms occurred over time. Furthermore, the SSC results indicate that ASD children exhibited a higher prevalence of constipation compared to NT siblings (P = 0.05).

Comparison of CBCL and macronutrient intake
As shown in Table 3 and S1 Table, the ASD severity score, ADOS-CSS, did not differ significantly between the ASD w FGID group and the ASD w/o FGID group. Mean total, internalizing, externalizing and subscale CBCL scores were significantly increased (p < 0.05) in the ASD children in both the ASD/NT sibling pairs and in comparison of all the ASD with NT children (matched and unmatched). These CBCL scores were also significantly increased (p < 0.05) in NT siblings with FGID compared to NT siblings without FGID ( Table 3). There was no significant difference in the mean CBCL scores between ASD children with FGID compared with ASD children without FGID. No significant differences (p < 0.05) between the four study groups with respect to daily intake of macronutrients (calories, protein, fats, carbohydrates, sugars or dietary fiber) were noted ( Table 3). Only 4 children were reported to adhere to a gluten and/or casein free diet. The study participants had BMIs that were within the normal range and were not obese. Table 2. SSC problematic GI symptoms results. The previous and current percent of ASD children and NT siblings having a problematic GI symptom is listed below. The previous percent were based on parental responses to the SSC medical form for ASD children only on first entering the SSC registry. The current percent were based on parental responses to the same forms for both ASD and NT siblings during the current study~2-3 years later. The P-values reflect comparison of the current responses of the ASD and NT siblings. There was no significant change in the prevalence of any of the seven problematic GI symptoms in the ASD children at the two time points. No associations (FDR < 0.1) detected in targeted bacterial subgroups between ASD, FGID and/or first order ASD*FGID by QPCR Targeted qPCR assays of fecal samples were conducted for Sutterella, Prevotella and total Bacteroidetes subgroups, because of previous reports that the relative abundance of these subgroups differed between ASD and NT children [7,8,[10][11][12][13][14]. QPCR assays targeting the C. coccoides-E-rectales group, Faecalibacterium prausnitzii and Escherichia coli were also performed because altered relative abundances have been reported previously in GI disorders such as inflammatory bowel diseases [23]. The mean ΔCt values of these subgroups, which represent a log 2 transformation of the relative abundance, for 1) ASD w FGID, 2) ASD w/o FGID, 3) NT w FGID and 4) NT w/o FGID are shown in Table 4. No significant difference was observed for any of these bacterial subgroups in the 37 ASD/NT sibling pairs (see Methods). Furthermore, in mixed models, no significant effect of ASD, FGID or ASD Ã FGID was observed in an analysis of all 59 ASD and 44 NT siblings.

16S rRNA sequence analysis of ASD children and NT siblings
We . No significant effect of ASD, FGID or ASD Ã FGID was observed on OTU complexity or richness, two measures of α-diversity, Shannon diversity and Chao 1 (see Table 5 for mean values). No significant difference (p < 0.05) was noted in β-diversity between ASD and NT sibling bacterial communities analyzed at the phyla, family or genus level and using Bray-Curtis, Jaccard or Morisita-Horn indices as dissimilarity measurements. The relative abundances of the major four phyla, (Firmicutes, Bacteroidetes, Actinobacteria, Proteobacteria) for the ASD children with FGID, ASD children without FGID, NT siblings with FGID and NT siblings without FGID groups for the V1V2 and V1V3 datasets are shown in Fig 1 (see S2 Table). Phyla of lower abundance, including RF3, Tenericutes, Cyanobacteria, Verrucomicrobia, Lentisphaerae, and Fusobacterium were also detected (see S3 Table). Permutational Hotelling T test revealed no significant difference in the overall microbial composition when binned at the phyla level, between ASD children and NT siblings. With the exception of Table 4. QPCR comparisons of selected bacterial subgroups in ASD children with FGID, ASD children without FGID, NT siblings with FGID and NT siblings without FGID. The qPCR assays were conducted using established primers as described in Methods. The mean ΔCt values (~Log2 relative abundance of targeted bacterial subgroups) ± standard deviations are listed. The data from family matched and unmatched ASD and NT siblings are included.  the Verrucomicrobia phylum, the relative abundances of the phyla were similar between the V1V2 and V1V3 data sets. Exploratory analyses of individual phyla detected no significant effect (FDR <0.1) of ASD, FGID or ASD Ã FGID first order interactions for any of the individual phyla, including Bacteroidetes, using a negative binomial model. At the genus level, 359 operational taxonomic units (OTU) were detected for the V1V2 dataset and 189 OTUs for the V1-V3 datasets, respectively. No significant associations were detected between the relative abundance of the Sutterella or Prevotella genera and ASD, FGID or ASD Ã FGID first order interactions (see Table 6) in either the V1V2 or V1V3 datasets. Exploratory analyses of individual genera revealed a significant effect (FDR < 0.1) of ASD and ASD Ã FGID first order interactions on the Cyanobacteria/Chloroplast genus for both the V1V2 and/or V1V3 datasets using a zero-inflated negative binomial model (see Table 6). The mean relative abundance of this genus was increased in the ASD-with FGID group compared to those of the other three  categories (ASD without FGID, NT siblings with FGID and NT siblings without FGID, see S4 Table for mean relative abundances). 16S RNA sequence analyses of the human gut microbiota have previously detected sequences binned within the Cyanobacteria phylum. These sequences could represent chloroplast 16S rRNA genomic sequences present in dietary plant foods or novel uncultivated bacterial species. Metagenomic studies have detected a novel nonphotosynthetic taxa termed "Melainabacteria" present in human gut flora that represents either a sibling phylum or a class within the Cyanobacteria phylum [65,66]. The Silva database used in the current study, however, binned the Melainabacteria sequences in a taxon that was distinct from the Chloroplast taxon. Inspection of the daily food diaries of all the subjects revealed that two ASD children with FGID (functional constipation), who had high relative abundances of this genus and consequently dominated the signal associated with ASD with FGID category, were fed chia seeds added to smoothies. None of the other subjects consumed chia seeds except one ASD child who ingested a serving of mixed grain bread that included chia grain. Of note, chia seeds have been touted as an autism cure on the internet [67,68]. For the V1V2 dataset only, significant associations were observed for 1) ASD and ASD Ã F-GID first order interactions with Firmicutes/Asteroleplasma, 2) ASD, FGID and ASD Ã FGID first order interactions with Proteobacteria/Thalassospira, and 3) FGID and ASD Ã FGID first order interactions with Proteobacteria/Burkholderia (see Table 5 and S4 Table). The Firmicutes/Asteroleplasma genus appears to represent a group of uncultured wall-less mycobacterium like organisms within the class Erysipelotrichia [69]. The Proteobacteria/Thalassospira genus appears to represent alpha-proteobacteria present in bacterioplankton and a number of uncultivated organisms [70]. The sequences binned in this Proteobacteria/Burkholderia taxon represent unclassified sequences within the Burkholderia order that did not bin within the Comamonadaceae family (see below).
For the V1V3 dataset only, significant associations were observed for 1) ASD and ASD Ã F-GID first order interactions with the Proteobacteria/Comamonadaceae, 2) ASD, FGID and ASD Ã FGID first order interactions with Fusobacteria.Fusobacteriales, 3) ASD Ã FGID first order interactions with Bacteroidetes/Prevotellaceae, and 4) ASD and ASD Ã FGID interactions with Actinobacteria/Mobiluncus. These associations, however, did not reach significance in the V1V2 dataset. Proteobacteria/Comamonadaceae is a large and diverse bacterial family within the Burkholderiales order (see above). The sequences binned within this Fusobacteria/ Table 6. Low abundance taxa binned at the genera level that are significantly associated with ASD, FGID or ASD*FGID first order interactions. The FDRs for each low abundance taxa are listed below for the V1V2 and V1V3 datasets, with the threshold of significant <0.1. N.D. is not detected based on filtering criteria. The FDRs are also listed for two genera, Sutterella and Provetella, which have been previously reported to be increased and decreased, respectively, in ASD children compared to unrelated control children [10,11]. Fusobacteriales taxon represent unclassified sequences within the Fusobacteriales family that did not bin within the Fusobacterium genus. The sequences binned within the Bacteroidetes/ Prevotellaceae taxon represent unclassified sequences within the Prevotellaceae family that did not bin within the Prevotella genus. The Actinobacteria/Mobiluncus genus has previously been associated with bacterial vaginosis [71].
Comparison of MiSeq V1V2, V1V3 and qPCR measurements of selected bacterial categories Comparisons of the relative abundances of the Bacteroidetes phylum, and the Sutterella and Prevotella genera reveal high correlations between MiSeq V1V2 and V1V3 (ρ%0.9) and small proportional biases (S5 Table and S1 Fig). The different reverse primers, 338R and 534R, used for amplifying respectively the V1V2 and V1V3 regions, might have subtle differences in specificity. While there were generally positively correlated measurements between the MiSeq and qPCR platforms, we observed large systematic biases particularly with the two relative low abundance genera, Sutterella and Provetella. This may reflect at least in part that the depth of sequencing was not sufficient to accurately measure relatively low abundance bacterial categories.

Discussion
This study of 59 ASD children and 44 NT siblings in the Simons Simplex Community through the Interactive Autism Network (SSC@IAN) registry, was motivated by previous reports linking ASD, GI symptoms with alterations in the gut microbiota (see Fig 2) and some simply linking ASD with the gut microbiota (see Fig 3). These previous studies of gut microbial communities in ASD children used both unrelated control children and NT siblings of ASD children. One study also included related disorders, such as Aspergers or Pervasive developmental disorder not otherwise specified (PDD-NOS) as the non-ASD control population. Our current study compared microbial composition and diversity between ASD children with only NT siblings. Of the 44 NT siblings, 37 were family matched to the ASD children in the study and the remaining seven NT children were siblings of ASD children who were not in the current study, in some cases because the ASD child was older than 14 years of age. This study did not replicate previously reported differences in the relative abundances of bacterial categories, including the Bacteroidetes phylum [12][13][14], the Sutterella genus [7,8,10] and the Prevotella genus [11] between ASD and unrelated control children. This may reflect the smaller number of subjects enrolled in the previous studies, possibly resulting in Type 1 errors. One study used intestinal mucosal [10] rather than fecal samples. Of note, previous studies [6][7][8]12,14], with the exception of one study [13], failed to detect differences in microbial composition between ASD and NT siblings [11]. Our study also detected no differences in either α or β diversity between ASD and NT siblings. In this study, 16S rRNA sequencing using MiSeq was carried out to a greater depth than previous studies utilizing pyrosequencing. Both platforms detected an increased relative abundance of the low abundance Chloroplast genus in ASD compared to NT siblings. Inspection of the daily dietary logs of the ASD children with high counts of Chloroplast sequences revealed that these particular children were fed shakes containing chia seeds. We speculate that the observed increase in the relative abundance of Chloroplast sequences could reflect ingestion of chia seeds by these children.
An alternative explanation for why a study comparing ASD children with NT siblings did not replicate differences reported between ASD children with unrelated controls, is that NT siblings of ASD children have altered microbiomes compared to that of unrelated children [24]. Studies tracking siblings of ASD children have detected vulnerabilities in some neurocognitive domains in the absence of an ASD diagnosis [72] suggesting that some of the NT siblings may exhibit a broader subclinical ASD phenotype.
Review of previous studies suggest that GI symptoms are more common in ASD children than in NT children, but these studies suffer from methodological differences in assessing GI symptoms [28][29][30][31]. Our assessment of GI symptoms differs from previous studies in that we used the international standard for assessing FGIDs in the general pediatric population. Because of potential overlaps between ROME III FGID diagnoses, we subdivided the ASD children and NT siblings each into two categories 1) those with at least one FGID and 2) those without any FGID. A high prevalence of children diagnosed with at least one FGID was observed in both the ASD (~40%) and NT siblings (~30%) in the current study. Previous epidemiological studies using the ROME III process to survey the prevalence of a child having at least one FGID in general pediatric clinics reported a prevalence of 2% and 16%, respectively [73,74], suggesting that NT siblings may also have a higher prevalence of FGID than the general pediatric population.
The current study detected an increased prevalence of functional constipation FGID in ASD children compared to NT siblings reached significance (see Tables 1 and 2). Constipation has been previously recognized as a significant comorbidity in ASD children [28][29][30][31]. Further studies on a larger sample of ASD and NT siblings (preferably family matched) as well as unrelated NT children will need to be conducted to determine 1) whether FGIDs, particularly functional constipation, are more prevalent in ASD children than their NT siblings and 2) whether FGIDs, particularly functional constipation, are more prevalent in NT siblings of ASD children than unrelated control children.
It has also been proposed that ASD children with GI symptoms may differ from ASD children without GI symptoms. Adams et al. [15] reported that GI symptoms, which were assessed by the modified 6-GSI score, correlated with ASD severity that was assessed by the parental completed Autism Treatment Evaluation Checklist (ATEC) [15]. However, in our study, the mean ADOS-CSS severity scores, which are based on physician rather than parental Summary of studies comparing gut microbiota in ASD and NT children with and without GI symptoms. ASD w GI, ASD children with GI symptoms, ASD w/o GI, ASD children without GI symptoms, NT Sib w GI, NT siblings with GI symptoms; NT sib w/o GI, NT siblings without GI symptoms, NT Un Con w GI, unrelated controls with GI symptoms; Un Con w/o GI, unrelated controls without GI symptoms; PDD-NoS, pervasive developmental disorder not otherwise specified.
doi:10.1371/journal.pone.0137725.g002 evaluations, were not significantly different between the ASD children with at least one FGID and those without any FGID.
To assess the overall emotional and behavioral status of ASD and NT siblings we analyzed parental completed CBCL6/18 normalized T scores. The results of this study demonstrated that ASD children with or without FGID exhibited significantly higher scores than the NT siblings without FGID, consistent with previous studies reporting higher scores in ASD children compared to control children [34][35][36][37]. However, this study demonstrated that the NT siblings with at least one FGID had normalized CBCL6/18 T-scores that approximated those of ASD children with or without at least one FGID, and were also significantly higher than NT siblings without any FGIDs.
Although this is one of the larger studies comparing ASD, FGID and the gut microbiota in ASD children and NT siblings, our sample size is still modest given the number of variables included. Recruitment of additional ASD children and their NT siblings (preferably family matched), as well as unrelated control children that use the same phenotyping tools will need to be conducted to confirm the findings reported in this study. Furthermore, it may be necessary to possibly restrict dietary intake of plant-derived supplements, such as chia seeds, to determine whether the Cyanobacteria/Chloroplast signal is of biological significance with respect to ASD pathophysiology.  Table. P-values for pairwise comparisons of CBCL scores (See Table 2) between ASD w FGID, ASD w/o FGID, NT sib w FGID, and NT sib w/o FGID. (DOCX) S2 Table. Relative abundances of major phyla for ASD children with FGID, ASD children without FGID, NT siblings with FGID, and NT siblings without FGID. The mean relative abundance (sequence count for phylum/total sequence count) ± standard deviation for the V1V2 and V1V3 datasets are listed below. The data from family matched and unmatched ASD children and NT siblings are included. (DOCX) S3 Table. Relative abundances of low abundance phyla for ASD w FGID, ASD w/o FGID, NT w FGID, and NT w/o FGID, based on the V1V2 and V1V3 datasets. The mean relative abundance (sequence count for phylum/total sequence count) ± standard deviation are listed for each phylum. The data from family matched ASD children and NR siblings are included. (DOCX) S4 Table. Relative abundance of selected taxa in ASD children with FGID, ASD w/o FGID, NT siblings with FGID and NT siblings without FGID. The mean relative abundances (sequence count for genera/total sequence count) ± standard deviations are listed for the V1V2 and the V1V3 datasets. N.D. is not detected based on filtering criteria. The taxa that exhibited significant effects of ASD, FGID and/or ASD Ã FGID first order interactions are bolded. The data from family matched and unmatched ASD children are included. (DOCX) S5 Table. Measurement platform comparison for MiSeq V1V2, MiSeq V1V3 and qPCR. The 95% CI of estimated coefficients in the proposed SEM model are listed in a pairwise manner. For two perfectly consistent platforms, the slope is 1 and the intercept is 0. (DOCX)