Skip to main content
Advertisement

< Back to Article

Figure 1.

Classifications of the 33 training samples by gene expression.

(A) Unsupervised clustering of 17 healthy controls (blue dots) and 16 severe CdLS probands (red dots) by principle component analysis (PCA) of the 27,995 probe sets actively transcribed in LCLs. The separation between the training groups indicates that controls and probands have different gene expression patterns. (B) Heatmap showing that the identified 420 probe sets (FDR<0.01) are expressed dramatically differently between CdLS probands (PT) and healthy controls (N). Red represents genes that are upregulated and blue represents genes that are downregulated. The left 17 columns represent control samples, and the right 16 columns represent proband samples. Rows display gene expression levels. (C) Nearest centroid classifications of the 33 training samples and six testing samples. Among the training samples, two healthy controls and one CdLS proband were misclassified after Leave-One-Out cross validation. Among the testing samples, CdLS probands and two RBS probands were classified into CdLS group whereas the one healthy control and two probands with AGS were classified into the control group.

More »

Figure 1 Expand

Figure 2.

Classifier genes are identified for CdLS. Clear progression of discriminant score (DS) from low to high is correlated with the phenotype from unaffected → mild → moderate → severely affected with CdLS.

(A) The 23-gene classifier separates CdLS probands with NIPBL mutations from the rest of the individuals. Healthy controls, probands with other genetic disorders, CdLS probands with SMC1A mutations, and CdLS probands with no gene mutation identified are distinctly separated from each other in a progressive manner correlated with phenotypic severity. (B) The ten-gene classifier differentially categorizes all CdLS probands from non-CdLS individuals and plots correlate to the severity of the CdLS probands. Healthy controls are labeled as “Control,” disease severity is described as “Mild,” “Moderate,” and “Severe” CdLS probands with NIPBL mutations, SMC1A mutations or no identified gene mutation are labeled as “NIPBL,” “SMC1A,” or “No,” respectively. *, training samples; **, number of mild cases with an NIPBL mutation was reduced from 26 in (A) to 11 in (B) with the other 15 cases having been used as training samples.

More »

Figure 2 Expand

Figure 3.

Cohesin binding analyzed in 15,162 unique transcripts demonstrates preferential binding to TSSs and TTSs.

(A) The frequency of cohesin binding has a sharp peak around TSS and falls to the background level upstream of this peak. (B) The frequency of cohesin binding has another peak around TTS. The height of this peak is about half that of the peak height seen at TSS. Similarly the regions downstream of this peak have a cohesin binding frequency close to the background level.

More »

Figure 3 Expand

Figure 4.

Frequency of cohesin binding around the TSS as related to transcriptional status in LCLs.

Group A (silver), nontranscribed silent genes in LCLs (4,784 unique Refseq mRNAs); group B (yellow), genes without expression alterations between controls and CdLS probands (9,199 unique RefSeq mRNAs); group C (red), genes differentially expressed in CdLS (FDR < 0.05) (1,179 unique RefSeq mRNAs). (A) Frequency of cohesin binding at the TSS of group C genes is much lower in CdLS than in control. Group B genes have a moderate reduction, and group A genes have little change. Overall cohesin binding around the TSS is greatest for those genes that are actively transcribed in LCLs and especially in those genes that are misexpressed in CdLS. (B) Within the intragenic regions, 5′-UTRs of the actively transcribed genes (groups B and C) have higher cohesin binding frequency in control than other intragenic regions whereas group A genes have frequency close to the background level in all regions. In CdLS, the frequency dropped in all three gene groups in CdLS and the difference between different gene groups and regions tends to diminish. (C) Cohesin binding within 2 kb around TSS is enriched in differentially expressed genes. The 10,378 unique genes expressed in LCLs are ranked by their F scores. The reference enrichment is the percentage of genes having cohesin binding within 2 kb (+/− 1 kb) around TSS. The relative enrichment is calculated as the value of cohesin binding enrichment in top-ranked genes over the reference enrichment. The relative enrichment point is calculated for the total number of genes prior to the point on the x-axis. The numbers on x-axis denote the statistical ranks. The curves are smoothed by the LOWESS algorithm.

More »

Figure 4 Expand

Table 1.

Cohesin associated to the +/− 1-kb vicinities of TSSs among three different groups of genes in control and CdLS LCLs.

More »

Table 1 Expand

Table 2.

Differentially expressed genes tend to lose their cohesin binding at TSSs in CdLS samples.

More »

Table 2 Expand

Figure 5.

Cohesin and CTCF colocalize and separate the active chromatin region from the repressive chromatin region.

The cohesin site at this position is lost in CdLS, thus the silencing epigenetic signal from region 3 is able to migrate into region 2, which harbors ATP11A and downregulates its transcription. (A) Screen shot of ENCODE ENr132 region from the UCSC genome browser is displaying histone methylation and acetylation status, CTCF binding sites, and DNaseI sensitivity sites on this region in GM06990 cells (from Sanger Institute and University of Washington databases, respectively). hSCC1-Control and hSCC1-CdLS are custom tracks. hSCC1-Control track indicates the results of whole genome cohesin binding analysis in LCLs from controls, whereas hSCC1-CdLS track indicates the results of whole genome cohesin binding analysis in LCLs from the CdLS proband; data on CTCF_Bcell2_8 track are adapted from Wendt et al. [10]. (B) Schematic of ENr132 locus as in (A). Five genes located in three regions are displayed. Two cohesin and six CTCF binding sites are shown. Cohesin and CTCF colocalize at Chromosome 13: 112,645,000–112,645,600, which separates region 2 from region 3. Cohesin binding at this position was lost in CdLS proband. (C) ChIP-qPCR validation in three different healthy controls “Normal,” “N6,” and “N12” and three additional CdLS probands “PT2,” “PT12,” and “CDL-017.” Cohesin bound to this locus was dramatically reduced among CdLS probands including a proband with an SMC1A mutation (CDL-017). Sites 1 and 2 are positive controls, site 8 spans Chromosome 13: 112,645,000–112,645,600.

More »

Figure 5 Expand

Figure 6.

Proposed working models for cohesin and NIPBL.

(A) Cohesin's canonical role in regulating sister chromatid cohesion with NIPBL acting to facilitate the loading and unloading of the cohesin complex onto the chromosomes. It is not known if NIPBL directly interacts with chromatin. This model was described by Haering et al. [54]. (B) Cohesin loading model: NIPBL loads cohesin onto chromatin at the promoters or cis-regulatory elements after which cohesin regulates transcription without the direct involvement of NIPBL. (C) Cohesin and NIPBL collaborative model: Cohesin and NIPBL form a protein complex that binds to promoters or cis-regulatory elements. The functional integrity of this complex is required for transcriptional regulation of target genes. (D) NIPBL chromatin remodeling model: NIPBL may affect the accessibility for cohesin, e.g., by changing chromatin structures, to bind to chromatin elements through yet unknown pathways. Transcriptional regulation through cohesin is secondarily affected.

More »

Figure 6 Expand