Evidence of Influence of Genomic DNA Sequence on Human X Chromosome Inactivation
Figure 4
The Significance of the 12 Selected Features
(A) The three best principal components (PC1–PC3) among all 5,596 features for 50-kb and 100-kb windows (left) and the selected 12 features (right) for the 82 nonborder genes are shown projected onto a 3-D graph. Escaping genes are represented as blue circles and subject genes as red circles.
(B) These histograms show the distribution of XAR leave-one-out CV and XCR prediction rates by SVM models constructed using 1,000 random 12-feature sets taken from the 5,596 features for 50-kb and 100-kb windows. Black dots represent mean values, flanked by 95% confidence intervals denoted by error bars representing two standard deviations (2SD). Both the XAR CV rate and XCR prediction rate achieved by the selected 12 features (black arrows) exceed 2SD, and their p values calculated based on these random trials are shown.