Using machine learning to predict and analyze complex trait diseases: Lessons from a simple abstract model
Fig 5
Risk allele histograms for the core Crohn’s disease (CD, 3004 individuals) and Control populations (1949 individuals) in the 48 loci (left), and in a simulated data set (right).
The top, middle, and lower panels show the distribution of individuals carrying 0, 1, and 2 risk alleles, respectively. The CD population had a slightly higher fraction of risk alleles than the Control one. These trends are reproduced in the simulated data.