Univariate/Multivariate Genome-Wide Association Scans Using Data from Families and Unrelated Samples

As genome-wide association studies (GWAS) are becoming more popular, two approaches, among others, could be considered in order to improve statistical power for identifying genes contributing subtle to moderate effects to human diseases. The first approach is to increase sample size, which could be achieved by combining both unrelated and familial subjects together. The second approach is to jointly analyze multiple correlated traits. In this study, by extending generalized estimating equations (GEEs), we propose a simple approach for performing univariate or multivariate association tests for the combined data of unrelated subjects and nuclear families. In particular, we correct for population stratification by integrating principal component analysis and transmission disequilibrium test strategies. The proposed method allows for multiple siblings as well as missing parental information. Simulation studies show that the proposed test has improved power compared to two popular methods, EIGENSTRAT and FBAT, by analyzing the combined data, while correcting for population stratification. In addition, joint analysis of bivariate traits has improved power over univariate analysis when pleiotropic effects are present. Application to the Genetic Analysis Workshop 16 (GAW16) data sets attests to the feasibility and applicability of the proposed method.


Introduction
Genetic association analysis relies on linkage disequilibrium (LD) between alleles at two tightly linked loci [1]. With the availability of high-density maps of single nucleotide polymorphisms (SNPs), association studies have become popular tools for identifying genes underlying complex human traits and diseases [2]. It is now practical to perform genome-wide association studies (GWAS) with hundreds of thousands of SNPs in samples containing large numbers of individuals.
A common design for association studies is population-based, where unrelated subjects are collected and examined for the association between genetic variants and traits. Population-based studies are popular due to the relative ease in recruiting unrelated subjects. However, when samples are of different ethnic ancestries, population-based association studies may produce spurious associations due to population stratification, resulting in excess false positive or negative rates [3,4]. Several methods have been proposed to deal with population stratification [5][6][7][8][9][10][11].
An alternative design uses family-based studies, where family members are collected for association analyses [12]. The application of transmission disequilibrium tests (TDT) [13], and its various extensions to a variety of genetic models for both quantitative [14][15][16][17][18][19] and qualitative traits [20][21][22][23][24], form the basis of family-based association tests. In these tests, the association between phenotypic traits and transmission of alleles from parents to offspring is of primary interest. TDT-based methods possess an intrinsic property of protecting against population stratification, even when only one marker is examined. However, compared with population-based samples, recruiting family members tends to be more time consuming and costly.
For most current population-and family-based GWAS, statistical power is usually limited due to the complex interplay among factors that influence the etiology of diseases [25]. A variety of approaches, e.g., increasing sample size, population selection on the degree of LD, and selecting informative tagSNPs, can improve the power for detecting association. Sample size is often restricted due to genotyping costs and limited sample resources. However, a large sample size is required to ensure sufficient statistical power to detect genes contributing subtle to moderate effects to phenotypic traits. Several recent studies that have combined unrelated subjects and nuclear families to form an enlarged sample [26][27][28][29][30][31] have demonstrated that analyzing combined samples can be more powerful than analyzing individual samples separately.
The problem of population stratification can arise again when analyzing combined samples, however, since neither the aforementioned correction methods for unrelated sample nor the TDTbased methods for families can be naively applied to the combined data. Thus, previous studies require a preliminary step to test whether samples from different studies can be combined. When samples are from different ethnic groups they typically fail this test [26][27][28][29], so an obvious limitation for these methods is that they cannot use samples from different ethnic populations. To circumvent this limitation, Zhu et al. [30] proposed to correct for population stratification in the combined sample by using principal coordinate analysis (PCoA) [8,30,32]. PCoA calculates principal components on individuals, and retrieves information equal to that retrieved by PCA [33]. However, when large numbers of markers (e.g. GWAS data) are involved, the calculation of PCoA by ordinary singular value decomposition (SVD) algorithms can be quite demanding in terms of both computation and computer memory. Recent work on fast matrix approximation may help speed up these calculations and save memory capacities [34,35]. We recently proposed an extension of the method of Price et al. [6] to include familial data [36]. Compared to the method of Zhu et al. [30], this extended method can be applied to large data sets without additional demand for computation costs and computer memory.
In addition to combining samples, another approach to increasing association test power is to perform joint analysis of multiple correlated phenotypes. For many common multifactorial traits, several correlated phenotypes are usually recorded for each individual during sample collection. Joint analysis of these correlated phenotypes can theoretically provide greater power than that provided by analysis of individual phenotypes [37]. Multivariate analysis can also improve the ability to detect susceptible genetic variants whose effects are too small to be detected in univariate analysis [38], and the literature contains multiple applications of this approach to linkage studies [37][38][39][40][41][42]. For genetic association studies, methods have also been proposed for performing multivariate association tests on unrelated samples [43] and on families [44], separately. However, studies using multivariate analysis on combined samples are rare and further investigations using this approach are warranted [31].
In this study, we propose to perform univariate/multivariate association analysis for a combined sample of nuclear families and unrelated subjects. By use of generalized estimating equations (GEEs) [45][46][47], the proposed method assumes no specific distributions on phenotypes. Specifically, we adjust for population stratification for the combined sample by integrating principal component analysis and transmission disequilibrium test strategies. In addition, the proposed method accounts for the data of multiple siblings as well as missing parents. We evaluate the statistical properties of the proposed test through simulation studies, and demonstrate its efficacy by applying it to genetic analysis workshop 16 (GAW16) data sets.

Results
In this section, we will evaluate the performance of the proposed method under a variety of situations by simulation. We include two methods, EIGENSTRAT [6] and FBAT [48], for comparison. EIGENSTRAT implements the method of Price et al. [6], and performs univariate association tests (continuous or binary) for unrelated samples. FBAT implements the method of Laird et al. [48] and performs family-based association tests. EIGENSTRAT and FBAT are typically used in population-and family-based association analyses, respectively, when protecting against population stratification. To make results from separate data comparable to that from the combined data, we perform the fisher product test to combine p-values from EIGENSTRAT and FBAT together to form a uniform p-value. While FBAT can perform univariate or multivariate association tests, EIGENSTRAT can only perform univariate tests. Thus, we only report the fisher product test [49] for univariate tests. We notice that a similar method of Zhu et al. [30] can also perform association tests for binary traits in combined samples. However, their current implementation requires that all nuclear families have the same structure, (e.g., the same number of offspring), a significant limitation which prevented comparison of their method to ours in simulation studies. Table 1 lists type I error rates of the various tests when unrelated individuals and nuclear families are sampled. We also present results of the proposed test when analyzing unrelated samples and nuclear families separately. It is shown that the proposed test has correct type 1 error rates in all population structures when performing both univariate and bivariate analyses. Its application to unrelated samples and nuclear families, separately, also demonstrates correct error rates. The error rates for EIGENSTRAT and FBAT are also close to target levels regardless of the presence of population stratification. Thus, all tests considered here can correct for population stratification in both univariate and bivariate analyses. Table 2 lists type I error rates of the various methods when unrelated individuals and sib pairs are considered. All the tests again have correct error rates that are close to target levels. Thus, the proposed test is also robust to population stratification in applications with missing parental information. Table 3 lists univariate power estimations for binary phenotypes when unrelated individuals and nuclear families are sampled. When analyzing combined data, the proposed test has the highest power in all genetic models. When analyzing unrelated samples alone, the power of the proposed test is approximately equal to that for EIGENSTRAT. When analyzing nuclear families alone, the power of the proposed test is significantly improved compared to FBAT. We note that parental information in FBAT is used to control population stratification, but does not contribute to the association statistic. On the other hand, parental information in the proposed method can be used to both control population stratification, and to test the association. The power improvement demonstrates that parental data are informative for testing the association. Table 4 lists univariate power estimations for continuous phenotypes. Again, the proposed test analyzing the combined data provides the highest power. Similarly, the proposed test has an approximately equal power to EIGENSTRAT when analyzing only unrelated samples, and has improved power over FBAT when analyzing only nuclear families.

Power Estimates
Power estimations when sib pairs, instead of nuclear families, are considered are listed in Table 5 and 6 for binary and continuous phenotypes, respectively. The results are similar to those generated previously with nuclear families. Note that when analyzing only the data of sib pairs, the proposed test has similar power to FBAT. Table 7 lists the gain of power by bivariate analysis for a binary trait and a continuous trait. Obviously, power for bivariate analysis is higher than both univariate analyses under all population structures. The analyses on two continuous phenotypes have similar patterns, as listed in Table 8.
We also evaluate the loss of power of bivariate analyses in cases where pleiotropic effects are not present. Figure 1 displays the loss of power when the causal site contributes only to a binary trait (left panel) or only to a continuous trait (right panel), under the various population structures. Obviously, bivariate analysis in such cases is inferior to univariate analysis, but the power loss is relatively minor.

Application to GAW16 Data Sets
We apply the proposed method to GAW16 simulated data sets as described in the Methods section. Figures 2A-2C display the results of whole-genome scans by FBAT, EIGENSTRAT, and the proposed method, respectively, when analyzing the trait HDL. The most significant SNP identified by the proposed method, rs10820738, reaches a p-value 8.68E-13. This SNP corresponds to the major contributing gene, ABCA1, which explains 1.0% of HDL phenotypic variation in the GAW16 simulation. EIGEN-STRAT and FBAT have p-values 2.30E-5 and 6.57E-4, respectively, at this SNP; neither of these methods reaches a genome-wide significant level. At the other four major genes, the proposed test also has more significant p-values than both EIGENSTRAT and FBAT, as listed in Table 9. Figure 2D (left) displays a Quantile-Quantile (QQ) plot of the proposed method. It is obvious that p-values from the proposed method distribute uniformly between 0.0 and 1.0, demonstrating the validity of the proposed method. Most of the outliers in a logQQ plot ( Figure 2D, right) correspond to susceptible loci and/or their nearby SNPs.
We then perform bivariate analysis on the traits HDL and TG. One of the two major genes presenting pleiotropic effects, rs3200218, has a lower p-value (3.05E-07) in bivariate than in univariate analyses (7.20E-06 for HDL and 0.043 for TG). At the other major gene, rs8192719, bivariate analysis has a p-value (4.48E-04) that is approximately equal to that obtained by univariate HDL analysis (3.58E-04) ( Table 10). For those loci that did not exert pleiotropic effects, however, bivariate analyses generally produce results that are of lower significance than results generated by univariate analyses.

Discussion
In this study, we propose a simple approach to perform univariate or multivariate association tests when correcting for population stratification with data generated by combining unrelated samples and nuclear families. Simulation studies showed that the proposed test had improved power over tests typically used to analyze family and unrelated samples separately. Further, joint analysis of bivariate traits had improved power over univariate analysis when pleiotropic effects were present. Application of the proposed method to GAW16 data sets verified its practical applicability.
By combining population-and family-based tests together, the proposed test provides flexibility in integrating technologies of family based association tests. Here, we extend the proposed model to include sib pair data with missing parental information. It is relatively straightforward to extend the proposed test to include data of general pedigrees with arbitrary structures [50]. When applied to pedigree data only, the proposed method still may have improved power over traditional TDT-based methods, as shown by analyses using the software FBAT. The power gain is attributable to the fact In homogeneous and admixture population settings, we sampled 400 unrelated subjects and 200 nuclear families when the binary trait was not involved, and sampled 200 unrelated cases, 200 controls, and 200 nuclear families with at least one affected child when the binary trait was involved. In stratified population settings, we sampled 250 unrelated subjects and 150 nuclear families from population A, and 150 unrelated subjects and 50 nuclear families from population B when the binary trait was not involved. When the binary trait was involved, we sampled 150 cases, 100 controls and 150 nuclear families with at least one affected child from population A, and 50 cases, 100 controls and 50 nuclear families from population B. Type I error rates for univariate and bivariate analyses are estimated for the combined data of unrelated samples and nuclear families under homogeneous, stratified, and admixed populations. The error rates are estimated on 1,000 replicates. a: ''-'', for EIGENSTRAT, only univariate analyses are available. Abbreviations: T, the proposed test applied to the combined sample; T U , the proposed test applied to unrelated sample only; T F , the proposed test applied to nuclear families only; ESTRAT, the method proposed by Price et al. [6] and implemented in the software EIGENSTRAT, applied to unrelated sample only; FBAT, the program FBAT [48]; Fisher, the fisher product test on the outputs from EIGENSTRAT and FBAT. doi:10.1371/journal.pone.0006502.t001 that the proposed method can use parental information for association tests, while the alternate methods cannot.
Combining unrelated samples and nuclear families for genetic association studies has been a focus of research for several years. Nagelkerke et al. [29] proposed using a logistic-regression model for combining case-control subjects and case-parents trios to increase statistical power. Kazeem and Farrall [28] proposed combining results of case-control tests and TDT to obtain a weighted odds ratio for a given genetic marker. Epstein et al. [27] modified the work of Nagelkerke et al. with a likelihood-based approach to allow for more flexible genetic models, such as lessrestrictive assumptions of Hardy-Weinberg equilibrium (HWE) and of random mating. Chen and Lin [26] further extended the work of Epstein et al. to scenarios relaxing assumptions and  estimations on mating-type distributions using a weighted leastsquares approach. Jung et al. [31] recently proposed performing combined linkage and association tests for bivariate quantitative traits using a variance-component model. Despite their potential advantages, all of these methods have a requirement that both case-control subjects and case-parents come from a homogeneous population. This requirement substantially narrows the context to which these methods can be applied. The method we propose has a significant advantage in that it is robust to population stratification. Our simulation results show that the proposed test remains valid when applied to stratified or admixed populations.
We note that a similar method proposed by Zhu et al. [30] can also perform association tests on combined data when correcting for population stratification. However, their current program implementation, FamCC, can only handle nuclear families with both parents available and with equal numbers of children, which rarely occurs with real data. Additionally, analyses of their method with multivariate and quantitative traits are quite limited. Another feature of the proposed method that can improve statistical power is the ability to perform multivariate association tests. Compared with univariate models, multivariate models can be more powerful in cases where multiple traits are influenced by a     In summary, we have developed a simple and novel method for performing univariate/multivariate association tests while correcting for population stratification, in samples combining nuclear families and unrelated subjects. The proposed method is computationally effective and can complete a typical GWAS scan within minutes. The java program implementing the proposed method, Genetic Association analysis Platform (GAP), is freely available from the authors' website (http://sites.google.com/site/ zhangleira/GAP).

Methods
We first describe our method on a combined dataset of an unrelated sample and a collection of nuclear families with both parents available. Then we extend the method to include sib pairs with missing parents, to incorporate covariates, and to correct for population stratification.

Definitions
Assume that there are N f nuclear families and there are n i (i = 1, …, N f ) members in the ith family with the first two individuals being parents. In addition, a random sample with N c unrelated individuals is also assumed available. For simplicity, we take each individual in the random sample as a separate family with size 1 so that n i = 1 for i = N f +1, …,, N f +N c . Thus, the total number of individuals is N~P i n i , and the total number of unrelated individuals (including random sample and the two parents in each family) is N u = 2N f +N c . Assume that K phenotypes are available for each individual, and let y ij = (y ij1 , …, y ijK ) ' be the vector of phenotypic values for the jth (j = 1, …, n i ) individual in the ith family. Further assume that genotype data for M SNP markers are available for all individuals. A score g ijm at the mth SNP with alleles ''1'' and ''2'' is defined as 0, 1, and 2 for genotypes ''11'', ''12'', and ''22'', respectively, for the jth individual in the ith family.

Models
We construct our test statistic by use of previous work of score test. [17,20,23,44,48] For an individual phenotype indexed by k, we extend the previous work of Lunetta et al. [23] in the generalized linear model (GLM) framework, to model the association between genotype scores and phenotypes using. For a tested marker indexed by m, GLM relates phenotypes and genotypes by a link function (We omit the index m for simplicity) where L ijk is the link function for m ijk , the expected value of y ijk ; b 0k  and b 1k represent population mean and genotypic effect, respectively. The natural link function is the identity for continuous phenotypes, and is the logit-function for binary phenotypes.

Defining the Score Statistic
Given genotypes, phenotypes among unrelated individuals and family members are assumed independently distributed. The loglikelihood for the sample can then be expressed as where a L ijk À Á is a function of L ijk with the property La L ijk À Á LL ijk m ijk , i = 1, …, N f +N c , j = 1, …, n i . The derivation of the loglikelihood with respect to b 1k yields the score where t ijk~yijk {m ijk . Under the null hypothesis H 0 of no association (b 1k~0 ), m ijk is identical to all subjects, that is, m ijk~mk . When multiple correlated phenotypes are simultaneously modeled, it is difficult to specify the log-likelihood function (2), since joint distribution of phenotypes cannot be explicitly specified except for multivariate Gaussian distributions. For multivariate data with arbitrary distributions, Liang and Zeger [46] proposed an extension of GLM, termed generalized estimating equations (GEEs), to estimate model parameters while accounting for correlations among variables. Lange [44] further applied GEEs to genetic association analysis. Following the work of Lange, we define a multivariate score as where D ij is a diagonal matrix depending on the underlying GEE model, and t ij~tij1 ,:::,t ijK À Á 0 is a vector that codes phenotypes.
Under the null hypothesis H 0 of no association, D ij and Var t ij are identical to all subjects and they will vanish in the normalization of the test statistic. The resulting score under H 0 is then Obviously, S 1 is a special case of S where only one phenotype is modeled.

Distribution of the Test Statistic
The score test statistic is defined as where E(S) and Var(S) are the mean and variance of the score, respectively. Under the null hypothesis H 0 , the statistic T will asymptotically follow a chi-square distribution with degree of freedom being the rank of Var(S). For simplicity, let Z = S-E(S), so that Z and Var(S) are estimated by conditioning the distribution of genotype on traits, e.g., t ij is fixed as constant, so that To obtain estimations for above variables, we divide the total sample into two complement sets U and R, where U contains N u unrelated individuals, and R contains the remaining N -N u related offspring in each family. For the set U, population genotype mean and variance, denoted by g g and v(g) respectively, are estimated. For each individual in the set R, its genotype mean and variance are estimated from its parents' genotypes according to the Mendel's law. Note that as the estimation on offspring is conditional on the parental genotypes, there will be no genotypic correlations between offspring and parents and between offspring themselves. Thus, Z and Var(S) are expressible as  where I gi1 and I gi2 are the indicators of heterozygous genotypes for the two parents in the ith family. The above expressions of Z and Var(S) render intuitive interpretations: the first part in each expression attributes to unrelated individuals, and the second part attributes to related individuals in each family. We note that when considering only related individuals in the set R, the second parts of Z and Var(S) constitute a family-based test statistic proposed by Rabinowitz and Laird [50] and is implemented in the software FBAT [48]. When considering only unrelated individuals in the set U, the first parts of Z and Var(S) constitute a valid score test in an apparent manner for random samples. Thus, the proposed test T can be regarded as the uniform integration of population-and family-based association tests. This characteristic allows a great deal of flexibility in including nuclear families with various structures.

Including Data with Missing Parents
When parental genotype information is missing, the conditional means and variances for offspring genotypes in the set R can no longer be estimated from their parents. For families with incomplete parental data, Rabinowitz and Laird [50] propose to obtain the conditional distributions of offspring genotypes via the sufficient statistic of missing parental genotypes, which is derived from offspring genotypes and partially observed parental genotypes. By using sufficient statistic, the distributions of test statistics remain valid in the presence of population stratification. Application of the method of Rabinowitz and Laird to the proposed test with missing parents is straightforward. We replace the conditional expectations and variances of offspring genotypes in the second parts of Z and Var(S) by the ones that are estimated by conditioning on sufficient statistic of missing parental genotypes. Note that correlations among offspring in such circumstances would not vanish and they will be included in the test statistic.

Incorporation of Covariates
When covariates are strongly predictive factors of phenotypes, incorporating them into the model can increase test efficiency. Let W ijk be the vector of covariates at the k-th phenotype for the j-th individual in the i-th family. The link function (1) modeling covariates under H 0 will turn to L ijk~b0k zb Wk 0 W ijk : Estimation of b 0k and b Wk is used for construction of test statistic in follow-up steps.
We first adjust phenotypes by covariates and then construct the test statistic with the residual of phenotypes. Let y ij * be the residual phenotype for the jth individual in the ith family so that t ij = y ij * 2m codes phenotypic information of subjects. The resulting test statistic T depends on the nuisance parameter m. Though the statistic T remains valid regardless of the choices of m, a good choice of m can improve test efficiency [23]. In theory, m is the population mean of phenotypes. In cases where ascertainment depends upon phenotypes, such as in case-control and caseparents designs, m cannot be appropriately estimated from the sample. A variety of strategies have been proposed for different choices of m to improve test efficiency [17,20,23], among which is the one that minimizes Var(S) [23]. For the kth phenotype, it is obvious that Var(S k ) is the quadric form of m k , and is minimized For multivariate test, we select individual m in turn for single phenotype to obtain approximate performance.

Correcting for Population Stratification
When population stratification exists, the above test statistic may no longer be valid since the evaluations of the variance for the set of unrelated individuals are sensitive to population stratification. The adjustment for population stratification is straightforward by using our previously proposed extension [36] of PCAbased adjustment [6] that includes data with nuclear families. Briefly, we apply PCA to all unrelated individuals to calculate for each of them a vector of principal component. Individual genotypes and phenotypes are then adjusted through linear regression on principal components. For those related individuals, we propose a TDT-like strategy to infer their principal components as well as to adjust their genotypes and phenotypes. We denote the genotype score and the phenotype coding vector after adjustment as g ij * and t ij * , respectively, for the jth individual in the ith family, and denote population genotype mean and variance after adjustment as g gÃ and v(g * ), respectively. In the appendix S1, we show that Z and Var(S) have the following forms (assuming parental information is available) Note that the genotype deviation and variance in the second parts of Z and Var(S) are invariant to the adjustment by PCA. This is intuitively interpreted since the second parts are not affected by population stratification during construction.

Data Simulation
To evaluate the performance of the proposed test, we conducted a variety of simulation studies. In all simulations, we simulated two SNPs with specified allele frequencies, one as causal site and the other as test site that was both tightly linked to and strongly associated with the causal site. We also simulated additional 998 random SNP markers to conduct principal component analysis, resulting in a total number of 1,000 SNPs. Both binary and continuous traits were simulated. Although the proposed test was applicable to multivariate analyses with arbitrary number of traits, we only considered bivariate situations for simplicity. For bivariate simulation, we simulated a binary trait and a continuous trait or two continuous traits. Samples were generated under one of three following population structures: a homogeneous population, two discrete populations, and an admixture population with two ancestral populations.

Simulation 1. Homogeneous Population
In the homogeneous population structure, minor allele frequency (MAF) for both the causal and the test SNP was set to 0.2, and the allele frequency for each random SNP was drawn from a uniform distribution U(0.1, 0.9). Genotypes for unrelated individuals and parents were generated based on the corresponding allele frequencies with an assumption of linkage equilibrium between adjacent random markers, and genotypes for children in each family were generated according to parental genotypes with recombination rate 0.01 between the causal and the test sites. The number of children in each family was drawn from a Poisson distribution with mean 2.
When binary trait was not involved, we sampled 400 unrelated individuals and 200 nuclear families. When binary trait was involved, we sampled 200 cases, 200 controls and 200 nuclear families with at least one affected child. The disease prevalence was set to 30%, which was used to assign an individual's disease status under the null hypothesis. Under the alternative hypothesis, the probability of an individual being affected was calculated using the logistic regression model where Logit() was the logistic function; OR was the specified odds ratio and cons was a constant rendering the disease prevalence. g ij was the genotype code under recessive, additive, or dominant modes of inheritance. Unless otherwise specified, we set OR to be 1.5 for all simulations under alternative hypothesis. Continuous phenotypes were drawn from normal distributions with uniform phenotypic mean and variance. Background polygenic effects were assumed to account for 40% of the phenotypic variability in simulating phenotypes for nuclear families. Under the alternative hypothesis, the causal site was assumed to explain a specified proportion of phenotypic variability under recessive, additive, or dominant modes of inheritance. Unless otherwise specified, we set the proportion explained by the causal site to 1% for all simulations under alternative hypothesis.

Simulation 2. Two Discrete Populations
In discrete population structure, MAF at both the causal and the test site was set to 0.2 and 0.4 for two populations A and B, respectively. Allele frequencies at random markers for the two populations were generated using the Balding-Nichols model [51]. Briefly, for each marker an ancestry allele frequency p was drawn from the uniform distribution U(0.1, 0.9). The allele frequencies for the two populations were then drawn from a beta distribution with parameters p(1-F ST )/F ST and (1-p) (1-F ST )/F ST , where F ST was a measure of genetic distance between the two populations [52]. We set F ST to 0.05 to simulate moderate population stratification.
When binary trait was involved, we sampled 150 cases, 100 controls and 150 nuclear families from population A, and 50 cases, 100 controls and 50 nuclear families from population B. The disease prevalence in populations A and B was set to 30% and 10%, respectively, to produce the confounding effect due to population stratification. Population mean (m A and m B ) of continuous phenotype also varied between population A and B. m A and m B were set such that a proportion of 20% of phenotypic variation was explained by population stratification.
Under the alternative hypothesis, phenotypes were again simulated conditional on the causal site under recessive, additive, and dominant modes of inheritance.
When the binary trait was not involved, we sampled 250 unrelated individuals and 150 nuclear families from population A, and 150 unrelated individuals and 50 nuclear families from population B. Simulation of continuous phenotypes was the same as the above.

Simulation 3. Admixed Population with Two Ancestral Populations
In admixed population structure, we first generated two discrete populations A and B as in Simulation 2. We then adopted a continuous gene flow (CGF) model [53] to generate an admixed population from A and B. Specifically, an initial generation was produced by sampling 20,000 unrelated individuals from population A. To produce the second generation, a proportion (l) of randomly selected individuals from initial population mated to individuals drawn from population B, and the remaining proportion (1-l) mated among themselves. The number of children for each mating was drawn from a Poisson distribution with mean 2, and children from all marriages comprised the second generation. The second generation repeated the same process to produce the third generation, the forth, and so on. We set l to 0.1 and repeated the process 5 times, resulting in the current admixed population of approximately 60%/40% of ancestry from population A/B. We sampled the same number of unrelated individuals and nuclear families as in Simulation 1. On producing binary trait, the probability of being affected for an individual was set to 0.3a+0.1(1-a), where a was the ancestral proportion of population A for the individual. Similarly, phenotypic mean was set to m A a+m B (1-a) when simulating continuous traits.
Again, under the alternative hypothesis, phenotypes were simulated conditional on genotype scores at the causal site.
Besides nuclear families, we also simulated samples of sib pairs with missing parental information, which were obtained by deleting the two parents from each family after all the above simulations.

GAW16 Simulated Data Sets
As an application, we analyzed the Genetic Analysis Workshop 16 (GAW16) Problem 3 data sets with the proposed test. The GAW16 data sets consist of 6,476 subjects from Framingham Heart Study (FHS), where each subject has real genotypes at approximately 550,000 SNP markers and simulated phenotypes. Subjects are distributed among 3 generations and singletons. After dividing large families into smaller nuclear families and applying some quality controls to the data (for example, as the proposed test cannot analyze half-sibs, we deleted one of sibs from the data), we finally identified 5,942 subjects for analysis, 5,456 of which are family members from a total of 1,815 nuclear families and the remaining 486 are singletons. When analyzing unrelated sample, we also included parents in each family besides the 486 singletons, resulting in a total of 1,480 unrelated subjects.
A total of six correlated traits, termed HDL, LDL, TG, CHOL, CAC, and MI, respectively, are simulated on the observed genetic variation in order to mimic the lipid pathway underlying the development of cardiovascular disease [54]. Phenotype data are simulated at three pseudo-visits with 10 years apart to mimic the context of longitudinal study, and at each visit, 200 simulated data sets are replicated. We analyzed only the data set from the first replicate of the first visit, as suggested by the workshop. For univariate analysis, we focused on the trait HDL, which is influenced by five major genes each contributing 0.3% to 1% to the phenotypic variation. For bivariate analysis, we included the trait TG as well. TG is influenced by three major genes contributing to 0.3% or 0.4% to the phenotypic variation. Two major genes affecting TG also present pleiotropic effects to HDL. Both phenotypes were adjusted by age and sex.