Primary open-angle glaucoma (POAG) is the most common form of glaucoma and one of the leading causes of vision loss worldwide. The genetic etiology of POAG is complex and poorly understood. The purpose of this work is to identify genomic regions of interest linked to POAG. This study is the largest genetic linkage study of POAG performed to date: genomic DNA samples from 786 subjects (538 Caucasian ancestry, 248 African ancestry) were genotyped using either the Illumina GoldenGate Linkage 4 Panel or the Illumina Infinium Human Linkage-12 Panel. A total of 5233 SNPs was analyzed in 134 multiplex POAG families (89 Caucasian ancestry, 45 African ancestry). Parametric and non-parametric linkage analyses were performed on the overall dataset and within race-specific datasets (Caucasian ancestry and African ancestry). Ordered subset analysis was used to stratify the data on the basis of age of glaucoma diagnosis. Novel linkage regions were identified on chromosomes 1 and 20, and two previously described loci—GLC1D on chromosome 8 and GLC1I on chromosome 15—were replicated. These data will prove valuable in the context of interpreting results from genome-wide association studies for POAG.
Citation:Crooks KR, Allingham RR, Qin X, Liu Y, Gibson JR, et al. (2011) Genome-Wide Linkage Scan for Primary Open Angle Glaucoma: Influences of Ancestry and Age at Diagnosis. PLoS ONE 6(7): e21967. doi:10.1371/journal.pone.0021967
Editor: Amanda Ewart Toland, Ohio State University Medical Center, United States of America
Received: February 1, 2011; Accepted: June 15, 2011; Published: July 12, 2011
Copyright: © 2011 Crooks et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding:The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. This work was supported in part by a generous donation by Roger Milliken, Spartanburg, South Carolina and by the research infrastructure of the Duke Center for Human Genetics. This work was also supported by CTSA grant 1 UL1 RR024128-01 from NCRR/NIH, NIH grants R01EY013315 (MAH), R01EY019126 (MAH), RO1EY015543 (RRA), R01EY015872 (JLW), and P30EY014104 (Core support at the Massachusetts Eye and Ear Infirmary).
Competing interests: The authors have declared that no competing interests exist.
Glaucoma comprises a group of disorders that are characterized by retinal ganglion cell death and a characteristic pattern of progressive vision loss. POAG is the most common type of glaucoma globally , and it is estimated that by 2020 the number of people diagnosed with POAG in the United States alone will total more than 3 million . It has long been recognized that there is a heterogeneous genetic component to POAG. Genome-wide linkage analyses have identified 14 loci, designated GLC1A-N, which are thought to contribute to POAG risk –. Causative mutations have been identified in genes within three of these loci: Myocilin (MYOC) on 1q24.3 (GLC1A) , optineurin (OPTN) on 10p15-14 (GLC1E) , and WD40-repeat 36 on 5q22.1 (GLC1G) . Together, mutations in these three genes account for less than 10% of POAG cases . Thus, the majority of the genetic etiology of POAG remains to be discovered.
Among the challenges in the study of POAG is genetic and phenotypic heterogeneity of study subjects. For example, while mutations in OPTN or MYOC both result in POAG, a missense change in OPTN (E50K) causes an adult-onset form of POAG that is characterized by normal intraocular pressures , whereas MYOC mutations can cause either adult-onset or juvenile-onset disease with highly elevated intraocular pressures . Reducing this genetic variability in the study population is essential for identifying variants that contribute to POAG risk, and it can be achieved by phenotypic stratification. In the study of POAG and other complex diseases, phenotypic stratification by ordered subset analysis (OSA)  has been particularly successful in identifying genetically homogeneous subsets of families with increased evidence for linkage and in reducing linkage intervals for follow-up analysis. In OSA, families are ranked according to a phenotypic variable. In this study, families were sorted from lowest to highest average age at diagnosis (AAD, see Methods) of POAG in affected relatives. We chose this variable based on previous linkage analyses by our group  and others  that established AAD as an important source of genetic heterogeneity.
In this study we report the results of the largest SNP-based genome-wide POAG linkage study performed to date. Using both standard linkage methodology and OSA to account for genetic heterogeneity, our study identified global as well as ancestry-specific and phenotype-specific genomic regions that may harbor POAG susceptibility variants.
Clinical data summary
Table 1 summarizes sample size and clinical characteristics of the study population. After exclusion of families segregating MYOC mutations, 786 sampled subjects from 134 multiplex families were analyzed. Clinical characteristics were similar among subjects of African ancestry and Caucasian ancestry. As expected, intraocular pressure (IOP) was clinically elevated in affected members of both ancestry groups, and pressures in those of African ancestry were significantly higher than those of Caucasian ancestry (31.5±1.0 versus 28.0±0.4 mmHg, p≤0.003). Both groups had an age at diagnosis (AAD, see Methods) that averaged in the 50 s and ranged from the 20 s to the 80 s. Slightly more than half the affected study subjects were female in both groups.
Whole genome linkage analysis
Figure 1 shows results of the multipoint linkage analyses for the overall dataset, based on 5,233 SNPs with an average intermarker distance of 0.68 cM. Parametric linkage analyses were performed using both dominant and recessive models. We found the strongest evidence for linkage at 20q13.12–13.13, with the peak marker rs911411 (multipoint HLOD = 2.3, 75.8 cM, dominant model). The one-lod unit support interval comprises the region between the markers rs765147 and rs718630.
Multipoint linkage scores for parametric dominant (red), parametric recessive (green), and nonparametric (blue) models are plotted for the combined dataset. Symbols indicate two-point lod scores ≥1.5 in the parametric dominant (red) and recessive (green) models.
Results of the two-point and multipoint linkage analyses for the Caucasian ancestry dataset and African ancestry dataset are shown in Figures 2 and 3, respectively. Among families of Caucasian ancestry, the best evidence for linkage was identified at 1q22–23.3, with the peak marker rs876537 (multipoint HLOD = 2.03, 154.5 cM, recessive model), and the one-lod unit support interval between markers rs2066981 and rs836. Among families of African ancestry, the strongest linkage was found at 20ptel-13, with the peak marker rs1342137 (multipoint HLOD = 2.09, 0.3 cM, dominant model), and the centromeric one-lod unit support boundary at rs600832.
Multipoint linkage scores for parametric dominant (red), parametric recessive (green), and nonparametric (blue) models are plotted for the Caucasian ancestry dataset. Symbols indicate two-point lod scores ≥1.5 in the parametric dominant (red) and recessive (green) models.
Multipoint linkage scores for parametric dominant (red), parametric recessive (green), and nonparametric (blue) models are plotted for the African ancestry dataset. Symbols indicate two-point lod scores ≥1.5 in the parametric dominant (red) and recessive (green) models.
The most notable linked regions for each dataset are shown in Table 2. In the combined dataset, there were four regions with multipoint lod scores >1.5. For the ancestry-specific datasets, all linked regions with both a multipoint lod score >1.5 and at least one two-point lod >1.5 within the one-lod support interval are shown. There were four such regions of interest in the Caucasian ancestry dataset, and two in the African ancestry dataset.
Ordered subset analysis
To examine whether AAD was a significant modifier of POAG linkage evidence, we performed OSA on the overall dataset, using ascending family mean AAD (lowest to highest age at diagnosis) as the covariate. We found significantly increased evidence for linkage to chromosomes 8 and 15 (empirical p≤0.05, with 10,000 permutations, Figure 4).
Ordered subset analysis (red line) indicates improved evidence for linkage to chromosome 8 (left panel) among families with a mean age of onset ≤49 years. The black line plots non-parametric linkage analysis in all families in the combined dataset. Improved evidence for linkage to chromosome 15 (right panel) was demonstrated among Caucasian families with a mean age of onset ≤52 years. The black line plots non-parametric linkage analysis in all Caucasian families.
On chromosome 8 (Figure 4, left panel), the one-lod unit support interval comprised 22.1 cM with a maximum multipoint lod score of 2.03 at 117.2 cM (empirical p = 0.04). This region includes the previously reported POAG locus GLC1D . All 26 families (13 Caucasian ancestry, 13 African ancestry) in the OSA subset had an average AAD below 50 years (mean 41, range: 28 to 49). In the complementary subset of 102 families (78 Caucasian ancestry, 34 African ancestry), the mean AAD was 62 years (range: 49 to 78).
The linkage region on chromosome 15 was found in the Caucasian early-onset dataset at 15q11–12 (Figure 4, right panel), comprising 13.6 cM with a maximum lod score of 3.17 at 3.2 cM (empirical p = 7×10−5), replicating our previously described GLC1I locus . There were 19 families in this OSA subset, including the previously screened 11 families, all of which had an AAD below 53 years (mean: 45 years, range: 30 to 52 years). The complementary group of 68 families had an average age of onset of 63 years (range 52–78 years).
We have conducted family-based linkage analysis to identify regions of the genome that may harbor POAG susceptibility variants. These linkage regions provide distinct and complementary data that can assist in the interpretation of genome-wide association studies. The overall dataset and the African ancestry dataset linked POAG to non-overlapping intervals on chromosome 20. As these regions appear to be distinct from the JOAG-linked locus GLC1K, located between D20S846 and rs6081603 , there may now be as many as three discrete regions of interest on this chromosome alone. The Caucasian ancestry dataset linked POAG to a novel locus on chromosome 1.
Two of the regions with evidence for linkage, 14q11.2 in the combined and Caucasian ancestry datasets and 20p13 in the African ancestry dataset, are telomeric. It is generally accepted that telomeric regions may give rise to false positive linkage peaks at a higher frequency than other regions of the chromosome. In the absence of confirmation, these regions of interest should be considered with caution and follow-up analysis should be delayed until the findings are replicated.
Using OSA, we found increased evidence for linkage in families with adult early-onset on chromosomes 8 and 15, replicating previous findings. The linkage region on chromosome 8 includes the GLC1D locus, which was first reported in a single family with an apparently Mendelian form of glaucoma . It is not surprising that the one-lod unit support interval calculated here is somewhat larger than the reported GLC1D locus, considering the differences between a nonparametric linkage analysis of a complex phenotype in a larger family collection, compared to recombination-based linkage mapping within a single large pedigree. Interestingly, the age-at-onset of glaucoma in that pedigree is reported as “within the third to fourth decade of life” which is consistent with the AAD of the OSA-identified subset of families reported here. Our finding suggests the possibility that one or more variants in the GLC1D region may give rise to both rare Mendelian and more common non-Mendelian forms of early-onset POAG.
We have previously reported microsatellite linkage to proximal 15q in a collection of 15 (11 Caucasian and 4 African American) early onset families . Our current dataset comprises 34 early-onset families (19 Caucasian ancestry, 15 African ancestry). With the additional families, we were able to divide the dataset based on ancestry and analyze the two groups separately. We replicated linkage to GLC1I in the Caucasian dataset (19 families, including the 11 families reported earlier). In the current report, the one-lod unit support interval is larger and the peak lod score marginally lower than in the previous report. This likely reflects the nature of the individually more informative microsatellite markers compared to SNPs, rather than different underlying genes in the two OSA subsets. There was no evidence for linkage to POAG in the African ancestry early-onset dataset.
In conclusion, we have reported results of the first SNP-based genome-wide linkage analysis of POAG. We identified regions of interest for further investigation in a dataset of African ancestry, a dataset of Caucasian ancestry, and in the combined dataset. We also replicated two previously-reported early-onset POAG loci, strengthening the case that these regions may harbor one or more genes that are either causative for early-onset glaucoma or that modulate the age at which symptoms are first evident. We expect that the results reported here will prove useful in the context of interpreting and strengthening results from genome-wide association studies and will complement efforts to better understand the complex genetic etiology of glaucoma.
Materials and Methods
This study adhered to the tenets of the Declaration of Helsinki. Written informed consent was obtained from all participating individuals. Caucasian and African-American subjects were recruited at the Duke Eye Center (Durham, NC). Caucasian subjects were also enrolled at the Massachusetts Eye and Ear Infirmary (Boston, MA). African subjects were enrolled at the University of Ghana. The research was reviewed and approved by the Institutional Review Board from all participating institutions, including Duke University Medical Center, the Massachusetts Eye and Ear Infirmary and the Noguchi Memorial Institute of Medical Research of the College of Health Sciences, University of Ghana.
POAG probands were unrelated and met the following three inclusion criteria: (1) intraocular pressure greater than 22 mm Hg in both eyes without medications or greater than 19 mm Hg with two or more medications; (2) glaucomatous optic neuropathy in both eyes; and (3) visual field loss consistent with optic nerve damage in at least one eye. Other affected members met at least 2 of these criteria. Glaucomatous optic neuropathy was defined as cup-to-disc ratio higher than 0.7 or focal loss of the nerve fiber layer (notch). Visual fields were performed by using standard automated perimetry. Exclusion criteria included the presence of any secondary form of glaucoma. Consenting family members who did not meet the above criteria were enrolled, but their genotypes were used only to establish linkage phase. The MYOC gene was sequenced in all probands. Families with disease-associated MYOC mutations were excluded from analysis. Age at diagnosis (AAD) was self-reported by POAG cases as the age at which they were first told by an eye specialist that they 1) had elevated intraocular pressure, 2) were prescribed IOP-lowering eye medications, or 3) had glaucoma.
Sample preparation and genotyping
To conduct the genome screen, samples were genotyped using the Illumina GoldenGate Linkage 4 Panel or the Illumina Infinium Human Linkage-12 Panel. DNA from two CEPH individuals and two quality control samples were included in each 96-well plate used for genotyping, and a 100% match of these samples was required for inclusion of a marker in the analysis. A minimum genotyping efficiency of 95% was required for each marker.
Pedigree relationships were tested with RELPAIR , which uses identity-by-descent allele sharing estimates for statistical inference of biological relationships within and across the specified family structures. Discrepancies between specified and inferred relationships, typically due to sample switches, were addressed by removing four unresolved individuals and two families from the analysis. Genotypes that were inconsistent with Mendelian inheritance were identified with the program PEDCHECK  and removed prior to the linkage analysis.
Whole genome linkage analysis
The final linkage analysis included 5233 single nucleotide polymorphisms (SNPs). Marker order and intermarker distances (in cM) were derived from the Decode linkage maps . For markers not included in that panel, genetic distances were interpolated based on physical distances (1 cM~1 Mb). The software MERLIN  was used to calculate nonparametric two-point and multipoint lod scores, using the exponential model and Spairs allele sharing statistic . Parametric affecteds-only heterogeneity lod scores (HLODs) assuming a dominant (disease allele frequency 0.01) or recessive (disease allele frequency 0.2) model were also computed with MERLIN. For the separate analysis of Caucasian and African ancestry datasets, ethnicity-specific marker allele frequencies were estimated from all genotyped individuals . To avoid an inflation of lod scores due to misspecified allele frequencies, particularly for markers with rare minor alleles in one of the two ethnicities, we also used these ethnicity-specific marker allele frequencies in the overall analysis. This was done by modifying the analysis files to include two “dummy markers” at the same map position as the real marker. The dummy markers had ethnicity-specific allele frequencies, with observed genotypes for samples of one ethnicity, but missing genotypes for the other. This computational approach allowed for the appropriate calculation of joint multipoint lod scores, which combine linkage information across map positions (i.e., at and in-between genotyped markers); joint two-point lod scores could not be calculated. To avoid an inflation of linkage evidence due to inter-marker LD in the absence of parental genotypes, we estimated haplotype frequencies of SNP clusters in high pairwise LD, using a threshold of r2 = 0.16 to define these clusters , .
Ordered subset analysis (OSA)
Based on previous POAG linkage analyses , , we used OSA  to test whether AAD was a significant source of linkage heterogeneity in our dataset. Families were sorted by increasing average AAD in affected relatives and nonparametric linkage analysis was performed one family at a time in the AAD-based order until the family subset generating the maximum lod score anywhere on the given chromosome was identified. This maximum could occur at different map positions for different family subsets. Permutation testing was employed to test whether the observed increase in linkage evidence in this family subset was greater than expected by chance. To this end, families were randomly ordered and the lod score maximization procedure was repeated 10,000 times to calculate the proportion of random permutations with a maximum subset-based lod score at least as large as the observed one (empirical p-value). The OSA null hypothesis specifies no relationship between family-specific covariate and family-specific linkage evidence. Rejection of this null hypothesis suggests that the covariate, here AAD, is a statistically significant (empirical p-value≤0.05) source of linkage heterogeneity.
We would like to extend special thanks to Daniel Weeks for statistical assistance in calculating joint multipoint lod scores using multiple ethnicities.
Conceived and designed the experiments: MAH SS RRA JRG. Performed the experiments: KRL-A YL JRG. Analyzed the data: XQ SS JRG. Contributed reagents/materials/analysis tools: JLW PC LWH SA CS-T EDB JRG. Wrote the paper: KRC RRA MAH SS JRG.
- 1. Quigley HA (1996) Number of people with glaucoma worldwide. Br J Ophthalmol 80: 389–393.
- 2. Friedman DS,Wolfs RC,O'Colmain BJ,Klein BE,Taylor HR,et al. (2004) Prevalence of open-angle glaucoma among adults in the United States. Arch Ophthalmol 122: 532–538.
- 3. Allingham RR,Wiggs JL,Hauser ER,Larocque-Abramson KR,Santiago-Turla C,et al. (2005) Early Adult-Onset POAG Linked to 15q11–13 Using Ordered Subset Analysis. Invest Ophthalmol Vis Sci 46: 2002–2005.
- 4. Baird PN,Foote SJ,Mackey DA,Craig J,Speed TP,et al. (2005) Evidence for a novel glaucoma locus at chromosome 3p21–22. Hum Genet 117: 249–257.
- 5. Lin Y,Liu T,Li J,Yang J,Du Q,et al. (2008) A genome-wide scan maps a novel autosomal dominant juvenile-onset open-angle glaucoma locus to 2p15–16. Mol Vis 14: 739–744.
- 6. Monemi S,Spaeth G,Dasilva A,Popinchalk S,Ilitchev E,et al. (2005) Identification of a novel adult-onset primary open-angle glaucoma (POAG) gene on 5q22.1. Hum Mol Genet 14: 725–733.
- 7. Pang CP,Fan BJ,Canlas O,Wang DY,Dubois S,et al. (2006) A genome-wide scan maps a novel juvenile-onset primary open angle glaucoma locus to chromosome 5q. Mol Vis 12: 85–92.
- 8. Rezaie T,Child A,Hitchings R,Brice G,Miller L,et al. (2002) Adult-onset primary open-angle glaucoma caused by mutations in optineurin. Science 295: 1077–1079.
- 9. Sarfarazi M,Child A,Stoilova D,Brice G,Desai T,et al. (1998) Localization of the fourth locus (GLC1E) for adult-onset primary open-angle glaucoma to the 10p15-p14 region. Am J Hum Genet 62: 641–652.
- 10. Sheffield VC,Stone EM,Alward WL,Drack AV,Johnson AT,et al. (1993) Genetic linkage of familial open angle glaucoma to chromosome 1q21–q31. Nat Genet 4: 47–50.
- 11. Stoilova D,Child A,Trifan OC,Crick RP,Coakes RL,et al. (1996) Localization of a locus (GLC1B) for adult-onset primary open angle glaucoma to the 2cen-q13 region. Genomics 36: 142–150.
- 12. Stone EM,Fingert JH,Alward WLM,Nguyen TD,Polansky JR,et al. (1997) Identification of a gene that causes primary open angle glaucoma. Science 275: 668–670.
- 13. Suriyapperuma SP,Child A,Desai T,Brice G,Kerr A,et al. (2007) A new locus (GLC1H) for adult-onset primary open-angle glaucoma maps to the 2p15–p16 region. Arch Ophthalmol 125: 86–92.
- 14. Trifan OC,Traboulsi EI,Stoilova D,Alozie I,Nguyen R,et al. (1998) A third locus (GLC1D) for adult-onset primary open-angle glaucoma maps to the 8q23 region. Am J Ophthalmol 126: 17–28.
- 15. Wang DY,Fan BJ,Chua JK,Tam PO,Leung CK,et al. (2006) A genome-wide scan maps a novel juvenile-onset primary open-angle glaucoma locus to 15q. Invest Ophthalmol Vis Sci 47: 5315–5321.
- 16. Wiggs JL,Lynch S,Ynagi G,Maselli M,Auguste J,et al. (2004) A genomewide scan identifies novel early-onset primary open-angle glaucoma loci on 9q22 and 20p12. Am J Hum Genet 74: 1314–1320.
- 17. Wirtz MK,Samples JR,Kramer PL,Rust K,Topinka JR,et al. (1997) Mapping a gene for adult-onset primary open-angle glaucoma to chromosome 3q. Am J Hum Genet 60: 296–304.
- 18. Wirtz MK,Samples JR,Rust K,Lie J,Nordling L,et al. (1999) GLC1F, a new primary open-angle glaucoma locus, maps to 7q35–q36. Arch Ophthalmol 117: 237–241.
- 19. Allingham RR,Liu Y,Rhee DJ (2009) The genetics of primary open-angle glaucoma: a review. Exp Eye Res 88: 837–844.
- 20. Alward WL,Fingert JH,Coote MA,Johnson AT,Lerner SF,et al. (1998) Clinical features associated with mutations in the chromosome 1 open-angle glaucoma gene (GLC1A). N Engl J Med 338: 1022–1027.
- 21. Hauser ER,Watanabe RM,Duren WL,Bass MP,Langefeld CD,et al. (2004) Ordered subset analysis in genetic linkage mapping of complex traits. Genet Epidemiol 27: 53–63.
- 22. Woodroffe A,Krafchak CM,Fuse N,Lichter PR,Moroi SE,et al. (2006) Ordered subset analysis supports a glaucoma locus at GLC1I on chromosome 15 in families with earlier adult age at diagnosis. Exp Eye Res 82: 1068–1074.
- 23. Sud A,Del Bono EA,Haines JL,Wiggs JL (2008) Fine mapping of the GLC1K juvenile primary open-angle glaucoma locus and exclusion of candidate genes. Mol Vis 14: 1319–1326.
- 24. Epstein MP,Duren WL,Boehnke M (2000) Improved inference of relationship for pairs of individuals. Am J Hum Genet 67: 1219–1231.
- 25. O'Connell JR,Weeks DE (1998) PedCheck: a program for identification of genotype incompatibilities in linkage analysis. Am J Hum Genet 63: 259–266.
- 26. Kong X,Murphy K,Raj T,He C,White PS,et al. (2004) A combined linkage-physical map of the human genome. Am J Hum Genet 75: 1143–1148.
- 27. Abecasis GR,Cherny SS,Cookson WO,Cardon LR (2002) Merlin–rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet 30: 97–101.
- 28. Kong A,Cox NJ (1997) Allele-sharing models: LOD scores and accurate linkage tests. Am J Hum Genet 61: 1179–1188.
- 29. Broman KW (2001) Estimation of allele frequencies with data on sibships. Genet Epidemiol 20: 307–315.
- 30. Abecasis GR,Wigginton JE (2005) Handling marker-marker linkage disequilibrium: pedigree analysis with clustered markers. Am J Hum Genet 77: 754–767.
- 31. Boyles AL,Scott WK,Martin ER,Schmidt S,Li YJ,et al. (2005) Linkage disequilibrium inflates type I error rates in multipoint linkage analysis when parental genotypes are missing. Hum Hered 59: 220–227.