Identification of Genomic Loci Associated with Rhodococcus equi Susceptibility in Foals

Pneumonia caused by Rhodococcus equi is a common cause of disease and death in foals. Although agent and environmental factors contribute to the incidence of this disease, the genetic factors influencing the clinical outcomes of R. equi pneumonia are ill-defined. Here, we performed independent single nucleotide polymorphism (SNP)- and copy number variant (CNV)-based genome-wide association studies to identify genomic loci associated with R. equi pneumonia in foals. Foals at a large Quarter Horse breeding farm were categorized into 3 groups: 1) foals with R. equi pneumonia (clinical group [N = 43]); 2) foals with ultrasonographic evidence of pulmonary lesions that never developed clinical signs of pneumonia (subclinical group [N = 156]); and, 3) foals without clinical signs or ultrasonographic evidence of pneumonia (unaffected group [N = 49]). From each group, 24 foals were randomly selected and used for independent SNP- and CNV-based genome-wide association studies (GWAS). The SNP-based GWAS identified a region on chromosome 26 that had moderate evidence of association with R. equi pneumonia when comparing clinical and subclinical foals. A joint analysis including all study foals revealed a 3- to 4-fold increase in odds of disease for a homozygous SNP within the associated region when comparing the clinical group with either of the other 2 groups of foals or their combination. The region contains the transient receptor potential cation channel, subfamily M, member 2 (TRPM2) gene, which is involved in neutrophil function. No associations were identified in the CNV-based GWAS. Collectively, these data identify a region on chromosome 26 associated with R. equi pneumonia in foals, providing evidence that genetic factors may indeed contribute to this important disease of foals.


Introduction
Rhodococcus equi is an important intracellular pathogen affecting horses, most commonly among foals in which it causes chronic, suppurative bronchopneumonia [1], as well as extrapulmonary disorders [2]. The cumulative incidence of pneumonia caused by R. equi may be high at breeding farms with affected foals, and this disease may adversely impact future racing performance [3]. At affected farms, a varying proportion of foals will develop clinical signs of pneumonia while the other foals remain free of the disease; however, subclinical pneumonia can occur following either experimental or natural infection with R. equi [4][5][6].
Although the factors contributing to R. equi pneumonia are complex, recent evidence suggests that some horses may be genetically predisposed to this condition [1,7,8]. Identifying the genetic and biological basis of susceptibility, or perhaps resistance, to R. equi pneumonia in foals is important, because it might lead to the development of diagnostic and therapeutic tools to manage atrisk foals on breeding farms and might shed light on critical host defense mechanisms. Currently, single nucleotide polymorphism (SNP)-based genotyping platforms are available for performing genome-wide association studies (GWAS) in horses [9]. Use of SNP-based genotyping platforms to identify genomic regions associated with particular phenotypes in animals is growing at a rapid pace [9][10][11][12][13][14]. As a result, researchers, veterinarians, and producers increasingly rely on data from these studies to make important production and management decisions [15,16].
Although high-density SNP arrays are powerful tools for performing association studies, they are often inadequate for examining structurally complex regions, particularly those enriched with copy number variants (CNVs) [17]. Results from the 1000 Genomes project estimate approximately 20% of CNVs are not in linkage disequilibrium with flanking or tagging SNPs [18], indicating that additional testing is required to accurately genotype these variants. The identification of CNVs is further complicated by the probe placement and design of most commercial SNP arrays [17][18][19]. Comparative genomic hybridization (CGH) arrays are optimized for genotyping CNVs. Using SNP and CGH arrays together may, in some instances, increase the power of a GWAS by expanding the number of informative markers, particularly within structurally complex regions of the genome [19].
In horses, CNVs are present in genes involved in many biological processes and may underlie or modify many common and disease traits [20][21][22]. Of the CNVs in horses identified to date, most are enriched in genes involved in sensory perception, signal transduction, and metabolism [22]. In other animal species, including horses, CNVs often affect genes regulating the immune system, particularly the MHC; they may also be causative or modifying variants of many immune related conditions [23][24][25][26][27][28][29][30].
The genetic basis of susceptibility or resistance to R. equi pneumonia has not been explored on a genome-wide basis. Here, we describe independent SNP-and CNV-based GWAS to identify genomic loci associated with R. equi pneumonia in Quarter Horse foals. We identified a number of regions associated with R. equi pneumonia, including a region on chromosome 26. Located within this region is the transient receptor potential cation channel, subfamily M, member 2 (TRPM2) gene that encodes a protein associated with neutrophil function.

Ethics statement
All protocols for this study were reviewed and approved by the Clinical Research Review Committee (CRRC Protocol 10-12), College of Veterinary Medicine & Biomedical Sciences, Texas A&M University. This study was carried out on private land (33u379140N 100u199220W) and specific permissions for use were granted by GPB. During the time this study was conducted, research involving client-owned animals at Texas A&M University was not subject to review by the Institutional Animal Care and Use Committee. Written informed consent for participation was obtained for all foals included in the study, and the 6666 Ranch provided access to the foals included in this project. This study did not involve any endangered or protected species.

Study population
The 6666 Ranch was selected as the site for this study because it agreed to provide access to foals, had history of R. equi pneumonia among foals with a cumulative incidence of $15% for the preceding 3 years, and because the farm's veterinarian/general manager (GPB) was conducting a separate study during 2011 evaluating screening tests for R. equi pneumonia in foals, which was directed by one of the authors (MKC). The screening test evaluation required that treatment was not initiated for any foal on the basis of screening test results alone, and that the veterinarians making decisions about diagnosis and treatment of R. equi pneumonia were not informed of the results of screening tests. Each foal at the farm underwent bilateral thoracic ultrasonographic examination at 2-week intervals, beginning at 3 weeks of age either until 19 weeks of age or until the foal developed clinical signs of pneumonia (as described below). Ultrasonographic examinations were performed by a veterinarian who did not participate in diagnosis or treatment of R. equi pneumonia. The anatomic location (left versus right hemithorax; intercostal space; and, dorsal, middle, or ventral region) and maximal diameter of any areas of pulmonary abscesses or consolidation were recorded. In addition, the total number of lesions was counted.
All foals [N = 248] born at the farm during 2011 were eligible to be included in the study. All foals were monitored daily by farm personnel for clinical signs of pneumonia until 20 weeks of age. Clinical signs suggestive of pneumonia included fever, lethargy, signs of depressed attitude, cough, nasal discharge, polysynovitis, tachypnea, increased respiratory effort, respiratory distress, and detection of a tracheal rattle or pulmonary crackles or wheezes via thoracic auscultation. For each foal that developed clinical signs of pneumonia, thoracic ultrasonography and collection of a transendoscopic tracheobronchial aspirate (TBA) sample with a commercially available triple-guarded catheter (Triple stage tracheal wash catheter, MILA International Inc., Erlanger, KY) were performed. Between uses, the endoscope was disinfected with a 3.4% glutaraldehyde solution (CIDEX-PLUS, Advanced Sterilization Products, Irvine, CA) following a standard protocol used in our laboratory and known to be microbicidal against R. equi. Each sample of TBA fluid was submitted for microbiologic culture and cytologic evaluation to the Texas Veterinary Medical Diagnostic Laboratory in College Station, Texas.
Foals with R. equi pneumonia (clinical group; N = 43 [17%]) were defined as those having signs of pneumonia at 3 to 20 weeks of age, ultrasonographic evidence of peripheral pulmonary consolidation or abscesses at the time of examination for clinical signs of pneumonia, and R. equi detected in TBA fluid via microbiologic culture, and cytological evidence of gram-positive intracellular coccobacilli in the TBA sample. Subclinical foals (N = 156 [63%]) were defined as those having ultrasonographic evidence of peripheral pulmonary consolidation or abscesses, but lacking clinical signs of pneumonia [31]. Unaffected foals (N = 49 [20%]) were classified as having no clinical signs of pneumonia and no ultrasonographic evidence of pulmonary consolidation or abscessation. From each of the 3 groups of foals (i.e., clinical, subclinical, and unaffected), 24 foals were selected randomly for the SNP-and CNV-based genome-wide association studies. The rationale for including 24 foals was based on funding available to conduct the study (rather than an a priori sample size calculation).

DNA samples and isolation
A blood sample (4 mL) was collected by jugular venipuncture into a tube containing acid citrate dextrose (ACD) as an anticoagulant from the first (i.e., age 3 weeks) blood sample obtained from each foal. Genomic DNA was isolated using a standard phenol-chloroform isoamyl extraction protocol from these blood samples from each foal [22].

SNP genotyping and data analysis
The SNP genotyping was performed at Gene Seek (Neogen, Lincoln, NE) using the EquineSNP70 BeadChip Array (Illumina, San Diego, CA). The resulting SNP genotypes were analyzed using the PLINK analysis package [32]. Genotypes were determined for each animal and then filtered (i.e., excluded) on the basis of missingness per individual (.10%), missingness per SNP (.10%) minor allele frequency (,5%), and absence of Hardy-Weinberg equilibrium (P,0.001), as described by Raudsepp et al. [12]. A standard chi-square association test (Max(T) permutations [N = 10,000]) based on a binary outcome of disease status using a case-control design was performed using PLINK [12,33]. Genotype ped files were loaded into PLINK and foals were assigned a phenotypic status of either affected or unaffected (case/control). A P value of P,1610 25 was considered evidence of association [34]. Population stratification was determined using plots of the observed versus the expected -log 10 P values of Cochran-Armitage trend tests and by determining the genomic inflation factor, l, using the R package GenABEL [35]. Using the R package pedigreemm [36], mixed-effects logistic regression with sire modeled as a random effect was used for the association test in comparisons showing evidence of population stratification; SNPs with any genotype represented fewer than 10 times were removed from analysis to permit model convergence [35][36][37]. All SNP array data have been deposited in NCBI's Gene Expression Omnibus (GEO) [38] and are accessible through GEO Series accession number GSE57510 (http://www.ncbi.nlm.nih.gov/ geo/query/acc.cgi?acc = GSE57510).
Genotype data from the joint analysis were analyzed using logistic regression on the basis of the binary outcome of disease (pneumonia versus each of the respective comparison groups [i.e., clinical foals, subclinical foals, and unaffected foals]). The association of disease with genotype for the SNP was expressed as the odds ratio (OR), estimated from logistic regression modeling; 95% confidence intervals were estimated using maximum likelihood methods. Models were fit for comparisons of clinical versus subclinical foals, clinical versus the combination of subclinical and unaffected foals, and clinical versus unaffected foals. Models were fit using S-PLUS statistical software (Version 8.2, TIBCO, Inc., Seattle, WA). A significance level of P,0.05 was used for the analyses.

CNV detection and analyses
Copy number variants were identified using a previously reported equine exome array for comparative genomic hybridization (CGH) [41]. An individual Quarter Horse mare was used as the reference sample for each CGH experiment [22]. Array CGH was performed using methods described by Doan et al. [41]. Briefly, genomic DNA was sonicated and then labeled with the Cy5 (experimental) and Cy3 (reference) AlexaFluor dyes using the BioPrime Plus labeling kit (Invitrogen, Carlsbad, CA). Two mg of reference and experimental DNA were hybridized onto the arrays (Agilent Technologies, Santa Clara, CA). The arrays were scanned using an Agilent DNA Microarray Scanner (2-mm settings and 0.05 XDR). Fluorescent intensity values were calculated using Agilent's Feature Extraction 10.5 software (Agilent). Copy number variants, including their corresponding log 2 ratios, were identified using Agilent's Genomics Workbench 7.
Copy number variants were called using the ADM-2 algorithm and the following filters: minimum probe span $3 and average log 2 ratio $0.5, removal of probes with $3 standard deviations above or below the mean log 2 fluorescent intensity.
Logistic regression modeling was used to perform 2 separate CNV-based GWAS. The first approach modeled the association of the binary outcome of 2 groups (e.g., clinical versus subclinical) with the log 2 ratio of intensity values (a continuous variable) for each CNV. The second approach modeled the association of the binary outcome of 2 groups with the presence or absence of a CNV (a binary categorical variable) within a CNV region. For the first approach, CNV regions (CNVRs) were determined for the foals examined. The CNVRs were then filtered to include only CNVs identified in at least 3 foals. The log 2 ratios of probes within each CNVR were then averaged to calculate a single log 2 (CNVRlog 2 ratio) for each CNVR for each foal. The CNVR-log 2 values were used in a logistic regression model to identify associated CNVs among the pairwise comparisons of the 3 groups (casecontrol design described above). For the second approach, a logistic regression model involving the binary outcome of presence (or absence) of a CNV within each CNVR was used to identify associated CNVs among the pairwise comparisons of the 3 groups. For both approaches, the generated P values from linear modeling or regression analyses were corrected for multiple comparisons using the method outlined by Hochberg et al. [42]. Statistical analyses were performed using R (Version 3.0.1; R Statistical Project). All CNV data have been deposited in NCBI's Gene Expression Omnibus (GEO) [38] and are accessible through GEO Series accession number GSE57510 (http://www.ncbi.nlm.nih. gov/geo/query/acc.cgi?acc = GSE57510).

SNP-based GWAS
Three case-control GWAS were performed among the 24 randomly selected foals representing each group (clinical group, subclinical group, and unaffected group; Figure 1). The number of SNPs excluded on the basis of missingness per individual, missingness per SNP, and minor allele frequency were 0, 1,292, and 11,157, respectively (12,449 total SNPs). For comparisons 1, 2, and 3 (Figure 1), the number of SNPs excluded on the basis of Hardy-Weinberg equilibrium were 0, 83, and 54, respectively. After filtering, the total genotyping rate of the foals was estimated at 99.  Figure S1A]) identified 10 regions with moderate association ( Figure 2B and Table 1 Figure S1B]) identified 2 regions with moderate association ( Figure 2C and Table 1). Results from each GWAS comparison are provided in Table S1.
The l value of comparison 3 (1.44) suggested evidence of confounding population structure from 1 of the groups, thus mixed-effects modeling with sire as the random effect term was used as an additional association test. There were 23,318 SNPs filtered due to failure to converge in mixed modeling, leaving 40,843 SNPs for evaluation in comparison 3. The logistic mixedeffects modeling reduced the l from 1.44 to 1.10 ( Figure S1D), and inspection of the observed versus expected P value plot indicated that none of the smallest P values [highest -log 10 P values] observed were greater than expected ( Figure 2D and Table S2). The mixed-effects model analysis identified 1 region with modest evidence of association (BIEC2_284540; chr15:7,394,044; P = 9.0610 5 ). The l values of each comparison suggested the population structure was attributable to foals in the unaffected group because the magnitude of l was greatest for the comparison of clinical versus unaffected groups (i.e., strongest evidence of population stratification), was 1.00 for clinical versus subclinical groups (i.e., absence of population stratification), and 1.14 when the clinical group was compared with the combination  Table 2). Considering just the AA genotype relative to the other genotypes, the odds of disease were approximately 3.7-fold greater for foals with the AA genotype ( Table 2; P = 0.0006). Using an additive model, there was a significant (P = 0.0014) association of the A allele in comparisons between the clinical and other groups, with an estimated odds of a clinical classification being increased nearly 3fold for each copy of the A allele. Examination of the genotype data, however, suggested an additive model was unlikely: the ORs for the AB and BB types relative to the AA genotype were similar. The AA genotype of the TRPM2 gene also was significantly associated with increased odds of R. equi pneumonia when considering the results of comparisons of the clinical foals versus all foals (comparison 1; Table S2) and for clinical foals versus healthy foals (comparison 3; Table S3), irrespective of the genetic model.

CNV-based GWAS
Next, array aCGH was performed to genotype CNVs in the 72 foals examined in the SNP-based GWAS. Two reactions failed to meet the minimum quality scores for CNV detection, so they were excluded from the study (foals 153 and 278). Collectively, 6,727 CNVs were identified among the 70 foals ( Table 3 and Table  S3). Merging shared CNVs yielded 2,350 CNV regions (CNVR) that were present at 3,492 Ensembl annotated genes (3,442 protein-coding and 50 RNA-coding). The lengths of CNVs ranged from 197 base-pairs (bp) to 7,229.5 kilo-bp (kb), with a mean length of 97.7 kb, median length of 4.4 kb, and mode length of 960 bp.
The association between disease status (the outcome variable) and individual CNVs (dependent variable) was assessed using logistic regression analysis. Because CNV genotypes reflect differences in DNA content between 2 individuals and are expressed as normalized log 2 ratios representative of varying degrees of copy number gains and losses, separate logistic regression modeling was performed using CNVs as the dependent variable as either 1) continuous variables representing average log 2 ratios of CNVRs or 2) a binary variable representing the presence or absence of a given CNV (see Materials and Methods section). Comparisons among groups of foals were made in the CNV-based GWAS as described above (Figure 1). Correction for multiple comparisons revealed no significant (P,0.05) associations of disease with CNVs when considered as continuous log 2 ratios ( Table 4) or as the binary outcome for presence or absence of a CNV. No association with clinical status grouping was detected on the basis of the total number of CNVs for individuals ( Table 3).

Discussion
Rhoddococcus equi is an important cause of disease and death in young foals [44]. Multiple factors such as age, environmental conditions including level of exposure to virulent organisms, and genetic background appear to play a role in the occurrence of this  complex disease [45][46][47]. The purpose of this study was to better characterize the genetic basis of susceptibility to R. equi pneumonia. One special feature of this study was the phenotypic characterization of foals into those that remained unaffected (i.e., free of both clinical signs and ultrasonographic evidence of disease) through weaning, those that had subclinical pneumonia (i.e., absence of clinical signs but ultrasonographic evidence of pulmonary lesions), and those that developed R. equi pneumonia.
At most farms where screening is performed, foals with evidence of subclinical pneumonia receive treatment or other interventions that precludes one from differentiating foals that would have progressed to clinical disease from those that would have remained subclinical. At the farm described in this report, however, foals with subclinical pneumonia were not treated or otherwise managed differently than unaffected foals providing us with the exceptional opportunity to conduct GWAS's with 3 clinically important phenotypes. A SNP-based GWAS revealed a region on chromosome 26 (chr26:39640172-39867963) that was positively associated with disease. This region contains the TRPM2 gene, which is associated with neutrophil function. Although, TRPM2 is an ideal candidate based on its known biological function, an adjacent SNP located within the KRTAP gene was more strongly associated than the SNP located with TRPM2. Further, investigation of this region (e.g., fine-mapping of the region) is needed before any conclusion can be made regarding the role of TRPM2 in R. equi pneumonia. Nevertheless, TRPM2 is of interest because it has been demonstrated in mice to play a role in neutrophil-mediated tissue damage [43]. Neutrophils have been shown to play an important role in the outcome of R. equi infection. Neutrophil-depleted mice had significantly heavier tissue burdens of R. equi following experimental infection than non-depleted mice, documenting a protective role for neutrophils [48]. The neutrophil concentrations at 2 and 4 weeks of age were significantly lower among foals that subsequently developed R. equi pneumonia than among age-matched foals that did not develop pneumonia [49]. Similar protective roles for neutrophils have been documented for other intracellular pulmonary pathogens [48,50,51]. Moreover, R. equi has an agedependent distribution (i.e., foals are usually affected and adults are generally resistant to infection), and age-related differences in neutrophil responses to R. equi have been documented [52][53][54]. Although neutrophils play a role in protecting against R. equi infection, they also contribute to lung parenchymal damage of this pyogranulomatous disease [44,55]. In a mouse model of ulcerative colitis, the over-abundance of neutrophil invasion into tissue mediated by TRPM2 expression, led to increased colonocyte death [43]. Thus, variation in TRPM2 expression could influence the extent to which neutrophil-induced pulmonary damage occurs following infection with R. equi, and this variation could be a crucial determinant of the clinical outcome of infection with R. equi and the progression from subclinical to clinical pneumonia. If the TRPM2 allele implicated in our study were associated with increased TRPM2 expression, it might consequently be associated with greater likelihood of pneumonia development as a result of greater neutrophil invasion. The functional effects of this TRPM2 genotype are unknown, however. Interestingly, expression of a splice variant of TRPM2 was demonstrated to inhibit death of several cultured cell lines [56]. Further evaluation of the functional differences in neutrophilic responses among TRPM2 genotypes is warranted.
A CNV-based GWAS was conducted and revealed no significant association with disease status. Despite the negative findings, these results are of interest with regard to better characterizing genetic variation in horses and using CNVs for GWAS with disease outcomes. A search of PubMed reveals no other attempt to perform a CNV GWAS via aCGH in horses and to the authors' knowledge no others have been done. Valuable information was gained in terms of the analysis of CNV data generated from a GWAS using aCGH. The identification of CNVs is based on log 2 ratios of intensity signals that are generated between the reference and sample DNA. The results may be interpreted either as presence or absence of a CNV, based on a threshold intensity value [41], or the actual intensity values themselves. As observed in this study, these 2 outcomes for CNVcalling yielded differing results. We propose that utilizing average log 2 ratios across CNV regions is superior because classification of CNVRs by presence or absence does not further characterize a  CNV, whereas analysis of the log 2 ratios allows for the identification of whether the CNV involved gains or losses, and for description of the magnitude of the gain or loss. We failed to identify significant association with candidate genes previously associated with R. equi pneumonia in other breeds of foals [1,7,8]. This may have been attributable to differences among populations of foals studied (e.g, breeds) or study methodology (e.g, case definitions, methods for detecting polymorphisms, etc.). Nevertheless, a commonality among these studies can be found in their identification of genes pertaining to host defenses against infectious pathogens, such as iron transport and innate immune responses. Conceivably, these apparently discrepant findings may converge on critical biological pathways or processes that influence susceptibility to infection with R. equi (and other intracellular pathogens).
This study had a number of limitations. First, it was likely underpowered for both the SNP-and CNV-based GWAS portions of the study: we only had the opportunity to find SNPs or CNVs with large effects. This lack of power is probably why we did not identify highly significant associations in the SNP-based GWAS for 2 of our 3 comparisons (Figure 1), and only moderate significance for the allele identified in comparison 2. Although sample sizes may be calculated for human studies, methods for incorporating crucial determinants of sample size such as the impact of the relatively longer length of linkage disequilibrium in horses relative to humans remain to be defined. Moreover, the cost of GWAS studies and the limited funding available for equine research can be restrictive. It is worth noting that our SNP-based GWAS did provide sufficient power to identify a SNP associated with R. equi that was subsequently substantiated by findings of the joint analysis using PCR testing of additional foals from this population for all 3 comparisons. Nevertheless, larger scale studies are indicated. For example, 1 locus on chromosome 15 (in the promoter region of a chemokine) was weakly associated in both the mixed-effects modeling and standard GWAS analyses for comparison 3: this finding could be simply attributable to chance but also could represent an underpowered association that merits further investigation.
A limitation of this study is that it was restricted to foals of a single breed at a single farm. Further studies are indicated to substantiate whether the observed association holds among other Quarter Horse foals at other farms, and among foals of other breeds. Another limitation was that although the mixed modeling for comparison 3 yielded a marked reduction in the estimated value of l (from 1.44 to 1.10), the mixed modeling value of l suggested residual confounding from population structure. Graphical analysis of the results of mixed modeling, however, suggested this was not the case: the plot of observed versus expected P values indicated that fewer than expected small P values were identified and more than expected large P values were observed following mixed-effects modeling, and Manhattan plots of the standard and mixed-effects model GWAS for comparison 3 revealed a shift to larger P values following mixed-effects modeling ( Figure S1). Moreover, the linear regression-based estimate of l had a poor fit because the data were non-linear in the range of the high P values (data not shown). We interpreted these results to indicate that the mixed-effects modeling had largely corrected for population stratification attributable to sire, and that the study was underpowered. As noted, a limitation of this study is that we lack fine-mapping of the region to determine whether the TRPM2 allele is a causal variant or simply in linkage disequilibrium with another gene or genes that contribute to susceptibility to R. equi. Finally, it must be noted that although the estimated OR for the TRMP2 SNP was relatively large, it is not of sufficient magnitude to be clinically useful for screening purposes.
Despite the aforementioned limitations, this study has identified a region on chromosome 26 that was associated with R. equi pneumonia in foals, and the largest scale GWAS in foals reported to date. Furthermore, it extends current knowledge of equine CNVs and analysis of data from GWAS using aCGH in horses.

Conclusions
In conclusion, the results of this study identify a locus and gene potentially involved in the development of R. equi pneumonia in foals. Future studies are warranted to substantiate the association of TRPM2 gene (and related pathways) with R. equi pneumonia and to provide fine-mapping of the region on chromosome 26 implicated in this GWAS.

Acknowledgments
We acknowledge the 6666 Ranch for providing access to the foals studied; Ms. Ellen Ruth Alexander for help with the PCR genotyping; and, Mr. Nick Culliton for assisting with the DNA extractions. Figure S1 Quantile-quantile (QQ) plots of expected chisquared significance values plotted against the observed Table 4. Top 5 CNV regions identified using logistic regression for the association of R. equi with either the binary variable presence or absence of a CNV identified in the region (Presence columns) or the log 2 ratio of intensity values of the CNVs (Intensity columns).

Presence
Intensity (    Author Contributions