Skip to main content
Advertisement

< Back to Article

Fig 1.

SigNet overview.

Population cohorts (top) are genotyped and phenotyped in a genome-wide association study (GWAS). The study identifies genetic variants, usually single-nucleotide polymorphisms (SNPs, indicated by vertical bars overlayed on double-stranded DNA), that are associated with the phenotype at genome-wide significance. These SNPs occur throughout the genome, and each SNP defines a genomic region, or locus, that likely contains a gene with a causal relationship with the phenotype. Each locus may contain several genes (arrows above and below the double helix indicate genes on the positive and negative strand), and three loci are depicted. The SigNet method integrates within-locus and between-locus information from DNA-based, RNA-based, and protein-based evidence to select the most likely causal gene at each locus. Locus 1 (red): a SNP in a protein-coding region may change the amino acid sequence of the encoded protein, indicated by the star overlaying the gene symbol and protein. Similarly, a gene in the region may be known to cause a Mendelian disease related to the GWAS phenotype, indicated as a familial case. At this locus, the red gene is selected as most likely. Locus 2 (orange): a SNP may affect the transcriptional regulation of a nearby gene, indicated by the orange arrow from the SNP to the gene transcription start site. The corresponding mRNA transcript may have altered abundance, indicated by the multiple transcripts. These SNPs are expression quantitative trait loci (eQTL), and colocalization of a GWAS association with an eQTL association provides evidence for the most likely causal gene. Methods such as transcriptome-wide association studies (TWAS) provide a similar type of evidence. Locus 3 (green): Many loci are information-poor, with no within-locus evidence and a default approach of selecting the gene closest to the SNP. The SigNet method adds between-locus information using a probability model for the network formed by protein-protein interactions and gene-regulatory interaction of the genes selected at each locus. The green gene product interacts with proteins encoded by genes selected at the other loci, and its causal likelihood is calculated to be higher than the other genes in the locus, including the gene closest to the GWAS SNP.

More »

Fig 1 Expand

Table 1.

Cardiovascular GWAS data.

More »

Table 1 Expand

Table 2.

Cardiovascular GWAS loci.

More »

Table 2 Expand

Table 3.

Cardiovascular Mendelian genes.

More »

Table 3 Expand

Table 4.

Functional evidence.

More »

Table 4 Expand

Fig 2.

Selection frequency: Fraction of SigNet runs where a gene was selected as the active gene within its locus, averaged over 100 runs.

Gene weight: Bayesian scores expressed as gene weights, as defined by Eq (41), averaged over final values from the same 100 runs.

More »

Fig 2 Expand

Table 5.

Parameter values.

More »

Table 5 Expand

Table 6.

Selection of genes by level of information at the GWAS locus.

More »

Table 6 Expand

Table 7.

Selection of genes by level of information for the gene.

More »

Table 7 Expand

Fig 3.

Distribution of the signed distance from a GWAS SNP to the transcription start site of the active gene selected at each locus.

Distributions are shown for genes selected by a minimum distance criterion, by best guess initialization, and by SigNet. Learned distribution: exponential distribution with converged distance parameter 161.3 kb used by SigNet.

More »

Fig 3 Expand

Table 8.

Number recovered, using information: Number of genes in the specified category selected by SigNet most often over 100 runs.

Number recovered, hiding information: Number of genes in the specified category selected by SigNet, hiding the functional information and performing 100 runs for each of the 37 genes in turn.

More »

Table 8 Expand

Table 9.

Cardiovascular pathway enrichment, all 245 loci.

More »

Table 9 Expand

Table 10.

Cardiovascular pathway enrichment, all 245 loci.

More »

Table 10 Expand

Fig 4.

Density plot of gene scores from SigNet compared with PoPS for the PoPS phenotypes AFib (left) and Cardio (right).

More saturated colors indicate higher density, with contour lines from kernel density estimation.

More »

Fig 4 Expand

Table 11.

Comparison of SigNet and PoPS gene rankings.

More »

Table 11 Expand

Table 12.

Genes with strong functional evidence found by SigNet but not by PoPS.

More »

Table 12 Expand

Fig 5.

GWAS loci with Mendelian evidence.

SigNet selects the gene with Mendelian evidence (green rectangle) over the closest gene in the locus to a GWAS SNP (gray rectangle). Pink ovals represent genes with Mendelian evidence; yellow ovals represent colocalized genes; white ovals represent information-poor genes; and gray lines represent protein-protein interactions. Networks are shown for three individual loci, highlighting the gene selected by SigNet: (a) CACNA1C, (b) KCNQ1, (c) SLC4A3.

More »

Fig 5 Expand

Fig 6.

GWAS loci with exome-chip or colocalization evidence.

SigNet selects the gene with exome-chip or colocalization evidence (green rectangle) over the closest gene in the locus to a GWAS SNP (gray rectangle). Pink ovals represent genes with Mendelian evidence; white ovals represent information-poor genes; gray lines represent protein-protein interactions; and green arrow represents gene-regulatory interaction. Networks are shown for two individual loci, highlighting the gene selected by SigNet: (a) CASR, (b) CAV1.

More »

Fig 6 Expand

Fig 7.

GWAS loci with no functional evidence.

SigNet selects the gene (green rectangle) based on network connectivity with genes selected at other loci, over the closest gene in the locus to a GWAS SNP (gray rectangle). Pink ovals represent genes with Mendelian evidence; orange ovals represent exome-chip evidence; yellow ovals represent colocalized genes; white ovals represent information-poor genes; and gray lines represent protein-protein interactions. Networks are shown for loci containing (a) STK38, (b) PMP22.

More »

Fig 7 Expand

Fig 8.

GWAS loci where multiple genes may be causal.

SigNet selects the gene (green rectangle) based on within locus and across loci evidence. SigNet+ augments the selection with other genes in the locus that have functional evidence (gray rectangle). Pink ovals represent genes with Mendelian evidence; orange ovals represent exome-chip evidence; yellow ovals represent colocalized genes; white ovals represent information-poor genes; gray lines represent protein-protein interactions; and green arrows represent gene-regulatory interactions. Networks are shown for loci containing (a) KCNE1, (b) SCN10A, (c) JOSD1, (d) ATP2A2, (e) NKX2–5.

More »

Fig 8 Expand