Heterogeneity analysis provides evidence for a genetically homogeneous subtype of bipolar-disorder

Caroline C. McGrouther; Aaditya V. Rangan; Arianna Di Florio; Jeremy A. Elman; Nicholas J. Schork; John Kelsoe; Bipolar Disorder Working Group of the Psychiatric Genomics Consortium

doi:10.1371/journal.pone.0314288

Abstract

Background

Bipolar Disorder (BD) is a complex disease. It is heterogeneous, both at the phenotypic and genetic level, although the extent and impact of this heterogeneity is not fully understood. One way to assess this heterogeneity is to look for patterns in the subphenotype data. Because of the variability in how phenotypic data was collected by the various BD studies over the years, homogenizing this subphenotypic data is a challenging task, and so is replication. An alternative methodology, taken here, is to set aside the intricacies of subphenotype and allow the genetic data itself to determine which subjects define a homogeneous genetic subgroup (termed ‘bicluster’ below).

Results

In this paper, we leverage recent advances in heterogeneity analysis to look for genetically-driven subgroups (i.e., biclusters) within the broad phenotype of Bipolar Disorder. We first apply this covariate-corrected biclustering algorithm to a cohort of 2524 BD cases and 4106 controls from the Bipolar Disease Research Network (BDRN) within the Psychiatric Genomics Consortium (PGC). We find evidence of genetic heterogeneity delineating a statistically significant bicluster comprising a subset of BD cases which exhibits a disease-specific pattern of differential-expression across a subset of SNPs. This disease-specific genetic pattern (i.e., ‘genetic subgroup’) replicates across the remaining data-sets collected by the PGC containing 5781/8289, 3581/7591, and 6825/9752 cases/controls, respectively. This genetic subgroup (discovered without using any BD subtype information) was more prevalent in Bipolar type-I than in Bipolar type-II.

Conclusions

Our methodology has successfully identified a replicable homogeneous genetic subgroup of bipolar disorder. This subgroup may represent a collection of correlated genetic risk-factors for BDI. By investigating the subgroup’s bicluster-informed polygenic-risk-scoring (PRS), we find that the disease-specific pattern highlighted by the bicluster can be leveraged to eliminate noise from our GWAS analyses and improve risk prediction. This improvement is particularly notable when using only a relatively small subset of the available SNPs, implying improved SNP replication. Though our primary focus is only the analysis of disease-related signal, we also identify replicable control-related heterogeneity.

Citation: McGrouther CC, Rangan AV, Di Florio A, Elman JA, Schork NJ, Kelsoe J, et al. (2025) Heterogeneity analysis provides evidence for a genetically homogeneous subtype of bipolar-disorder. PLoS ONE 20(1): e0314288. https://doi.org/10.1371/journal.pone.0314288

Editor: Wan-Tien Chiang, Augusta University, TAIWAN

Received: June 25, 2024; Accepted: November 7, 2024; Published: January 29, 2025

Copyright: © 2025 McGrouther et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: There are no primary data in the paper; all of the code is available at https://github.com/adirangan/ The data which is analyzed can be requested from the Psychiatric Genomics Consortium at: https://pgc.unc.edu/.

Funding: AVR and NJS are supported in part in part by the National Institutes of Health grant: U19AG023122. The funders did not play any role in the study design, data collection and analysis, decision to publish or preparation of this manuscript.

Competing interests: The authors have declared that no competing interests exist.

Background

Overview

Bipolar disorder (BD) is a brain disorder characterized by shifts in mood, energy and attention/focus [1]. BD affects roughly 50 million people across the world, with a mean age of onset of 20 years and an estimated lifetime prevalence of ∼1% [2–5]. BD is also highly heritable [6], with heritability estimates of 40% or higher [7–11] and evidence of increased risk when family-members exhibit other psychiatric disorders [7, 10, 11].

There is growing consensus that BD is heterogeneous, both at the phenotypic and genetic level [12–23]. For example, diagnostic systems usually consider at least two subtypes of bipolar disorder: bipolar I and bipolar II. The diagnostic criteria for bipolar I require the presence of at least one manic episode, while those for bipolar II require at least one hypomanic and one major depressive episode [1]. Response to medication (such as lithium) is highly heterogeneous across patients, and genetic predictors of drug-response have been difficult to clearly determine and replicate [24–28].

The high degree of heterogeneity for BD at the clinical and phenotypic level may make it more difficult to identify genetic risk-factors for BD. To briefly summarize: while the overall heritability of BD is estimated at ∼40%, the overall single-nucleotide-polymorphism (SNP) heritability is only ∼ 18.6% [29], which is moderate when compared to many other psychiatric and neurological disorders [6, 30–38]. Recent genome-wide association studies (GWASs) have been used to identify several (i.e. ∼ 100) independent loci associated with BDI and BDII, with the overall variance explained by SNPs reaching ∼ 15 − 18% [29]. However, many of the loci that seem promising in one cohort fail to replicate in other cohorts [23, 39, 40]. Studies attempting to uncover gene-environment interactions in BD have also encountered challenges finding replicable signals [41–45].

Rather than focusing on small sets of loci, one can also consider collections of SNPs which individually may not be of genome-wide significance. Along this vein, Polygenic-risk-scores (PRSs), which are usually weighted sums of genetic variants, have been used to summarize the genome-wide risk for BD [46]. These PRSs may provide an estimate of overall risk and/or severity: those individuals with PRSs in the top 90% were 3.62 times more likely to be a case than those with average PRSs. These PRSs also contain information regarding multiple phenotypic traits, including the risk of other psychiatric disorders, psychopathology, educational attainment and more [47–55]. Despite these successes, to the best of our knowledge, no individual PRS has yet been able to explain a large fraction of the variation between the main bipolar subtypes.

The high degree of heterogeneity within BD poses a challenge to understanding its etiology and developing new interventions. Ultimately, a comprehensive depiction of the landscape of BD will involve clear descriptions of the heterogeneity at the phenotypic level, as well as at the genetic level.

To date, the main research efforts aimed at understanding the genetic heterogeneity underlying BD have focused on (i) increasing the power of BD meta-GWAS, (ii) running subphenotypic-specific meta-GWAS, and (iii) performing pathway-specific analyses [56–58]. These research efforts are non-trivial and in some cases require insights we do not yet have. Generally speaking, recruiting, assessing, and genotyping new subjects is expensive; there is often a trade-off between the quantity of subjects that can be recruited and the ‘quality’ or accuracy with which their data is processed. For example, one promising resource for genotyped data is 23andMe, but many of the data-sets available through this resource rely on self-reported diagnoses [59]. Consequently, any synchronization effort involves the integration and harmonization of data collected using different phenotypic instruments or genotyping methods and may inadvertently introduce non-disease-related signal. Furthermore, in many cases, the relevant subphenotypic information was not collected at all, forcing interested researchers to contact prior participants or lose those data points entirely. Finally, even when promising results are obtained, it is not always easy to find an appropriate replication sample [60]. Since we do not yet know which trait or combination of subphenotypic traits (if any) is responsible for BD genetic heterogeneity, it is not always clear how best to proceed.

Contribution

Ultimately, we seek to investigate the genetic heterogeneity of BD by using an approach which does not require the user to provide pathways or subphenotypes. As described below, we introduce a methodology which first uses the genotyped data to identify a genetic subgroup within BD, and then uses that genetic subgroup for downstream analyses (in this case risk prediction). To briefly summarize: we use a covariate-corrected biclustering algorithm to search for statistically significant biclusters comprising subsets of BD cases which exhibit disease-specific patterns of differential-expression across subsets of SNPs. In this study we find one statistically-significant disease-specific structure, which is limited to only a fraction of the case-subjects. These case-subjects collectively exhibit a shared pattern of differential-expression—i.e., a form of genetic homogeneity—which is not shared by the other BD-cases nor by the control-subjects; we refer to this bicluster as a ‘genetic subgroup’. We then demonstrate that this genetic subgroup is useful for risk-prediction.

In more detail, our analysis begins by collecting data within which to search for genetic subgroups of BD. As members of the Psychiatric Genomics Consortium (PGC), we had access to the raw genotypes of ∼ 18K BD cases and ∼ 30K controls. This data was generated by 27 studies and genotyped on a variety of platforms (OMEX, Affymetrix, Illumina). When the PGC analyzed this data [20], they synchronized the data using imputation. We were not certain how imputation might impact the potentially subtle relationships between BD cases, and therefore decided to limit our analysis to the available raw genotyped data [60]. This choice to limit ourselves to raw genotyped data placed constraints on our choices for the training and testing data sets, as the various genotyping platforms types emphasize different SNP sets (see Fig 1).

Download:

Fig 1.

In this figure we illustrate the absolute (right) and relative (left) snp overlap between the studies available to us. The relative-overlap is calculated using the Szymkiewicz–Simpson coefficient (i.e., the overlap-coefficient between sets X and Y is |X ∩ Y|/min(|X|, |Y|)). Guided by the relative-overlap and genotyping platform used, we divided the studies into four arms (shown along the coordinate axes). The first arm contains only the single ‘BDRN’ data-set, which we use as a training/discovery set to search for heterogeneity (see Methods). We reserve the remaining studies (organized into three arms) for replication. Note that the training-set overlaps strongly with arm-2, and less strongly with arm-3 and arm-4. The magnitude of this overlap will constrain how faithfully any patterns of differential-expression found in arm-1 can possibly manifest within the other arms (see Figs 3–5).

https://doi.org/10.1371/journal.pone.0314288.g001

In order to minimize batch-effects and reduce the chances of spurious false-positives, we chose to initially focus our primary analysis on a relatively large curated study from the Bipolar Disorder Research Network (BDRN) comprising raw genotyped data collected across 2524 BD cases and 4106 controls (OMEX platform) [18]. We use this BDRN study as our training-arm, and set aside the remaining independent data for subsequent replication analyses (i.e., our replication-arms). We grouped all the BD cases in our training-arm together and searched within the training-arm for any subsets of subjects which exhibited a distinct genetic signature (i.e., differential expression) across a subset of SNPs. Any such subset of subjects along with the associated subset of differentially-expressed SNPs is referred to as a ‘bicluster’, or a ‘genetic subgroup’.

As described in [61, 62], many commonly used biclustering approaches suffer from two methodological issues. First, a bicluster that is found within the case-population may not be disease-related, as a similar signal may be found within the control-population (e.g., a bicluster representing non-disease-specific heterogeneity). Second, many biclustering algorithms proceed under the assumption that biclusters exist, often identifying ‘false-positive’ structures that are not statistically-significant.

To address these issues we searched for biclusters using the ‘half-loop’ algorithm of [63, 64]. As described in [64], this algorithm ensures that the pattern of differential-expression within the bicluster is not similarly present within the control-population, reducing the likelihood that we highlight structures unrelated to disease status. Second, the half-loop algorithm uses a permutation-test to estimate the p-value of each bicluster found, allowing us to test against the null hypothesis that no bicluster exists. Finally, the half-loop algorithm also allows us to correct for other covariates, such as proxies for genetic-ancestry (see Methods). While our approach is much simpler than some of the more recent machine-learning approaches, our biclusters are directly associated with subject- and SNP-subsets, which can be directly interpreted and assessed for homogeneity and/or used in downstream analyses.

Using the relatively conservative half-loop method mentioned above, we found strong evidence for genetic heterogeneity. We discovered one bicluster which is statistically significant and which replicates in all three other data-sets. This primary bicluster was enriched for (but not completely driven by) BDI over BDII. After removing this bicluster we saw further evidence of residual heterogeneity, but our training data-set was not sufficiently powered to clearly identify a secondary bicluster.

We then assessed the role of our bicluster in risk-prediction. We found that the subset of case-subjects highlighted by the bicluster can be used to improve the performance of a PRS. This advantage was more pronounced when (i) the SNPs included in the PRS were limited to those of high estimated significance, and (ii) the case-population was limited to those diagnosed with BDI. These observations suggest that focusing on genetically identifiable subgroups of BD-subjects might improve overall risk-prediction and enhance replication across the top SNPs.

Finally, we also ran a simple gene-set over-representation analysis, revealing that the genetic subgroup identified above (i.e., the bicluster) is significantly enriched for many pathways associated with neuronal development and maintenance.

In summary, we find strong evidence for the genetic-heterogeneity of BD in the form of a bicluster. Notably, BD subphenotype information was not required to identify this signature, nor were rare-variants (i.e., we relied only on common SNPs with maf greater than 25% within the training-arm). The signature of this bicluster has the potential to refine downstream analyses (e.g., improving genome-wide risk-prediction), and the associated gene-enrichment suggests an association with certain mechanisms of neuronal development.

Methods

In this section we describe several aspects of our methodology. An even more detailed description, including an outline of the steps involved and the considerations we made along the way, is available in S1 Text.

Data

We make use of data from 27 of the cohorts described in [20]. These cohorts have been curated as described in [20] and its supplementary information, and include de-identified subjects from several countries in Europe, North America and Australia, totaling over 18000 cases and 29000 controls of European descent. Case-subjects were required to meet international consensus criteria (DSM-IV, ICD-9, or ICD-10) for a lifetime diagnosis of BD established using structured diagnostic instruments from assessments by trained interviewers, clinician-administered checklists, or medical record review. Control-subjects in most samples were screened for the absence of lifetime psychiatric disorders, as indicated. For each of the 27 cohorts, we had access to both the raw genotypes and the imputed data generated by Stahl et al. using the 1000 Genomes (1KG) European reference-panel (see [20]).

Due to the details of our heterogeneity analysis (described further below), we make three additional choices. First, for our primary analysis we use only the raw genotyped data within each cohort, but not the imputed data. This is because we want to avoid any concerns of spurious correlations that might arise from imputation [60]. Second, when running our biclustering algorithm we do not explicitly correct for linkage-disequilibrium (LD) between genotyped SNPs at the level of the data-set itself (e.g., by eliminating SNPs in strong LD with other SNPs). Instead, we implicitly correct for LD within our biclustering algorithm by contrasting cases against controls. Third, it is typically quite difficult to reliably detect signal associated with rare variants (i.e., SNPs with a low minor-allele-frequency, a.k.a. ‘maf’), especially when the power of the data-set is low. This difficulty is compounded when searching for heterogeneity, as the effective sample-size (e.g., the number of subjects in a bicluster) is further reduced—often only a fraction of the total subject-population [64]. Thus, in order to avoid spurious results associated with rare-variants, we limit our analysis to common variants (i.e., SNPs with maf greater than 25%). This high maf-threshold has the added benefit that the signals that we do find are described in terms of common variants, which will hopefully be easier to access in future studies.

As shown in Fig 1, the common genotyped SNP-overlap between the cohorts varies significantly. Cohorts that were genotyped using similar platforms tend to have large SNP-overlaps, while those genotyped on different platforms tend to have smaller SNP-overlaps. After clustering the cohorts by platform (and removing any duplicate subjects across cohorts) we defined four ‘arms’, as shown along the axes in Fig 1. Arm-1 consists of the single cohort labeled ‘BDRN’ (2524 cases, 4106 controls, OMEX). Arm-2 includes cohorts ‘may1’ through ‘rom3’ (5781 cases, 8289 controls, OMEX). Arm-3 includes cohorts ‘bonn’ through ‘bmpo’ (3581 cases, 7591 controls, Illumina). Arm-4 includes cohorts ‘dub1’ through ‘gain’ (6825 cases, 9752 controls, Affymetrix).

The first arm (comprising the single cohort ‘BDRN’) is relatively large and collected within the UK, comprising case-subjects of European descent over the age of 17 (see [18, 65, 66] for details). As a result, we expect this cohort to be less susceptible to spurious heterogeneity associated with batch-effects, and we use this cohort as a ‘training’ or ‘discovery’ arm, reserving the other three independent data-sets for validation (i.e., ‘replication’ arms). This training-arm has a large SNP-overlap of ∼ 85% with arm-2, and a smaller SNP-overlap with arms 3 and 4 (i.e., ∼ 50% and ∼ 30%, respectively). Correspondingly, we expect that any signal involving a multi-SNP-pattern found in arm-1 will only have an opportunity to replicate strongly in arm-2, and will not have the opportunity to replicate as strongly in arms 3 and 4 (as we will have fewer SNPs to use for validation).

Ethics statement.

We first obtained access to this data on 2013–11-25, and we have never had access to any information that could identify individual participants during or after data collection. This study was approved by the institutional review boards (IRBs) at University of California, San Diego as well as New York University. Because all the data we were working with had been de-identified, both IRBs certified our study as exempt from review and continuing review. The metadata for the subjects included genome-wide principal-components, which we used as a proxy for ancestry and corrected for in our primary- and secondary analyses, as described below. The metadata also included sex, whose effect we assessed a-posteriori (see Fig 17 in S1 Text). We did not have access to confounding variables such as socioeconomic status, nutrition, environmental exposures, or other similar factors, and could not correct for these in our analysis.

Correcting for ancestry

We use the genome-wide principal-components calculated by Stahl et al. to assess relatedness and correct for ancestry. Of the first 20 principal-components, denoted {U₁, …, U₂₀}, Stahl et al. determined that the first six principal-components (U₁-U₆) and U₁₉ showed significant correlation with the main phenotype across the studies considered in [20].

In the discovery phase of our analysis we are restricted to the training-arm (arm-1). For this sample we use an F-test applied to a nested logistic model, which selects only the first two principal-components (i.e., U₁ and U₂) as significantly related to case-control status in arm-1. Therefore, to mimic the analyses one might conduct with access only to arm-1, we correct our biclustering algorithm for these two principal-components, under the assumption that they are a proxy for ancestry.

In all but this initial biclustering analysis on arm-1, we remain consistent with [20] and correct for principal-components U₁ through U₆, as well as U₁₉. This includes both the calculation of A U C in the subsequent replication studies (e.g., A(i) and A′(i) in Fig 3) as well as the PRS-analysis described below.

Biclustering

For our initial biclustering of arm-1 we use the half-loop method of [64]. To briefly summarize the method, we first introduce some notation. Assume that the data-set contains M_D case-subjects, and M_X control-subjects, each measured across N allele-combinations (note, each SNP is associated with three allele combinations: heterozygous and homozygous dominant and recessive). We denote the array of case-subjects by D, with D(j_D, k) referring to allele-combination-k in case-subject-j_D. Similarly, we denote the array of control-subjects by X, with X(j_X, k) referring to allele-combination-k in control-subject-j_X. We’ll use the generic subject-index j to refer to both the j_D and the j_X.

In its most basic form, the half-loop algorithm proceeds as follows:

Step-0 First we load/initialize the data-arrays D and X.
Step-1 For each case j_D and allele-combination k, we measure the fraction of other cases in D which share that allele-combination, denoted by [D ← D](j_D, k). Similarly, we measure the fraction of controls in X which share that allele-combination, denoted by [D ← X](j_D, k). The difference between these two values, denoted by Q(j_D, k) = [D ← D](j_D, k) − [D ← X](j_D, k) is a measure of differential-expression.
Step-2 After calculating Q(j_D, k), we form the ‘row-scores’ Q^row(j_D) = ∑_k Q(j_D, k), as well as the ‘column-scores’ and the ‘trace’ . The row- and column-scores measure how strongly each case-subject and allele-combination contribute to the trace, which is itself a measure of the overall differential-expression exhibited between D and X.
Step-3 We remove a small fraction of case-subjects and allele-combinations from D with the lowest row- and column-scores.
Step-4 We return to Step-1, iterating until there are no more case-subjects within D.

The algorithm proceeds iteratively; at each iteration i we remove a small fraction γ of the remaining case-subjects and allele-combinations. In this analysis we choose γ = 0.5⁸ ∼ 0.004, which is sufficiently small that we expect statistical convergence of the algorithm’s accuracy (see Fig 32 in supplementary section 7.3 in [64]). After each iteration i, a subset comprising M(i) case-subjects and a subset comprising N(i) allele-combinations remain, together forming an M(i) × N(i) sub-array D(i) of the original D. If the case-array D were to contain a bicluster with a sufficiently strong signal, then the rows and columns of that bicluster would be retained until the end, with the other rows and columns eliminated earlier.

This half-loop method has detection-thresholds similar to spectral-clustering and message-passing [67, 68], but has several additional useful features. First, the half-loop method allows us to search for disease-specific heterogeneity by directly correcting for control-subjects. This case-control correction also motivates the null-hypothesis H0 described below; the permutation-test allows us to avoid spurious structures that are unrelated to the disease-label. Second, the half-loop scores in Step-1 allow us to (implicitly) correct for linkage-disequilibrium (LD). More specifically, subsets of SNPs which are in equally strong LD in both the case- and control-populations will be excluded as the algorithm proceeds, unless some of those SNPs are involved in a pattern of differential-expression specific to the remaining case-subjects, in which case they will be retained (as desired). Third, the method also allows us to correct for continuous covariates. This covariate-correction is described in detail in [64], but essentially amounts to a reweighting of the Q(j, k) in Step-1 to reduce the overall level of differential-expression contributed by structures which are not evenly distributed in covariate-space. Finally, the method itself is rather straightforward and does not require the fine-tuning of parameters.

As mentioned in Step-2, the overall level of differential-expression between D(i) and X at each iteration is recorded as the trace . The significance-level of is determined with respect to a null hypothesis (H0) which assumes that the heterogeneity is independent of case- and control-labels. Samples from H0 are drawn by randomly permuting the case- and control-labels in arm-1 (i.e., randomly interchanging rows of D and X) while respecting proximity in covariate-space. By comparing the values of the from the original data to the distribution of associated with the null-hypothesis, we assign an (empirical) training-p-value to the individual for each iteration i. Similarly, we calculate an overall empirical training-p-value (across all iterations), which estimates the probability that the trace from the original data-set could be drawn from the null-hypothesis.

Within this context, the detection of a disease-specific bicluster corresponds to an elevated (i.e., statistically-significant) value of . The case-subjects and allele-combinations comprising the bicluster can then be approximated by the subsets and for those i.

Replication

When discussing any particular replication-arm (e.g., arm-2), we will use primed indices (e.g., cases and controls will be indexed via and ). To assess replication we first consider the set of allele-combinations available within the replication-arm. This subset will limit the alleles we can use from within the original training-arm (i.e., arm-1). For any iteration i, we select the allele-subset from the training-data-set, and then construct the intersection . For the replication-arm arm-2 the allele set will have a size N′(i), which is typically around 85% of N(i) (i.e., 85% of the full size of ). For the other replication-arms (i.e., arms 3 and 4) the overlap will be lower. Using as well as the case-subject subset , we define the M(i) × N′(i) submatrix D′(i) within the training data (note D′(i) is a submatrix of the M(i) × N(i) submatrix D(i) defined above). We then calculate the dominant SNP-wise principal-component of D′(i).

We project each subject within the training-data-set onto v(i), producing a ‘bicluster-score’ (i.e., a single number) for each case-subject in the training-data-set, and for each control-subject in the training-data-set (recall that j_D and j_X index the case- and control-subjects in the training-data-set). Based on the definition of the bicluster, we expect that the typical values of will be larger than the typical values of . We measure this difference by calculating the area under the receiver-operator-characteristic curve (A U C) between the sets and ; we refer to this A U C as A(i). When calculating A(i) we correct for the same ancestry-related covariates as in [20] (see Methods and [60]).

We also project each subject in the replication-arm onto the same vector v(i), producing bicluster-scores for each case-subject in the replication-arm, and for each control-subject in the replication-arm. Once again, we expect that the typical values of will be larger than the typical values of in the replication-arm. We measure this difference by calculating the A U C A′(i), once again correcting for the ancestry-related covariates.

We assess the overall significance of the replication by considering a null-hypothesis where the structure of the replication-arm is independent of disease-status. We can draw a sample from this null-hypothesis (H0’) by randomly permuting the case- and control-labels within the replication-arm (while respecting proximity in covariate-space). In this manner we compare the original replication A U C A′(⋅) (as a function of i) to the distribution of A′(⋅) obtained under H0’.

Later on below (e.g., Fig 3) we calculate the average of A′(⋅) over a range of iterations, and then compare to the distribution of obtained under this label-shuffled null-hypothesis. We define the range of iterations by taking an interval which is significant for both the trace and the A U C A(i) defined using only the training-arm. For example, in Fig 3 we consider the range of iterations i ∈ [175, 350].

Polygenic-Risk-Scores (PRSs)

We calculate PRSs using the general strategy from [20], and further described in page 60 of the S1 Text within that paper. To briefly summarize: We use the genotype-level data from [20], which was imputed using the 1KG reference-panel. We then run a GWAS on this genotype-level data. This GWAS produces summary-statistics defined by contrasting cases and controls from the training-arm, while correcting for ancestry-related covariates. Once we have the summary-statistics defined by the GWAS, we run Plink’s ‘clump’ function to account for LD. We perform this clumping step using the same parameters as in [20] (e.g., info-score threshold of 0.9, R²-threshold of 0.1, genomic window of 500Kb, and minor-allele-frequency threshold of 0.05.) As a technical note: our ultimate goal is to analyze these PRS scores in the context of our heterogeneity analysis, which can be influenced by subtle relationships between SNPs. Consequently, we wanted to use the most accurate available information regarding LD. After the initial data-sets described in [20] were published, the Haplotype Reference Consortium European Reference Panel (HRC EUR panel) became available through the Wellcome Trust Sanger Institute [69]. This HRC EUR panel dramatically increased the amount of information available for approximating LD, and we use this panel when clumping our summary statistics. Finally, after clumping, we use the assigned weights for each SNP to form a PRS. We test the performance of this PRS on our replication-arms.

For any subject j′ within a particular replication-arm, we denote by PRS_wide(j′) the ‘population-wide’ PRS defined by contrasting all the cases in the training-arm with the controls in the training-arm (when generating the summary-statistics). We further denote by the population-wide PRS constructed after restricting the SNP-weight-vector to include only those SNPs with individual GWAS p-values that are more significant than the threshold (when forming the PRS).

We also define a ‘bicluster-informed’ PRS, denoted by PRS_bicl(j′;i), by contrasting only the cases in D(i) with the controls from the training-arm (when generating the summary-statistics). We further denote by the bicluster-informed PRS constructed after restricting the SNP-weight-vector to include only those SNPs with individual GWAS p-values that are more significant than the threshold (when forming the PRS). With this notation PRS_wide(j′) and are equivalent to PRS_bicl(j′;1) and , respectively. However, we will typically consider PRS_bicl for iterations i ∈ [175, 350]; in this range and will differ.

We measure the performance of the population-wide PRS_wide(j′) by calculating the AUC_wide between the case-values and the control-values , once again correcting for the ancestry-related covariates. Similarly, we measure the performance of , PRS_bicl(j′;i) and by calculating the associated AUCs, denoted by , AUC_bicl(i) and , respectively.

Gene-enrichment analysis

We perform a simple over-representation analysis using the go_bp ontology from Seek [70]. We restrict our attention to the 132 neuronally-related pathways (i.e., those referencing neurons, synapses or axons). For any given iteration i we consider the remaining allele-combinations within , retaining those genes which have more than half their originally associated alleles remaining. These retained genes form a gene-set which we then overlap with each pathway to obtain the intersection . From this intersection we obtain the gene-count for pathway l at iteration i.

We assess the significance of the gene-counts by considering the same null-hypothesis H0 used when biclustering. We compare each of the κ(i, l) to the distribution of κ(i, l) obtained under the label-shuffled null-hypothesis. Later on below we calculate the average z-score of the κ(i, l) over a range of iterations and all the neuronally-related pathways, and then compare that to the distribution of obtained under H0.

Results

We apply the half-loop-counting algorithm (see Methods) to the ‘BDRN’ cohort used as the training arm. The trace associated with the original data is shown in red in Fig 2. Were the signal homogeneous, we would expect to see a trace that starts out high and gradually decreases in magnitude. Instead, we see a trace that behaves non-monotonically, and is statistically insignificant for a range of iterations. The trace from the original data (in red) attains values that are significantly higher than the majority of the traces one would expect under the null-hypothesis (black) near iteration i ∼ 175. This is an indicator that the data is heterogeneous, and that a bicluster has been detected near iteration i ∼ 175; the identity of the bicluster can be approximated by one of the submatrices D(i) where the training-p-value is large. We can calculate the empirical p-value associated with the entire trace by comparing the red curve (across all iterations) to the black curves, estimating an overall p-value of p ≲ 1/64.

Download:

Fig 2. In this figure we show the output of the half-loop biclustering algorithm applied to the BDRN cohort in arm-1 (limited to those SNPs with maf ≥0.25).

As described in the main text, the algorithm proceeds iteratively, eliminating rows and columns from the case-subject-array D until all have been removed. At each iteration i, the remaining submatrix D(i) comprises case-subjects and allele-combinations . At each iteration we record the ‘row-trace’ , which is the covariate-corrected average level of differential-expression between D(i) and the control-subjects X. In the top row of subplots we show the row-trace for the data (red) as well as for 128 label-shuffled trials (black). Each of the row-traces has been transformed into an iteration-dependent z-score (estimated using the distribution of label-shuffled trials at that iteration). In the bottom row we show the corresponding empirical p-value, as estimated for each iteration using the label-shuffled trials. The dashed black-line corresponds to the 95th percentile (i.e., a significance value of 0.05 if each iteration were considered independently). If the signal were homogeneous we would expect to see the red trace begin at a high value and decay relatively monotonically. By contrast, we see strong evidence for heterogeneity; the red trace is far from monotonic. The overall p-value for the data (red-trace), estimated using the strategy in [64], is p ≲ 1/64. Note that the trace is significant over a range of iterations, including i ∈ [175, 350].

https://doi.org/10.1371/journal.pone.0314288.g002

In idealized scenarios where the ‘true’ bicluster is sharply defined, the trace typically has a sharp peak near the D(i) that most closely corresponds to the bicluster [63, 64]. However, in this case while the trace has a peak at around i ∼ 175, this peak is not particularly sharp, and the trace is nearly as significant across a range of iterations i ∈ [175, 350]. The largest of these submatrices (i.e., D(175)) corresponds to ∼ 47% of the case-subjects and ∼ 31% of the allele-combinations. The smallest of these submatrices (i.e., D(350)) corresponds to ∼ 21% of the case-subjects and ∼ 9% of the allele-combinations.

This ‘plateau’ of significance indicates that the true signal is not a perfectly crisp and well-delineated bicluster. Instead, this plateau suggests that, while there are certain ‘core’ case-subjects that exhibit a strong similarity across certain allele-combinations, there are additional case-subjects that are ‘adjacent’ to those in the core. These adjacent subjects exhibit a slightly weaker similarity involving a slightly expanded set of allele-combinations. Consequently, we expect iterations in the interval i ∈ [175, 350] to provide a range of approximations to the true ‘core’ signal (which is still unknown). One could certainly select the iteration with the highest training-p-value to approximate the bicluster, but as nearby iterations have nearly the same training-p-value, we expect them to also provide reasonable estimates of the true signal.

Given our approximation to the signal described above from the training-data-set, we test for replication in each of the replication-arms 2, 3 and 4. We are interested in how strongly our approximate signal replicates, as well as whether our approximation has been compromised by overfitting. Because the signal spans a range of iterations in arm-1, we assess the extent of replication across the plateau i ∈ [175, 350]. This interval corresponds to significant values of the trace as well as the AUC A(i) defined only using the training-data.

The results of this replication study for arm-2 are shown in Fig 3. The top subplot illustrates the AUC A(i) (red) and A′(i) (green) as a function of i. The bottom subplot shows the associated p-value for each i (under a label-shuffled null-hypothesis). Note that the training-AUC A(i) is high over the range of iterations i ∈ [175, 350] for which the training-p value is significant. Note also that the peak of A(i) occurs within a few iterations of the peak of the training p-value. This correspondence corroborates the claims made above: we believe we have detected a disease-related signal within the training-data-set that involves only a subset of subjects and alleles. While the magnitude of the replication-AUC A′(i) is lower than the training-AUC A(i), the value of A′(i) is also statistically significant over the range of iterations i ∈ [175, 350], with a peak at roughly the same point.

Download:

Fig 3. In this figure we illustrate the replication of the bicluster in arm-2.

Note that the SNP-overlap between arm-1 and arm-2 is ∼ 85%. On the top we show A(i) in red and A′(i) in green. On the bottom we show the associated p-values for A(i) and A′(i), calculated with respect to H0 and H0′ for each iteration individually. Standard significance-levels 0.05 and 0.01 are shown in dashed- and dotted-lines, respectively. The interval i ∈ [175, 350] is highlighted in white. Note that both A(i) and A′(i) have peaks within the range that the trace was significant (c.f. Fig 2). The overall replication for arm-2 within the interval i ∈ [175, 350] is estimated at p ≲ 10^-12.

https://doi.org/10.1371/journal.pone.0314288.g003

Similar results for arm-3 and arm-4 are shown in Figs 4 and 5. Note that the SNP-overlap between these arms and the training-data-set is quite a bit lower than that for arm-2. Recall that arm-2 has a overlap of ∼ 85% with the SNPs in arm-1, while arm-3 and arm-4 have overlaps of ∼ 50% and ∼ 30%, respectively.

Download:

Fig 4. This figure is similar to Fig 3, except that we use arm-3 instead of arm-2.

The overall replication for arm-3 within the interval i ∈ [175, 350] is estimated at p ≲ 10³. Note that the SNP-overlap between arm-1 and arm-3 is only ∼ 50%.

https://doi.org/10.1371/journal.pone.0314288.g004

Download:

Fig 5. This figure is similar to Fig 3, except that we use arm-4 instead of arm-2.

The overall replication for arm-3 within the interval i ∈ [175, 350] is estimated at p ≲ 10³. Note that the SNP-overlap between arm-1 and arm-4 is only ∼ 30%.

https://doi.org/10.1371/journal.pone.0314288.g005

We believe that this reduction in SNP-overlap is partially responsible for the reduction in the magnitude of replication-AUCs observed in these arms. To test this hypothesis, we randomly eliminate SNPs from arm-2 until the SNP-overlap between the training-data-set and arm-2 is equal to the SNP-overlap between the training-data-set and arm-3. The results of this replication-study are shown in Fig 13 in S1 Text: note that the amplitude of A′(i) has degraded in comparison to the values shown in Fig 3. We then randomly eliminate even more SNPs, until the SNP-overlap between the training-data-set and arm-1 is equal to the SNP-overlap between the training-data-set and arm-4 (see Fig 14 in S1 Text), and the amplitude A′(i) degrades even further. More generally, by reducing the number of SNPs we include in the replication-arm, we can cause the values of A′(i) to drop; depending on the subset of SNPs retained, the values of A′(i) for arm-2 can be reduced to values similar to those observed in arm-3 and arm-4.

In summary, the AUC associated with the genotype-based bicluster score discovered in the training-data-set replicates to varying degrees across all 3 replication arms. In each case the average A′(i) calculated over the interval i ∈ [175, 350] was significantly larger than what one would expect were the case- and control-labels in the replication-arm randomly permuted (p ≲ 1/1000). Consequently, we are fairly certain that—while our approximation of the bicluster is far from perfect—we have indeed identified a robust disease-related signal which generalizes across a variety of different BD studies.

Discussion

Interaction with covariates

Given the observations above, it is natural to ask what might be driving the signal associated with this bicluster. We first checked to see if the bicluster was driven by the ancestry-related covariates in our data-set. As shown in Figs 15 and 16 in S1 Text, the subjects in the bicluster have a distribution of ancestries similar to the remainder of arm-1 (recall that we corrected for ancestry as a covariate). By considering the subjects remaining in D(i), we also determined that the bicluster does not seem to be associated with sex (see Fig 17 in S1 Text).

Interaction with BD subtype

We then checked to see if the bicluster was associated with bipolar subtype. We measured the fraction of subjects classified as bipolar-type-1 versus bipolar-type-2 as our algorithm proceeded. Specifically, we measured the fraction of case-subjects in that were classified as BDI and BDII. If the bicluster were driven by BDII subjects, then we would expect the proportion of remaining BDII case-subjects to increase with the iteration-index i. Conversely, if the bicluster were driven by BDI subjects, then we would expect the proportion of remaining BDI case-subjects to increase with iteration-index. As shown in Fig 6, we found that this latter scenario holds; the bicluster was significantly enriched for BDI relative to BDII. This enrichment for BDI also impacts our risk-prediction results (see below). Note that, when determining this enrichment, we compare the proportion of BDI and BDII case-subjects at each iteration to the proportion at iteration i = 1 (i.e., across all case-subjects in arm-1). In this manner our enrichment is defined relative to the starting proportion of BDI and BDII subjects in our training-arm, and is not influenced by the recruitment rates for BDI and BDII (which can differ across studies).

While significant, this BDI-enrichment was not completely overwhelming: the initial fraction of BDII participants in arm-1 was ∼ 31%, which dropped to ∼ 26% at iteration i = 240. Thus, while the majority of the case-subjects in the bicluster are classified as BDI, those classified with BDII do still contribute to the overall signal. It is possible that this BDI-enrichment is due to a true difference between the BD-subtypes at the genetic level. However, it is also possible that this enrichment is partially driven by inaccuracies associated with classification [14].

Download:

Fig 6.

This figure plots the ratio of BDI to BDII subjects within (light-green, left y-axis) as a function of the iteration i (left) and the number of removed case-subjects (right). The dark-green line corresponds to the negative-log-probability (right y-axis) of observing a ratio at least as large by chance. The dashed and dotted horizontal lines indicate 0.05 and 0.01 significance values, respectively. Note that the BDI population is over-represented across a range of iterations including i ∈ [175, 350], implying that the bicluster we observe is significantly enriched for BDI subjects.

https://doi.org/10.1371/journal.pone.0314288.g006

Bicluster-informed PRS performance

As described in the Methods section, we calculated the population-wide and the bicluster-informed across a variety of iterations i and -thresholds. We compared the bicluster-informed performance to the one generated by the population-wide across a variety of -thresholds. Results for arm-2 are shown in Fig 7. Results for arm-3 and arm-4 are shown alongside arm-2 in Fig 8, and individually in Figs 23 and 24 in S1 Text.

Note that, when constructing , we restrict ourselves to a subset of case-subjects within the training-arm determined by . In this case, when i ∈ [175, 350] the case-subset retains only ∼ 50% − 20% of the original case-subjects in arm-1. Typically, one might expect a reduction in the number of case-subjects to yield a corresponding reduction in power, giving rise to a reduced discriminability in the testing-arms 2,3 and 4. However, as we see in Fig 8, the discriminability for is typically higher than when i ∈ [175, 350]. This suggests that the case-subjects in identified by the bicluster correspond to a stronger genetic signal, likely arising from the increased homogeneity within .

Note that PRS_bicl and PRS_wide are not capturing identical signals (see the Nagelkerke R² analysis in the S1 Text). It is useful to compare the performance of PRS_bicl with PRS_wide as there are features of PRS_bicl which indicate that it is more robust than PRS_wide. As one example, we point out that is markedly higher than when the number of SNPs used (denoted by N_SNP) is fewer; one begins to see the effect between 1K and 10K. This suggests that the bicluster-informed is not only outperforming the population-wide , but also correctly attributing the largest PRS-weights to those SNPs that truly carry the signal (and which are most important for replication). As one illustration, by comparing the values of AUC_bicl to AUC_wide in Fig 8, we can directly see that the bicluster-informed PRS would replicate across arms 2,3 and 4 for values of i = 225 and N_SNP ∈ [10³, 10⁴], while the population-wide PRS would not.

Download:

Fig 7. In each subplot we show in yellow the

(vertical) for arm-2 as a function of the number of SNPs corresponding to each

-threshold (horizontal, log-scale).

Additionally, we show for a particular iteration i (with i varying across subplots). The color-code used for ranges from blue to pink, corresponding to the iteration index i. Note that, by using the bicluster to inform the PRS, the performance typically improves. This improvement in performance becomes marked when the number of SNPs is limited to a relatively small fraction of the total (e.g., ∼ 1% of the total, corresponding to a log₁₀(#) of ∼ 3).

https://doi.org/10.1371/journal.pone.0314288.g007

Download:

Fig 8. This figure uses circles to displays the same information as Fig 7 (corresponding to replication arm-2).

In this figure we use an algebraic-scale for the horizontal-axis (rather than a log-scale) in order to better emphasize the interval where the number of SNPs used is between 1K and 10K. The results for replication arm-3 and arm-4 are shown using squares and triangles, respectively.

https://doi.org/10.1371/journal.pone.0314288.g008

Motivated by the significant BDI-enrichment seen within the training-arm (see Fig 6), we repeated these assessments for the BDI- and BDII-populations within the testing-arms. More specifically, recall that, for any particular testing-arm, the and values shown in Figs 7 and 8 are defined using the values of across all case- and control-subjects j′ for that testing-arm. We can now use the same values of and , but only compare the BDI-case-subjects to the control-subjects in the testing-arm. This produces ‘restricted’ AUC-values, which we denote by and , respectively. In a similar fashion we can restrict the case-subjects in the testing-arm to the BDII-case-subjects, and calculate and .

The results are shown in Figs 9 and 10, respectively. Note that the improvement to risk-prediction persists for the BDI-population, but is not as robust for the BDII-population. The performance of is particularly poor for the BDII-population in arm-3, for which there were only M = 435 BDII-subjects (i.e., the fewest out of all the arms). It is possible that the variation in the performance of for the BDII-population across the replication-arms has to do with these differences in power. It is also possible that there are other systematic issues affecting the BDII-population, including variation in the life history of the subjects or the metrics used for their clinical diagnosis [14].

Download:

Fig 9. This figure is similar to Fig 8, except that we limit ourselves only to those case-subjects in the replication-arms which are classified as BDI.

This subset corresponded to 66% (M = 3834), 84% (M = 2995) and 75% (M = 5107) of the case-population for arms 2, 3 and 4, respectively. The corresponding AUC-values are denoted by and in the main text. For reference the training-arm had M = 1645 BDI case-subjects, corresponding to 65% of the case-population in arm-1.

https://doi.org/10.1371/journal.pone.0314288.g009

Download:

Fig 10. This figure is similar to Fig 8, except that we limit ourselves only to those case-subjects in the replication-arms which are classified as BDII.

This subset corresponded to 19% (M = 1082), 12% (M = 435) and 16% (M = 1060) of the case-population for arms 2, 3 and 4, respectively. The corresponding AUC-values are denoted by and in the main text. For reference the training-arm had M = 788 BDII case-subjects, corresponding to 31% of the case-population in arm-1.

https://doi.org/10.1371/journal.pone.0314288.g010

To summarize the overall relationship between BD-subtype, the bicluster-informed PRS and the population-wide PRS, we pool the subjects across the replication-arms and convert the combined AUC-values into R²-values on a liability-scale [71] using prevalences of 2% for BD, and 1% for BDI and BDII [72]. Using notation analogous to the AUC-values, we denote these liability-scores as , and , as well as , and , respectively. The resulting liability-scores are shown in Fig 11.

We believe that Fig 11 hints at the potential our methodology offers to researchers of complex disease. By limiting our definition of a case to those with a more genetically homogeneous BD signature, we were able to generate a PRS_bicl which outperforms PRS_wide in the following ways:

The maximum is 20–40% higher than the maximum , depending on the iteration-index i.
This increase in liability-score occurs despite the fact that the PRS_bicl is generated using ∼ 50% to 80% fewer cases than the PRS_wide. For example, we considered only between 1191–526 cases in arm-1 to generate the values for i = [175, 350], whereas 2524 cases were used to generate the values.
The p-values assigned to the SNPs via the bicluster-informed GWAS were less noisy than those p-values assigned using the population-wide GWAS. For example, Fig 11 indicates that the first 10K SNPs of highest significance from the population-wide GWAS contained almost no disease-related information. By contrast, the first 10K SNPs of highest significance from the bicluster-informed GWAS typically contain most of the available disease-related information. The bicluster-informed GWAS produces a with only ∼ 5K to 10K SNPs that surpasses the maximum of (e.g., within the i = 225 subplot the value of at 5K SNPs is comparable to the value of at ∼ 150K SNPs).
Furthermore, the values of typically plateau somewhere between 10K–35K SNPs (corresponding to ). Meanwhile, the values of continue to increase until (including all ∼ 150K SNPs).

Download:

Fig 11. This figure is similar to Fig 8, and uses the data from Figs 8, 9 and 10.

This time we combine the information across all three replication-arms, and calculate replication AUC-values for this combined data-set. We then convert these AUC-values into liability-scores (see [71]). The results for all the cases ( and ) are shown with an asterisk ‘*’, whereas the results for only the BD1-cases ( and ) are shown with an ‘×’, and the results for only the BD2-cases ( and ) are shown with a diamond. In each case the yellow curves correspond to the liability-scores derived from the population-wide PRS, whereas the cyan-magenta curves correspond to the liability-scores derived from the bicluster-informed PRS. Note that our overall results are closely matched by the BD1-cases, but not by the BD2-cases.

https://doi.org/10.1371/journal.pone.0314288.g011

To summarize: The PRS_bicl outperforms PRS_wide overall, and when restricted to either BDI and BDII, achieving a higher maximum for each subtype. The PRS_bicl also achieves its peak performance with far fewer SNPs, consistent with a far less noisy signal. Put another way, the values of , and all plateau earlier than , and , indicating that the SNPs which are most relevant to the bicluster-informed PRS performance have indeed been identified by the bicluster-informed GWAS as having low individual p-values.

Additionally, we note that there is a close relationship between and , but a discrepancy between and . The values of indicate that some subset of BDII cases share the bicluster signature, but the maximum for is only 50% of the maximum for . This could imply that the bicluster has focused on a signature that is associated with BDI, perhaps serving as a risk factor for manic episodes in the presence of the necessary epigenetic or environmental influences.

Gene-enrichment

We also perform a simple over-representation analysis, measuring the overlap κ(i, l) between the bicluster D(i) at iteration i and the various neuronally-related pathways from the go_bp ontology (see Methods). The average z-score for the enrichment-values κ(i, l), averaged over the interval i ∈ [175, 350] and all neuronally-related pathways, is quite significant, with p ≲ 1e − 4 (as determined by a permutation-test). Examples of some of the more significantly over-represented pathways are shown in Table 1.

Download:

Table 1. Here we list some of the pathways from the go_bp ontology.

Shown here are only the 32 most significant pathways as determined by κ(175, l). Each pathway is listed alongside approximations to its individual over-representation p-value (estimated using the hypergeometric-distribution). The −log₁₀(p)-values are listed for iterations 175–350 (see top row). Those annotations with an individual over-representation p-value smaller than 0.05 are in bold.

https://doi.org/10.1371/journal.pone.0314288.t001

Secondary bicluster

After discovering and analyzing the primary bicluster within arm-1 (described above), we searched for a secondary bicluster. We first eliminated the structure associated with the primary bicluster by scrambling the entries of the submatrix D(175) (see [64] for details). We then reran our half-loop algorithm on this scrambled version of arm-1. While we did find a secondary trace that was indicative of heterogeneity, the overall level of differential-expression was far lower than for the first bicluster (see Fig 25 in S1 Text). Moreover, the structure associated with this secondary trace did not significantly replicate (see Figs 26–28 in S1 Text). It is possible that a secondary bicluster exists, but that we could not pinpoint it due to a lack of power in our training-arm. It is also possible that the scrambled version of arm-1 is heterogeneous, but not in a way that can be described by a bicluster (see [64] for examples along these lines). In either case, a larger sample size will be required to further probe this residual heterogeneity.

Control biclusters

Up to this point we have only considered biclusters within the case-population; i.e., subsets of case-subjects which exhibit a genetic-signature that is not shared by the control-subjects. It is natural to ask if there are also biclusters that exist within the control-population (i.e., whether or not the control-population is homogeneous). Such ‘control-biclusters’ might be induced by batch effects or issues associated with recruitment; e.g., many of the BD controls may be drawn from another disease study (such as cancer), thus being more likely to share certain genetic features. It might also be the case that some of the control-biclusters are biologically significant, corresponding to mechanisms which protect against the disease. In either scenario, a better understanding of the heterogeneity within the control-population can assist in designing homogeneous populations of controls for future studies.

We can easily carry out this analysis simply by reversing the labels within our biclustering algorithm (i.e., swapping D and X). This reversed search will find biclusters that are driven by genetic-signatures which are more prevalent within the controls than within the cases. As mentioned above, we find that the control-population within arm-1 is quite homogeneous: the trace decays monotonically with no distinguished peaks (see Fig 29 in S1 Text). This homogeneity can be viewed as a validation of our initial choice of arm-1 as a training- or discovery-arm.

On the other hand, we find strong evidence for heterogeneity within the control-populations of arms 2, 3 and 4 (see Figs 30—32 in S1 Text). In each case the trace has a significant distinguished maximum involving only a fraction of the control-subjects (i.e,. 13%, 28% and 15% of the controls, respectively).

The heterogeneity observed in the control-populations of arms 2, 3 and 4 might be expected; each of these arms comprises multiple smaller studies. Notably however, the ‘control-biclusters’ within these arms cannot all be easily dismissed as batch-effects. Indeed, each of the dominant control-biclusters is also quite significant, while also usually well balanced across the ancestry-related covariates and individual cohorts within each arm. Each of these dominant control-biclusters also replicates across the majority of other arms.

Thus, while a portion of these control-biclusters might be driven by batch-effects or other idiosyncrasies in the control-population, it is possible that some of these signals have biological relevance, perhaps involving mechanisms which protect against BD (as the control-biclusters were identified specifically because they involved genetic patterns not as prevalent across the cases). Consequently, we would recommend considering this heterogeneity when performing other kinds of analysis. For example, one should not necessarily assume that the controls are homogeneous, as small subgroups of controls can likely exhibit genetic-signatures that are distinct from the rest.

Conclusion

In this paper we have taken a ‘genotype-driven’ approach to investigating genotypic-heterogeneity. That is to say, first we used only basic phenotypic classification to divide subjects into cases (BD) and controls (not BD). We then applied a biclustering analysis to identify genetic subgroups within the case-population. Analyzing the BDI and BDII cases together as a whole allowed us to identify a genetic subgroup (i.e., the bicluster described above). This bicluster involved a genetically homogeneous subset of the BD-cases within the training-arm, which we then used to inform a more robust PRS with better replication across studies.

Our results suggest two hypotheses for future work. Most directly, our replication- and PRS-analyses indicate that the bicluster we found within the training-arm indeed represents a genetic subgroup of BD which generalizes across data-sets. More generally, our results provide a proof-of-principle for our overall methodology: a data-driven approach to identifying genetically homogeneous subsets of case-subjects can help construct more robust PRSs, with the potential of improving SNP-replication in BD GWAS and, ultimately, a better understanding of the etiology of Bipolar Disorder.

In some respects our approach can be termed ‘unsupervised’, as we did not use BD-subtype (BDI vs. BDII) or subphenotype information to guide our primary analysis. This unsupervised approach allows us to circumvent many of the challenges associated with phenotype classification, such as missingness and variation in assessment and collection process (e.g., expert-led vs. self-report). It also allows us to identify genetic patterns which straddle traditional classifications provided the signature is not present in the control group. E.g., though our bicluster was enriched for BDI, it was by no means limited to BDI and included many BDII cases.

Along these lines, we believe that a similar unsupervised approach could be used to search for interactions between the signals we have found and other diseases, as well as for cross-psychiatric-disorder signals not present in the control group. There are many examples of genetic interactions along these lines: the SNPs driving BD have a strong correlation with those driving schizophrenia, and also share overlap with the SNPs driving MDD, OCD, anorexia nervosa, ADHD, ASD and substance-abuse [34, 73, 74]. Many SNPs have also been associated with other disorders [17, 75–77]. More generally speaking, BD shows substantial overlap with other disorders; e.g., more than 90% of BD subjects exhibit lifetime comorbidity [3] with at least one other psychiatric disorder [58, 78, 79], or non-psychiatric disorder [80–82]. This high rate of comorbidity implies that BD is one of multiple disorders which perturb several important regulatory systems [83, 84]. Given these relationships, it is possible that the bicluster-score and/or the bicluster-limited PRSs may also correlate with some of the signals of these other disorders. It is possible that we could discover interesting biclusters which cross psychiatric disorders or are present in the control groups and predict resistance to psychiatric illness more generally; we defer an investigation of these interactions to future work.

The biclustering algorithm we use also offers a ‘supervised’ option which uses additional information (e.g., BD-subtype or other clinical data) to subdivide the case-population while searching for heterogeneity. Sex might be one important variable to include in such a supervised BD analysis. For example, while most studies do not indicate large difference in BD prevalence between men and women (indeed, the bicluster we identified was not significantly enriched for sex), there is some evidence of a sex disparity in the prevalence of BDII, rapid-cycling and mixed-episodes [85, 86]. Age may also be an important role-player, as an earlier age of onset may be associated with higher severity and a poorer long-term prognosis (possibly due to mis-diagnoses at an early stage) [57, 87].

One limitation of our current study is that it is restricted to common variants (i.e., SNPs with a high minor-allele-frequency). While it is encouraging that the common variants alone can be used to find replicable and robust signals, it is also likely that the rare variants also play a role in the heterogeneity of BD. Analyzing the rare variants brings new challenges, as rare variants often require more statistical power to detected and/or validate [88–93].

Another more serious limitation is that our training-arm is quite restricted in terms of ancestry. More generally, almost all the individuals in our data-set are of European descent. We expect that this lack of diversity will limit our ability to pinpoint the most biologically relevant signals, as many previous GWAS analyses have not generalized well to cohorts of different ancestry [29, 94–98]. An important future direction will be to investigate the interactions between genotypic heterogeneity and ancestry.

We do not expect a full analysis of genetic-heterogeneity to be entirely trivial. For example, appropriately correcting for ancestry is not always easy, even when searching for homogeneous signals. When searching for heterogeneity such a correction becomes more complicated and, necessarily, involves more parameters. Larger (and more diverse) sample sizes will likely be necessary to clarify (i) the disease-specific genetic-subgroups (i.e., biclusters) within BD, as well as (ii) the phenotypic subtypes of BD, and perhaps most importantly: (iii) the interaction between these subgroups and subtypes and other covariates such as ancestry. We suspect that a careful treatment of the associated statistical issues will pose a significant challenge. Nevertheless, these advancements will likely further improve our understanding of the etiology of BD.

Supporting information

S1 Text. Contains a detailed description of our methods, including an outline of the steps involved and the considerations we made along the way.

Also contains the supporting figures referenced in the main text.

https://doi.org/10.1371/journal.pone.0314288.s001

(PDF)

Acknowledgments

The chair of the Bipolar Disorder Working Group of the Psychiatric Genomics Consortium is currently Ole A Andreassen (contact email: o.a.andreassen@medisin.uio.no). The contributors to the Bipolar Disorder Working Group are listed below.

Bipolar Disorder Working Group of the Psychiatric Genomics Consortium

Eli A Stahl^1,2,3, Gerome Breen^4,5, Andreas J Forstner^6,7,8,9,10, Andrew McQuillin¹¹, Stephan Ripke^12,13,14, Vassily Trubetskoy¹³, Manuel Mattheisen^{15,16,17,18,19}, Yunpeng Wang^20,21, Jonathan R I Coleman^4,5, Héléna A Gaspar^4,5, Christiaan A de Leeuw²², Stacy Steinberg²³, Jennifer M Whitehead Pavlides²⁴, Maciej Trzaskowski²⁵, Enda M Byrne²⁵, Tune H Pers^3,26, Peter A Holmans²⁷, Alexander L Richards²⁷, Liam Abbott¹², Esben Agerbo^19,28,29, Huda Akil³⁰, Diego Albani³¹, Ney Alliey-Rodriguez³², Thomas D Als^15,16,19, Adebayo Anjorin³³, Verneri Antilla¹⁴, Swapnil Awasthi¹³, Judith A Badner³⁴, Marie Bækvad-Hansen^19,35, Jack D Barchas³⁶, Nicholas Bass¹¹, Michael Bauer³⁷, Richard Belliveau¹², Sarah E Bergen³⁸, Carsten Bøcker Pedersen^19,28,29, Erlend Bøen³⁹, Marco P. Boks⁴⁰, James Boocock⁴¹, Monika Budde⁴², William Bunney⁴³, Margit Burmeister⁴⁴, Jonas Bybjerg-Grauholm^19,35, William Byerley⁴⁵, Miquel Casas^46,47,48,49, Felecia Cerrato¹², Pablo Cervantes⁵⁰, Kimberly Chambert¹², Alexander W Charney², Danfeng Chen¹², Claire Churchhouse^12,14, Toni-Kim Clarke⁵¹, William Coryell⁵², David W Craig⁵³, Cristiana Cruceanu^50,54, David Curtis^55,56, Piotr M Czerski⁵⁷, Anders M Dale^58,59,60,61, Simone de Jong^4,5, Franziska Degenhardt⁸, Jurgen Del-Favero⁶², J Raymond DePaulo⁶³, Srdjan Djurovic^64,65, Amanda L Dobbyn^1,2, Ashley Dumont¹², Torbjørn Elvsåshagen^66,67, Valentina Escott-Price²⁷, Chun Chieh Fan⁶¹, Sascha B Fischer^6,10, Matthew Flickinger⁶⁸, Tatiana M Foroud⁶⁹, Liz Forty²⁷, Josef Frank⁷⁰, Christine Fraser²⁷, Nelson B Freimer⁷¹, Louise Frisén^72,73,74, Katrin Gade^42,75, Diane Gage¹², Julie Garnham⁷⁶, Claudia Giambartolomei²⁰⁶, Marianne Giørtz Pedersen^19,28,29, Jaqueline Goldstein¹², Scott D Gordon⁷⁷, Katherine Gordon-Smith⁷⁸, Elaine K Green⁷⁹, Melissa J Green^80,133, Tiffany A Greenwood⁶⁰, Jakob Grove^15,16,19,81, Weihua Guan⁸², José Guzman-Parra⁸³, Marian L Hamshere²⁷, Martin Hautzinger⁸⁴, Urs Heilbronner⁴², Stefan Herms^6,8,10, Maria Hipolito⁸⁵, Per Hoffmann^6,8,10, Dominic Holland^58,86, Laura Huckins^1,2, Stéphane Jamain^87,88, Jessica S Johnson^1,2, Radhika Kandaswamy⁴, Robert Karlsson³⁸, James L Kennedy^89,90,91,92, Sarah Kittel-Schneider⁹³, James A Knowles^94,95, Manolis Kogevinas⁹⁶, Anna C Koller⁸, Ralph Kupka^97,98,99, Catharina Lavebratt⁷², Jacob Lawrence¹⁰⁰, William B Lawson⁸⁵, Markus Leber¹⁰¹, Phil H Lee^12,14,102, Shawn E Levy¹⁰³, Jun Z Li¹⁰⁴, Chunyu Liu¹⁰⁵, Susanne Lucae¹⁰⁶, Anna Maaser⁸, Donald J MacIntyre^107,108, Pamela B Mahon^63,109, Wolfgang Maier¹¹⁰, Lina Martinsson⁷³, Steve McCarroll^12,111, Peter McGuffin⁴, Melvin G McInnis¹¹², James D McKay¹¹³, Helena Medeiros⁹⁵, Sarah E Medland⁷⁷, Fan Meng^30,112, Lili Milani¹¹⁴, Grant W Montgomery²⁵, Derek W Morris^115,116, Thomas W Mühleisen^6,117, Niamh Mullins⁴, Hoang Nguyen^1,2, Caroline M Nievergelt^60,118, Annelie Nordin Adolfsson¹¹⁹, Evaristus A Nwulia⁸⁵, Claire O’Donovan⁷⁶, Loes M Olde Loohuis⁷¹, Anil P S Ori⁷¹, Lilijana Oruc¹²⁰, Urban Ösby¹²¹, Roy H Perlis^122,123, Amy Perry⁷⁸, Andrea Pfennig³⁷, James B Potash⁶³, Shaun M Purcell^2,109, Eline J Regeer¹²⁴, Andreas Reif⁹³, Céline S Reinbold^6,10, John P Rice¹²⁵, Fabio Rivas⁸³, Margarita Rivera^4,126, Panos Roussos^1,2,127, Douglas M Ruderfer¹²⁸, Euijung Ryu¹²⁹, Cristina Sánchez-Mora^46,47,49, Alan F Schatzberg¹³⁰, William A Scheftner¹³¹, Nicholas J Schork¹³², Cynthia Shannon Weickert^80,133, Tatyana Shehktman⁶⁰, Paul D Shilling⁶⁰, Engilbert Sigurdsson¹³⁴, Claire Slaney⁷⁶, Olav B Smeland^135,136, Janet L Sobell¹³⁷, Christine Søholm Hansen^19,35, Anne T Spijker¹³⁸, David St Clair¹³⁹, Michael Steffens¹⁴⁰, John S Strauss^91,141, Fabian Streit⁷⁰, Jana Strohmaier⁷⁰, Szabolcs Szelinger¹⁴², Robert C Thompson¹¹², Thorgeir E Thorgeirsson²³, Jens Treutlein⁷⁰, Helmut Vedder¹⁴³, Weiqing Wang^1,2, Stanley J Watson¹¹², Thomas W Weickert^80,133, Stephanie H Witt⁷⁰, Simon Xi¹⁴⁴, Wei Xu^145,146, Allan H Young¹⁴⁷, Peter Zandi¹⁴⁸, Peng Zhang¹⁴⁹, Sebastian Zöllner¹¹², eQTLGen Consortium, BIOS Consortium, Rolf Adolfsson¹¹⁹, Ingrid Agartz^17,39,150, Martin Alda^76,151, Lena Backlund⁷³, Bernhard T Baune^152,158, Frank Bellivier^{153,154,155,156}, Wade H Berrettini¹⁵⁷, Joanna M Biernacka¹²⁹, Douglas H R Blackwood⁵¹, Michael Boehnke⁶⁸, Anders D Børglum^15,16,19, Aiden Corvin¹¹⁶, Nicholas Craddock²⁷, Mark J Daly^12,14, Udo Dannlowski¹⁵⁸, Tõnu Esko^{3,111,114,159}, Bruno Etain^{153,155,156,160}, Mark Frye¹⁶¹, Janice M Fullerton^133,162, Elliot S Gershon^32,163, Michael Gill¹¹⁶, Fernando Goes⁶³, Maria Grigoroiu-Serbanescu¹⁶⁴, Joanna Hauser⁵⁷, David M Hougaard^19,35, Christina M Hultman³⁸, Ian Jones²⁷, Lisa A Jones⁷⁸, René S Kahn^2,40, George Kirov²⁷, Mikael Landén^38,165, Marion Leboyer^88,153,166, Cathryn M Lewis^4,5,167, Qingqin S Li¹⁶⁸, Jolanta Lissowska¹⁶⁹, Nicholas G Martin^77,170, Fermin Mayoral⁸³, Susan L McElroy¹⁷¹, Andrew M McIntosh^51,172, Francis J McMahon¹⁷³, Ingrid Melle^174,175, Andres Metspalu^114,176, Philip B Mitchell⁸⁰, Gunnar Morken^177,178, Ole Mors^19,179, Preben Bo Mortensen^15,19,28,29, Bertram Müller-Myhsok^54,180,181, Richard M Myers¹⁰³, Benjamin M Neale^3,12,14, Vishwajit Nimgaonkar¹⁸², Merete Nordentoft^19,183, Markus M Nöthen⁸, Michael C O’Donovan²⁷, Ketil J Oedegaard^184,185, Michael J Owen²⁷, Sara A Paciga¹⁸⁶, Carlos Pato^95,187, Michele T Pato⁹⁵, Danielle Posthuma^22,188, Josep Antoni Ramos-Quiroga^46,47,48,49, Marta Ribasés^46,47,49, Marcella Rietschel⁷⁰, Guy A Rouleau^189,190, Martin Schalling⁷², Peter R Schofield^133,162, Thomas G Schulze^{42,63,70,75,173}, Alessandro Serretti¹⁹¹, Jordan W Smoller^12,192,193, Hreinn Stefansson²³, Kari Stefansson^23,194, Eystein Stordal^195,196, Patrick F Sullivan^38,197,198, Gustavo Turecki¹⁹⁹, Arne E Vaaler²⁰⁰, Eduard Vieta²⁰¹, John B Vincent¹⁴¹, Thomas Werge^19,202,203, John I Nurnberger²⁰⁴, Naomi R Wray^24,25, Arianna Di Florio^27,198, Howard J Edenberg²⁰⁵, Sven Cichon^6,8,10,117, Roel A Ophoff^40,41,71, Laura J Scott⁶⁸, Ole A Andreassen^135,136, John Kelsoe⁶⁰, Pamela Sklar^1,2,†

¹Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, US. ²Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, US. ³Medical and Population Genetics, Broad Institute, Cambridge, MA, US. ⁴MRC Social, Genetic and Developmental Psychiatry Centre, King’s College London, London, GB. ⁵NIHR BRC for Mental Health, King’s College London, London, GB. ⁶Department of Biomedicine, University of Basel, Basel, CH. ⁷Department of Psychiatry (UPK), University of Basel, Basel, CH. ⁸Institute of Human Genetics, University of Bonn, School of Medicine & University Hospital Bonn, Bonn, DE. ⁹Centre for Human Genetics, University of Marburg, Marburg, DE. ¹⁰Institute of Medical Genetics and Pathology, University Hospital Basel, Basel, CH. ¹¹Division of Psychiatry, University College London, London, GB. ¹²Stanley Center for Psychiatric Research, Broad Institute, Cambridge, MA, US. ¹³Department of Psychiatry and Psychotherapy, Charité—Universitätsmedizin, Berlin, DE. ¹⁴Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, US. ¹⁵iSEQ, Center for Integrative Sequencing, Aarhus University, Aarhus, DK. ¹⁶Department of Biomedicine—Human Genetics, Aarhus University, Aarhus, DK. ¹⁷Department of Clinical Neuroscience, Centre for Psychiatry Research, Karolinska Institutet, Stockholm, SE. ¹⁸Department of Psychiatry, Psychosomatics and Psychotherapy, Center of Mental Health, University Hospital Würzburg, Würzburg, DE. ¹⁹iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, DK. ²⁰Institute of Biological Psychiatry, Mental Health Centre Sct. Hans, Copenhagen, DK. ²¹Institute of Clinical Medicine, University of Oslo, Oslo, NO. ²²Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research, Amsterdam Neuroscience, Vrije Universiteit Amsterdam, Amsterdam, NL. ²³deCODE Genetics / Amgen, Reykjavik, IS. ²⁴Queensland Brain Institute, The University of Queensland, Brisbane, QLD, AU. ²⁵Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, AU. ²⁶Division of Endocrinology and Center for Basic and Translational Obesity Research, Boston Children’s Hospital, Boston, MA, US. ²⁷Medical Research Council Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff, GB. ²⁸National Centre for Register-Based Research, Aarhus University, Aarhus, DK. ²⁹Centre for Integrated Register-based Research, Aarhus University, Aarhus, DK. ³⁰Molecular & Behavioral Neuroscience Institute, University of Michigan, Ann Arbor, MI, US. ³¹Department of Neuroscience, IRCCS—Istituto Di Ricerche Farmacologiche Mario Negri, Milan, IT. ³²Department of Psychiatry and Behavioral Neuroscience, University of Chicago, Chicago, IL, US. ³³Psychiatry, Berkshire Healthcare NHS Foundation Trust, Bracknell, GB. ³⁴Psychiatry, Rush University Medical Center, Chicago, IL, US. ³⁵Center for Neonatal Screening, Department for Congenital Disorders, Statens Serum Institut, Copenhagen, DK. ³⁶Department of Psychiatry, Weill Cornell Medical College, New York, NY, US. ³⁷Department of Psychiatry and Psychotherapy, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, DE. ³⁸Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, SE. ³⁹Department of Psychiatric Research, Diakonhjemmet Hospital, Oslo, NO. ⁴⁰Psychiatry, UMC Utrecht Brain Center Rudolf Magnus, Utrecht, NL. ⁴¹Human Genetics, University of California Los Angeles, Los Angeles, CA, US. ⁴²Institute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, DE. ⁴³Department of Psychiatry and Human Behavior, University of California, Irvine, Irvine, CA, US. ⁴⁴Molecular & Behavioral Neuroscience Institute and Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI, US. ⁴⁵Psychiatry, University of California San Francisco, San Francisco, CA, US. ⁴⁶Instituto de Salud Carlos III, Biomedical Network Research Centre on Mental Health (CIBERSAM), Madrid, ES. ⁴⁷Department of Psychiatry, Hospital Universitari Vall d´Hebron, Barcelona, ES. ⁴⁸Department of Psychiatry and Forensic Medicine, Universitat Autònoma de Barcelona, Barcelona, ES. ⁴⁹Psychiatric Genetics Unit, Group of Psychiatry Mental Health and Addictions, Vall d´Hebron Research Institut (VHIR), Universitat Autònoma de Barcelona, Barcelona, ES. ⁵⁰Department of Psychiatry, Mood Disorders Program, McGill University Health Center, Montreal, QC, CA. ⁵¹Division of Psychiatry, University of Edinburgh, Edinburgh, GB. ⁵²University of Iowa Hospitals and Clinics, Iowa City, IA, US. ⁵³Translational Genomics, USC, Phoenix, AZ, US. ⁵⁴Department of Translational Research in Psychiatry, Max Planck Institute of Psychiatry, Munich, DE. ⁵⁵Centre for Psychiatry, Queen Mary University of London, London, GB. ⁵⁶UCL Genetics Institute, University College London, London, GB. ⁵⁷Department of Psychiatry, Laboratory of Psychiatric Genetics, Poznan University of Medical Sciences, Poznan, PL. ⁵⁸Department of Neurosciences, University of California San Diego, La Jolla, CA, US. ⁵⁹Department of Radiology, University of California San Diego, La Jolla, CA, US. ⁶⁰Department of Psychiatry, University of California San Diego, La Jolla, CA, US. ⁶¹Department of Cognitive Science, University of California San Diego, La Jolla, CA, US. ⁶²Applied Molecular Genomics Unit, VIB Department of Molecular Genetics, University of Antwerp, Antwerp, Belgium. ⁶³Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD, US. ⁶⁴Department of Medical Genetics, Oslo University Hospital Ullevål, Oslo, NO. ⁶⁵NORMENT, KG Jebsen Centre for Psychosis Research, Department of Clinical Science, University of Bergen, Bergen, NO. ⁶⁶Department of Neurology, Oslo University Hospital, Oslo, NO. ⁶⁷NORMENT, KG Jebsen Centre for Psychosis Research, Oslo University Hospital, Oslo, NO. ⁶⁸Center for Statistical Genetics and Department of Biostatistics, University of Michigan, Ann Arbor, MI, US. ⁶⁹Department of Medical & Molecular Genetics, Indiana University, Indianapolis, IN, US. ⁷⁰Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, Mannheim, DE. ⁷¹Center for Neurobehavioral Genetics, University of California Los Angeles, Los Angeles, CA, US. ⁷²Department of Molecular Medicine and Surgery, Karolinska Institutet and Center for Molecular Medicine, Karolinska University Hospital, Stockholm, SE. ⁷³Department of Clinical Neuroscience, Karolinska Institutet and Center for Molecular Medicine, Karolinska University Hospital, Stockholm, SE. ⁷⁴Child and Adolescent Psychiatry Research Center, Stockholm, SE. ⁷⁵Department of Psychiatry and Psychotherapy, University Medical Center Göttingen, Göttingen, DE. ⁷⁶Department of Psychiatry, Dalhousie University, Halifax, NS, CA. ⁷⁷Genetics and Computational Biology, QIMR Berghofer Medical Research Institute, Brisbane, QLD, AU. ⁷⁸Department of Psychological Medicine, University of Worcester, Worcester, GB. ⁷⁹School of Biomedical Sciences, Plymouth University Peninsula Schools of Medicine and Dentistry, University of Plymouth, Plymouth, GB. ⁸⁰School of Psychiatry, University of New South Wales, Sydney, NSW, AU. ⁸¹Bioinformatics Research Centre, Aarhus University, Aarhus, DK. ⁸²Biostatistics, University of Minnesota System, Minneapolis, MN, US. ⁸³Mental Health Department, University Regional Hospital, Biomedicine Institute (IBIMA), Málaga, ES. ⁸⁴Department of Psychology, Eberhard Karls Universität Tübingen, Tubingen, DE. ⁸⁵Department of Psychiatry and Behavioral Sciences, Howard University Hospital, Washington, DC, US. ⁸⁶Center for Multimodal Imaging and Genetics, University of California San Diego, La Jolla, CA, US. ⁸⁷Psychiatrie Translationnelle, Inserm U955, Créteil, FR. ⁸⁸Faculté de Médecine, Université Paris Est, Créteil, FR. ⁸⁹Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, ON, CA. ⁹⁰Neurogenetics Section, Centre for Addiction and Mental Health, Toronto, ON, CA. ⁹¹Department of Psychiatry, University of Toronto, Toronto, ON, CA. ⁹²Institute of Medical Sciences, University of Toronto, Toronto, ON, CA. ⁹³Department of Psychiatry, Psychosomatic Medicine and Psychotherapy, University Hospital Frankfurt, Frankfurt am Main, DE. ⁹⁴Cell Biology, SUNY Downstate Medical Center College of Medicine, Brooklyn, NY, US. ⁹⁵Institute for Genomic Health, SUNY Downstate Medical Center College of Medicine, Brooklyn, NY, US. ⁹⁶ISGlobal, Barcelona, ES. ⁹⁷Psychiatry, Altrecht, Utrecht, NL. ⁹⁸Psychiatry, GGZ inGeest, Amsterdam, NL. ⁹⁹Psychiatry, VU medisch centrum, Amsterdam, NL. ¹⁰⁰Psychiatry, North East London NHS Foundation Trust, Ilford, GB. ¹⁰¹Clinic for Psychiatry and Psychotherapy, University Hospital Cologne, Cologne, DE. ¹⁰²Psychiatric and Neurodevelopmental Genetics Unit, Massachusetts General Hospital, Boston, MA, US. ¹⁰³HudsonAlpha Institute for Biotechnology, Huntsville, AL, US. ¹⁰⁴Department of Human Genetics, University of Michigan, Ann Arbor, MI, US. ¹⁰⁵Psychiatry, University of Illinois at Chicago College of Medicine, Chicago, IL, US. ¹⁰⁶Max Planck Institute of Psychiatry, Munich, DE. ¹⁰⁷Mental Health, NHS 24, Glasgow, GB. ¹⁰⁸Division of Psychiatry, Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, GB. ¹⁰⁹Psychiatry, Brigham and Women’s Hospital, Boston, MA, US. ¹¹⁰Department of Psychiatry and Psychotherapy, University of Bonn, Bonn, DE. ¹¹¹Department of Genetics, Harvard Medical School, Boston, MA, US. ¹¹²Department of Psychiatry, University of Michigan, Ann Arbor, MI, US. ¹¹³Genetic Cancer Susceptibility Group, International Agency for Research on Cancer, Lyon, FR. ¹¹⁴Estonian Genome Center, University of Tartu, Tartu, EE. ¹¹⁵Discipline of Biochemistry, Neuroimaging and Cognitive Genomics (NICOG) Centre, National University of Ireland, Galway, Galway, IE. ¹¹⁶Neuropsychiatric Genetics Research Group, Dept of Psychiatry and Trinity Translational Medicine Institute, Trinity College Dublin, Dublin, IE. ¹¹⁷Institute of Neuroscience and Medicine (INM-1), Research Centre Jülich, Jülich, DE. ¹¹⁸Research/Psychiatry, Veterans Affairs San Diego Healthcare System, San Diego, CA, US. ¹¹⁹Department of Clinical Sciences, Psychiatry, Umeå University Medical Faculty, Umeå, SE. ¹²⁰Department of Clinical Psychiatry, Psychiatry Clinic, Clinical Center University of Sarajevo, Sarajevo, BA. ¹²¹Department of Neurobiology, Care sciences, and Society, Karolinska Institutet and Center for Molecular Medicine, Karolinska University Hospital, Stockholm, SE. ¹²²Psychiatry, Harvard Medical School, Boston, MA, US. ¹²³Division of Clinical Research, Massachusetts General Hospital, Boston, MA, US. ¹²⁴Outpatient Clinic for Bipolar Disorder, Altrecht, Utrecht, NL. ¹²⁵Department of Psychiatry, Washington University in Saint Louis, Saint Louis, MO, US. ¹²⁶Department of Biochemistry and Molecular Biology II, Institute of Neurosciences, Center for Biomedical Research, University of Granada, Granada, ES. ¹²⁷Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, US. ¹²⁸Medicine, Psychiatry, Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, US. ¹²⁹Department of Health Sciences Research, Mayo Clinic, Rochester, MN, US. ¹³⁰Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA, US. ¹³¹Rush University Medical Center, Chicago, IL, US. ¹³²Scripps Translational Science Institute, La Jolla, CA, US. ¹³³Neuroscience Research Australia, Sydney, NSW, AU. ¹³⁴Faculty of Medicine, Department of Psychiatry, School of Health Sciences, University of Iceland, Reykjavik, IS. ¹³⁵Division of Mental Health and Addiction, Oslo University Hospital, Oslo, NO. ¹³⁶NORMENT, University of Oslo, Oslo, NO. ¹³⁷Psychiatry and the Behavioral Sciences, University of Southern California, Los Angeles, CA, US. ¹³⁸Mood Disorders, PsyQ, Rotterdam, NL. ¹³⁹Institute for Medical Sciences, University of Aberdeen, Aberdeen, UK. ¹⁴⁰Research Division, Federal Institute for Drugs and Medical Devices (BfArM), Bonn, DE. ¹⁴¹Centre for Addiction and Mental Health, Toronto, ON, CA. ¹⁴²Neurogenomics, TGen, Los Angeles, AZ, US. ¹⁴³Psychiatry, Psychiatrisches Zentrum Nordbaden, Wiesloch, DE. ¹⁴⁴Computational Sciences Center of Emphasis, Pfizer Global Research and Development, Cambridge, MA, US. ¹⁴⁵Department of Biostatistics, Princess Margaret Cancer Centre, Toronto, ON, CA. ¹⁴⁶Dalla Lana School of Public Health, University of Toronto, Toronto, ON, CA. ¹⁴⁷Psychological Medicine, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, GB. ¹⁴⁸Department of Mental Health, Johns Hopkins University Bloomberg School of Public Health, Baltimore, MD, US. ¹⁴⁹Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, US. ¹⁵⁰NORMENT, KG Jebsen Centre for Psychosis Research, Division of Mental Health and Addiction, Institute of Clinical Medicine and Diakonhjemmet Hospital, University of Oslo, Oslo, NO. ¹⁵¹National Institute of Mental Health, Klecany, CZ. ¹⁵²Department of Psychiatry, University of Melbourne, Melbourne, Victoria, AU. ¹⁵³Department of Psychiatry and Addiction Medicine, Assistance Publique—Hôpitaux de Paris, Paris, FR. ¹⁵⁴Paris Bipolar and TRD Expert Centres, FondaMental Foundation, Paris, FR. ¹⁵⁵UMR-S1144 Team 1: Biomarkers of relapse and therapeutic response in addiction and mood disorders, INSERM, Paris, FR. ¹⁵⁶Psychiatry, Université Paris Diderot, Paris, FR. ¹⁵⁷Psychiatry, University of Pennsylvania, Philadelphia, PA, US. ¹⁵⁸Department of Psychiatry, University of Münster, Münster, DE. ¹⁵⁹Division of Endocrinology, Children’s Hospital Boston, Boston, MA, US. ¹⁶⁰Centre for Affective Disorders, Institute of Psychiatry, Psychology and Neuroscience, London, GB. ¹⁶¹Department of Psychiatry & Psychology, Mayo Clinic, Rochester, MN, US. ¹⁶²School of Medical Sciences, University of New South Wales, Sydney, NSW, AU. ¹⁶³Department of Human Genetics, University of Chicago, Chicago, IL, US. ¹⁶⁴Biometric Psychiatric Genetics Research Unit, Alexandru Obregia Clinical Psychiatric Hospital, Bucharest, RO. ¹⁶⁵Institute of Neuroscience and Physiology, University of Gothenburg, Gothenburg, SE. ¹⁶⁶INSERM, Paris, FR. ¹⁶⁷Department of Medical & Molecular Genetics, King’s College London, London, GB. ¹⁶⁸Neuroscience Therapeutic Area, Janssen Research and Development, LLC, Titusville, NJ, US. ¹⁶⁹Cancer Epidemiology and Prevention, M. Sklodowska-Curie Cancer Center and Institute of Oncology, Warsaw, PL. ¹⁷⁰School of Psychology, The University of Queensland, Brisbane, QLD, AU. ¹⁷¹Research Institute, Lindner Center of HOPE, Mason, OH, US. ¹⁷²Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, GB. ¹⁷³Human Genetics Branch, Intramural Research Program, National Institute of Mental Health, Bethesda, MD, US. ¹⁷⁴Division of Mental Health and Addiction, Oslo University Hospital, Oslo, NO. ¹⁷⁵Division of Mental Health and Addiction, University of Oslo, Institute of Clinical Medicine, Oslo, NO. ¹⁷⁶Institute of Molecular and Cell Biology, University of Tartu, Tartu, EE. ¹⁷⁷Mental Health, Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology—NTNU, Trondheim, NO. ¹⁷⁸Psychiatry, St Olavs University Hospital, Trondheim, NO. ¹⁷⁹Psychosis Research Unit, Aarhus University Hospital, Risskov, DK. ¹⁸⁰Munich Cluster for Systems Neurology (SyNergy), Munich, DE. ¹⁸¹University of Liverpool, Liverpool, GB. ¹⁸²Psychiatry and Human Genetics, University of Pittsburgh, Pittsburgh, PA, US. ¹⁸³Mental Health Services in the Capital Region of Denmark, Mental Health Center Copenhagen, University of Copenhagen, Copenhagen, DK. ¹⁸⁴Division of Psychiatry, Haukeland Universitetssjukehus, Bergen, NO. ¹⁸⁵Faculty of Medicine and Dentistry, University of Bergen, Bergen, NO. ¹⁸⁶Human Genetics and Computational Biomedicine, Pfizer Global Research and Development, Groton, CT, US. ¹⁸⁷College of Medicine Institute for Genomic Health, SUNY Downstate Medical Center College of Medicine, Brooklyn, NY, US. ¹⁸⁸Department of Clinical Genetics, Amsterdam Neuroscience, Vrije Universiteit Medical Center, Amsterdam, NL. ¹⁸⁹Department of Neurology and Neurosurgery, McGill University, Faculty of Medicine, Montreal, QC, CA. ¹⁹⁰Montreal Neurological Institute and Hospital, Montreal, QC, CA. ¹⁹¹Department of Biomedical and NeuroMotor Sciences, University of Bologna, Bologna, IT. ¹⁹²Department of Psychiatry, Massachusetts General Hospital, Boston, MA, US. ¹⁹³Psychiatric and Neurodevelopmental Genetics Unit (PNGU), Massachusetts General Hospital, Boston, MA, US. ¹⁹⁴Faculty of Medicine, University of Iceland, Reykjavik, IS. ¹⁹⁵Department of Psychiatry, Hospital Namsos, Namsos, NO. ¹⁹⁶Department of Neuroscience, Norges Teknisk Naturvitenskapelige Universitet Fakultet for naturvitenskap og teknologi, Trondheim, NO. ¹⁹⁷Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, US. ¹⁹⁸Department of Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, NC, US. ¹⁹⁹Department of Psychiatry, McGill University, Montreal, QC, CA. ²⁰⁰Dept of Psychiatry, Sankt Olavs Hospital Universitetssykehuset i Trondheim, Trondheim, NO. ²⁰¹Clinical Institute of Neuroscience, Hospital Clinic, University of Barcelona, IDIBAPS, CIBERSAM, Barcelona, ES. ²⁰²Institute of Biological Psychiatry, MHC Sct. Hans, Mental Health Services Copenhagen, Roskilde, DK. ²⁰³Department of Clinical Medicine, University of Copenhagen, Copenhagen, DK. ²⁰⁴Psychiatry, Indiana University School of Medicine, Indianapolis, IN, US. ²⁰⁵Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN, US. ²⁰⁶Department of Pathology and Laboratory Medicine, University of California Los Angeles, Los Angeles, CA, US. ^† deceased.

References

1. Association AP. Diagnostic and statistical manual of mental disorders [5th edition]. Washington. 2013;DC 20024:USA: American Psychiatric Association Publishing. Available from: https://books.google.com/books/about/Diagnostic_and_Statistical_Manual_of_Men.html?hl=&id=-JivBAAAQBAJ.
2. Angst J. The emerging epidemiology of hypomania and bipolar II disorder. Journal of Affective Disorders. 1998;50:143–151. Available from: pmid:9858074
- View Article
- PubMed/NCBI
- Google Scholar
3. Merikangas KR, Akiskal HS, Angst J, Greenberg PE, Hirschfeld RMA, Petukhova M, et al. Lifetime and 12-month prevalence of bipolar spectrum disorder in the National Comorbidity Survey replication. Archives of General Psychiatry. 2007;64:543–552. Available from: pmid:17485606
- View Article
- PubMed/NCBI
- Google Scholar
4. Merikangas KR, Jin R, He JP, Kessler RC, Lee S, Sampson NA, et al. Prevalence and correlates of bipolar spectrum disorder in the world mental health survey initiative. Archives of General Psychiatry. 2011;68:241–251. Available from: pmid:21383262
- View Article
- PubMed/NCBI
- Google Scholar
5. Disease G, Incidence I, Collaborators P. Global, regional, and national incidence, prevalence, and years lived with disability for 328 diseases and injuries for 195 countries, 1990-2016: A systematic analysis for the Global Burden of Disease Study 2016. The Lancet. 2017;390:1211–1259. Available from:
- View Article
- Google Scholar
6. Bienvenu OJ, Davydow DS, Kendler KS. Psychiatric ‘diseases’ versus behavioral disorders and degree of genetic influence. Psychological Medicine. 2011;41:33–40. Available from: pmid:20459884
- View Article
- PubMed/NCBI
- Google Scholar
7. Craddock N, Sklar P. Genetics of bipolar disorder. The Lancet. 2013;381:1654–1662. Available from:
- View Article
- Google Scholar
8. Merikangas K, Yu K. Genetic epidemiology of bipolar disorder. Clinical Neuroscience Research. 2002;2:127–141. Available from:
- View Article
- Google Scholar
9. Smoller JW, Finn CT. Family, twin, and adoption studies of bipolar disorder. American Journal of Medical Genetics Part C. 2003. Available from: pmid:14601036
- View Article
- PubMed/NCBI
- Google Scholar
10. Song J, Bergen SE, Kuja-Halkola R, Larsson H, Landén M, Lichtenstein P. Bipolar disorder and its relation to major psychiatric disorders: A family-based study in the Swedish population. Bipolar Disorders. 2015;17:184–193. Available from: pmid:25118125
- View Article
- PubMed/NCBI
- Google Scholar
11. Kendler KS, Ohlsson H, Sundquist J, Sundquist K. An extended Swedish national adoption study of bipolar disorder illness and cross-generational familial association with schizophrenia and major depression. JAMA Psychiatry. 2020;77:814–822. pmid:32186664
- View Article
- PubMed/NCBI
- Google Scholar
12. World Health Organization WS. The ICD-10 classification of mental and behavioural disorders: Clinical descriptions and diagnostic guidelines. Geneva. 1992. Available from: Switzerland: World Health Organization. https://books.google.com/books/about/The_ICD_10_Classification_of_Mental_and.html?hl=&id=DFM0DgAAQBAJ.
13. Grande I, Berk M, Birmaher B, Vieta E. Bipolar disorder. The Lancet. 2016;387:1561–1572. Available from:
- View Article
- Google Scholar
14. Charney AW, Ruderfer DM, Stahl EA, Moran JL, Chambert K, Belliveau RA, et al. Evidence for genetic heterogeneity between clinical subtypes of bipolar disorder. Translational Psychiatry. 2017;7:e993. Available from: pmid:28072414
- View Article
- PubMed/NCBI
- Google Scholar
15. Allardyce J, Leonenko G, Hamshere M, Pardiñas AF, Forty L, Knott S, et al. Association between schizophrenia-related polygenic liability and the occurrence and level of mood-incongruent psychotic symptoms in bipolar disorder. JAMA Psychiatry. 2018;75:28–35. Available from: pmid:29167880
- View Article
- PubMed/NCBI
- Google Scholar
16. Markota M, Coombes BJ, Larrabee BR, McElroy SL, Bond DJ, Veldic M, et al. Association of schizophrenia polygenic risk score with manic and depressive psychosis in bipolar disorder. Translational Psychiatry. 2018;8:188. Available from: pmid:30201969
- View Article
- PubMed/NCBI
- Google Scholar
17. Disorder B, of the Psychiatric Genomics Consortium SWG. Genomic dissection of bipolar disorder and schizophrenia, including 28 subphenotypes. Cell. 2018;173:1705–1715. Available from: pmid:29906448
- View Article
- PubMed/NCBI
- Google Scholar
18. Lewis KJS, Richards A, Karlsson R, Leonenko G, Jones SE, Jones HJ, et al. Comparison of Genetic Liability for Sleep Traits Among Individuals With Bipolar Disorder I or II and Control Participants. JAMA Psychiatry. 2020 03;77(3):303–10. Available from: https://doi.org/10.1001/jamapsychiatry.2019.4079 pmid:31751445
- View Article
- PubMed/NCBI
- Google Scholar
19. Charney AW, Stahl EA, Green EK, Chen CY, Moran JL, Chambert K, et al. Contribution of rare copy number variants to bipolar disorder risk is limited to schizoaffective cases. Biological Psychiatry. 2019;86:110–119. Available from: pmid:30686506
- View Article
- PubMed/NCBI
- Google Scholar
20. Stahl EA, Breen G, Forstner AJ, McQuillin A, Ripke S, et al. Genome-wide association study identifies 30 loci associated with bipolar disorder. Nature Genetics. 2019;51:793–803. Available from: pmid:31043756
- View Article
- PubMed/NCBI
- Google Scholar
21. Coombes B, Markota M, Mann J, Colby C, Stahl E, Talati A, et al. Dissecting clinical heterogeneity of bipolar disorder using multiple polygenic risk scores. medRxiv. 2020. Available from: pmid:32948743
- View Article
- PubMed/NCBI
- Google Scholar
22. Carvalho AF, Firth J, Vieta E. Bipolar disorder. The New England Journal of Medicine. 2020;383:58–66. Available from: pmid:32609982
- View Article
- PubMed/NCBI
- Google Scholar
23. O’Connel KS, Coombes BJ. Genetic contributions to bipolar disorder: current status and future directions. Psychologial Medicine. 2021;51(13):2156–67.
- View Article
- Google Scholar
24. International Consortium on Lithium Genetics [ConLi + Gen] A, A A T, H KO, H L, C S R, …, et al. Association of polygenic score for schizophrenia and HLA antigen and inflammation genes with response to lithium in bipolar affective disorder: A genome-wide association study. JAMA Psychiatry. 2018;75:65–74.
- View Article
- Google Scholar
25. Amare AT, Schubert KO, Hou L, Clark SR, Papiol S, Cearns M, et al. Association of polygenic score for major depression with response to lithium in patients with bipolar disorder. Nature, Molecular Psychiatry. 2021;26:2457–70. Available from:
- View Article
- Google Scholar
26. Nunes A, Trappenberg T, Alda M. Asymmetrical reliability of the Alda score favours a dichotomous representation of lithium responsiveness. PLoS ONE. 2020;15:e0225353. Available from: pmid:31986152
- View Article
- PubMed/NCBI
- Google Scholar
27. Gordovez FJA, McMahon FJ. The genetics of bipolar disorder. Molecular Psychiatry. 2020;25:544–559. Available from: pmid:31907381
- View Article
- PubMed/NCBI
- Google Scholar
28. Ho AMC, Coombes BJ, Nguyen TTL, Liu D, McElroy SL, Singh B, et al. Mood-stabilizing antiepileptic treatment response in bipolar disorder: A genome-wide association study. Clinical Pharmacology and Therapeutics. 2020;108:1233–1242. Available from: pmid:32627186
- View Article
- PubMed/NCBI
- Google Scholar
29. Mullins N, Forstner AJ, O’Connell KS, Coombes B, Coleman JRI, Qiao Z, et al. Genome-wide association study of over 40000 bipolar disorder cases provides novel biological insights. medRxiv. 2020. Available from:
- View Article
- Google Scholar
30. Gatz M, Reynolds CA, Fratiglioni L, Johansson B, Mortimer JA, Berg S, et al. Role of genes and environments for explaining Alzheimer disease. Archives of General Psychiatry. 2006;63:168–174. Available from: pmid:16461860
- View Article
- PubMed/NCBI
- Google Scholar
31. Browne HA, Gair SL, Scharf JM, Grice DE. Genetics of obsessive-compulsive disorder and related disorders. The Psychiatric Clinics of North America. 2014;37:319–335. Available from: pmid:25150565
- View Article
- PubMed/NCBI
- Google Scholar
32. Zilhão NR, Olthof MC, Smit DJA, Cath DC, Ligthart L, Mathews CA, et al. Heritability of tic disorders: A twin-family study. Psychological Medicine. 2017;47:1085–1096. Available from: pmid:27974054
- View Article
- PubMed/NCBI
- Google Scholar
33. Walters RK, Polimanti R, Johnson EC, McClintick JN, Adams MJ, Adkins AE, et al. Transancestral GWAS of alcohol dependence reveals common genetic underpinnings with psychiatric disorders. Nature Neuroscience. 2018;21:1656–1669. Available from: pmid:30482948
- View Article
- PubMed/NCBI
- Google Scholar
34. of the Psychiatric Genomics Consortium CDG. Genomic relationships, novel loci, and pleiotropic mechanisms across eight psychiatric disorders. Cell. 2019;179:1469–1482. Available from:
- View Article
- Google Scholar
35. Demontis D, Rajagopal VM, Thorgeirsson TE, Als TD, Grove J, Leppälä K, et al. Genome-wide association study implicates CHRNA2 in cannabis use disorder. Nature Neuroscience. 2019;22:1066–1074. Available from: pmid:31209380
- View Article
- PubMed/NCBI
- Google Scholar
36. Faraone SV, Larsson H. Genetics of attention deficit hyperactivity disorder. Molecular Psychiatry. 2019;24:562–575. Available from: pmid:29892054
- View Article
- PubMed/NCBI
- Google Scholar
37. Jansen IE, Savage JE, Watanabe K, Bryois J, Williams DM, Steinberg S, et al. Genome-wide meta-analysis identifies new loci and functional pathways influencing Alzheimer’s disease risk. Nature Genetics. 2019;51:404–413. Available from: pmid:30617256
- View Article
- PubMed/NCBI
- Google Scholar
38. Purves KL, Coleman JRI, Meier SM, Rayner C, Davis KAS, Cheesman R, et al. A major role for common genetic variation in anxiety disorders. Molecular Psychiatry. 2020;25:3292–3303. Available from: pmid:31748690
- View Article
- PubMed/NCBI
- Google Scholar
39. Djurovic S, Gustafsson O, Mattingsdal M, Athanasiu L, Bjella T, Tesli M, et al. A genome-wide association study of bipolar disorder in Norwegian individuals, followed by replication in Icelandic sample. Journal of Affective Disorders. 2010;126:312–316. Available from: pmid:20451256
- View Article
- PubMed/NCBI
- Google Scholar
40. Smith EN, Koller DL, Panganiban C, Szelinger S, Zhang P, Badner JA, et al. Genome-wide association of bipolar disorder suggests an enrichment of replicable associations in regions near genes. PLoS Genetics. 2011;7:e1002134. Available from: pmid:21738484
- View Article
- PubMed/NCBI
- Google Scholar
41. Winham SJ, Cuellar-Barboza AB, Oliveros A, McElroy SL, Crow S, Colby C, et al. Genome-wide association study of bipolar disorder accounting for effect of body mass index identifies a new risk allele in TCF7L2. Molecular Psychiatry. 2014;19:1010–1016. Available from: pmid:24322204
- View Article
- PubMed/NCBI
- Google Scholar
42. Aas M, Haukvik UK, Djurovic S, Tesli M, Athanasiu L, Bjella T, et al. Interplay between childhood trauma and BDNF val66met variants on blood BDNF mRNA levels and on hippocampus subfields volumes in schizophrenia spectrum and bipolar disorders. Journal of Psychiatric Research. 2014;59:14–21. Available from: pmid:25246365
- View Article
- PubMed/NCBI
- Google Scholar
43. Oliveira J, Kazma R, Le Floch E, Bennabi M, Hamdani N, Bengoufa D, et al. Toxoplasma gondii exposure may modulate the influence of TLR2 genetic variation on bipolar disorder: A gene–environment interaction study. International Journal of Bipolar Disorders. 2016. Available from: pmid:27207565
- View Article
- PubMed/NCBI
- Google Scholar
44. Hosang GM, Fisher HL, Cohen-Woods S, McGuffin P, Farmer AE. Stressful life events and catechol-O-methyl-transferase (COMT) gene in bipolar disorder. Depression and Anxiety. 2017;34:419–426. Available from: pmid:28102561
- View Article
- PubMed/NCBI
- Google Scholar
45. Aas M, Bellivier F, Bettella F, Henry C, Gard S, Kahn JP, et al. Childhood maltreatment and polygenic risk in bipolar disorders. Bipolar Disorders. 2020;22:174–181. Available from: pmid:31628696
- View Article
- PubMed/NCBI
- Google Scholar
46. International Schizophrenia Consortium P, W S M, S N R, V J L, O P M, …, et al. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature. 2009;460:748–752. Available from:
- View Article
- Google Scholar
47. Wilcox HC, Fullerton JM, Glowinski AL, Benke K, Kamali M, Hulvershorn LA, et al. Traumatic stress interacts with bipolar disorder genetic risk to increase risk for suicide attempts. Journal of the American Academy of Child and Adolescent Psychiatry. 2017;56:1073–1080. Available from: pmid:29173741
- View Article
- PubMed/NCBI
- Google Scholar
48. Mistry S, Harrison JR, Smith DJ, Escott-Price V, Zammit S. The use of polygenic risk scores to identify phenotypes associated with genetic risk of bipolar disorder and depression: A systematic review. Journal of Affective Disorders. 2018;234:148–155. Available from: pmid:29529547
- View Article
- PubMed/NCBI
- Google Scholar
49. Reginsson GW, Ingason A, Euesden J, Bjornsdottir G, Olafsson S, Sigurdsson E, et al. Polygenic risk scores for schizophrenia and bipolar disorder associate with addiction. Addiction Biology. 2018;23:485–492. Available from: pmid:28231610
- View Article
- PubMed/NCBI
- Google Scholar
50. Mistry S, Escott-Price V, Florio AD, Smith DJ, Zammit S. Genetic risk for bipolar disorder and psychopathology from childhood to early adulthood. Journal of Affective Disorders. 2019;246:633–639. pmid:30611060
- View Article
- PubMed/NCBI
- Google Scholar
51. Mistry S, Escott-Price V, Florio AD, Smith DJ, Zammit S. Investigating associations between genetic risk for bipolar disorder and cognitive functioning in childhood. Journal of Affective Disorders. 2019;259:112–120. pmid:31445336
- View Article
- PubMed/NCBI
- Google Scholar
52. Musliner KL, Mortensen PB, McGrath JJ, Suppli NP, Hougaard DM, …, et al. Association of polygenic liabilities for major depression, bipolar disorder, and schizophrenia with risk for depression in the Danish population. JAMA Psychiatry. 2019;76:516–525. pmid:30698613
- View Article
- PubMed/NCBI
- Google Scholar
53. Musliner KL, Krebs MD, Albiñana C, Vilhjalmsson B, Agerbo E, Zandi PP, et al. Polygenic risk and progression to bipolar or psychotic disorders among individuals diagnosed with unipolar depression in early life. American Journal of Psychiatry. 2020;177:936–943. pmid:32660297
- View Article
- PubMed/NCBI
- Google Scholar
54. Mullins N, Bigdeli TB, Børglum AD, Coleman JRI, Demontis D, Mehta D, et al. GWAS Of suicide attempt in psychiatric disorders and association with major depression polygenic risk scores. The American Journal of Psychiatry. 2019;176:651–660. pmid:31164008
- View Article
- PubMed/NCBI
- Google Scholar
55. Grigoroiu-Serbanescu M, Giaroli G, Thygesen JH, Shenyan O, Bigdeli TB, Bass NJ, et al. Predictive power of the ADHD GWAS 2019 polygenic risk scores in independent samples of bipolar patients with childhood ADHD. Journal of Affective Disorders. 2020;265:651–659. pmid:31791676
- View Article
- PubMed/NCBI
- Google Scholar
56. Chen DT, Jiang X, Akula N, Shugart YY, Wendland JR, Steele CJM, et al. Genome-wide association study meta-analysis of European and Asian-ancestry samples identifies three novel loci associated with bipolar disorder. Molecular Psychiatry. 2013;18:195–205. pmid:22182935
- View Article
- PubMed/NCBI
- Google Scholar
57. Joslyn C, Hawes DJ, Hunt C, Mitchell PB. Is age of onset associated with severity, prognosis, and clinical features in bipolar disorder? A meta-analytic review. Bipolar Disorders. 2016;18:389–403. pmid:27530107
- View Article
- PubMed/NCBI
- Google Scholar
58. Eser HY, Kacar AS, Kilciksiz CM, Yalçinay-Inan M, Ongur D. Prevalence and associated features of anxiety disorder comorbidity in bipolar disorder: A meta-analysis and meta-regression study. Frontiers in Psychiatry. 2018;9:229. Available from:
- View Article
- Google Scholar
59. 23andMe. Research Consent Document; 2023. [Online; accessed 19-Dec-2023]. https://www.23andme.com/about/consent/.
60. McGrouther C. Confounding factors in association studies of neuropsychiatric disease: A case study of Bipolar Affective Disorder [Ph.D. Thesis]. San Diego, CA: University of California, San Diego; 2014.
61. Xie J, Ma A, Fennell A, Ma Q, Zhao J. It is time to apply biclustering: a comprehensive review of biclustering applications in biological and biomedical data. Brief Bioinform. 2019;20(4):1449–64. pmid:29490019
- View Article
- PubMed/NCBI
- Google Scholar
62. Dahl A, Zaitlen N. Genetic Influences on Disease Subtypes. Annu Rev Genomics Hum Genet. 2020;21:413–35. pmid:32873077
- View Article
- PubMed/NCBI
- Google Scholar
63. Rangan AV. A simple filter for detecting low-rank submatrices. Journal of Computational Physics. 2012 Apr;231(7):2682–90.
- View Article
- Google Scholar
64. Rangan AV, McGrouther CC, Kelsoe J, Schork N, Stahl E, Zhu Q, et al. A loop-counting method for covariate-corrected low-rank biclustering of gene-expression and genome-wide association study data. PLOS Computational Biology. 2018 05;14(5):1–29. Available from: pmid:29758032
- View Article
- PubMed/NCBI
- Google Scholar
65. Consortium WTCC. Genome-wide association study of 14 000 cases of seven common diseases and 3000 shared controls. Nature. 2007;447:661–678.
- View Article
- Google Scholar
66. Jones L, Metcalf A, Gordon-Smith K, Forty L, Perry A, Lloyd J, et al. Gambling problems in bipolar disorder in the UK: Prevalence and distribution. British Journal of Psychiatry. 2015;207(4):328–333. Available from:
- View Article
- Google Scholar
67. Alon N, Krivelevich M, Sudakov B. Finding a large hidden clique in a random graph. Random Structures & Algorithms. 1998;13(3-4):457–66. https://onlinelibrary.wiley.com/doi/abs/10.1002/%28SICI%291098-2418%28199810/12%2913%3A3/4%3C457%3A%3AAID-RSA14%3E3.0.CO%3B2-W.
- View Article
- Google Scholar
68. Deshpande Y, Montanari A. Improved Sum-of-Squares Lower Bounds for Hidden Clique and Hidden Submatrix Problems; 2015.
69. McCarthy Sea. A reference panel of 64,976 haplotypes for genotype imputation. Nature genetics. 2016;48(10):1279–83. pmid:27548312
- View Article
- PubMed/NCBI
- Google Scholar
70. Zhu Q, Wong AK, Krishnan A, Aure MR, Tadych A, Zhang R, et al. Targeted exploration and analysis of large cross-platform human transcriptomic compendia. Nature Methods. 2015;12(3):211–4. pmid:25581801
- View Article
- PubMed/NCBI
- Google Scholar
71. Lee SH, Goddard ME, Wray NR, Visscher PM. A Better Coefficient of Determination for Genetic Profile Analysis. Genetic Epidemiology. 2012;36:214–24. pmid:22714935
- View Article
- PubMed/NCBI
- Google Scholar
72. O’Connell KS, Koromina M, van der Veen T, Boltz T, David FS, Kay Yang JM, et al. Genomics yields biological and phenotypic insights into bipolar disorder. medRxiv. 2024. Available from: https://www.medrxiv.org/content/early/2024/08/28/2023.10.07.23296687.
- View Article
- Google Scholar
73. Kranzler HR, Zhou H, Kember RL, Vickers Smith R, Justice AC, Damrauer S, et al. Genome-wide association study of alcohol consumption and use disorder in 274 424 individuals from multiple populations. Nature Communications. 2019;10:1499. pmid:30940813
- View Article
- PubMed/NCBI
- Google Scholar
74. Jang SK, Saunders G, Liu M, Jiang Y, Liu DJ, Vrieze S. Genetic correlation, pleiotropy, and causal associations between substance use and psychiatric disorder. Psychological Medicine. 2020. Available from: pmid:32762793
- View Article
- PubMed/NCBI
- Google Scholar
75. van Hulzen KJE, Scholz CJ, Franke B, Ripke S, Klein M, McQuillin A, et al. Genetic overlap between attention-deficit/hyperactivity disorder and bipolar disorder: Evidence from genome-wide association study meta-analysis. Biological Psychiatry. 2017;82:634–641. Available from: pmid:27890468
- View Article
- PubMed/NCBI
- Google Scholar
76. O'Connell KS, McGregor NW, Lochner C, Emsley R, Warnich L. The genetic architecture of schizophrenia, bipolar disorder, obsessive-compulsive disorder and autism spectrum disorder. Molecular and Cellular Neurosciences. 2018;88:300–307. Available from: pmid:29505902
- View Article
- PubMed/NCBI
- Google Scholar
77. Coleman JRI, Gaspar HA, Bryois J, Bipolar Disorder Working Group of the Psychiatric Genomics Consortium MDDWGotPGC, Breen G. The genetics of the mood disorder Spectrum: Genome-wide association analyses of more than 185 000 cases and 439 000 controls. Biological Psychiatry. 2020;88:169–184. Available from: pmid:31926635
- View Article
- PubMed/NCBI
- Google Scholar
78. Fr#x00ED;as A, Baltasar I, Birmaher B. Comorbidity between bipolar disorder and borderline personality disorder: Prevalence, explanatory theories, and clinical impact. Journal of Affective Disorders. 2016;202:210–219. Available from:
- View Article
- Google Scholar
79. Salloum IM, Brown ES. Management of comorbid bipolar disorder and substance use disorders. The American Journal of Drug and Alcohol Abuse. 2017;43:366–376. Available from: pmid:28301219
- View Article
- PubMed/NCBI
- Google Scholar
80. Bortolato B, Berk M, Maes M, McIntyre RS, Carvalho AF. Fibromyalgia and bipolar disorder: Emerging epidemiological associations and shared pathophysiology. Current Molecular Medicine. 2016;16:119–136. Available from: pmid:26812920
- View Article
- PubMed/NCBI
- Google Scholar
81. Correll CU, Solmi M, Veronese N, Bortolato B, Rosson S, Santonastaso P, et al. Prevalence, incidence and mortality from cardiovascular disease in patients with pooled and specific severe mental illness: A large-scale meta-analysis of 3 211 768 patients and 113 383 368 controls. World Psychiatry: Official Journal of the World Psychiatric Association. 2017;16:163–180. Available from: pmid:28498599
- View Article
- PubMed/NCBI
- Google Scholar
82. Vancampfort D, Correll CU, Galling B, Probst M, De Hert M, Ward PB, et al. Diabetes mellitus in people with schizophrenia, bipolar disorder and major depressive disorder: A systematic review and large scale meta-analysis. World Psychiatry: Official Journal of the World Psychiatric Association. 2016;15:166–174. Available from: pmid:27265707
- View Article
- PubMed/NCBI
- Google Scholar
83. Roshanaei-Moghaddam B, Katon W. Premature mortality from general medical illnesses among persons with bipolar disorder: A review. Psychiatric Services. 2009;60:147–156. Available from: pmid:19176408
- View Article
- PubMed/NCBI
- Google Scholar
84. Kessing LV, Vradi E, McIntyre RS, Andersen PK. Causes of decreased life expectancy over the life span in bipolar disorder. Journal of Affective Disorders. 2015;180:142–147. Available from: pmid:25909752
- View Article
- PubMed/NCBI
- Google Scholar
85. Diflorio A, Jones I. Is sex important? Gender differences in bipolar disorder. International Review of Psychiatry. 2010;22:437–452. Available from: pmid:21047158
- View Article
- PubMed/NCBI
- Google Scholar
86. Nivoli AMA, Pacchiarotti I, Rosa AR, Popovic D, Murru A, Valenti M, et al. Gender differences in a cohort study of 604 bipolar patients: The role of predominant polarity. Journal of Affective Disorders. 2011;133:443–449. Available from: pmid:21620480
- View Article
- PubMed/NCBI
- Google Scholar
87. Zimmerman M, Ruggero CJ, Chelminski I, Young D. Is bipolar disorder overdiagnosed? The Journal of Clinical Psychiatry. 2008;69:935–40. Available from: pmid:18466044
- View Article
- PubMed/NCBI
- Google Scholar
88. Goes FS, Pirooznia M, Parla JS, Kramer M, Ghiban E, Mavruk S, et al. Exome sequencing of familial bipolar disorder. JAMA Psychiatry. 2016;73:590–597. Available from: pmid:27120077
- View Article
- PubMed/NCBI
- Google Scholar
89. Maaser A, Forstner AJ, Strohmaier J, Hecker J, Ludwig KU, …, et al. Exome sequencing in large, multiplex bipolar disorder families from Cuba. PLoS ONE. 2018;13:e0205895. Available from: pmid:30379966
- View Article
- PubMed/NCBI
- Google Scholar
90. Toma C, Shaw AD, Allcock RJN, Heath A, Pierce KD, Mitchell PB, et al. An examination of multiple classes of rare variants in extended families with bipolar disorder. Translational Psychiatry. 2018;8:65. Available from: pmid:29531218
- View Article
- PubMed/NCBI
- Google Scholar
91. Goes FS, Pirooznia M, Tehan M, Zandi PP, McGrath J, Wolyniec P, et al. De novo variation in bipolar disorder. Molecular Psychiatry. 2019;387:1561. Available from: pmid:31776463
- View Article
- PubMed/NCBI
- Google Scholar
92. Forstner AJ, Fischer SB, Schenk LM, Strohmaier J, Maaser-Hecker A, Reinbold CS, et al. Whole-exome sequencing of 81 individuals from 27 multiply affected bipolar disorder families. Translational Psychiatry. 2020;10:57. Available from: pmid:32066727
- View Article
- PubMed/NCBI
- Google Scholar
93. Sul JH, Service SK, Huang AY, Ramensky V, Hwang SG, Teshiba TM, et al. Contribution of common and rare variants to bipolar disorder susceptibility in extended pedigrees from population isolates. Translational Psychiatry. 2020;10:74. Available from: pmid:32094344
- View Article
- PubMed/NCBI
- Google Scholar
94. Akinhanmi MO, Biernacka JM, Strakowski SM, McElroy SL, Balls Berry JE, Merikangas KR, et al. Racial disparities in bipolar disorder treatment and research: A call to action. Bipolar Disorders. 2018;20:506–514. Available from: pmid:29527766
- View Article
- PubMed/NCBI
- Google Scholar
95. Martin AR, Kanai M, Kamatani Y, Okada Y, Neale BM, Daly MJ. Clinical use of current polygenic risk scores may exacerbate health disparities. Nature Genetics. 2019;51:584–591. Available from: pmid:30926966
- View Article
- PubMed/NCBI
- Google Scholar
96. Peterson RE, Kuchenbaecker K, Walters RK, Chen CY, Popejoy AB, Periyasamy S, et al. Genome-wide association studies in ancestrally diverse populations: Opportunities, methods, pitfalls, and recommendations. Cell. 2019;179:589–603. Available from: pmid:31607513
- View Article
- PubMed/NCBI
- Google Scholar
97. Sirugo G, Williams SM, Tishkoff SA. The missing diversity in human genetic studies. Cell. 2019;177:1080. Available from: pmid:31051100
- View Article
- PubMed/NCBI
- Google Scholar
98. Duncan L, Shen H, Gelaye B, Meijsen J, Ressler K, Feldman M, et al. Analysis of polygenic risk score usage and performance in diverse human populations. Nature Communications 10(1). 2019. Available from: pmid:31346163
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Association AP. Diagnostic and statistical manual of mental disorders [5th edition]. Washington. 2013;DC 20024:USA: American Psychiatric Association Publishing. Available from: https://books.google.com/books/about/Diagnostic_and_Statistical_Manual_of_Men.html?hl=&id=-JivBAAAQBAJ.

[ref2] 2. Angst J. The emerging epidemiology of hypomania and bipolar II disorder. Journal of Affective Disorders. 1998;50:143–151. Available from: pmid:9858074
View Article
PubMed/NCBI
Google Scholar

[3] View Article

[4] PubMed/NCBI

[5] Google Scholar

[ref3] 3. Merikangas KR, Akiskal HS, Angst J, Greenberg PE, Hirschfeld RMA, Petukhova M, et al. Lifetime and 12-month prevalence of bipolar spectrum disorder in the National Comorbidity Survey replication. Archives of General Psychiatry. 2007;64:543–552. Available from: pmid:17485606
View Article
PubMed/NCBI
Google Scholar

[7] View Article

[8] PubMed/NCBI

[9] Google Scholar

[ref4] 4. Merikangas KR, Jin R, He JP, Kessler RC, Lee S, Sampson NA, et al. Prevalence and correlates of bipolar spectrum disorder in the world mental health survey initiative. Archives of General Psychiatry. 2011;68:241–251. Available from: pmid:21383262
View Article
PubMed/NCBI
Google Scholar

[11] View Article

[12] PubMed/NCBI

[13] Google Scholar

[ref5] 5. Disease G, Incidence I, Collaborators P. Global, regional, and national incidence, prevalence, and years lived with disability for 328 diseases and injuries for 195 countries, 1990-2016: A systematic analysis for the Global Burden of Disease Study 2016. The Lancet. 2017;390:1211–1259. Available from:
View Article
Google Scholar

[15] View Article

[16] Google Scholar

[ref6] 6. Bienvenu OJ, Davydow DS, Kendler KS. Psychiatric ‘diseases’ versus behavioral disorders and degree of genetic influence. Psychological Medicine. 2011;41:33–40. Available from: pmid:20459884
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref7] 7. Craddock N, Sklar P. Genetics of bipolar disorder. The Lancet. 2013;381:1654–1662. Available from:
View Article
Google Scholar

[22] View Article

[23] Google Scholar

[ref8] 8. Merikangas K, Yu K. Genetic epidemiology of bipolar disorder. Clinical Neuroscience Research. 2002;2:127–141. Available from:
View Article
Google Scholar

[25] View Article

[26] Google Scholar

[ref9] 9. Smoller JW, Finn CT. Family, twin, and adoption studies of bipolar disorder. American Journal of Medical Genetics Part C. 2003. Available from: pmid:14601036
View Article
PubMed/NCBI
Google Scholar

[28] View Article

[29] PubMed/NCBI

[30] Google Scholar

[ref10] 10. Song J, Bergen SE, Kuja-Halkola R, Larsson H, Landén M, Lichtenstein P. Bipolar disorder and its relation to major psychiatric disorders: A family-based study in the Swedish population. Bipolar Disorders. 2015;17:184–193. Available from: pmid:25118125
View Article
PubMed/NCBI
Google Scholar

[32] View Article

[33] PubMed/NCBI

[34] Google Scholar

[ref11] 11. Kendler KS, Ohlsson H, Sundquist J, Sundquist K. An extended Swedish national adoption study of bipolar disorder illness and cross-generational familial association with schizophrenia and major depression. JAMA Psychiatry. 2020;77:814–822. pmid:32186664
View Article
PubMed/NCBI
Google Scholar

[36] View Article

[37] PubMed/NCBI

[38] Google Scholar

[ref12] 12. World Health Organization WS. The ICD-10 classification of mental and behavioural disorders: Clinical descriptions and diagnostic guidelines. Geneva. 1992. Available from: Switzerland: World Health Organization. https://books.google.com/books/about/The_ICD_10_Classification_of_Mental_and.html?hl=&id=DFM0DgAAQBAJ.

[ref13] 13. Grande I, Berk M, Birmaher B, Vieta E. Bipolar disorder. The Lancet. 2016;387:1561–1572. Available from:
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref14] 14. Charney AW, Ruderfer DM, Stahl EA, Moran JL, Chambert K, Belliveau RA, et al. Evidence for genetic heterogeneity between clinical subtypes of bipolar disorder. Translational Psychiatry. 2017;7:e993. Available from: pmid:28072414
View Article
PubMed/NCBI
Google Scholar

[44] View Article

[45] PubMed/NCBI

[46] Google Scholar

[ref15] 15. Allardyce J, Leonenko G, Hamshere M, Pardiñas AF, Forty L, Knott S, et al. Association between schizophrenia-related polygenic liability and the occurrence and level of mood-incongruent psychotic symptoms in bipolar disorder. JAMA Psychiatry. 2018;75:28–35. Available from: pmid:29167880
View Article
PubMed/NCBI
Google Scholar

[48] View Article

[49] PubMed/NCBI

[50] Google Scholar

[ref16] 16. Markota M, Coombes BJ, Larrabee BR, McElroy SL, Bond DJ, Veldic M, et al. Association of schizophrenia polygenic risk score with manic and depressive psychosis in bipolar disorder. Translational Psychiatry. 2018;8:188. Available from: pmid:30201969
View Article
PubMed/NCBI
Google Scholar

[52] View Article

[53] PubMed/NCBI

[54] Google Scholar

[ref17] 17. Disorder B, of the Psychiatric Genomics Consortium SWG. Genomic dissection of bipolar disorder and schizophrenia, including 28 subphenotypes. Cell. 2018;173:1705–1715. Available from: pmid:29906448
View Article
PubMed/NCBI
Google Scholar

[56] View Article

[57] PubMed/NCBI

[58] Google Scholar

[ref18] 18. Lewis KJS, Richards A, Karlsson R, Leonenko G, Jones SE, Jones HJ, et al. Comparison of Genetic Liability for Sleep Traits Among Individuals With Bipolar Disorder I or II and Control Participants. JAMA Psychiatry. 2020 03;77(3):303–10. Available from: https://doi.org/10.1001/jamapsychiatry.2019.4079 pmid:31751445
View Article
PubMed/NCBI
Google Scholar

[60] View Article

[61] PubMed/NCBI

[62] Google Scholar

[ref19] 19. Charney AW, Stahl EA, Green EK, Chen CY, Moran JL, Chambert K, et al. Contribution of rare copy number variants to bipolar disorder risk is limited to schizoaffective cases. Biological Psychiatry. 2019;86:110–119. Available from: pmid:30686506
View Article
PubMed/NCBI
Google Scholar

[64] View Article

[65] PubMed/NCBI

[66] Google Scholar

[ref20] 20. Stahl EA, Breen G, Forstner AJ, McQuillin A, Ripke S, et al. Genome-wide association study identifies 30 loci associated with bipolar disorder. Nature Genetics. 2019;51:793–803. Available from: pmid:31043756
View Article
PubMed/NCBI
Google Scholar

[68] View Article

[69] PubMed/NCBI

[70] Google Scholar

[ref21] 21. Coombes B, Markota M, Mann J, Colby C, Stahl E, Talati A, et al. Dissecting clinical heterogeneity of bipolar disorder using multiple polygenic risk scores. medRxiv. 2020. Available from: pmid:32948743
View Article
PubMed/NCBI
Google Scholar

[72] View Article

[73] PubMed/NCBI

[74] Google Scholar

[ref22] 22. Carvalho AF, Firth J, Vieta E. Bipolar disorder. The New England Journal of Medicine. 2020;383:58–66. Available from: pmid:32609982
View Article
PubMed/NCBI
Google Scholar

[76] View Article

[77] PubMed/NCBI

[78] Google Scholar

[ref23] 23. O’Connel KS, Coombes BJ. Genetic contributions to bipolar disorder: current status and future directions. Psychologial Medicine. 2021;51(13):2156–67.
View Article
Google Scholar

[80] View Article

[81] Google Scholar

[ref24] 24. International Consortium on Lithium Genetics [ConLi + Gen] A, A A T, H KO, H L, C S R, …, et al. Association of polygenic score for schizophrenia and HLA antigen and inflammation genes with response to lithium in bipolar affective disorder: A genome-wide association study. JAMA Psychiatry. 2018;75:65–74.
View Article
Google Scholar

[83] View Article

[84] Google Scholar

[ref25] 25. Amare AT, Schubert KO, Hou L, Clark SR, Papiol S, Cearns M, et al. Association of polygenic score for major depression with response to lithium in patients with bipolar disorder. Nature, Molecular Psychiatry. 2021;26:2457–70. Available from:
View Article
Google Scholar

[86] View Article

[87] Google Scholar

[ref26] 26. Nunes A, Trappenberg T, Alda M. Asymmetrical reliability of the Alda score favours a dichotomous representation of lithium responsiveness. PLoS ONE. 2020;15:e0225353. Available from: pmid:31986152
View Article
PubMed/NCBI
Google Scholar

[89] View Article

[90] PubMed/NCBI

[91] Google Scholar

[ref27] 27. Gordovez FJA, McMahon FJ. The genetics of bipolar disorder. Molecular Psychiatry. 2020;25:544–559. Available from: pmid:31907381
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref28] 28. Ho AMC, Coombes BJ, Nguyen TTL, Liu D, McElroy SL, Singh B, et al. Mood-stabilizing antiepileptic treatment response in bipolar disorder: A genome-wide association study. Clinical Pharmacology and Therapeutics. 2020;108:1233–1242. Available from: pmid:32627186
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref29] 29. Mullins N, Forstner AJ, O’Connell KS, Coombes B, Coleman JRI, Qiao Z, et al. Genome-wide association study of over 40000 bipolar disorder cases provides novel biological insights. medRxiv. 2020. Available from:
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref30] 30. Gatz M, Reynolds CA, Fratiglioni L, Johansson B, Mortimer JA, Berg S, et al. Role of genes and environments for explaining Alzheimer disease. Archives of General Psychiatry. 2006;63:168–174. Available from: pmid:16461860
View Article
PubMed/NCBI
Google Scholar

[104] View Article

[105] PubMed/NCBI

[106] Google Scholar

[ref31] 31. Browne HA, Gair SL, Scharf JM, Grice DE. Genetics of obsessive-compulsive disorder and related disorders. The Psychiatric Clinics of North America. 2014;37:319–335. Available from: pmid:25150565
View Article
PubMed/NCBI
Google Scholar

[108] View Article

[109] PubMed/NCBI

[110] Google Scholar

[ref32] 32. Zilhão NR, Olthof MC, Smit DJA, Cath DC, Ligthart L, Mathews CA, et al. Heritability of tic disorders: A twin-family study. Psychological Medicine. 2017;47:1085–1096. Available from: pmid:27974054
View Article
PubMed/NCBI
Google Scholar

[112] View Article

[113] PubMed/NCBI

[114] Google Scholar

[ref33] 33. Walters RK, Polimanti R, Johnson EC, McClintick JN, Adams MJ, Adkins AE, et al. Transancestral GWAS of alcohol dependence reveals common genetic underpinnings with psychiatric disorders. Nature Neuroscience. 2018;21:1656–1669. Available from: pmid:30482948
View Article
PubMed/NCBI
Google Scholar

[116] View Article

[117] PubMed/NCBI

[118] Google Scholar

[ref34] 34. of the Psychiatric Genomics Consortium CDG. Genomic relationships, novel loci, and pleiotropic mechanisms across eight psychiatric disorders. Cell. 2019;179:1469–1482. Available from:
View Article
Google Scholar

[120] View Article

[121] Google Scholar

[ref35] 35. Demontis D, Rajagopal VM, Thorgeirsson TE, Als TD, Grove J, Leppälä K, et al. Genome-wide association study implicates CHRNA2 in cannabis use disorder. Nature Neuroscience. 2019;22:1066–1074. Available from: pmid:31209380
View Article
PubMed/NCBI
Google Scholar

[123] View Article

[124] PubMed/NCBI

[125] Google Scholar

[ref36] 36. Faraone SV, Larsson H. Genetics of attention deficit hyperactivity disorder. Molecular Psychiatry. 2019;24:562–575. Available from: pmid:29892054
View Article
PubMed/NCBI
Google Scholar

[127] View Article

[128] PubMed/NCBI

[129] Google Scholar

[ref37] 37. Jansen IE, Savage JE, Watanabe K, Bryois J, Williams DM, Steinberg S, et al. Genome-wide meta-analysis identifies new loci and functional pathways influencing Alzheimer’s disease risk. Nature Genetics. 2019;51:404–413. Available from: pmid:30617256
View Article
PubMed/NCBI
Google Scholar

[131] View Article

[132] PubMed/NCBI

[133] Google Scholar

[ref38] 38. Purves KL, Coleman JRI, Meier SM, Rayner C, Davis KAS, Cheesman R, et al. A major role for common genetic variation in anxiety disorders. Molecular Psychiatry. 2020;25:3292–3303. Available from: pmid:31748690
View Article
PubMed/NCBI
Google Scholar

[135] View Article

[136] PubMed/NCBI

[137] Google Scholar

[ref39] 39. Djurovic S, Gustafsson O, Mattingsdal M, Athanasiu L, Bjella T, Tesli M, et al. A genome-wide association study of bipolar disorder in Norwegian individuals, followed by replication in Icelandic sample. Journal of Affective Disorders. 2010;126:312–316. Available from: pmid:20451256
View Article
PubMed/NCBI
Google Scholar

[139] View Article

[140] PubMed/NCBI

[141] Google Scholar

[ref40] 40. Smith EN, Koller DL, Panganiban C, Szelinger S, Zhang P, Badner JA, et al. Genome-wide association of bipolar disorder suggests an enrichment of replicable associations in regions near genes. PLoS Genetics. 2011;7:e1002134. Available from: pmid:21738484
View Article
PubMed/NCBI
Google Scholar

[143] View Article

[144] PubMed/NCBI

[145] Google Scholar

[ref41] 41. Winham SJ, Cuellar-Barboza AB, Oliveros A, McElroy SL, Crow S, Colby C, et al. Genome-wide association study of bipolar disorder accounting for effect of body mass index identifies a new risk allele in TCF7L2. Molecular Psychiatry. 2014;19:1010–1016. Available from: pmid:24322204
View Article
PubMed/NCBI
Google Scholar

[147] View Article

[148] PubMed/NCBI

[149] Google Scholar

[ref42] 42. Aas M, Haukvik UK, Djurovic S, Tesli M, Athanasiu L, Bjella T, et al. Interplay between childhood trauma and BDNF val66met variants on blood BDNF mRNA levels and on hippocampus subfields volumes in schizophrenia spectrum and bipolar disorders. Journal of Psychiatric Research. 2014;59:14–21. Available from: pmid:25246365
View Article
PubMed/NCBI
Google Scholar

[151] View Article

[152] PubMed/NCBI

[153] Google Scholar

[ref43] 43. Oliveira J, Kazma R, Le Floch E, Bennabi M, Hamdani N, Bengoufa D, et al. Toxoplasma gondii exposure may modulate the influence of TLR2 genetic variation on bipolar disorder: A gene–environment interaction study. International Journal of Bipolar Disorders. 2016. Available from: pmid:27207565
View Article
PubMed/NCBI
Google Scholar

[155] View Article

[156] PubMed/NCBI

[157] Google Scholar

[ref44] 44. Hosang GM, Fisher HL, Cohen-Woods S, McGuffin P, Farmer AE. Stressful life events and catechol-O-methyl-transferase (COMT) gene in bipolar disorder. Depression and Anxiety. 2017;34:419–426. Available from: pmid:28102561
View Article
PubMed/NCBI
Google Scholar

[159] View Article

[160] PubMed/NCBI

[161] Google Scholar

[ref45] 45. Aas M, Bellivier F, Bettella F, Henry C, Gard S, Kahn JP, et al. Childhood maltreatment and polygenic risk in bipolar disorders. Bipolar Disorders. 2020;22:174–181. Available from: pmid:31628696
View Article
PubMed/NCBI
Google Scholar

[163] View Article

[164] PubMed/NCBI

[165] Google Scholar

[ref46] 46. International Schizophrenia Consortium P, W S M, S N R, V J L, O P M, …, et al. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature. 2009;460:748–752. Available from:
View Article
Google Scholar

[167] View Article

[168] Google Scholar

[ref47] 47. Wilcox HC, Fullerton JM, Glowinski AL, Benke K, Kamali M, Hulvershorn LA, et al. Traumatic stress interacts with bipolar disorder genetic risk to increase risk for suicide attempts. Journal of the American Academy of Child and Adolescent Psychiatry. 2017;56:1073–1080. Available from: pmid:29173741
View Article
PubMed/NCBI
Google Scholar

[170] View Article

[171] PubMed/NCBI

[172] Google Scholar

[ref48] 48. Mistry S, Harrison JR, Smith DJ, Escott-Price V, Zammit S. The use of polygenic risk scores to identify phenotypes associated with genetic risk of bipolar disorder and depression: A systematic review. Journal of Affective Disorders. 2018;234:148–155. Available from: pmid:29529547
View Article
PubMed/NCBI
Google Scholar

[174] View Article

[175] PubMed/NCBI

[176] Google Scholar

[ref49] 49. Reginsson GW, Ingason A, Euesden J, Bjornsdottir G, Olafsson S, Sigurdsson E, et al. Polygenic risk scores for schizophrenia and bipolar disorder associate with addiction. Addiction Biology. 2018;23:485–492. Available from: pmid:28231610
View Article
PubMed/NCBI
Google Scholar

[178] View Article

[179] PubMed/NCBI

[180] Google Scholar

[ref50] 50. Mistry S, Escott-Price V, Florio AD, Smith DJ, Zammit S. Genetic risk for bipolar disorder and psychopathology from childhood to early adulthood. Journal of Affective Disorders. 2019;246:633–639. pmid:30611060
View Article
PubMed/NCBI
Google Scholar

[182] View Article

[183] PubMed/NCBI

[184] Google Scholar

[ref51] 51. Mistry S, Escott-Price V, Florio AD, Smith DJ, Zammit S. Investigating associations between genetic risk for bipolar disorder and cognitive functioning in childhood. Journal of Affective Disorders. 2019;259:112–120. pmid:31445336
View Article
PubMed/NCBI
Google Scholar

[186] View Article

[187] PubMed/NCBI

[188] Google Scholar

[ref52] 52. Musliner KL, Mortensen PB, McGrath JJ, Suppli NP, Hougaard DM, …, et al. Association of polygenic liabilities for major depression, bipolar disorder, and schizophrenia with risk for depression in the Danish population. JAMA Psychiatry. 2019;76:516–525. pmid:30698613
View Article
PubMed/NCBI
Google Scholar

[190] View Article

[191] PubMed/NCBI

[192] Google Scholar

[ref53] 53. Musliner KL, Krebs MD, Albiñana C, Vilhjalmsson B, Agerbo E, Zandi PP, et al. Polygenic risk and progression to bipolar or psychotic disorders among individuals diagnosed with unipolar depression in early life. American Journal of Psychiatry. 2020;177:936–943. pmid:32660297
View Article
PubMed/NCBI
Google Scholar

[194] View Article

[195] PubMed/NCBI

[196] Google Scholar

[ref54] 54. Mullins N, Bigdeli TB, Børglum AD, Coleman JRI, Demontis D, Mehta D, et al. GWAS Of suicide attempt in psychiatric disorders and association with major depression polygenic risk scores. The American Journal of Psychiatry. 2019;176:651–660. pmid:31164008
View Article
PubMed/NCBI
Google Scholar

[198] View Article

[199] PubMed/NCBI

[200] Google Scholar

[ref55] 55. Grigoroiu-Serbanescu M, Giaroli G, Thygesen JH, Shenyan O, Bigdeli TB, Bass NJ, et al. Predictive power of the ADHD GWAS 2019 polygenic risk scores in independent samples of bipolar patients with childhood ADHD. Journal of Affective Disorders. 2020;265:651–659. pmid:31791676
View Article
PubMed/NCBI
Google Scholar

[202] View Article

[203] PubMed/NCBI

[204] Google Scholar

[ref56] 56. Chen DT, Jiang X, Akula N, Shugart YY, Wendland JR, Steele CJM, et al. Genome-wide association study meta-analysis of European and Asian-ancestry samples identifies three novel loci associated with bipolar disorder. Molecular Psychiatry. 2013;18:195–205. pmid:22182935
View Article
PubMed/NCBI
Google Scholar

[206] View Article

[207] PubMed/NCBI

[208] Google Scholar

[ref57] 57. Joslyn C, Hawes DJ, Hunt C, Mitchell PB. Is age of onset associated with severity, prognosis, and clinical features in bipolar disorder? A meta-analytic review. Bipolar Disorders. 2016;18:389–403. pmid:27530107
View Article
PubMed/NCBI
Google Scholar

[210] View Article

[211] PubMed/NCBI

[212] Google Scholar

[ref58] 58. Eser HY, Kacar AS, Kilciksiz CM, Yalçinay-Inan M, Ongur D. Prevalence and associated features of anxiety disorder comorbidity in bipolar disorder: A meta-analysis and meta-regression study. Frontiers in Psychiatry. 2018;9:229. Available from:
View Article
Google Scholar

[214] View Article

[215] Google Scholar

[ref59] 59. 23andMe. Research Consent Document; 2023. [Online; accessed 19-Dec-2023]. https://www.23andme.com/about/consent/.

[ref60] 60. McGrouther C. Confounding factors in association studies of neuropsychiatric disease: A case study of Bipolar Affective Disorder [Ph.D. Thesis]. San Diego, CA: University of California, San Diego; 2014.

[ref61] 61. Xie J, Ma A, Fennell A, Ma Q, Zhao J. It is time to apply biclustering: a comprehensive review of biclustering applications in biological and biomedical data. Brief Bioinform. 2019;20(4):1449–64. pmid:29490019
View Article
PubMed/NCBI
Google Scholar

[219] View Article

[220] PubMed/NCBI

[221] Google Scholar

[ref62] 62. Dahl A, Zaitlen N. Genetic Influences on Disease Subtypes. Annu Rev Genomics Hum Genet. 2020;21:413–35. pmid:32873077
View Article
PubMed/NCBI
Google Scholar

[223] View Article

[224] PubMed/NCBI

[225] Google Scholar

[ref63] 63. Rangan AV. A simple filter for detecting low-rank submatrices. Journal of Computational Physics. 2012 Apr;231(7):2682–90.
View Article
Google Scholar

[227] View Article

[228] Google Scholar

[ref64] 64. Rangan AV, McGrouther CC, Kelsoe J, Schork N, Stahl E, Zhu Q, et al. A loop-counting method for covariate-corrected low-rank biclustering of gene-expression and genome-wide association study data. PLOS Computational Biology. 2018 05;14(5):1–29. Available from: pmid:29758032
View Article
PubMed/NCBI
Google Scholar

[230] View Article

[231] PubMed/NCBI

[232] Google Scholar

[ref65] 65. Consortium WTCC. Genome-wide association study of 14 000 cases of seven common diseases and 3000 shared controls. Nature. 2007;447:661–678.
View Article
Google Scholar

[234] View Article

[235] Google Scholar

[ref66] 66. Jones L, Metcalf A, Gordon-Smith K, Forty L, Perry A, Lloyd J, et al. Gambling problems in bipolar disorder in the UK: Prevalence and distribution. British Journal of Psychiatry. 2015;207(4):328–333. Available from:
View Article
Google Scholar

[237] View Article

[238] Google Scholar

[ref67] 67. Alon N, Krivelevich M, Sudakov B. Finding a large hidden clique in a random graph. Random Structures & Algorithms. 1998;13(3-4):457–66. https://onlinelibrary.wiley.com/doi/abs/10.1002/%28SICI%291098-2418%28199810/12%2913%3A3/4%3C457%3A%3AAID-RSA14%3E3.0.CO%3B2-W.
View Article
Google Scholar

[240] View Article

[241] Google Scholar

[ref68] 68. Deshpande Y, Montanari A. Improved Sum-of-Squares Lower Bounds for Hidden Clique and Hidden Submatrix Problems; 2015.

[ref69] 69. McCarthy Sea. A reference panel of 64,976 haplotypes for genotype imputation. Nature genetics. 2016;48(10):1279–83. pmid:27548312
View Article
PubMed/NCBI
Google Scholar

[244] View Article

[245] PubMed/NCBI

[246] Google Scholar

[ref70] 70. Zhu Q, Wong AK, Krishnan A, Aure MR, Tadych A, Zhang R, et al. Targeted exploration and analysis of large cross-platform human transcriptomic compendia. Nature Methods. 2015;12(3):211–4. pmid:25581801
View Article
PubMed/NCBI
Google Scholar

[248] View Article

[249] PubMed/NCBI

[250] Google Scholar

[ref71] 71. Lee SH, Goddard ME, Wray NR, Visscher PM. A Better Coefficient of Determination for Genetic Profile Analysis. Genetic Epidemiology. 2012;36:214–24. pmid:22714935
View Article
PubMed/NCBI
Google Scholar

[252] View Article

[253] PubMed/NCBI

[254] Google Scholar

[ref72] 72. O’Connell KS, Koromina M, van der Veen T, Boltz T, David FS, Kay Yang JM, et al. Genomics yields biological and phenotypic insights into bipolar disorder. medRxiv. 2024. Available from: https://www.medrxiv.org/content/early/2024/08/28/2023.10.07.23296687.
View Article
Google Scholar

[256] View Article

[257] Google Scholar

[ref73] 73. Kranzler HR, Zhou H, Kember RL, Vickers Smith R, Justice AC, Damrauer S, et al. Genome-wide association study of alcohol consumption and use disorder in 274 424 individuals from multiple populations. Nature Communications. 2019;10:1499. pmid:30940813
View Article
PubMed/NCBI
Google Scholar

[259] View Article

[260] PubMed/NCBI

[261] Google Scholar

[ref74] 74. Jang SK, Saunders G, Liu M, Jiang Y, Liu DJ, Vrieze S. Genetic correlation, pleiotropy, and causal associations between substance use and psychiatric disorder. Psychological Medicine. 2020. Available from: pmid:32762793
View Article
PubMed/NCBI
Google Scholar

[263] View Article

[264] PubMed/NCBI

[265] Google Scholar

[ref75] 75. van Hulzen KJE, Scholz CJ, Franke B, Ripke S, Klein M, McQuillin A, et al. Genetic overlap between attention-deficit/hyperactivity disorder and bipolar disorder: Evidence from genome-wide association study meta-analysis. Biological Psychiatry. 2017;82:634–641. Available from: pmid:27890468
View Article
PubMed/NCBI
Google Scholar

[267] View Article

[268] PubMed/NCBI

[269] Google Scholar

[ref76] 76. O'Connell KS, McGregor NW, Lochner C, Emsley R, Warnich L. The genetic architecture of schizophrenia, bipolar disorder, obsessive-compulsive disorder and autism spectrum disorder. Molecular and Cellular Neurosciences. 2018;88:300–307. Available from: pmid:29505902
View Article
PubMed/NCBI
Google Scholar

[271] View Article

[272] PubMed/NCBI

[273] Google Scholar

[ref77] 77. Coleman JRI, Gaspar HA, Bryois J, Bipolar Disorder Working Group of the Psychiatric Genomics Consortium MDDWGotPGC, Breen G. The genetics of the mood disorder Spectrum: Genome-wide association analyses of more than 185 000 cases and 439 000 controls. Biological Psychiatry. 2020;88:169–184. Available from: pmid:31926635
View Article
PubMed/NCBI
Google Scholar

[275] View Article

[276] PubMed/NCBI

[277] Google Scholar

[ref78] 78. Fr#x00ED;as A, Baltasar I, Birmaher B. Comorbidity between bipolar disorder and borderline personality disorder: Prevalence, explanatory theories, and clinical impact. Journal of Affective Disorders. 2016;202:210–219. Available from:
View Article
Google Scholar

[279] View Article

[280] Google Scholar

[ref79] 79. Salloum IM, Brown ES. Management of comorbid bipolar disorder and substance use disorders. The American Journal of Drug and Alcohol Abuse. 2017;43:366–376. Available from: pmid:28301219
View Article
PubMed/NCBI
Google Scholar

[282] View Article

[283] PubMed/NCBI

[284] Google Scholar

[ref80] 80. Bortolato B, Berk M, Maes M, McIntyre RS, Carvalho AF. Fibromyalgia and bipolar disorder: Emerging epidemiological associations and shared pathophysiology. Current Molecular Medicine. 2016;16:119–136. Available from: pmid:26812920
View Article
PubMed/NCBI
Google Scholar

[286] View Article

[287] PubMed/NCBI

[288] Google Scholar

[ref81] 81. Correll CU, Solmi M, Veronese N, Bortolato B, Rosson S, Santonastaso P, et al. Prevalence, incidence and mortality from cardiovascular disease in patients with pooled and specific severe mental illness: A large-scale meta-analysis of 3 211 768 patients and 113 383 368 controls. World Psychiatry: Official Journal of the World Psychiatric Association. 2017;16:163–180. Available from: pmid:28498599
View Article
PubMed/NCBI
Google Scholar

[290] View Article

[291] PubMed/NCBI

[292] Google Scholar

[ref82] 82. Vancampfort D, Correll CU, Galling B, Probst M, De Hert M, Ward PB, et al. Diabetes mellitus in people with schizophrenia, bipolar disorder and major depressive disorder: A systematic review and large scale meta-analysis. World Psychiatry: Official Journal of the World Psychiatric Association. 2016;15:166–174. Available from: pmid:27265707
View Article
PubMed/NCBI
Google Scholar

[294] View Article

[295] PubMed/NCBI

[296] Google Scholar

[ref83] 83. Roshanaei-Moghaddam B, Katon W. Premature mortality from general medical illnesses among persons with bipolar disorder: A review. Psychiatric Services. 2009;60:147–156. Available from: pmid:19176408
View Article
PubMed/NCBI
Google Scholar

[298] View Article

[299] PubMed/NCBI

[300] Google Scholar

[ref84] 84. Kessing LV, Vradi E, McIntyre RS, Andersen PK. Causes of decreased life expectancy over the life span in bipolar disorder. Journal of Affective Disorders. 2015;180:142–147. Available from: pmid:25909752
View Article
PubMed/NCBI
Google Scholar

[302] View Article

[303] PubMed/NCBI

[304] Google Scholar

[ref85] 85. Diflorio A, Jones I. Is sex important? Gender differences in bipolar disorder. International Review of Psychiatry. 2010;22:437–452. Available from: pmid:21047158
View Article
PubMed/NCBI
Google Scholar

[306] View Article

[307] PubMed/NCBI

[308] Google Scholar

[ref86] 86. Nivoli AMA, Pacchiarotti I, Rosa AR, Popovic D, Murru A, Valenti M, et al. Gender differences in a cohort study of 604 bipolar patients: The role of predominant polarity. Journal of Affective Disorders. 2011;133:443–449. Available from: pmid:21620480
View Article
PubMed/NCBI
Google Scholar

[310] View Article

[311] PubMed/NCBI

[312] Google Scholar

[ref87] 87. Zimmerman M, Ruggero CJ, Chelminski I, Young D. Is bipolar disorder overdiagnosed? The Journal of Clinical Psychiatry. 2008;69:935–40. Available from: pmid:18466044
View Article
PubMed/NCBI
Google Scholar

[314] View Article

[315] PubMed/NCBI

[316] Google Scholar

[ref88] 88. Goes FS, Pirooznia M, Parla JS, Kramer M, Ghiban E, Mavruk S, et al. Exome sequencing of familial bipolar disorder. JAMA Psychiatry. 2016;73:590–597. Available from: pmid:27120077
View Article
PubMed/NCBI
Google Scholar

[318] View Article

[319] PubMed/NCBI

[320] Google Scholar

[ref89] 89. Maaser A, Forstner AJ, Strohmaier J, Hecker J, Ludwig KU, …, et al. Exome sequencing in large, multiplex bipolar disorder families from Cuba. PLoS ONE. 2018;13:e0205895. Available from: pmid:30379966
View Article
PubMed/NCBI
Google Scholar

[322] View Article

[323] PubMed/NCBI

[324] Google Scholar

[ref90] 90. Toma C, Shaw AD, Allcock RJN, Heath A, Pierce KD, Mitchell PB, et al. An examination of multiple classes of rare variants in extended families with bipolar disorder. Translational Psychiatry. 2018;8:65. Available from: pmid:29531218
View Article
PubMed/NCBI
Google Scholar

[326] View Article

[327] PubMed/NCBI

[328] Google Scholar

[ref91] 91. Goes FS, Pirooznia M, Tehan M, Zandi PP, McGrath J, Wolyniec P, et al. De novo variation in bipolar disorder. Molecular Psychiatry. 2019;387:1561. Available from: pmid:31776463
View Article
PubMed/NCBI
Google Scholar

[330] View Article

[331] PubMed/NCBI

[332] Google Scholar

[ref92] 92. Forstner AJ, Fischer SB, Schenk LM, Strohmaier J, Maaser-Hecker A, Reinbold CS, et al. Whole-exome sequencing of 81 individuals from 27 multiply affected bipolar disorder families. Translational Psychiatry. 2020;10:57. Available from: pmid:32066727
View Article
PubMed/NCBI
Google Scholar

[334] View Article

[335] PubMed/NCBI

[336] Google Scholar

[ref93] 93. Sul JH, Service SK, Huang AY, Ramensky V, Hwang SG, Teshiba TM, et al. Contribution of common and rare variants to bipolar disorder susceptibility in extended pedigrees from population isolates. Translational Psychiatry. 2020;10:74. Available from: pmid:32094344
View Article
PubMed/NCBI
Google Scholar

[338] View Article

[339] PubMed/NCBI

[340] Google Scholar

[ref94] 94. Akinhanmi MO, Biernacka JM, Strakowski SM, McElroy SL, Balls Berry JE, Merikangas KR, et al. Racial disparities in bipolar disorder treatment and research: A call to action. Bipolar Disorders. 2018;20:506–514. Available from: pmid:29527766
View Article
PubMed/NCBI
Google Scholar

[342] View Article

[343] PubMed/NCBI

[344] Google Scholar

[ref95] 95. Martin AR, Kanai M, Kamatani Y, Okada Y, Neale BM, Daly MJ. Clinical use of current polygenic risk scores may exacerbate health disparities. Nature Genetics. 2019;51:584–591. Available from: pmid:30926966
View Article
PubMed/NCBI
Google Scholar

[346] View Article

[347] PubMed/NCBI

[348] Google Scholar

[ref96] 96. Peterson RE, Kuchenbaecker K, Walters RK, Chen CY, Popejoy AB, Periyasamy S, et al. Genome-wide association studies in ancestrally diverse populations: Opportunities, methods, pitfalls, and recommendations. Cell. 2019;179:589–603. Available from: pmid:31607513
View Article
PubMed/NCBI
Google Scholar

[350] View Article

[351] PubMed/NCBI

[352] Google Scholar

[ref97] 97. Sirugo G, Williams SM, Tishkoff SA. The missing diversity in human genetic studies. Cell. 2019;177:1080. Available from: pmid:31051100
View Article
PubMed/NCBI
Google Scholar

[354] View Article

[355] PubMed/NCBI

[356] Google Scholar

[ref98] 98. Duncan L, Shen H, Gelaye B, Meijsen J, Ressler K, Feldman M, et al. Analysis of polygenic risk score usage and performance in diverse human populations. Nature Communications 10(1). 2019. Available from: pmid:31346163
View Article
PubMed/NCBI
Google Scholar

[358] View Article

[359] PubMed/NCBI

[360] Google Scholar

Figures

Abstract

Background

Results

Conclusions

Background

Overview

Contribution

Methods

Data

Ethics statement.

Correcting for ancestry

Biclustering

Replication

Polygenic-Risk-Scores (PRSs)

Gene-enrichment analysis

Results

Discussion

Interaction with covariates

Interaction with BD subtype

Bicluster-informed PRS performance

Gene-enrichment

Secondary bicluster

Control biclusters

Conclusion

Supporting information

S1 Text. Contains a detailed description of our methods, including an outline of the steps involved and the considerations we made along the way.

Acknowledgments

Bipolar Disorder Working Group of the Psychiatric Genomics Consortium

References