A knowledge-based multivariate statistical method for examining gene-brain-behavioral/cognitive relationships: Imaging genetics generalized structured component analysis

Heungsun Hwang; Gyeongcheol Cho; Min Jin Jin; Ji Hoon Ryoo; Younyoung Choi; Seung Hwan Lee

doi:10.1371/journal.pone.0247592

Abstract

With advances in neuroimaging and genetics, imaging genetics is a naturally emerging field that combines genetic and neuroimaging data with behavioral or cognitive outcomes to examine genetic influence on altered brain functions associated with behavioral or cognitive variation. We propose a statistical approach, termed imaging genetics generalized structured component analysis (IG-GSCA), which allows researchers to investigate such gene-brain-behavior/cognitive associations, taking into account well-documented biological characteristics (e.g., genetic pathways, gene-environment interactions, etc.) and methodological complexities (e.g., multicollinearity) in imaging genetic studies. We begin by describing the conceptual and technical underpinnings of IG-GSCA. We then apply the approach for investigating how nine depression-related genes and their interactions with an environmental variable (experience of potentially traumatic events) influence the thickness variations of 53 brain regions, which in turn affect depression severity in a sample of Korean participants. Our analysis shows that a dopamine receptor gene and an interaction between a serotonin transporter gene and the environment variable have statistically significant effects on a few brain regions’ variations that have statistically significant negative impacts on depression severity. These relationships are largely supported by previous studies. We also conduct a simulation study to safeguard whether IG-GSCA can recover parameters as expected in a similar situation.

Citation: Hwang H, Cho G, Jin MJ, Ryoo JH, Choi Y, Lee SH (2021) A knowledge-based multivariate statistical method for examining gene-brain-behavioral/cognitive relationships: Imaging genetics generalized structured component analysis. PLoS ONE 16(3): e0247592. https://doi.org/10.1371/journal.pone.0247592

Editor: Giuseppe Biagini, University of Modena and Reggio Emilia, ITALY

Received: November 13, 2020; Accepted: February 10, 2021; Published: March 10, 2021

Copyright: © 2021 Hwang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Since our data contain potentially sensitive personal information, including brain structures and genetic information, it is forbidden to share these data with a third party without obtaining additional written form of informed consent for information sharing, according to the bioethics law and personal information protection act in South Korea. We did not obtain the additional written consent for information sharing and sharing the data would violate the law and the ethical policy. South Korea’s Ministry of Justice imposes the ethical and legal restrictions on using, opening, and transferring of personal information, even though the data are de-identified. You may contact the Ministry of Justice, South Korea, for data requests: Ministry of Justice, Building #1, Government Complex-Gwacheon, 47, Gwanmun-ro, Gwacheon-si, Gyeonggi-do, Republic of Korea, 13809. Tel: +82-2-2110-3000. Web: https://www.moj.go.kr/moj_eng/1772/subview.do.

Funding: This work was partially supported by the Ministry of Education and the National Research Foundation of Korea (NRF-2019S1A5A2A03052192) to Heungsun Hwang and Younyoung Choi, and by the Brain Research Program through the National Research Foundation of Korea from the Ministry of Science, ICT & Future Planning (NRF-2015M3C7A1028252) and the Korea Medical Device Development Fund grant funded by the Korean government (the Ministry of Science and ICT, the Ministry of Trade, Industry and Energy, the Ministry of Health & Welfare, Republic of Korea, the Ministry of Food and Drug Safety) (Project Number: 202013B10) to Seung Hwan Lee.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Imaging genetics is a rapidly emerging field that integrates genetic and neuroimaging data with behavioral or cognitive outcomes to examine genetic influence on the variation of brain function, which is in turn associated with behavioral or cognitive variation [1]. This field has made remarkable progress in recent years [2], showing its potential for studying disease- or task-specific “gene-brain-behavior/cognition (G-B-B/C)” relationships. For example, variation in the Apolipoprotein E (APOE) gene was associated with altered activity in brain regions, such as the hippocampus, parietal and prefrontal cortex, during a memory task [3]. A functional variation in the catechol-O-methyltransferase (COMT) gene was related to differential brain activity in the dorsolateral prefrontal cortex, anterior cingulate cortex, and parietal cortex during cognitive control tasks [4].

Imaging genetic studies have increasingly involved a number of genotypes, such as single nucleotide polymorphisms (SNPs), and a number of brain-based phenotypes, such as voxel-level variations (shortly voxels hereafter), in accordance with accumulated evidence that multiple genotypes can be associated with a single phenotype and a single genotype can be associated with multiple phenotypes [5]. Thus, multivariate techniques have been main statistical tools for imaging genetic studies [6, 7], including canonical correlation analysis [8], partial least squares [9], reduced-rank regression [10], and independent component analysis [11, 12]. These techniques generally aim to obtain (low-dimensional) linear combinations or components of genetic and imaging data and examine the associations between the resultant genetic and imaging components.

Despite their usefulness, the scope and flexibility of the conventional multivariate techniques are limited in several ways. First, they do not explicitly account for various well-documented biological characteristics, such as genetic and molecular pathways (e.g., which SNPs occur in which genes), while extracting genetic and imaging components. As a result, the extracted components are often difficult to interpret, lacking direct biological meaning [6]. Second, they largely remain descriptive in nature, focusing on how genetic and imaging components are correlated to each other. This makes it difficult to statistically examine the influence of a component on another (e.g., which genetic components have effects on which imaging components and how the effects look like). Third, although in principle it is possible to extend the multivariate techniques to the analysis of more than two datasets, they have typically been applied to genetic and imaging datasets only to extract their components and associate them. Subsequently, the extracted genetic and/or imaging components are used to predict behavioral or cognitive outcomes through the adoption of regression analysis or machine learning algorithms [13]. This sequential or two-step approach does not guarantee that the extracted genetic and imaging components are optimal for predicting behavioral or cognitive phenotypes because they are obtained separately without considering prediction of such phenotypes. It would be more desirable to develop a unified framework for incorporating all genetic, imaging, and behavioral/cognitive phenotypes into analyses simultaneously, so that genetic and imaging components are extracted to be highly associated with each other as well as to predict behavioral/cognitive phenotypes well. To address these limitations, it would be necessary to develop a unified and path-analytic approach for specifying and testing more biologically plausible G-B-B/C relationships.

In this paper, we propose a general multivariate approach, termed imaging genetics generalized structured component analysis (IG-GSCA), for such unified analyses of path-analytic relationships among all the three sources of data (genetic, imaging, and behavioral/cognitive) in a more biologically meaningful manner. As will be discussed in more detail in Section 2, IG-GSCA allows researchers to specify and examine various biologically plausible G-B-B/C relationships based on knowledge accumulated from previous studies, for example, in genome-wide whole brain association [14] and connectivity analysis [15, 16].

As the name denotes, this approach is methodologically built on generalized structured component analysis (GSCA) [17, 18] that is a multivariate method for modeling and testing path-analytic relationships between observed variables and components thereof based on prior knowledge or theory. GSCA is well-suited to our data analytic purposes for several reasons. First, in imaging genetics, genetic and brain phenotypes, such as SNPs and voxels, represent observed measurements at specific locations in the genome and brain, indicating that a set of SNPs or voxels constitutes a gene or brain region. That is, in a statistical model, a gene or brain region may be regarded as a component of SNPs and voxels [19–21]. GSCA can be used to obtain such genetic and imaging components based on prior biological knowledge (e.g., which SNPs occur in which genes, or which voxels form which brain regions). Also, GSCA provides the unique individual scores of genetic and imaging components, which may represent gene- or brain-level scores of individuals associated with a specific behavioral or cognitive outcome. The provision of these individual scores may have important empirical implications. For example, clinicians may use the scores as proxies for individual gene- or brain-level vulnerabilities associated with risk for chronic diseases. Moreover, GSCA is less likely to suffer from non-convergence in small samples, complex models, and/or in the presence of multicollinearity [18], which are not uncommon in imaging genetic studies [22]. In addition, recent extensions of GSCA can be very useful for imaging genetic studies. For example, imaging genetic studies can often involve the specification of interaction terms (e.g., gene-gene interactions, gene-environment interactions, etc.). GSCA has been extended to incorporate various component interaction terms effectively [23]. Moreover, specifying a number of genes and/or brain regions as well as their potential interactions simultaneously is likely to lead to the issue of multicollinearity. GSCA has been combined with regularization to address the multicollinearity issue [18, 20, 24].

GSCA has been applied to examine the directional relationships between SNPs, genes, and behavioral phenotypes [20, 21]. It also has been used for examining directional relationships among brain regions [19, 25]. However, GSCA has never been employed for connecting genetic, imaging, and behavioral/cognitive phenotypes simultaneously. Furthermore, we integrate the aforementioned extensions of GSCA, such as testing interaction effects and regularization, for more efficient model specification and testing of the directional associations among the three data sources. Thus, IG-GSCA is a GSCA method tailored for the path-analytic analysis of imaging genetic data in a unified manner.

Owing to its generality and flexibility, structural equation modeling (SEM) [26, 27] can also be considered for such knowledge-based path-analytic analyses of imaging genetic data. Nonetheless, SEM may be less suitable for these analyses than IG-GSCA for several reasons. Most notably, SEM will specify a gene or brain region as a (common) factor that explains the covariation of SNPs or voxels only [28, 29], under the assumption that a gene or brain region exists independently of SNPs or voxels [30]. This indicates that assigning different SNPs or voxels to a gene or brain region should not change the gene’s or brain region’s meaning [31], which does not appear biologically plausible. As stated earlier, instead, it seems more reasonable to specify a gene or brain region as a weighted composite or biological cluster of SNPs or voxels, as postulated in IG-GSCA. However, this way of specifying a gene or brain region is not compatible with SEM, leading to identification problems in general [32]. Moreover, SEM cannot provide unique individual gene- or brain-level scores because of the factor score indeterminacy problem [33, 34]. Furthermore, it suffers from non-convergence particularly in small samples, complex models, and/or in the presence of multicollinearity [35, 36].

A few studies used SEM for associating genetic and imaging data with behavioral or cognitive variables [28, 37]. However, they applied a series of SEM or SEM and other statistical methods (e.g., regression) sequentially to examine the associations among these data, regardless of whether SEM is a suitable method for the studies. Conversely, as noted earlier, IG-GSCA is a unified statistical framework for researchers to be able to simultaneously associate genetic, imaging, and behavioral/cognitive data in a biologically plausible fashion.

The remainder of the paper proceeds as follows. We begin by discussing both conceptual and technical underpinnings of IG-GSCA, including its model specification and parameter estimation. We then apply IG-GSCA to real imaging genetics data collected from a sample of Korean participants in order to investigate the effects of gene-level variations on the thickness differences of brain regions, which in turn influence depression severity. We also conduct a simulation study to safeguard whether IG-GSCA performs as expected. We consider a model similar to the one specified in the real data analysis and examine IG-GSCA’s parameter recovery under different sample sizes. Lastly, we summarize the implications of IG-GSCA and discuss directions for future research.

Method

Model specification

It is crucial to build a model bridging all three main constituents of imaging genetics (i.e., genetic, brain, and behavioral/cognitive phenotypes) in a biologically plausible manner based on knowledge accumulated from previous literature or researchers’ hypotheses. In model specification, we should begin with the fundamental premise of imaging genetics that brain-based phenotypes serve as intermediate phenotypes between genotypes and behavioral/cognitive phenotypes, indicating that the influence of genetic variation on behaviour/cognition is mediated through brain phenotypes (i.e., indirect effects of genetic variation). Genome-wide whole brain association studies can be used to obtain information about genotypes that are associated with brain phenotypes relevant to specific behavioral or cognitive variation.

We can also consider various characteristics of each constituent. For example, it may be reasonable to assume from genetic studies that several SNPs often occur in a gene, rather than a single SNP per gene. A substantial amount of information on genetic pathways has already been gathered for researchers to specify which SNPs are linked to which genes in different diseases [14]. As discussed earlier, SNPs can be considered observed variables, whereas a gene can be a weighted sum of SNPs (i.e., a component). It is also known that multiple genes can be associated with a single phenotype (polygenicity); a single gene can be involved in multiple phenotypes (pleiotropy); and the effect of one gene can be modified by another gene, indicating gene-gene interactions (epistasis) [38].

In imaging studies, it is well recognized that a particular behavioral or cognitive task is associated with neural networks of multiple brain regions, rather than isolated brain regions [15]. Connectivity analysis can describe relationships between brain regions within a network [39]. There are two different approaches to connectivity analysis–functional vs. effective connectivity [16]. Functional connectivity analysis generally focuses on an inter-correlational pattern or inter-regional coupling between brain regions (e.g., activities in brain region A correlate with those in brain region B). It can offer insight into correlations between different brain regions but is limited in that it does not account for directionality between interacting regions. Conversely, effective connectivity analysis focuses on directional relationships between brain regions selected based on a hypothesis or prior knowledge about their importance in completing a task (e.g., activities in region A exert influence on those in region B). This approach can be used to better explain functional integration within a distributed neural system, allowing quantifications and stronger inferences of directed connections of different brain region activities [15]. Thus, it may be more desirable to explicitly incorporate directional neural network information given by previous effective connectivity studies. Furthermore, we can include more than one behavioral/cognitive phenotype at the same time to consider their potential correlations as well as the effects of genotypes and/or brain-based phenotypes on multiple behavioral/cognitive phenotypes.

In IG-GSCA, we incorporate such theoretical considerations into sets of mathematical equations, also called sub-models. Specifically, as in GSCA, it will involve three sub-models–weighted relation, measurement, and structural. The weighed relation model is used to explicitly define a component as a weighted sum of observed variables. The measurement model specifies the relationships between observed and components, whereas the structural model is to express the relationships between components.

For simplicity, hereafter, let us assume that all genetic observed variables indicate SNPs and imaging observed variables are called voxels, whereas all genetic and imaging components are genes and brain regions, respectively. Let z denote a J by 1 vector of all observed variables, including SNPs, voxels, and behavioral/cognitive phenotypes. Let γ denote a P by 1 vector of all components, including genes, brain regions, and behavioral/cognitive traits. We assume that all observed variables and components are standardized to have zero means and unit variances.

The weighted relation model is generally written as follows. (1) where W is a P by J matrix of weights assigned to J observed variables. This sub-model is unique to IG-GSCA (or GSCA), which is distinct from (factor-based) SEM.

The measurement model is generally written as (2) where C is a J by P matrix of loadings relating P components to J observed variables, and ε is a J by 1 vector of the residuals of all observed variables left unexplained by their components. The structural model is generally expressed as (3) where B is a P by P matrix of path coefficients relating P components among themselves, and ζ is a P by 1 vector of the residuals of all components left unexplained by their independent components. The combination of (1) and (2) can be seen as the constrained principal component analysis model [40] in that components of observed variables in (1) are obtained in such a way that they explain the maximum variances of the observed variables, signified by loadings in (2), as well as some elements of the W and C matrices are typically constrained to fixed values (e.g., zero) based on prior knowledge, as illustrated below.

To exemplify these sub-models, we contemplate a prototype model depicted in Fig 1. In the figure, a box indicates an observed variable and a hexagon represents a component. An arrow signifies that the variable at the base of an arrow affects the variable at the head of the arrow, whereas a straight line indicates a weight assigned to each observed variable. This model contains two genes (γ₁ and γ₂) and two brain regions (γ₃ and γ₄), each of which is a weighed sum of two observed variables (SNPs or voxels). It includes one observed behavioral outcome. The model shows that the two genes affect the two brain regions, one brain region influences the other brain region, and both brain regions influence the behavioral outcome.

Download:

Fig 1. A path diagram of a prototype IG-GSCA model.

https://doi.org/10.1371/journal.pone.0247592.g001

The weighted relation model for the prototype model can be expressed as (4)

The measurement model for the prototype model can be written as follows (5)

Finally, the structural model for the prototype can be expressed as (6)

This sub-model contains a series of regression models for all dependent components.

IG-GSCA combines the three sub-models into a single model, as follows. (7) where I is an identity matrix, V = , and A = , and e = . We call (7) the IG-GSCA model, which enables to accommodate a variety of hypothesized G-B-B/C relationships.

In the prototype model, for simplicity, we consider only main effects of each component. However, we can also consider interaction effects of components, for example, gene-gene or gene-environment interactions. For example, let γ₁₂ denote a gene-gene interaction term that is defined as the product of the two genes (i.e., γ₁₂ = γ₁γ₂). Let γ* = [γ; γ₁₂] denote a vector consisting of all components and the component interaction term. Then, the weighted relation model is given as (8) where W* = , and z* = . The measurement model is generally given as (9) where C* = [C, 0]. The structural model is generally expressed as follows. (10) where B* consists of additional path coefficients relating γ₁₂ to other variables. The model (7) can easily accommodate component interaction terms because the above sub-models are essentially of the same form as (1), (2), and (3).

Parameter estimation

The unknown parameters of IG-GSCA include weights in W, loadings in C, and path coefficients in B. As illustrated in the previous section, the W, C and B matrices include fixed values (e.g., zeros) to express hypothesized relationships between variables, making it difficult to estimate the parameters in closed form. Instead, they are to be estimated iteratively. Moreover, components (e.g., genes and brain regions) and their interaction terms tend to be highly correlated to one another, leading to multicollinearity.

Let z_i denote a vector of indicators measured on a single observation of a sample of N observations (i = 1, …, N). To estimate the parameters, we aim to minimize the following penalized least squares criterion (11) subject to , where 1 is a vector of ones of appropriate order, and λ is a non-negative tuning parameter for path coefficients. In (11), for any matrix X, | X | denotes the absolute value of X. When τ = 2, 1'|B^τ|1 become the ridge or L₂ penalty [41], whereas when τ = 1, it is equivalent to the lasso or L₁ penalty [42]. Ridge or L₂ regularization has been widely used to deal with multicollinearity, whereas lasso or L₁ regularization is used for variable selection [43]. We are typically interested in dealing with multicollinearity, while keeping our model specification intact. It is known that within a certain range of the tuning parameter, the ridge estimator always exhibits a smaller mean square error than the ordinary least squares estimator [41]. This tendency becomes salient in the presence of multicollinearity [44]. Nonetheless, if variable section is of concern, lasso regularization can be adopted to select subsets of components, facilitating the parsimony and interpretability of the model.

We apply an alternating regularized least squares algorithm [24] to minimize this criterion. This algorithm will repeat three steps until convergence. In each step, one set of the parameters will be updated with the other sets fixed. If an interaction term of components is included, the algorithm estimates weights, considering that the component interaction term shares the same weights as those for its interacting components because it is the product of these components, each of which is a weighted sum of observed variables [23]. For instance, if γ₁₂ is a gene-gene interaction between γ₁ and γ₂ in the prototype model, i.e., γ₁₂ = γ₁γ₂ = (z₁w₁ + z₂w₂)(z₃w₃ + z₄w₄), then γ₁₂ shares w₁ and w₂ with γ₁, and w₃ and w₄ with γ₂.

We employ K-fold cross validation [45] to determine the value of λ in an automatic manner. We use the bootstrap method [46] to estimate the standard errors or confidence intervals of the parameter estimates without resorting to a distributional assumption. The standard errors or confidence intervals can be used for testing the statistical significance of the parameter estimates. Upon convergence, IG-GSCA provides unique individual component scores as shown in (1).

Example: Gene-brain-depression data

Data overview

Participants.

In a sample of 231 Korean participants, healthy volunteers were 137 (59.3%), who were recruited from community advertisements, whereas post-traumatic stress disorder (PTSD) patients were 94 (40.7%), who were recruited from notices on the bulletin board in a university hospital in a suburban area of Seoul, South Korea. The PTSD patients were diagnosed based on the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) by a psychiatrist, and healthy participants were also evaluated using the DSM-5 by a psychiatrist. Participants were excluded if they were pregnant, intellectually disabled, drug-abusing, taking medications with potentially psychoactive effects, or at high risk for suicide. The sample consisted of 75 (32.5%) men and 156 (67.5%) women with a mean age of 46.10 years (SD = 13.49). All the participants signed a written form of informed consent, approved by the Institutional Review Board at Inje University Ilsan Paik Hospital prior to the start of the research (IRB no. 2015-07-025).

Measures.

Psychiatric and behavioral measures. To measure the severity of depression as the outcome variable, the Korean translation of the Hospital Anxiety Depression Scale (HADS) [47] was administered. The HADS is a self-report rating scale and comprised of a set of seven questions for anxiety (HADS-A) and a set of seven questions for depression (HADS-D). The total sum score of the seven items in the HADS-D was used in the study.

To measure the exposure to traumatic events as an independent variable, the Korean validated version of Life Events Checklist (LEC) was used to assess the experience of potentially traumatic events (PTEs) [48]. The LEC comprised of 17 items of PTEs concerning experiencing, witnessing, and learning about PTEs. We used the items of PTE experience since other responses could be confusing to some respondents.

To control for the effect of alcohol-related problems as a covariate, the Alcohol Use Disorders Identification Test (AUDIT) was used to assess alcohol consumption, drinking behaviours, and alcohol-related problems. The AUDIT is a 10-item screening tool developed by the World Health Organization (WHO), and well-validated in Korea [49]. The AUDIT is assessed with a 5-point Likert scale ranging from 0 (“never”) to 4 (“4 or more times a week”). Table 1 provides a summary of demographic, psychological, and behavioral characteristics of the participants.

Download:

Table 1. A summary of participants’ demographic, psychological, and behavioral characteristics.

https://doi.org/10.1371/journal.pone.0247592.t001

DNA genotyping. All participants had their blood sampled to extract DNA using NanoDrop® ND-1000 UV-Vis Spectrophotometer. Then, genomic DNA were diluted to 10 ng/㎕ concentration at 96 well PCR plates. TaqMan SNP Genotyping Assays were obtained from Applied Biosystems (Waltham, MA). The probes were labeled with FAM or VIC dye at the 5’ end and a minor-groove binder and non-fluorescent quencher at the 3’ end. 2 μL of DNA was added to each 5 μL PCR reaction at 384 well reaction plates. SNP genotyping reactions were performed on ABI PRISM 7900HT Real-time PCR system. After the PCR amplification, allelic discrimination is performed at the same machines (ABI 7900HT). The allelic discrimination is an end point plate read. The SDS v2.4 software calculates the fluorescence measurements made during the plate read and plots Rn values based on the signals from each well. A total of 18 SNPs from 9 different DNAs were obtained for the study. For all SNPs, the wild, hetero, and mutant genotypes were coded as 1, 2, and 3, respectively. Nine genes selected based on their relations with depression include: SLC6A4 [50], FKBP5 [51], ADCYAP1R1 [52], BDNF [53], COMT [54], HTR3A [55], DRD2 [56], NR3C1 [57], and OXTR [58]. Table 2 exhibits the names and frequencies of all the genes considered in the study.

Download:

Table 2. List of genes and SNPs included in the gene-brain-depression data.

https://doi.org/10.1371/journal.pone.0247592.t002

MRI acquisition and processing and voxel-based morphometry. MRI was performed using a 1.5 T scanner (Magneton Avanto, Siemens, Erlangen, Germany). Head motion was minimized with restraining foam pads provided by the manufacturer. High-resolution T1-weighted MRI images were acquired with the acquisition parameters of a 227 × 384 acquisition matrix, a 210 × 250 field-of-view, 0.9 × 0.7 × 1.2 voxel size, a total of 87,168 voxels, a TE of 3.42 ms, a TR of 1,900 ms, 1.2 mm slice thickness, and a flip angle of 15°.

All images were inspected visually for motion or other artifacts before and after preprocessing. The voxel-based volumetry (VBM) was conducted using CAT12 (http://dbm.neuro.uni-jena.de/cat/) implemented in SPM12 (Wellcome Department of Cognitive Neurology, London, UK). SPM12 tissue probability maps were used for the initial spatial registration. The structural T1 images were regularized with an ICBM East Asian template and normalized using the DARTEL algorithm [59]. The images were then segmented into gray matter, white matter, and cerebrospinal fluid [60]. Jacobian-transformed tissue probability maps were used to modulate images. The projection-based thickness method was applied to the SPM analysis to estimate the cortical thickness for the left and right hemispheres [61]. The cortical thickness was extracted using the Destrieux atlas, which is the default FreeSurfer atlas. The Destrieux atlas consists of 74 cortical areas in the left and right hemispheres, including both gyri and sulci. Segmentation was automatically conducted using probabilistic methods [62]. A total of 53 regions of interest (ROIs), which mostly represent the thickness of cortical gyri and limbic sulci, were selected from the atlas for the study. Table 3 shows the name, mean, and standard deviation of each ROI.

Download:

Table 3. List of regions of interest (ROIs) included in the gene-brain-depression data.

https://doi.org/10.1371/journal.pone.0247592.t003

Model specification

Depression symptoms are known to be linked to altered brain structures [63], which may be influenced by the number of exposure to stressful or traumatic life events [64, 65], genetic polymorphism [66], and the interaction of both—gene-environment interactions [67–69]. Also, these relations may be affected by covariates such as age [70, 71], sex [72, 73], and alcohol-related problems [74, 75]. Accordingly, we hypothesized that the PTE, genetic polymorphism, and their interactions directly influenced the cortical thickness of the ROIs, which in turn had direct effects on depression severity, while controlling for the effects of age, sex, and AUDIT on both cortical thickness and depression severity. We also assumed that the PTE influenced depression severity directly. Fig 2 displays the hypothesized structural model. As shown in the figure, the model consisted of nine genetic components (i.e., genes) and 53 imaging components (i.e., ROIs). Each gene was associated with its observed variables (i.e., SNPs). The number of SNPs per gene ranged from one to nine. Each ROI was associated with two observed variables that denoted its left and right sides of the brain. Nine gene-environment interactions between the genes and PTE were considered that also influenced the ROIs.

Download:

Fig 2. The structural model specified for the gene-brain-depression data.

All weights and residual terms are omitted to make the figure concise.

https://doi.org/10.1371/journal.pone.0247592.g002

Results

We applied IG-GSCA to fit the specified model to the data. We chose λ = 136 based on five-fold cross validation. We used 4000 bootstrap samples to estimate the standard errors and 95% confidence intervals of the parameter estimates. As shown in Table 4, all weight estimates were statistically significant, suggesting that all observed variables contributed to forming their corresponding components. In addition, all the loading estimates were statistically significant and large in magnitude (> .75). This indicates that all components were obtained to explain the variances of their observed variables well.

Download:

Table 4. The estimates of weights and loadings and their standard errors and 95% confidence intervals.

https://doi.org/10.1371/journal.pone.0247592.t004

The specified model included a total of 1,246 path coefficients. One hundred eighty-four of their estimates turned out to be statistically significant. To conserve space, we focus here on reporting and interpreting statistically significant path coefficient estimates that constituted the hypothesized G-B-B/C pathways linking the genes, ROIs, and depression severity, presented in Table 5. The full results of all the path coefficient estimates can be found in S1 Table.

Download:

Table 5. Statistically significant estimates of the path coefficients that constitute the linkages from genes to ROIs to depression severity, and their standard errors and 95% confidence intervals.

https://doi.org/10.1371/journal.pone.0247592.t005

In the specified gene and brain function relationships, the dopamine receptor D2 (DRD2) gene had positive influences on the middle-posterior part of the cingulate gyrus and sulcus (pMCC) (b = .07, SE = .03, 95% CI = [.01, .14]) and the triangular part of the inferior frontal gyrus (b = .11, SE = .03, 95% CI = [.04, .17]), suggesting that people with the mutant allele in DRD2 are likely to have a thicker triangular part of the inferior frontal gyrus and pMCC. This finding is consistent with previous research that people with the wild genotype of the DRD2 gene (GG) had reduced activity in the inferior frontal gyrus [76] and reduced connectivity in pMCC [77] relative to people with the hetero or mutant genotype. In the specified brain function and depression relationships, three ROIs, such as the triangular part of the inferior frontal gyrus (b = -.07, SE = .03, 95% CI = [-.12, -.00]), the anterior circular sulcus of the insula (b = -.11, SE = .03, 95% CI = [-.17, -.05]), and pMCC (b = -.08, SE = .03, 95% CI = [-.14, -.01]), turned out to be negatively associated with depression severity, indicating that the thinner these ROIs are, the higher level of depression on average. This is consistent with previous findings that revealed a significantly thinner or smaller triangular part of the inferior frontal gyrus [78], anterior insula [79], and pMCC [80] in depressive people than those in healthy people. Moreover, PTE had a negative effect on the triangular part of the inferior frontal gyrus (b = -.08, SE = .03, 95% CI = [-.14, -.01]), suggesting that traumatic stressful events may diminish the thickness of this part. This finding is supported by previous research that traumatic experiences were associated with a thinner or smaller triangular part of the inferior frontal gyrus [81, 82]. Lastly, PTE had a positive impact on depression severity (b = .14, SE = .03, 95% CI = [.07, .20]). This also indicates that the effect of PTE on depression severity was not fully mediated by the ROIs. The association between stressful or traumatic experiences and depression has been found in numerous studies [83–86].

The gene-environment interaction between PTE and the serotonin transporter gene (SLC6A4) had a significant effect on the triangular part of the inferior frontal gyrus (b = .06, SE = .03, 95% CI = [.00, .02]). We additionally investigated the conditional effects of PTE on the triangular part of the inferior frontal gyrus at different levels of the serotonin transporter gene. Specifically, the conditional effects were tested when rs25531 was AA (the wild genotype) and AG (the hetero genotype), since SLC6A4 had only one observed variable rs25531, whose values were AA and AG in the data. It turned out that PTE had a negative effect on the triangular part of the inferior frontal gyrus only when rs25531 was AA (b = -.11, SE = .04, 95% CI = [-.18, -.03]). Although there are few studies regarding the interaction between rs25531 and an environment on brain structures, many other studies revealed that the wild allele (A) of serotonin transporter genes could be considered a risk allele and be associated with thinner or smaller brain [87, 88].

In addition, DRD2 had an indirect effect on depression severity mediated through the triangular part of the inferior frontal gyrus (indirect effect = -.01, SE = .00, 95% CI = [-.02, -.00]). This indicates that mutation of the DRD2 gene may render the person less susceptible to depression through thickening the triangular part of the inferior frontal gyrus. This finding supports gene–brain–behaviour relationships of dopamine genes, which are known to be associated with neural changes in reward-related regions, which could play an essential role in the pathogenesis of depression [89]. Lastly, Table 6 shows how much variance of each dependent component is explained by its independent variables (average R² = .20).

Download:

Table 6. R² for all dependent variables that include ROIs and depression severity.

https://doi.org/10.1371/journal.pone.0247592.t006

Simulation study

We conducted a simulation study to examine whether IG-GSCA could perform as expected, particularly in terms of parameter recovery. In this study, we contemplated a model that was quite similar to, yet on a slightly larger scale, the one specified in the real data analysis. As displayed in Fig 3, the model included nine genes, which were associated with one, two, or four SNPs, and sixty brain ROIs, each of which was linked to two brain-level observed variables. It also included an independent observed variable representing an environmental variable. The genes and environmental variable were specified to affect the 60 ROIs, which in turn were to influence an outcome variable that represents a behavioral or cognitive variable of interest. The environmental variable also had a direct effect on the outcome variable. The model further included the interaction term of each gene and the environmental variable (i.e., a total of nine gene-environment interaction terms), which influenced each ROI. In the model, a zero path coefficient is denoted by a dashed arrow.

Download:

Fig 3. The structural model specified for the simulations study.

All weights and residual terms are omitted. A non-zero path coefficient is denoted by an arrow, whereas a zero path coefficient is by a dashed arrow.

https://doi.org/10.1371/journal.pone.0247592.g003

We considered four levels of sample size (N = 250, 500, 1000, and 2000), for each of which we drew 1000 samples randomly based on a data generating procedure, whose detailed description is provided in S1 Appendix. As in the real data analysis, we applied ridge-type regularization based on five-fold cross validation.

As parameter recovery measures, we calculated finite-sample properties, such as bias, standard deviation (SD), and root mean square error (RMSE), of the IG-GSCA parameter estimates. To conserve space, we focus here on reporting the average values of these properties for loading and path coefficient estimates per sample size. All the properties of individual parameter estimates are provided in S2 Table.

Table 7 presents the average bias, SD, and RMSE values of loading and path coefficient estimates per sample size. On average, the biases of the loading estimates for both sets of components (i.e., genes and ROIs) were virtually zero across all sample sizes, and their SD and RMSE values deceased and became close to zero as the sample size increased. On the other hand, in general, IG-GSCA’s estimates for non-zero path coefficients seemed to be slightly biased in smaller samples. This seems to be due to the adoption of ridge-type regularization, which tends to yield biased estimates particularly in small samples [44]. Nonetheless, their average bias decreased with the sample size and became close to zero when N = 2000. This tendency is also expected because multicollinearity can be of less concern in large samples. The average SD and RMSE values of the path coefficient estimates also decreased when the sample size increased. IG-GSCA’s estimates for zero path coefficients were unbiased regardless of the sample size and their SD and RMSE deceased when the sample size increased.

Download:

Table 7. Average biases, Standard Deviations (SD), and Root Mean Square Errors (RMSE) of loadings and path coefficients estimated from IG-GSCA over different sample sizes in the simulation study.

https://doi.org/10.1371/journal.pone.0247592.t007

Conclusions

We proposed a flexible statistical approach, named IG-GSCA, for examining the associations among genetic, imaging and behavioral/cognitive data in a unified manner. As demonstrated in Section 3, IG-GSCA was able to specify complex directional relationships among genes, ROIs, and depression severity in a more biologically plausible way based on previous knowledge, and to identify the influences of a gene (DRD2) and a gene-environment interaction (PTE x SLC6A4) on several brain regions, which in turn affected depression severity. In addition, our simulation study showed that IG-GSCA performed as expected in terms of parameter recovery under a model similar to the one specified for the real data analysis.

IG-GSCA can be a useful tool for researchers in imaging genetics to study the neurobiological basis of individual behavioral or cognitive differences, addressing various issues inherent to current multivariate methodologies (e.g., less biologically interpretable, descriptive, or sequential). It also has the potential to inform clinicians about specific genetic or brain-level vulnerabilities associated with risk for chronic diseases later in life, which are proxied by individuals’ genetic or imaging component scores in disease-specific imaging genetics models.

Despite its technical and empirical implications, IG-GSCA can be refined and extended in many ways to further enhance its generality and flexibility. For example, genetic and imaging data can often be hierarchically structured such that their individual-level cases are grouped within higher-level units. For example, individuals’ genetic variation and brain activity can be measured across different experimental groups or time points. In such hierarchical/multilevel data, the individual-level measures nested within the same higher-level unit are likely to be more similar than those in different units, thus leading to dependency among individual-level measures within the same unit. Ignoring this dependency in parameter estimation can yield biased results [90]. In its base form, IG-GSCA will estimate parameters under the assumption that all observations are independent, ignoring potential nested structures in genetic and imaging data. It can be extended to explicitly account for such nested structures by permitting parameters to vary across higher-level units.

In addition, IG-GSCA currently posits that a component is always associated with a set of observed variables (e.g., a gene with SNPs, or a brain region with voxels). This type of component is called a first-order component, which is directly linked to observed variables [18]. In genetic studies, it may also be reasonable to assume that multiple genes in turn constitute a biological pathway [20]. Then, such a pathway can be seen as a ‘second-order’ component, which is related only to their first-order components (genes). In neuroimaging studies, it becomes common to utilize multiple neuroimaging modalities (e.g., structured magnetic resonance imaging, electroencephalography, etc.) to measure activities of brain regions. In this case, we may consider higher-order components integrated over brain regions from each modality [91]. IG-GSCA can be extended to take into account such higher-order genetic or imaging components.

Furthermore, imaging data have increasingly been treated as smooth functions or curves that vary over a continuum (e.g., time and/or space), rather than conventional multivariate data (a collection of discrete observations). For example, functional magnetic resonance imaging records blood-oxygen level dependent signals per voxel continuously over a great number of time points (scans), indicating that these signals can be represented as bivariate functions of time (scans) and space (voxels) [92, 93]. Similarly, SNPs have been considered smooth functions of space (physical positions) [94]. IG-GSCA is geared only for the analysis of multivariate data. It can be generalized to the analysis of genetic and imaging data as functions in the measurement model, accounting for their distinctive characteristics (e.g., smoothness), in a way similar to functional GSCA [95].

IG-GSCA in this paper has not paid attention to the analysis of longitudinal data. For example, it does not take into account the dynamic nature of temporally (serially) correlated data that are prevalent particularly in brain connectivity studies [16]. IG-GSCA may be extended to incorporate autoregressive modeling to consider the dynamic relationships in time series data, as proposed in dynamic GSCA [19]. Moreover, IG-GSCA can be readily extended to accommodate growth curve models [96, 97], as GSCA can deal with the same models [18].

IG-GSCA currently estimates parameters by aggregating the data across observations under the implicit assumption that all observations come from a single homogenous population. In some cases, however, it may be more reasonable to assume that observations are drawn from (unknown) heterogeneous subgroups in the population, which exhibit different path-analytic relationships among observed variables and components [98–100]. Thus, future work is needed to simultaneously combine IG-GSCA with cluster analysis to capture such cluster-level heterogeneity, inspired by the development of fuzzy clusterwise GSCA [98].

In closing, IG-GSCA can be a useful tool for imaging genetic studies that aim to associate both genetic and imaging data with behavioral/cognitive outcomes simultaneously. It is more general than regression models, enabling to combine SNPs to genes and voxels to brain regions and examine various gene-brain-behavior/cognition relationships, in a biologically plausible manner. Also, this approach can be more beneficial for such complex path-analytic association studies of the three sources of data, as compared to (factor-based) SEM. Although we have discussed several limitations of IG-GSCA, we may address these technical issues in future research by adapting many prior developments in GSCA, contributing to making IG-GSCA applicable and useful for a greater variety of imaging genetic studies. In addition, we hope to develop a software program for IG-GSCA in a user-friendly format, such as an R package, in the near future. This will make the approach more accessible to researchers in imaging genetics, facilitating its applications to more diverse real-world problems and more thorough investigations of its empirical utility in the field.

Supporting information

S1 Table. The entire path coefficient estimates, their standard errors and 95% confidence intervals (direct effects only) in the empirical study.

https://doi.org/10.1371/journal.pone.0247592.s001

(DOCX)

S2 Table. Biases, Standard Deviations (SD), and Root Mean Square Errors (RMSE) of loadings and path coefficients estimated from IG-GSCA over different sample sizes in the simulation study.

https://doi.org/10.1371/journal.pone.0247592.s002

(DOCX)

S1 Appendix. The data generation procedure for the simulation study.

https://doi.org/10.1371/journal.pone.0247592.s003

(DOCX)

References

1. Hariri AR, Weinberger DR. Functional neuroimaging of genetic variation in serotonergic neurotransmission. Genes, Brain Behav. 2003;2: 341–349. pmid:14653306
- View Article
- PubMed/NCBI
- Google Scholar
2. Pezawas L, Meyer-Lindenberg A. Imaging genetics: Progressing by leaps and bounds. Neuroimage. 2010;53: 801–803. pmid:20816317
- View Article
- PubMed/NCBI
- Google Scholar
3. Bookheimer SY, Strojwas MH, Cohen MS, Saunders AM, Pericak-Vance MA, Mazziotta JC, et al. Patterns of brain activation in people at risk for Alzheimer’s disease. N Engl J Med. 2000;343: 450–456. pmid:10944562
- View Article
- PubMed/NCBI
- Google Scholar
4. Rasetti R, Weinberger DR. Intermediate phenotypes in psychiatric disorders. Curr Opin Genet Dev. 2011;21: 340–348. pmid:21376566
- View Article
- PubMed/NCBI
- Google Scholar
5. Purcell SM, Wray NR, Stone JL, Visscher PM, O’Donovan MC, Sullivan PF, et al. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature. 2009;460: 748–752. pmid:19571811
- View Article
- PubMed/NCBI
- Google Scholar
6. Liu J, Calhoun V. A review of multivariate analyses in imaging genetics. Front Neuroinform. 2014;8: 29. Available: https://www.frontiersin.org/article/10.3389/fninf.2014.00029 pmid:24723883
- View Article
- PubMed/NCBI
- Google Scholar
7. Meyer-Lindenberg A. The future of fMRI and genetics research. Neuroimage. 2012;62: 1286–1292. pmid:22051224
- View Article
- PubMed/NCBI
- Google Scholar
8. Sheng R, Kim H, Lee H, Xin Y, Chen Y, Tian W, et al. Cholesterol selectively activates canonical Wnt signalling over non-canonical Wnt signalling. Nat Commun. 2014;5: 4393. pmid:25024088
- View Article
- PubMed/NCBI
- Google Scholar
9. Le Floch E, Guillemot V, Frouin V, Pinel P, Lalanne C, Trinchera L, et al. Significant correlation between a set of genetic polymorphisms and a functional brain network revealed by feature selection and sparse Partial Least Squares. Neuroimage. 2012;63: 11–24. pmid:22781162
- View Article
- PubMed/NCBI
- Google Scholar
10. Vounou M, Nichols TE, Montana G. Discovering genetic associations with high-dimensional neuroimaging phenotypes: A sparse reduced-rank regression approach. Neuroimage. 2010;53: 1147–1159. pmid:20624472
- View Article
- PubMed/NCBI
- Google Scholar
11. Meda SA, Jagannathan K, Gelernter J, Calhoun VD, Liu J, Stevens MC, et al. A pilot multivariate parallel ICA study to investigate differential linkage between neural networks and genetic profiles in schizophrenia. Neuroimage. 2010;53: 1007–1015. pmid:19944766
- View Article
- PubMed/NCBI
- Google Scholar
12. Liu J, Pearlson G, Windemuth A, Ruano G, Perrone-Bizzozero NI, Calhoun V. Combining fMRI and SNP data to investigate connections between brain function and genetics using parallel ICA. Hum Brain Mapp. 2009;30: 241–255. pmid:18072279
- View Article
- PubMed/NCBI
- Google Scholar
13. Yang H, Liu J, Sui J, Pearlson G, Calhoun V. A hybrid machine learning method for fusing fMRI and genetic data: Combining both improves classification of schizophrenia. Front Hum Neurosci. 2010;4: 192. Available: https://www.frontiersin.org/article/10.3389/fnhum.2010.00192 pmid:21119772
- View Article
- PubMed/NCBI
- Google Scholar
14. Wang K, Li M, Hakonarson H. ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38: e164–e164. pmid:20601685
- View Article
- PubMed/NCBI
- Google Scholar
15. Birnbaum R, Weinberger DR. Functional neuroimaging and schizophrenia: A view towards effective connectivity modeling and polygenic risk. Dialogues Clin Neurosci. 2013;15: 279–289. Available: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3811100/ pmid:24174900
- View Article
- PubMed/NCBI
- Google Scholar
16. Friston KJ. Functional and effective connectivity in neuroimaging: A synthesis. Hum Brain Mapp. 1994;2: 56–78.
- View Article
- Google Scholar
17. Hwang H, Takane Y. Generalized structured component analysis. Psychometrika. 2004;69: 81–99.
- View Article
- Google Scholar
18. Hwang H, Takane Y. Generalized structured component analysis: A component-based approach to structural equation modeling. New York, NY: Chapman and Hall/CRC Press; 2014.
19. Jung K, Takane Y, Hwang H, Woodward TS. Dynamic GSCA (Generalized Structured Component Analysis) with applications to the analysis of effective connectivity in functional neuroimaging data. Psychometrika. 2012;77: 827–848.
- View Article
- Google Scholar
20. Lee S, Choi S, Kim YJ, Kim BJ, Hwang H, Park T. Pathway-based approach using hierarchical components of collapsed rare variants. Bioinformatics. 2016;32: i586–i594. pmid:27587678
- View Article
- PubMed/NCBI
- Google Scholar
21. Romdhani H, Hwang H, Paradis G, Roy-Gagnon MH, Labbe A. Pathway-based association study of multiple candidate genes and multiple traits using structural equation models. Genet Epidemiol. 2015;39: 101–113. pmid:25558046
- View Article
- PubMed/NCBI
- Google Scholar
22. Arslan A. Imaging genetics of schizophrenia in the post-GWAS era. Prog Neuropsychopharmacol Biol Psychiatry. 2018;80: 155–165. pmid:28645536
- View Article
- PubMed/NCBI
- Google Scholar
23. Hwang H, Ho M-HR, Lee J. Generalized Structured Component Analysis with Latent Interactions. Psychometrika. 2010;75: 228–242.
- View Article
- Google Scholar
24. Hwang H. Regularized generalized structured component analysis. Psychometrika. 2009;74: 517–530.
- View Article
- Google Scholar
25. Hwang H, Takane Y, Jung K. Generalized structured component analysis with uniqueness terms for accommodating measurement error. Front Psychol. 2017;8: 2137. pmid:29270146
- View Article
- PubMed/NCBI
- Google Scholar
26. Bollen KA. Structural equations with latent variables. New York, NY: Wiley; 1989. https://doi.org/10.1002/9781118619179
27. Jöreskog KG. A general method for estimating a linear structural equation system. In: Goldberger AS, Duncan OD, editors. Structural equation models in the social sciences. New York,: Seminar Press; 1973. pp. 255–284.
28. Huisman SMH, Mahfouz A, Batmanghelich NK, Lelieveldt BPF, Reinders MJT. A structural equation model for imaging genetics using spatial transcriptomics. Brain informatics. 2018;5: 13. pmid:30390165
- View Article
- PubMed/NCBI
- Google Scholar
29. Köhncke Y, Düzel S, Sander MC, Lindenberger U, Kühn S, Brandmaier AM. Hippocampal and parahippocampal grey matter structural integrity assessed by multimodal imaging is associated with episodic memory in old age. bioRxiv. 2020; 2020.02.07.936872.
- View Article
- Google Scholar
30. Borsboom D, Mellenbergh GJ, Van Heerden J. The concept of validity. Psychol Rev. 2004;111: 1061–1071. pmid:15482073
- View Article
- PubMed/NCBI
- Google Scholar
31. Bollen KA, Bauldry S. Three Cs in measurement models: Causal indicators, composite indicators, and covariates. Psychol Methods. 2011;16: 265–284. pmid:21767021
- View Article
- PubMed/NCBI
- Google Scholar
32. Kline RB. Principles and practice of structural equation modeling. 3rd ed. New York, NY, US: Guilford Press; 2011. Available: https://psycnet.apa.org/record/2010-18801-000
33. McDonald RP, Mulaik SA. Determinacy of common factors: A nontechnical review. Psychol Bull. 1979;86: 297–306.
- View Article
- Google Scholar
34. Steiger JH. Factor indeterminacy in the 1930’s and the 1970’s some interesting parallels. Psychometrika. 1979;44: 157–167.
- View Article
- Google Scholar
35. Nonconvergence Boomsma A., improper solutions, and starting values in lisrel maximum likelihood estimation. Psychometrika. 1985;50: 229–242.
- View Article
- Google Scholar
36. Chen F, Bollen KA, Paxton P, Curran PJ, Kirby JB. Improper solutions in structural equation models: Causes, consequences, and strategies. Sociol Methods Res. 2001;29: 468–508.
- View Article
- Google Scholar
37. Cox SR, Harris MA, Ritchie SJ, Buchanan CR, Hernández MV, Corley J, et al. Three major dimensions of human brain cortical ageing in relation to cognitive decline across the 8th decade of life. bioRxiv. 2020; 2020.01.19.911420.
- View Article
- Google Scholar
38. Tan H-Y, Chen Q, Sust S, Buckholtz JW, Meyers JD, Egan MF, et al. Epistasis between catechol-O-methyltransferase and type II metabotropic glutamate receptor 3 genes on working memory brain function. Proc Natl Acad Sci. 2007;104: 12536 LP– 12541. pmid:17636131
- View Article
- PubMed/NCBI
- Google Scholar
39. Green AE, Munafò MR, DeYoung CG, Fossella JA, Fan J, Gray JR. Using genetic data in cognitive neuroscience: from growing pains to genuine insights. Nat Rev Neurosci. 2008;9: 710–720. pmid:19143051
- View Article
- PubMed/NCBI
- Google Scholar
40. Kiers HAL, Takane Y, ten Berge JMF. The analysis of multitrait-multimethod matrices via constrained components analysis. Psychometrika. 1996;61: 601–628.
- View Article
- Google Scholar
41. Hoerl AE, Kennard RW. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics. 1970;12: 55–67.
- View Article
- Google Scholar
42. Tibshirani R. Regression shrinkage and selection via the Lasso. J R Stat Soc Ser B. 1996;58: 267–288.
- View Article
- Google Scholar
43. Berk RA. Statistical Learning from a Regression Perspective. New York, NY: Springer; 2008. https://doi.org/10.1007/978-3-319-44048-4
44. Takane Y, Hwang H. Regularized linear and kernel redundancy analysis. Comput Stat Data Anal. 2007;52: 394–405. https://doi.org/10.1016/j.csda.2007.02.014
- View Article
- Google Scholar
45. Hastie T, Tibshirani R, Friedman J. The elements of statistical learning. New York: Springer; 2001. https://doi.org/10.1007/978-0-387-21606-5
46. Efron B. Bootstrap methods: Another look at the jackknife. Ann Stat. 1979;7: 1–26.
- View Article
- Google Scholar
47. Oh SM, Min KJ, Park DB. A study on the standardization of the hospital anxiety and depression scale for Koreans: A comparison of normal, depressed and anxious groups. J Korean Neuropsychiatr Assoc. 1999;38: 289–296. Available: https://koreamed.org/article/0055JKNA/1999.38.2.289
- View Article
- Google Scholar
48. Bae H, Kim D, Koh H, Kim Y, Park JS. Psychometric properties of the life events checklist-korean version. Psychiatry Investig. 2008;5: 163–167. pmid:20046360
- View Article
- PubMed/NCBI
- Google Scholar
49. Lee BO, Lee CH, Lee PG, Choi MJ, Namkoong K. Development of Korean version of alcohol use disorder identification test (AUDIT-K): Its reliability and validity. J Korean Acad Addict Psychiatry. 2000;4: 85–94. Available: https://ir.ymlib.yonsei.ac.kr/handle/22282913/172500
- View Article
- Google Scholar
50. Holmes AJ, Bogdan R, Pizzagalli DA. Serotonin transporter genotype and action monitoring dysfunction: A possible substrate underlying increased vulnerability to depression. Neuropsychopharmacology. 2010;35: 1186–1197. pmid:20090673
- View Article
- PubMed/NCBI
- Google Scholar
51. Zobel A, Schuhmacher A, Jessen F, Hfels S, Von Widdern O, Metten M, et al. DNA sequence variants of the FKBP5 gene are associated with unipolar depression. Int J Neuropsychopharmacol. 2010;13: 649–660. pmid:20047716
- View Article
- PubMed/NCBI
- Google Scholar
52. Lowe SR, Pothen J, Quinn JW, Rundle A, Bradley B, Galea S, et al. Gene-by-social-environment interaction (GxSE) between ADCYAP1R1 genotype and neighborhood crime predicts major depression symptoms in trauma-exposed women. J Affect Disord. 2015;187: 147–150. pmid:26334183
- View Article
- PubMed/NCBI
- Google Scholar
53. Sen S, Nesse RM, Stoltenberg SF, Li S, Gleiberman L, Chakravarti A, et al. A BDNF coding variant is associated with the NEO personality inventory domain neuroticism, a risk factor for depression. Neuropsychopharmacology. 2003;28: 397–401. pmid:12589394
- View Article
- PubMed/NCBI
- Google Scholar
54. Åberg E, Fandiño-Losada A, Sjöholm LK, Forsell Y, Lavebratt C. The functional Val158Met polymorphism in catechol-O- methyltransferase (COMT) is associated with depression and motivation in men from a Swedish population-based study. J Affect Disord. 2011;129: 158–166. pmid:20828831
- View Article
- PubMed/NCBI
- Google Scholar
55. Gatt JM, Williams LM, Schofield PR, Dobson-Stone C, Paul RH, Grieve SM, et al. Impact of the HTR3A gene with early life trauma on emotional brain networks and depressed mood. Depress Anxiety. 2010;27: 752–759. pmid:20694966
- View Article
- PubMed/NCBI
- Google Scholar
56. Vaske J, Makarios M, Boisvert D, Beaver KM, Wright JP. The interaction of DRD2 and violent victimization on depression: An analysis by gender and race. J Affect Disord. 2009;112: 120–125. pmid:18501970
- View Article
- PubMed/NCBI
- Google Scholar
57. Gałecka E, Szemraj J, Bieńkiewicz M, Majsterek I, Przybyłowska-Sygut K, Gałecki P, et al. Single nucleotide polymorphisms of NR3C1 gene and recurrent depressive disorder in population of Poland. Mol Biol Rep. 2013;40: 1693–1699. pmid:23073785
- View Article
- PubMed/NCBI
- Google Scholar
58. McQuaid RJ, McInnis OA, Stead JD, Matheson K, Anisman H. A paradoxical association of an oxytocin receptor gene polymorphism: Early-life adversity and vulnerability to depression. Front Neurosci. 2013;7: 128. pmid:23898235
- View Article
- PubMed/NCBI
- Google Scholar
59. Ashburner J. A fast diffeomorphic image registration algorithm. Neuroimage. 2007;38: 95–113. pmid:17761438
- View Article
- PubMed/NCBI
- Google Scholar
60. Ashburner J, Friston KJ. Unified segmentation. Neuroimage. 2005;26: 839–851. pmid:15955494
- View Article
- PubMed/NCBI
- Google Scholar
61. Dahnke R, Yotter RA, Gaser C. Cortical thickness and central surface estimation. Neuroimage. 2013;65: 336–348. pmid:23041529
- View Article
- PubMed/NCBI
- Google Scholar
62. Desikan RS, Ségonne F, Fischl B, Quinn BT, Dickerson BC, Blacker D, et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage. 2006;31: 968–980. pmid:16530430
- View Article
- PubMed/NCBI
- Google Scholar
63. Kaufman J, Charney D. Effects of early stress on brain structure and function: implications for understanding the relationship between child maltreatment and depression. Dev Psychopathol. 2001;13: 451–471. pmid:11523843
- View Article
- PubMed/NCBI
- Google Scholar
64. Corbo V, Salat DH, Amick MM, Leritz EC, Milberg WP, McGlinchey RE. Reduced cortical thickness in veterans exposed to early life trauma. Psychiatry Res Neuroimaging. 2014;223: 53–60. pmid:24862391
- View Article
- PubMed/NCBI
- Google Scholar
65. Papagni SA, Benetti S, Arulanantham S, McCrory E, McGuire P, Mechelli A. Effects of stressful life events on human brain structure: A longitudinal voxel-based morphometry study. Stress. 2011;14: 227–232. pmid:21034297
- View Article
- PubMed/NCBI
- Google Scholar
66. Peper JS, Brouwer RM, Boomsma DI, Kahn RS, Hulshoff Pol HE. Genetic influences on human brain structure: A review of brain imaging studies in twins. Hum Brain Mapp. 2007;28: 464–473. pmid:17415783
- View Article
- PubMed/NCBI
- Google Scholar
67. Caspi A, Sugden K, Moffitt TE, Taylor A, Craig IW, Harrington H, et al. Influence of Life Stress on Depression: Moderation by a Polymorphism in the 5-HTT Gene. Science (80-). 2003;301: 386 LP– 389. pmid:12869766
- View Article
- PubMed/NCBI
- Google Scholar
68. Charney DS, Manji HK. Life stress, genes, and depression: multiple pathways lead to increased risk and new opportunities for intervention. Sci Signal. 2004;2004: re5. pmid:15039492
- View Article
- PubMed/NCBI
- Google Scholar
69. Heim C, Binder EB. Current research trends in early life stress and depression: review of human studies on sensitive periods, gene-environment interactions, and epigenetics. Exp Neurol. 2012;233: 102–111. pmid:22101006
- View Article
- PubMed/NCBI
- Google Scholar
70. Kessler RC, Birnbaum H, Bromet E, Hwang I, Sampson N, Shahly V. Age differences in major depression: Results from the national comorbidity survey replication (NCS-R). Psychol Med. 2010;40: 225–237. pmid:19531277
- View Article
- PubMed/NCBI
- Google Scholar
71. Salat DH, Buckner RL, Snyder AZ, Greve DN, Desikan RSR, Busa E, et al. Thinning of the cerebral cortex in aging. Cereb Cortex. 2004;14: 721–730. pmid:15054051
- View Article
- PubMed/NCBI
- Google Scholar
72. Piccinelli M, Wilkinson G. Gender differences in depression: Critical review. Br J Psychiatry. 2000;177: 486–492. pmid:11102321
- View Article
- PubMed/NCBI
- Google Scholar
73. Sowell ER, Peterson BS, Kan E, Woods RP, Yoshii J, Bansal R, et al. Sex differences in cortical thickness mapped in 176 healthy individuals between 7 and 87 years of age. Cereb Cortex. 2007;17: 1550–1560. pmid:16945978
- View Article
- PubMed/NCBI
- Google Scholar
74. Boden JM, Fergusson DM. Alcohol and depression. Addiction. 2011;106: 906–914. pmid:21382111
- View Article
- PubMed/NCBI
- Google Scholar
75. Durazzo TC, Tosun D, Buckley S, Gazdzinski S, Mon A, Fryer SL, et al. Cortical thickness, surface area, and volume of the brain reward system in alcohol dependence: Relationships to relapse and extended abstinence. Alcohol Clin Exp Res. 2011;35: 1187–1200. pmid:21410483
- View Article
- PubMed/NCBI
- Google Scholar
76. Bertolino A, Fazio L, Di Giorgio A, Blasi G, Romano R, Taurisano P, et al. Genetically determined interaction between the dopamine transporter and the D2 receptor on prefronto-striatal activity and volume in humans. J Neurosci. 2009;29: 1224 LP– 1234. pmid:19176830
- View Article
- PubMed/NCBI
- Google Scholar
77. Sambataro F, Fazio L, Taurisano P, Gelao B, Porcelli A, Mancini M, et al. DRD2 genotype-based variation of default mode network activity and of its relationship with striatal DAT binding. Schizophr Bull. 2013;39: 206–216. pmid:21976709
- View Article
- PubMed/NCBI
- Google Scholar
78. Grieve SM, Korgaonkar MS, Koslow SH, Gordon E, Williams LM. Widespread reductions in gray matter volume in depression. NeuroImage Clin. 2013;3: 332–339. pmid:24273717
- View Article
- PubMed/NCBI
- Google Scholar
79. Takahashi T, Yücel M, Lorenzetti V, Tanino R, Whittle S, Suzuki M, et al. Volumetric MRI study of the insular cortex in individuals with current and past major depression. J Affect Disord. 2010;121: 231–238. pmid:19540599
- View Article
- PubMed/NCBI
- Google Scholar
80. van Tol M-J, Li M, Metzger CD, Hailla N, Horn DI, Li W, et al. Local cortical thinning links to resting-state disconnectivity in major depressive disorder. Psychol Med. 2013/11/01 2014;44: 2053–2065. pmid:24156689
- View Article
- PubMed/NCBI
- Google Scholar
81. Bing X, Ming-Guo Q, Ye Z, Jing-Na Z, Min L, Han C, et al. Alterations in the cortical thickness and the amplitude of low-frequency fluctuation in patients with post-traumatic stress disorder. Brain Res. 2013;1490: 225–232. pmid:23122880
- View Article
- PubMed/NCBI
- Google Scholar
82. Lim L, Radua J, Rubia K. Gray matter abnormalities in childhood maltreatment: A voxel-wise meta-analysis. Am J Psychiatry. 2014;171: 854–863. pmid:24781447
- View Article
- PubMed/NCBI
- Google Scholar
83. Chou KL, Chi I. Stressful life events and depressive symptoms: social support and sense of control as mediators or moderators? Int J Aging Hum Dev. 2001;52: 155–171. pmid:11352200
- View Article
- PubMed/NCBI
- Google Scholar
84. Hammen C. Stress and depression. Annu Rev Clin Psychol. 2004;1: 293–319. pmid:17716090
- View Article
- PubMed/NCBI
- Google Scholar
85. Kessler RC. The effects of stressful life events on depression. Annu Rev Psychol. 1997;48: 191–214. pmid:9046559
- View Article
- PubMed/NCBI
- Google Scholar
86. You S, Conner KR. Stressful life events and depressive symptoms: influences of gender, event severity, and depression history. J Nerv Ment Dis. 2009;197: 829–833. pmid:19996721
- View Article
- PubMed/NCBI
- Google Scholar
87. Frodl T, Koutsouleris N, Bottlender R, Born C, Jäger M, Mörgenthaler M, et al. Reduced gray matter brain volumes are associated with variants of the serotonin transporter gene in major depression. Mol Psychiatry. 2008;13: 1093–1101. pmid:19008895
- View Article
- PubMed/NCBI
- Google Scholar
88. Jaworska N, MacMaster FP, Foster J, Ramasubbu R. The influence of 5-HTTLPR and Val66Met polymorphisms on cortical thickness and volume in limbic and paralimbic regions in depression: a preliminary study. BMC Psychiatry. 2016;16: 61. pmid:26976307
- View Article
- PubMed/NCBI
- Google Scholar
89. Gene Northoff G., brains, and environment-genetic neuroimaging of depression. Curr Opin Neurobiol. 2013;23: 133–142. pmid:22995550
- View Article
- PubMed/NCBI
- Google Scholar
90. Snijders TAB, Bosker RJ (Roel J. Multilevel analysis: An introduction to basic and advanced multilevel modeling. London, England: Sage; 1999.
91. Zhou L, Takane Y, Hwang H. Dynamic GSCANO (Generalized Structured Canonical Correlation Analysis) with applications to the analysis of effective connectivity in functional neuroimaging data. Comput Stat Data Anal. 2016;101: 93–109. https://doi.org/10.1016/j.csda.2016.03.001
- View Article
- Google Scholar
92. Ramsay JO, Silverman BW. Principal components analysis for functional data. In: Ramsay JO, Silverman BW, editors. Functional Data Analysis. New York, NY: Springer; 2005. pp. 147–172. https://doi.org/10.1007/0-387-22751-2_8
93. Tian P, Teng IC, May LD, Kurz R, Lu K, Scadeng M, et al. Cortical depth-specific microvascular dilation underlies laminar differences in blood oxygenation level-dependent functional MRI signal. Proc Natl Acad Sci. 2010;107: 15246–15251. pmid:20696904
- View Article
- PubMed/NCBI
- Google Scholar
94. Luo L, Zhu Y, Xiong M. A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data. J Comput Biol. 2012;19: 731–744. pmid:22651812
- View Article
- PubMed/NCBI
- Google Scholar
95. Suk HW, Hwang H. Functional generalized structured component analysis. Psychometrika. 2016;81: 940–968. pmid:27714543
- View Article
- PubMed/NCBI
- Google Scholar
96. Duncan TE, Duncan SC, Strycker LA. An introduction to latent variable growth curve modeling: Concepts, issues, and applications. 2nd ed. Mahwah, NJ: Erlbaum; 2006.
97. Meredith W, Tisak J. Latent curve analysis. Psychometrika. 1990;55: 107–122.
- View Article
- Google Scholar
98. Hwang H, Desarbo WS, Takane Y. Fuzzy clusterwise generalized structured component analysis. Psychometrika. 2007;72: 181–198.
- View Article
- Google Scholar
99. Ryoo JH, Park S, Kim S, Ryoo HS. Efficiency of cluster validity indexes in fuzzy clusterwise generalized structured component analysis. Symmetry (Basel). 2020;12.
- View Article
- Google Scholar
100. Park S, Kim S, Ryoo JH. Latent class regression utilizing fuzzy clusterwise generalized structured component analysis. Mathematics. 2020;8.
- View Article
- Google Scholar

[ref1] 1. Hariri AR, Weinberger DR. Functional neuroimaging of genetic variation in serotonergic neurotransmission. Genes, Brain Behav. 2003;2: 341–349. pmid:14653306
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Pezawas L, Meyer-Lindenberg A. Imaging genetics: Progressing by leaps and bounds. Neuroimage. 2010;53: 801–803. pmid:20816317
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Bookheimer SY, Strojwas MH, Cohen MS, Saunders AM, Pericak-Vance MA, Mazziotta JC, et al. Patterns of brain activation in people at risk for Alzheimer’s disease. N Engl J Med. 2000;343: 450–456. pmid:10944562
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Rasetti R, Weinberger DR. Intermediate phenotypes in psychiatric disorders. Curr Opin Genet Dev. 2011;21: 340–348. pmid:21376566
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Purcell SM, Wray NR, Stone JL, Visscher PM, O’Donovan MC, Sullivan PF, et al. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature. 2009;460: 748–752. pmid:19571811
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Liu J, Calhoun V. A review of multivariate analyses in imaging genetics. Front Neuroinform. 2014;8: 29. Available: https://www.frontiersin.org/article/10.3389/fninf.2014.00029 pmid:24723883
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref7] 7. Meyer-Lindenberg A. The future of fMRI and genetics research. Neuroimage. 2012;62: 1286–1292. pmid:22051224
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref8] 8. Sheng R, Kim H, Lee H, Xin Y, Chen Y, Tian W, et al. Cholesterol selectively activates canonical Wnt signalling over non-canonical Wnt signalling. Nat Commun. 2014;5: 4393. pmid:25024088
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref9] 9. Le Floch E, Guillemot V, Frouin V, Pinel P, Lalanne C, Trinchera L, et al. Significant correlation between a set of genetic polymorphisms and a functional brain network revealed by feature selection and sparse Partial Least Squares. Neuroimage. 2012;63: 11–24. pmid:22781162
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref10] 10. Vounou M, Nichols TE, Montana G. Discovering genetic associations with high-dimensional neuroimaging phenotypes: A sparse reduced-rank regression approach. Neuroimage. 2010;53: 1147–1159. pmid:20624472
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref11] 11. Meda SA, Jagannathan K, Gelernter J, Calhoun VD, Liu J, Stevens MC, et al. A pilot multivariate parallel ICA study to investigate differential linkage between neural networks and genetic profiles in schizophrenia. Neuroimage. 2010;53: 1007–1015. pmid:19944766
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref12] 12. Liu J, Pearlson G, Windemuth A, Ruano G, Perrone-Bizzozero NI, Calhoun V. Combining fMRI and SNP data to investigate connections between brain function and genetics using parallel ICA. Hum Brain Mapp. 2009;30: 241–255. pmid:18072279
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref13] 13. Yang H, Liu J, Sui J, Pearlson G, Calhoun V. A hybrid machine learning method for fusing fMRI and genetic data: Combining both improves classification of schizophrenia. Front Hum Neurosci. 2010;4: 192. Available: https://www.frontiersin.org/article/10.3389/fnhum.2010.00192 pmid:21119772
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref14] 14. Wang K, Li M, Hakonarson H. ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38: e164–e164. pmid:20601685
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref15] 15. Birnbaum R, Weinberger DR. Functional neuroimaging and schizophrenia: A view towards effective connectivity modeling and polygenic risk. Dialogues Clin Neurosci. 2013;15: 279–289. Available: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3811100/ pmid:24174900
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref16] 16. Friston KJ. Functional and effective connectivity in neuroimaging: A synthesis. Hum Brain Mapp. 1994;2: 56–78.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref17] 17. Hwang H, Takane Y. Generalized structured component analysis. Psychometrika. 2004;69: 81–99.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref18] 18. Hwang H, Takane Y. Generalized structured component analysis: A component-based approach to structural equation modeling. New York, NY: Chapman and Hall/CRC Press; 2014.

[ref19] 19. Jung K, Takane Y, Hwang H, Woodward TS. Dynamic GSCA (Generalized Structured Component Analysis) with applications to the analysis of effective connectivity in functional neuroimaging data. Psychometrika. 2012;77: 827–848.
View Article
Google Scholar

[69] View Article

[70] Google Scholar

[ref20] 20. Lee S, Choi S, Kim YJ, Kim BJ, Hwang H, Park T. Pathway-based approach using hierarchical components of collapsed rare variants. Bioinformatics. 2016;32: i586–i594. pmid:27587678
View Article
PubMed/NCBI
Google Scholar

[72] View Article

[73] PubMed/NCBI

[74] Google Scholar

[ref21] 21. Romdhani H, Hwang H, Paradis G, Roy-Gagnon MH, Labbe A. Pathway-based association study of multiple candidate genes and multiple traits using structural equation models. Genet Epidemiol. 2015;39: 101–113. pmid:25558046
View Article
PubMed/NCBI
Google Scholar

[76] View Article

[77] PubMed/NCBI

[78] Google Scholar

[ref22] 22. Arslan A. Imaging genetics of schizophrenia in the post-GWAS era. Prog Neuropsychopharmacol Biol Psychiatry. 2018;80: 155–165. pmid:28645536
View Article
PubMed/NCBI
Google Scholar

[80] View Article

[81] PubMed/NCBI

[82] Google Scholar

[ref23] 23. Hwang H, Ho M-HR, Lee J. Generalized Structured Component Analysis with Latent Interactions. Psychometrika. 2010;75: 228–242.
View Article
Google Scholar

[84] View Article

[85] Google Scholar

[ref24] 24. Hwang H. Regularized generalized structured component analysis. Psychometrika. 2009;74: 517–530.
View Article
Google Scholar

[87] View Article

[88] Google Scholar

[ref25] 25. Hwang H, Takane Y, Jung K. Generalized structured component analysis with uniqueness terms for accommodating measurement error. Front Psychol. 2017;8: 2137. pmid:29270146
View Article
PubMed/NCBI
Google Scholar

[90] View Article

[91] PubMed/NCBI

[92] Google Scholar

[ref26] 26. Bollen KA. Structural equations with latent variables. New York, NY: Wiley; 1989. https://doi.org/10.1002/9781118619179

[ref27] 27. Jöreskog KG. A general method for estimating a linear structural equation system. In: Goldberger AS, Duncan OD, editors. Structural equation models in the social sciences. New York,: Seminar Press; 1973. pp. 255–284.

[ref28] 28. Huisman SMH, Mahfouz A, Batmanghelich NK, Lelieveldt BPF, Reinders MJT. A structural equation model for imaging genetics using spatial transcriptomics. Brain informatics. 2018;5: 13. pmid:30390165
View Article
PubMed/NCBI
Google Scholar

[96] View Article

[97] PubMed/NCBI

[98] Google Scholar

[ref29] 29. Köhncke Y, Düzel S, Sander MC, Lindenberger U, Kühn S, Brandmaier AM. Hippocampal and parahippocampal grey matter structural integrity assessed by multimodal imaging is associated with episodic memory in old age. bioRxiv. 2020; 2020.02.07.936872.
View Article
Google Scholar

[100] View Article

[101] Google Scholar

[ref30] 30. Borsboom D, Mellenbergh GJ, Van Heerden J. The concept of validity. Psychol Rev. 2004;111: 1061–1071. pmid:15482073
View Article
PubMed/NCBI
Google Scholar

[103] View Article

[104] PubMed/NCBI

[105] Google Scholar

[ref31] 31. Bollen KA, Bauldry S. Three Cs in measurement models: Causal indicators, composite indicators, and covariates. Psychol Methods. 2011;16: 265–284. pmid:21767021
View Article
PubMed/NCBI
Google Scholar

[107] View Article

[108] PubMed/NCBI

[109] Google Scholar

[ref32] 32. Kline RB. Principles and practice of structural equation modeling. 3rd ed. New York, NY, US: Guilford Press; 2011. Available: https://psycnet.apa.org/record/2010-18801-000

[ref33] 33. McDonald RP, Mulaik SA. Determinacy of common factors: A nontechnical review. Psychol Bull. 1979;86: 297–306.
View Article
Google Scholar

[112] View Article

[113] Google Scholar

[ref34] 34. Steiger JH. Factor indeterminacy in the 1930’s and the 1970’s some interesting parallels. Psychometrika. 1979;44: 157–167.
View Article
Google Scholar

[115] View Article

[116] Google Scholar

[ref35] 35. Nonconvergence Boomsma A., improper solutions, and starting values in lisrel maximum likelihood estimation. Psychometrika. 1985;50: 229–242.
View Article
Google Scholar

[118] View Article

[119] Google Scholar

[ref36] 36. Chen F, Bollen KA, Paxton P, Curran PJ, Kirby JB. Improper solutions in structural equation models: Causes, consequences, and strategies. Sociol Methods Res. 2001;29: 468–508.
View Article
Google Scholar

[121] View Article

[122] Google Scholar

[ref37] 37. Cox SR, Harris MA, Ritchie SJ, Buchanan CR, Hernández MV, Corley J, et al. Three major dimensions of human brain cortical ageing in relation to cognitive decline across the 8th decade of life. bioRxiv. 2020; 2020.01.19.911420.
View Article
Google Scholar

[124] View Article

[125] Google Scholar

[ref38] 38. Tan H-Y, Chen Q, Sust S, Buckholtz JW, Meyers JD, Egan MF, et al. Epistasis between catechol-O-methyltransferase and type II metabotropic glutamate receptor 3 genes on working memory brain function. Proc Natl Acad Sci. 2007;104: 12536 LP– 12541. pmid:17636131
View Article
PubMed/NCBI
Google Scholar

[127] View Article

[128] PubMed/NCBI

[129] Google Scholar

[ref39] 39. Green AE, Munafò MR, DeYoung CG, Fossella JA, Fan J, Gray JR. Using genetic data in cognitive neuroscience: from growing pains to genuine insights. Nat Rev Neurosci. 2008;9: 710–720. pmid:19143051
View Article
PubMed/NCBI
Google Scholar

[131] View Article

[132] PubMed/NCBI

[133] Google Scholar

[ref40] 40. Kiers HAL, Takane Y, ten Berge JMF. The analysis of multitrait-multimethod matrices via constrained components analysis. Psychometrika. 1996;61: 601–628.
View Article
Google Scholar

[135] View Article

[136] Google Scholar

[ref41] 41. Hoerl AE, Kennard RW. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics. 1970;12: 55–67.
View Article
Google Scholar

[138] View Article

[139] Google Scholar

[ref42] 42. Tibshirani R. Regression shrinkage and selection via the Lasso. J R Stat Soc Ser B. 1996;58: 267–288.
View Article
Google Scholar

[141] View Article

[142] Google Scholar

[ref43] 43. Berk RA. Statistical Learning from a Regression Perspective. New York, NY: Springer; 2008. https://doi.org/10.1007/978-3-319-44048-4

[ref44] 44. Takane Y, Hwang H. Regularized linear and kernel redundancy analysis. Comput Stat Data Anal. 2007;52: 394–405. https://doi.org/10.1016/j.csda.2007.02.014
View Article
Google Scholar

[145] View Article

[146] Google Scholar

[ref45] 45. Hastie T, Tibshirani R, Friedman J. The elements of statistical learning. New York: Springer; 2001. https://doi.org/10.1007/978-0-387-21606-5

[ref46] 46. Efron B. Bootstrap methods: Another look at the jackknife. Ann Stat. 1979;7: 1–26.
View Article
Google Scholar

[149] View Article

[150] Google Scholar

[ref47] 47. Oh SM, Min KJ, Park DB. A study on the standardization of the hospital anxiety and depression scale for Koreans: A comparison of normal, depressed and anxious groups. J Korean Neuropsychiatr Assoc. 1999;38: 289–296. Available: https://koreamed.org/article/0055JKNA/1999.38.2.289
View Article
Google Scholar

[152] View Article

[153] Google Scholar

[ref48] 48. Bae H, Kim D, Koh H, Kim Y, Park JS. Psychometric properties of the life events checklist-korean version. Psychiatry Investig. 2008;5: 163–167. pmid:20046360
View Article
PubMed/NCBI
Google Scholar

[155] View Article

[156] PubMed/NCBI

[157] Google Scholar

[ref49] 49. Lee BO, Lee CH, Lee PG, Choi MJ, Namkoong K. Development of Korean version of alcohol use disorder identification test (AUDIT-K): Its reliability and validity. J Korean Acad Addict Psychiatry. 2000;4: 85–94. Available: https://ir.ymlib.yonsei.ac.kr/handle/22282913/172500
View Article
Google Scholar

[159] View Article

[160] Google Scholar

[ref50] 50. Holmes AJ, Bogdan R, Pizzagalli DA. Serotonin transporter genotype and action monitoring dysfunction: A possible substrate underlying increased vulnerability to depression. Neuropsychopharmacology. 2010;35: 1186–1197. pmid:20090673
View Article
PubMed/NCBI
Google Scholar

[162] View Article

[163] PubMed/NCBI

[164] Google Scholar

[ref51] 51. Zobel A, Schuhmacher A, Jessen F, Hfels S, Von Widdern O, Metten M, et al. DNA sequence variants of the FKBP5 gene are associated with unipolar depression. Int J Neuropsychopharmacol. 2010;13: 649–660. pmid:20047716
View Article
PubMed/NCBI
Google Scholar

[166] View Article

[167] PubMed/NCBI

[168] Google Scholar

[ref52] 52. Lowe SR, Pothen J, Quinn JW, Rundle A, Bradley B, Galea S, et al. Gene-by-social-environment interaction (GxSE) between ADCYAP1R1 genotype and neighborhood crime predicts major depression symptoms in trauma-exposed women. J Affect Disord. 2015;187: 147–150. pmid:26334183
View Article
PubMed/NCBI
Google Scholar

[170] View Article

[171] PubMed/NCBI

[172] Google Scholar

[ref53] 53. Sen S, Nesse RM, Stoltenberg SF, Li S, Gleiberman L, Chakravarti A, et al. A BDNF coding variant is associated with the NEO personality inventory domain neuroticism, a risk factor for depression. Neuropsychopharmacology. 2003;28: 397–401. pmid:12589394
View Article
PubMed/NCBI
Google Scholar

[174] View Article

[175] PubMed/NCBI

[176] Google Scholar

[ref54] 54. Åberg E, Fandiño-Losada A, Sjöholm LK, Forsell Y, Lavebratt C. The functional Val158Met polymorphism in catechol-O- methyltransferase (COMT) is associated with depression and motivation in men from a Swedish population-based study. J Affect Disord. 2011;129: 158–166. pmid:20828831
View Article
PubMed/NCBI
Google Scholar

[178] View Article

[179] PubMed/NCBI

[180] Google Scholar

[ref55] 55. Gatt JM, Williams LM, Schofield PR, Dobson-Stone C, Paul RH, Grieve SM, et al. Impact of the HTR3A gene with early life trauma on emotional brain networks and depressed mood. Depress Anxiety. 2010;27: 752–759. pmid:20694966
View Article
PubMed/NCBI
Google Scholar

[182] View Article

[183] PubMed/NCBI

[184] Google Scholar

[ref56] 56. Vaske J, Makarios M, Boisvert D, Beaver KM, Wright JP. The interaction of DRD2 and violent victimization on depression: An analysis by gender and race. J Affect Disord. 2009;112: 120–125. pmid:18501970
View Article
PubMed/NCBI
Google Scholar

[186] View Article

[187] PubMed/NCBI

[188] Google Scholar

[ref57] 57. Gałecka E, Szemraj J, Bieńkiewicz M, Majsterek I, Przybyłowska-Sygut K, Gałecki P, et al. Single nucleotide polymorphisms of NR3C1 gene and recurrent depressive disorder in population of Poland. Mol Biol Rep. 2013;40: 1693–1699. pmid:23073785
View Article
PubMed/NCBI
Google Scholar

[190] View Article

[191] PubMed/NCBI

[192] Google Scholar

[ref58] 58. McQuaid RJ, McInnis OA, Stead JD, Matheson K, Anisman H. A paradoxical association of an oxytocin receptor gene polymorphism: Early-life adversity and vulnerability to depression. Front Neurosci. 2013;7: 128. pmid:23898235
View Article
PubMed/NCBI
Google Scholar

[194] View Article

[195] PubMed/NCBI

[196] Google Scholar

[ref59] 59. Ashburner J. A fast diffeomorphic image registration algorithm. Neuroimage. 2007;38: 95–113. pmid:17761438
View Article
PubMed/NCBI
Google Scholar

[198] View Article

[199] PubMed/NCBI

[200] Google Scholar

[ref60] 60. Ashburner J, Friston KJ. Unified segmentation. Neuroimage. 2005;26: 839–851. pmid:15955494
View Article
PubMed/NCBI
Google Scholar

[202] View Article

[203] PubMed/NCBI

[204] Google Scholar

[ref61] 61. Dahnke R, Yotter RA, Gaser C. Cortical thickness and central surface estimation. Neuroimage. 2013;65: 336–348. pmid:23041529
View Article
PubMed/NCBI
Google Scholar

[206] View Article

[207] PubMed/NCBI

[208] Google Scholar

[ref62] 62. Desikan RS, Ségonne F, Fischl B, Quinn BT, Dickerson BC, Blacker D, et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage. 2006;31: 968–980. pmid:16530430
View Article
PubMed/NCBI
Google Scholar

[210] View Article

[211] PubMed/NCBI

[212] Google Scholar

[ref63] 63. Kaufman J, Charney D. Effects of early stress on brain structure and function: implications for understanding the relationship between child maltreatment and depression. Dev Psychopathol. 2001;13: 451–471. pmid:11523843
View Article
PubMed/NCBI
Google Scholar

[214] View Article

[215] PubMed/NCBI

[216] Google Scholar

[ref64] 64. Corbo V, Salat DH, Amick MM, Leritz EC, Milberg WP, McGlinchey RE. Reduced cortical thickness in veterans exposed to early life trauma. Psychiatry Res Neuroimaging. 2014;223: 53–60. pmid:24862391
View Article
PubMed/NCBI
Google Scholar

[218] View Article

[219] PubMed/NCBI

[220] Google Scholar

[ref65] 65. Papagni SA, Benetti S, Arulanantham S, McCrory E, McGuire P, Mechelli A. Effects of stressful life events on human brain structure: A longitudinal voxel-based morphometry study. Stress. 2011;14: 227–232. pmid:21034297
View Article
PubMed/NCBI
Google Scholar

[222] View Article

[223] PubMed/NCBI

[224] Google Scholar

[ref66] 66. Peper JS, Brouwer RM, Boomsma DI, Kahn RS, Hulshoff Pol HE. Genetic influences on human brain structure: A review of brain imaging studies in twins. Hum Brain Mapp. 2007;28: 464–473. pmid:17415783
View Article
PubMed/NCBI
Google Scholar

[226] View Article

[227] PubMed/NCBI

[228] Google Scholar

[ref67] 67. Caspi A, Sugden K, Moffitt TE, Taylor A, Craig IW, Harrington H, et al. Influence of Life Stress on Depression: Moderation by a Polymorphism in the 5-HTT Gene. Science (80-). 2003;301: 386 LP– 389. pmid:12869766
View Article
PubMed/NCBI
Google Scholar

[230] View Article

[231] PubMed/NCBI

[232] Google Scholar

[ref68] 68. Charney DS, Manji HK. Life stress, genes, and depression: multiple pathways lead to increased risk and new opportunities for intervention. Sci Signal. 2004;2004: re5. pmid:15039492
View Article
PubMed/NCBI
Google Scholar

[234] View Article

[235] PubMed/NCBI

[236] Google Scholar

[ref69] 69. Heim C, Binder EB. Current research trends in early life stress and depression: review of human studies on sensitive periods, gene-environment interactions, and epigenetics. Exp Neurol. 2012;233: 102–111. pmid:22101006
View Article
PubMed/NCBI
Google Scholar

[238] View Article

[239] PubMed/NCBI

[240] Google Scholar

[ref70] 70. Kessler RC, Birnbaum H, Bromet E, Hwang I, Sampson N, Shahly V. Age differences in major depression: Results from the national comorbidity survey replication (NCS-R). Psychol Med. 2010;40: 225–237. pmid:19531277
View Article
PubMed/NCBI
Google Scholar

[242] View Article

[243] PubMed/NCBI

[244] Google Scholar

[ref71] 71. Salat DH, Buckner RL, Snyder AZ, Greve DN, Desikan RSR, Busa E, et al. Thinning of the cerebral cortex in aging. Cereb Cortex. 2004;14: 721–730. pmid:15054051
View Article
PubMed/NCBI
Google Scholar

[246] View Article

[247] PubMed/NCBI

[248] Google Scholar

[ref72] 72. Piccinelli M, Wilkinson G. Gender differences in depression: Critical review. Br J Psychiatry. 2000;177: 486–492. pmid:11102321
View Article
PubMed/NCBI
Google Scholar

[250] View Article

[251] PubMed/NCBI

[252] Google Scholar

[ref73] 73. Sowell ER, Peterson BS, Kan E, Woods RP, Yoshii J, Bansal R, et al. Sex differences in cortical thickness mapped in 176 healthy individuals between 7 and 87 years of age. Cereb Cortex. 2007;17: 1550–1560. pmid:16945978
View Article
PubMed/NCBI
Google Scholar

[254] View Article

[255] PubMed/NCBI

[256] Google Scholar

[ref74] 74. Boden JM, Fergusson DM. Alcohol and depression. Addiction. 2011;106: 906–914. pmid:21382111
View Article
PubMed/NCBI
Google Scholar

[258] View Article

[259] PubMed/NCBI

[260] Google Scholar

[ref75] 75. Durazzo TC, Tosun D, Buckley S, Gazdzinski S, Mon A, Fryer SL, et al. Cortical thickness, surface area, and volume of the brain reward system in alcohol dependence: Relationships to relapse and extended abstinence. Alcohol Clin Exp Res. 2011;35: 1187–1200. pmid:21410483
View Article
PubMed/NCBI
Google Scholar

[262] View Article

[263] PubMed/NCBI

[264] Google Scholar

[ref76] 76. Bertolino A, Fazio L, Di Giorgio A, Blasi G, Romano R, Taurisano P, et al. Genetically determined interaction between the dopamine transporter and the D2 receptor on prefronto-striatal activity and volume in humans. J Neurosci. 2009;29: 1224 LP– 1234. pmid:19176830
View Article
PubMed/NCBI
Google Scholar

[266] View Article

[267] PubMed/NCBI

[268] Google Scholar

[ref77] 77. Sambataro F, Fazio L, Taurisano P, Gelao B, Porcelli A, Mancini M, et al. DRD2 genotype-based variation of default mode network activity and of its relationship with striatal DAT binding. Schizophr Bull. 2013;39: 206–216. pmid:21976709
View Article
PubMed/NCBI
Google Scholar

[270] View Article

[271] PubMed/NCBI

[272] Google Scholar

[ref78] 78. Grieve SM, Korgaonkar MS, Koslow SH, Gordon E, Williams LM. Widespread reductions in gray matter volume in depression. NeuroImage Clin. 2013;3: 332–339. pmid:24273717
View Article
PubMed/NCBI
Google Scholar

[274] View Article

[275] PubMed/NCBI

[276] Google Scholar

[ref79] 79. Takahashi T, Yücel M, Lorenzetti V, Tanino R, Whittle S, Suzuki M, et al. Volumetric MRI study of the insular cortex in individuals with current and past major depression. J Affect Disord. 2010;121: 231–238. pmid:19540599
View Article
PubMed/NCBI
Google Scholar

[278] View Article

[279] PubMed/NCBI

[280] Google Scholar

[ref80] 80. van Tol M-J, Li M, Metzger CD, Hailla N, Horn DI, Li W, et al. Local cortical thinning links to resting-state disconnectivity in major depressive disorder. Psychol Med. 2013/11/01 2014;44: 2053–2065. pmid:24156689
View Article
PubMed/NCBI
Google Scholar

[282] View Article

[283] PubMed/NCBI

[284] Google Scholar

[ref81] 81. Bing X, Ming-Guo Q, Ye Z, Jing-Na Z, Min L, Han C, et al. Alterations in the cortical thickness and the amplitude of low-frequency fluctuation in patients with post-traumatic stress disorder. Brain Res. 2013;1490: 225–232. pmid:23122880
View Article
PubMed/NCBI
Google Scholar

[286] View Article

[287] PubMed/NCBI

[288] Google Scholar

[ref82] 82. Lim L, Radua J, Rubia K. Gray matter abnormalities in childhood maltreatment: A voxel-wise meta-analysis. Am J Psychiatry. 2014;171: 854–863. pmid:24781447
View Article
PubMed/NCBI
Google Scholar

[290] View Article

[291] PubMed/NCBI

[292] Google Scholar

[ref83] 83. Chou KL, Chi I. Stressful life events and depressive symptoms: social support and sense of control as mediators or moderators? Int J Aging Hum Dev. 2001;52: 155–171. pmid:11352200
View Article
PubMed/NCBI
Google Scholar

[294] View Article

[295] PubMed/NCBI

[296] Google Scholar

[ref84] 84. Hammen C. Stress and depression. Annu Rev Clin Psychol. 2004;1: 293–319. pmid:17716090
View Article
PubMed/NCBI
Google Scholar

[298] View Article

[299] PubMed/NCBI

[300] Google Scholar

[ref85] 85. Kessler RC. The effects of stressful life events on depression. Annu Rev Psychol. 1997;48: 191–214. pmid:9046559
View Article
PubMed/NCBI
Google Scholar

[302] View Article

[303] PubMed/NCBI

[304] Google Scholar

[ref86] 86. You S, Conner KR. Stressful life events and depressive symptoms: influences of gender, event severity, and depression history. J Nerv Ment Dis. 2009;197: 829–833. pmid:19996721
View Article
PubMed/NCBI
Google Scholar

[306] View Article

[307] PubMed/NCBI

[308] Google Scholar

[ref87] 87. Frodl T, Koutsouleris N, Bottlender R, Born C, Jäger M, Mörgenthaler M, et al. Reduced gray matter brain volumes are associated with variants of the serotonin transporter gene in major depression. Mol Psychiatry. 2008;13: 1093–1101. pmid:19008895
View Article
PubMed/NCBI
Google Scholar

[310] View Article

[311] PubMed/NCBI

[312] Google Scholar

[ref88] 88. Jaworska N, MacMaster FP, Foster J, Ramasubbu R. The influence of 5-HTTLPR and Val66Met polymorphisms on cortical thickness and volume in limbic and paralimbic regions in depression: a preliminary study. BMC Psychiatry. 2016;16: 61. pmid:26976307
View Article
PubMed/NCBI
Google Scholar

[314] View Article

[315] PubMed/NCBI

[316] Google Scholar

[ref89] 89. Gene Northoff G., brains, and environment-genetic neuroimaging of depression. Curr Opin Neurobiol. 2013;23: 133–142. pmid:22995550
View Article
PubMed/NCBI
Google Scholar

[318] View Article

[319] PubMed/NCBI

[320] Google Scholar

[ref90] 90. Snijders TAB, Bosker RJ (Roel J. Multilevel analysis: An introduction to basic and advanced multilevel modeling. London, England: Sage; 1999.

[ref91] 91. Zhou L, Takane Y, Hwang H. Dynamic GSCANO (Generalized Structured Canonical Correlation Analysis) with applications to the analysis of effective connectivity in functional neuroimaging data. Comput Stat Data Anal. 2016;101: 93–109. https://doi.org/10.1016/j.csda.2016.03.001
View Article
Google Scholar

[323] View Article

[324] Google Scholar

[ref92] 92. Ramsay JO, Silverman BW. Principal components analysis for functional data. In: Ramsay JO, Silverman BW, editors. Functional Data Analysis. New York, NY: Springer; 2005. pp. 147–172. https://doi.org/10.1007/0-387-22751-2_8

[ref93] 93. Tian P, Teng IC, May LD, Kurz R, Lu K, Scadeng M, et al. Cortical depth-specific microvascular dilation underlies laminar differences in blood oxygenation level-dependent functional MRI signal. Proc Natl Acad Sci. 2010;107: 15246–15251. pmid:20696904
View Article
PubMed/NCBI
Google Scholar

[327] View Article

[328] PubMed/NCBI

[329] Google Scholar

[ref94] 94. Luo L, Zhu Y, Xiong M. A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data. J Comput Biol. 2012;19: 731–744. pmid:22651812
View Article
PubMed/NCBI
Google Scholar

[331] View Article

[332] PubMed/NCBI

[333] Google Scholar

[ref95] 95. Suk HW, Hwang H. Functional generalized structured component analysis. Psychometrika. 2016;81: 940–968. pmid:27714543
View Article
PubMed/NCBI
Google Scholar

[335] View Article

[336] PubMed/NCBI

[337] Google Scholar

[ref96] 96. Duncan TE, Duncan SC, Strycker LA. An introduction to latent variable growth curve modeling: Concepts, issues, and applications. 2nd ed. Mahwah, NJ: Erlbaum; 2006.

[ref97] 97. Meredith W, Tisak J. Latent curve analysis. Psychometrika. 1990;55: 107–122.
View Article
Google Scholar

[340] View Article

[341] Google Scholar

[ref98] 98. Hwang H, Desarbo WS, Takane Y. Fuzzy clusterwise generalized structured component analysis. Psychometrika. 2007;72: 181–198.
View Article
Google Scholar

[343] View Article

[344] Google Scholar

[ref99] 99. Ryoo JH, Park S, Kim S, Ryoo HS. Efficiency of cluster validity indexes in fuzzy clusterwise generalized structured component analysis. Symmetry (Basel). 2020;12.
View Article
Google Scholar

[346] View Article

[347] Google Scholar

[ref100] 100. Park S, Kim S, Ryoo JH. Latent class regression utilizing fuzzy clusterwise generalized structured component analysis. Mathematics. 2020;8.
View Article
Google Scholar

[349] View Article

[350] Google Scholar

Figures

Abstract

Introduction

Method

Model specification

Parameter estimation

Example: Gene-brain-depression data

Data overview

Participants.

Measures.

Model specification

Results

Simulation study

Conclusions

Supporting information

S1 Table. The entire path coefficient estimates, their standard errors and 95% confidence intervals (direct effects only) in the empirical study.

S2 Table. Biases, Standard Deviations (SD), and Root Mean Square Errors (RMSE) of loadings and path coefficients estimated from IG-GSCA over different sample sizes in the simulation study.

S1 Appendix. The data generation procedure for the simulation study.

References