Skip to main content
Advertisement
  • Loading metrics

Psychological distress among Japanese high school students during the COVID-19 pandemic: An energy landscape analysis

  • Daiki Tatematsu ,

    Contributed equally to this work with: Daiki Tatematsu, Naotoshi Nakamura, Masato S. Abe, Tetsuo Ishikawa, Takahiro Ezaki, Lin Cai, Shinsuke Koike, Shingo Iwami

    Roles Formal analysis, Investigation, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Interdisciplinary Biology Laboratory (iBLab), Division of Natural Science, Graduate School of Science, Nagoya University, Nagoya, Japan

  • Naotoshi Nakamura ,

    Contributed equally to this work with: Daiki Tatematsu, Naotoshi Nakamura, Masato S. Abe, Tetsuo Ishikawa, Takahiro Ezaki, Lin Cai, Shinsuke Koike, Shingo Iwami

    Roles Conceptualization, Investigation, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliations Interdisciplinary Biology Laboratory (iBLab), Division of Natural Science, Graduate School of Science, Nagoya University, Nagoya, Japan, Department of Data Science, Yokohama City University, Kanagawa, Japan

  • Masato S. Abe ,

    Contributed equally to this work with: Daiki Tatematsu, Naotoshi Nakamura, Masato S. Abe, Tetsuo Ishikawa, Takahiro Ezaki, Lin Cai, Shinsuke Koike, Shingo Iwami

    Roles Methodology, Resources, Writing – review & editing

    Affiliations Faculty of Culture and Information Science, Doshisha University, Kyoto, Japan,, Center for Advanced Intelligence Project, RIKEN, Chuo-ku, Tokyo, Japan, CBS-TOYOTA Collaboration Center, RIKEN, Wako, Saitama, Japan

  • Tetsuo Ishikawa ,

    Contributed equally to this work with: Daiki Tatematsu, Naotoshi Nakamura, Masato S. Abe, Tetsuo Ishikawa, Takahiro Ezaki, Lin Cai, Shinsuke Koike, Shingo Iwami

    Roles Methodology, Resources, Writing – review & editing

    Affiliations Division of Applied Mathematical Science, Center for Interdisciplinary Theoretical and Mathematical Sciences (iTHEMS), RIKEN, Yokohama, Japan, Predictive Medicine Special Project (PMSP), RIKEN Center for Integrative Medical Sciences (IMS), RIKEN, Yokohama, Japan, Department of Artificial Intelligence Medicine, Graduate School of Medicine, Chiba University, Chiba, Japan, Department of Extended Intelligence for Medicine, The Ishii-Ishibashi Laboratory, Keio University School of Medicine, Tokyo, Japan, Collective Intelligence Research Laboratory, Graduate School of Arts and Sciences, The University of Tokyo, Tokyo, Japan

  • Takahiro Ezaki ,

    Contributed equally to this work with: Daiki Tatematsu, Naotoshi Nakamura, Masato S. Abe, Tetsuo Ishikawa, Takahiro Ezaki, Lin Cai, Shinsuke Koike, Shingo Iwami

    Roles Methodology, Resources, Software, Writing – review & editing

    Affiliation Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, Japan

  • Lin Cai ,

    Contributed equally to this work with: Daiki Tatematsu, Naotoshi Nakamura, Masato S. Abe, Tetsuo Ishikawa, Takahiro Ezaki, Lin Cai, Shinsuke Koike, Shingo Iwami

    Roles Data curation, Investigation, Methodology, Resources, Writing – review & editing

    Affiliations Division of Information Science, Nara Institute of Science and Technology, Nara, Japan, Center for Evolutionary Cognitive Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Tokyo, Japan

  • Eiryo Kawakami,

    Roles Writing – review & editing, Methodology

    Affiliations Predictive Medicine Special Project (PMSP), RIKEN Center for Integrative Medical Sciences (IMS), RIKEN, Yokohama, Japan, Department of Artificial Intelligence Medicine, Graduate School of Medicine, Chiba University, Chiba, Japan, Institute for Advanced Academic Research (IAAR), Chiba University, Chiba, Japan, Interdisciplinary Theoretical and Mathematical Sciences Program (iTHEMS), RIKEN, Saitama, Japan

  • Kazuyuki Aihara,

    Roles Funding acquisition, Project administration, Writing – review & editing

    Affiliation The International Research Center for Neurointelligence (WPI-IRCN), Institutes for Advanced Study, University of Tokyo, Tokyo, Japan

  • Atsushi Nishida,

    Roles Data curation, Project administration, Resources, Validation, Writing – review & editing

    Affiliation Unit for Mental Health Promotion, Research Center for Social Science and Medicine, Tokyo Metropolitan Institute of Medical Science, Tokyo, Japan

  • Naohiro Okada,

    Roles Data curation, Resources, Validation, Writing – review & editing

    Affiliations The International Research Center for Neurointelligence (WPI-IRCN), Institutes for Advanced Study, University of Tokyo, Tokyo, Japan, Department of Neuropsychiatry, Graduate School of Medicine, University of Tokyo, Tokyo, Japan

  • Naoki Masuda,

    Roles Methodology, Writing – review & editing

    Affiliation Gilbert S. Omenn Department of Computational Medicine and Bioinformatics and Department of Mathematics, University of Michigan, Ann Arbor, Michigan, United States of America

  • Kiyoto Kasai,

    Roles Data curation, Funding acquisition, Methodology, Project administration, Resources, Writing – review & editing

    Affiliations The International Research Center for Neurointelligence (WPI-IRCN), Institutes for Advanced Study, University of Tokyo, Tokyo, Japan, Department of Neuropsychiatry, Graduate School of Medicine, University of Tokyo, Tokyo, Japan

  • Shinsuke Koike ,

    Contributed equally to this work with: Daiki Tatematsu, Naotoshi Nakamura, Masato S. Abe, Tetsuo Ishikawa, Takahiro Ezaki, Lin Cai, Shinsuke Koike, Shingo Iwami

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    c-koike@g.ecc.u-tokyo.ac.jp (SK); iwami.iblab@bio.nagoya-u.ac.jp (SI)

    Affiliations Center for Evolutionary Cognitive Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Tokyo, Japan, The International Research Center for Neurointelligence (WPI-IRCN), Institutes for Advanced Study, University of Tokyo, Tokyo, Japan, Department of Child Neuropsychiatry, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan

  • Shingo Iwami

    Contributed equally to this work with: Daiki Tatematsu, Naotoshi Nakamura, Masato S. Abe, Tetsuo Ishikawa, Takahiro Ezaki, Lin Cai, Shinsuke Koike, Shingo Iwami

    Roles Conceptualization, Funding acquisition, Investigation, Project administration, Supervision, Writing – original draft, Writing – review & editing

    c-koike@g.ecc.u-tokyo.ac.jp (SK); iwami.iblab@bio.nagoya-u.ac.jp (SI)

    Affiliations Interdisciplinary Biology Laboratory (iBLab), Division of Natural Science, Graduate School of Science, Nagoya University, Nagoya, Japan, Interdisciplinary Theoretical and Mathematical Sciences Program (iTHEMS), RIKEN, Saitama, Japan, The International Research Center for Neurointelligence (WPI-IRCN), Institutes for Advanced Study, University of Tokyo, Tokyo, Japan, Institute of Mathematics for Industry, Kyushu University, Fukuoka, Japan, Institute for the Advanced Study of Human Biology (ASHBi), Kyoto University, Kyoto, Japan, NEXT-Ganken Program, Japanese Foundation for Cancer Research (JFCR), Tokyo, Japan, Science Groove Inc., Fukuoka, Japan

Abstract

Background

The stay-at-home orders, lockdowns, and states of emergency of the Coronavirus Infectious Disease emerged in 2019 (COVID-19) pandemic have affected the mental health of school-aged children. Previous reports of psychological distress in adolescents during the pandemic have been mixed, however, with some reports showing increases in psychological distress and others suggesting decreases. To accurately assess the impact of the pandemic, we need to be able to compare psychological assessments longitudinally, both before and during the pandemic. However, current statistical methods have limitations for reconstructing the complex trajectory of psychological states as captured by short-item questionnaires.

Methods and findings

In this study, we analyzed monthly Kessler 6-item Psychological Distress Scale (K6) questionnaire responses collected from 16- to 18-year-old high school students participating in the population-neuroscience Tokyo TEEN Cohort (pn-TTC) in Japan (1,278 responses from 84 participants). Participants included 42 males and 42 females. The pn-TTC is a population-based longitudinal study conducted in Tokyo, Japan that follows children to investigate their developmental and mental health trajectories. In addition to conventional statistical approaches that summarize multiple questionnaire items into a composite score, we applied “energy landscape analysis,” a method derived from statistical physics that models multivariate psychological states as a dynamic system of interactions among K6 questionnaire items, to visualize longitudinal changes in psychological distress before and during the COVID-19 pandemic (July 2019 to September 2021). Here, we define the depressive and healthy states as configurations in which all six K6 items are above or below each participant’s individual mean, respectively. Before the pandemic, the healthy state occurred 11.0 times as frequently as the depressive state. In contrast, during the pandemic, the relative frequency of the healthy state increased to 18.2, 18.5, and 15.0 times that of the depressive state, respectively. The evolving energy landscape revealed an association between the pandemic period and a lower likelihood of being in a depressive state. We also identified two groups of students with different K6 dynamics and energy landscapes. The first group consisted of 61 participants whose total K6 score was relatively low (less than 5) and stable over time, and the second group consisted of 23 participants whose total K6 score was higher (with most being higher than 5) and less stable. The latter group showed a greater change in cortical thickness in the caudal part of the middle frontal gyrus (cMFG) (t-statistic = −2.36, p-value = 0.019, q-value = 0.048) and the temporal pole (TP) (t = 3.08, p = 0.0023, q = 0.012), as measured by magnetic resonance imaging, in the direction of accelerated adolescent brain development. Because all participants lived in Tokyo, generalizability remains limited, and as the association between psychological states and brain development is descriptive, future studies in diverse cohorts are needed to examine causality.

Conclusions

By revealing associations between the COVID-19 pandemic and lower levels of psychological distress and healthier mental health states, our work demonstrates the potential of using dynamical systems theory, such as the energy landscape analysis, to interpret health and disease metrics in psychology and psychiatry. This approach may improve mental health surveillance for the next pandemic.

Author summary

Why was this study done?

  • The mental health of school-aged children has been affected by pandemic-related policies such as the stay-at-home orders, lockdowns, and states of emergency.
  • Longitudinal comparisons of psychological assessments before and during the pandemic are essential to accurately evaluate its impact on adolescent mental health.
  • Conventional statistical methods have limitations in capturing the complex dynamics of psychological states based on questionnaires, highlighting the need for new analytical approaches.
  • Energy landscape analysis, a method derived from statistical physics, has the potential to capture the complex dynamics of psychological states.

What did the researchers do and find?

  • We applied the energy landscape analysis to monthly Kessler 6-item Psychological Distress Scale (K6) questionnaire responses collected from July 2019 to September 2021, spanning both the pre-pandemic and pandemic periods.
  • Analysis revealed that during the COVID-19 pandemic, adolescents were less likely to experience a depressive state and more likely to transition to a healthier psychological state.
  • We identified two groups of students with different K6 dynamics and energy landscapes. The group whose total K6 score was higher and less stable showed greater brain developmental change in cortical thickness in the caudal part of the middle frontal gyrus (cMFG) and temporal pole (TP).

What do these findings mean?

  • The COVID-19 pandemic was associated with a reduction in psychological distress and healthier mental health states among adolescents in this study.
  • Dynamical systems approaches such as the energy landscape analysis may improve mental health monitoring during future pandemics.
  • Main limitations include the fact that the participants in this cohort lived in Tokyo only, and the generalizability of our findings to adolescents living in other local areas and countries needs to be confirmed. Additionally, the association we found between psychological states and brain development remains descriptive, and further experimental or interventional studies are needed to validate causality.

Introduction

The Coronavirus Infectious Disease emerged in 2019 (COVID-19) pandemic, a global emergency unprecedented in decades, has had a profound and widespread impact on people of all ages, including adults, adolescents, and children. Most notably, many countries implemented movement restrictions, such as stay-at-home orders and lockdowns, during the early stages of the pandemic. In Japan, states of emergency were declared that closed schools and other institutions. While these policies were generally effective in reducing the spread of COVID-19, they came at the cost of mental health outcomes [1,2]. To learn from the current pandemic and apply these lessons to the next one, we need to be able to assess the extent of psychological distress due to the pandemic and how mental health changed over time.

Recent studies have discussed the impact of social distancing due to the COVID-19 pandemic and infection on psychological outcomes such as depression and anxiety symptoms, psychological well-being, suicidal ideation and behavior, and loneliness [36]. Children and adolescents were particularly affected, with school closure significantly decreasing physical activity and social interaction [7]. Most reports showed increased psychological distress during the pandemic [8,9], but some conversely showed better mental health outcomes, suggesting relief from routine psychological stress experienced during school and social activities [1012]. These findings suggest that adolescents may show heterogeneous patterns of response to a dramatic environmental change. However, because most of the surveys were conducted after the COVID-19 pandemic had already begun, we do not know how adolescents’ psychological distress varied over time from the onset of the pandemic. Findings from cohort studies that began before the pandemic can be used to illustrate the trajectory of psychological distress. For example, one cohort study that assessed psychological distress in adults before and repeatedly during the pandemic showed four distinct trajectories of distress: continuously low, temporarily elevated, repeatedly elevated, and continuously elevated [13].

Self-report questionnaires are a common method for assessing the psychological status of human participants, and repeated measures can provide insight into the dynamic fluctuations in psychological distress during the COVID-19 pandemic. However, this method has limitations. Increasing the number of survey points may decrease the quality of the responses and result in a higher attrition rate. To address this limitation, shorter self-report questionnaires are needed, but this may decrease sensitivity and specificity. For example, the General Health Questionnaire (GHQ) [14] originally consisted of 60 items, and shorter versions are now available to increase response rates. The Kessler 6-item Psychological Distress Scale (K6) [15], which assesses psychological distress in six items only, is now commonly used; however, it assesses only one factor of psychological status and may have reduced sensitivity and specificity compared with the 28-item version of the GHQ (GHQ-28) [16], which assesses four factors (somatic symptoms, anxiety and insomnia, social dysfunction, and severe depression). These limitations are difficult to address simultaneously using current research methods in psychology; therefore, a new method is needed that can accurately reconstruct the complex trajectory of psychological distress from repeated responses to a short-item questionnaire.

In this study, we focus on high-school-aged adolescents and the impact of school closure [17] in the context of states of emergency, using the population-neuroscience Tokyo TEEN Cohort (pn-TTC) in Japan [18]. We analyze data collected through monthly mental health surveys of high school students conducted for 2 years before and during the pandemic. In addition to conventional statistical analysis, which aggregates multiple questionnaire items into a single score, we provide an in-depth analysis of the combination of each item using a method called “energy landscape analysis.” The energy landscape analysis method allows us to intuitively visualize and interpret multidimensional categorical time-series data [19]. Rather than treating psychological distress as a static score that is simply “high” or “low,” the energy landscape analysis captures the dynamic nature of mental states over time. This perspective focuses on the stability and variability of psychological states, providing an intuitive way to understand how mental health fluctuates and how individuals move between healthier and more distressed conditions. The analysis method explains the data by assigning an “energy” to each combination of variables (in this case, questionnaire items). Here, “energy” is a physical interpretation of probability, based on the Ising model in physics. We hypothesize that the energy landscape analysis can characterize how the pattern and stability of adolescents’ psychological states and mental health shifted before and during the pandemic, particularly in response to social restrictions such as school closure. Therefore, this study aims to examine whether the energy landscape analysis appropriately represents the underlying dynamics of adolescent mental health data and to explore how visualizing the distribution of psychological energy can provide insights into the impact of the pandemic on adolescent health. We also explore whether variations in the mental health survey and energy landscapes are linked to changes in adolescent brain development observed in brain magnetic resonance imaging (MRI) scans taken approximately 2 years apart.

Methods

Population-neuroscience Tokyo TEEN Cohort (pn-TTC)

Study participants.

The pn-TTC [18] is an extension of the TTC study [20] and focuses on adolescent brain development and biomedical measurements. The TTC study is being conducted on 3,171 children randomly recruited from the resident registry in three municipalities of Tokyo, and the 479 pn-TTC participants were recruited from the TTC members (S1 Methods). Wave 1 (age 11 years) of the pn-TTC recruitment began in September 2013 and ended in February 2016. For the following Waves 2–4 (ages 13, 15, and 18, respectively), participant recruitment was conducted biennially. A total of 1,271 scans from 479 participants were performed between October 2013 and March 2023, and 1,211 scans from 471 participants were used after quality controls were implemented during image processing (S1 Fig)

In pn-TTC Wave 3 (conducted from May 2018 to September 2020), all participants were recruited for the monthly web-based questionnaire survey using their smartphones. A total of 109 participants provided written informed consent, and 1,278 responses from 84 participants who provided valid responses were used for the energy landscape analysis.

Monthly web-based questionnaire survey (including K6).

The monthly web-based survey included the Japanese version of the K6 scale [21,22] and background questions. The K6 is a widely used scale of subjective psychological distress over the past 30 days. Each response is scored from 0 (never) to 4 (always). In common use, the total score of 6 items (range, 0–24) is considered. For this study, we used the background questions on the number of school days per week. Each participant was asked to select one of the following options for school days in the past 30 days: not commuting to school, 1 day per week, 2 days per week, 3 days per week, 4 days per week, 5 days per week, 6 days per week, or 7 days per week. In accordance with the School Education Act Enforcement Order in Japan [23], summer vacation was assumed to be in August and winter vacation was assumed to be in January.

Image acquisition and processing

For the four waves of data collection (from September 2013 to June 2023), two 3-T scanners and three acquisition procedures were used (S1 Fig). Only T1-weighted images were acquired for procedures 1 and 2, whereas T1-weighted and T2-weighted images were acquired for procedure 3 (S2 Methods and S1 Table). When only T1-weighted images were available, the legacy style of the Human Connectome Project pipeline was used for image preprocessing. In cases in which both T1-weighted and T2-weighted images were available, the standard style of the Human Connectome Project pipeline was used for image preprocessing [24]. Then, we extracted the 75 features of cortical thickness, cortical surface area, and subcortical volume using the Desikan–Killiany atlas. Traveling subject harmonization was performed to diminish the procedural difference for each brain feature (S2 Methods) [10,25,26]. Quality controls were performed in each preprocessing step in a standardized manner (S2 Methods) [10,25].

The first COVID-19 case, the declarations of a state of emergency, and the definition of the four periods

Wuhan Municipal Health Commission, China, reported a cluster of pneumonia cases in Wuhan, Hubei Province, China, to the World Health Organization (WHO) on December 31, 2019 [27]. The first COVID-19 case in Japan was reported to WHO on January 16, 2020 [28]. The first declaration of a state of emergency in Tokyo, Japan, was from April 7, 2020, to May 25, 2020 [29]. The second was from January 8, 2021, to March 21, 2021; the third was from April 25, 2021, to June 20, 2021; and the fourth was from July 12, 2021, to September 30, 2021.

In this study, the four periods were defined as follows: (1) Period 1: July 2019 to December 2019. This is the period before the COVID-19 pandemic. (2) Period 2: January 2020 to May 2020. This period includes the first COVID-19 case in Japan and the first declaration of a state of emergency. (3) Period 3: June 2020 to December 2020. This is the period during which no state of emergency was declared. (4) Period 4: January 2021 to September 2021. This period includes the second, third, and fourth declarations of a state of emergency.

Preprocessing of time-series K6 scores

First, the acquisition time of the K6 score data was rounded to the nearest day. Second, because the questionnaire asked about participants’ status in the past 30 days, we extended the same data to the previous 30 days, assuming that the same status continued during this period. If the extension resulted in more than one data point on the same day, the data were averaged. Third, using gaussian process regression, the data were further interpolated to monthly data. The kernel function was defined using a combination of the sklearn.gaussian_process.kernels class in the python scikit-learn (https://scikit-learn.org), RBF() + ConstantKernel() + WhiteKernel(). Fourth, the data were binarized to perform the energy landscape analysis. The average was computed for each item per participant over the entire period. Using this average, the time series was binarized and assigned a value of 1 if above average and 0 otherwise. Values were set to 0 for items for which all values were equal. We express an individual response as an N-dimensional vector (hereafter referred to as a “response vector”) whose components consist of binarized responses to N questionnaire items. This preprocessing workflow is illustrated in S2 Fig.

Stratifying participants using time-series K6 scores

Participants were stratified using k-means clustering based on preprocessed time-series K6 scores (Python scikit-learn). We also computed the within-cluster sum of squared errors (SSE) to assess the accuracy of the clustering. To see the difference in depressive symptoms at the times of the MRI scans at Waves 3 and 4 between the clustering groups, we used a logistic regression model (S2).

Energy landscape analysis

The programming code we used to calculate the energy landscape is a Python version of the MATLAB code available from [30] along with a tutorial. The Python code is available on GitHub (https://github.com/ttttmmttddiikk/Energy_Landscape_Analysis) and Zenodo (https://doi.org/10.5281/zenodo.17585043).

Pairwise maximum entropy model.

We fit the pairwise maximum entropy model to the pooled individual response data in the same manner as in the previous studies [31]. Let be the set of all the possible “states” of -dimensional response vectors, where is the number of questionnaire items ( in the case of K6). has elements in total. For all , we assign the probability such that

(1)

The Shannon entropy is given by

(2)

where log is a natural logarithm. We maximize the entropy (2) under equation (1) and the following constraints (3) and (4):

(3)(4)

where , , , . Here, denotes the mean, and } denotes the th component of the th response vector in the pooled response data consisting of T individual responses. Using the method of Lagrange multipliers, we obtain the following equation about the model probability:

(5)

where

(6)

is the energy, and and are the parameters of the model ( is symmetric: ). Here, interactions higher than second order are ignored, hence the name pairwise maximum entropy model. Equations (5) and (6) match those of the Boltzmann distribution and the Ising model, respectively. The maximum-likelihood solution is expressed as

where is the likelihood given by

and is the th response vector in the pooled response data. We maximize the likelihood by a gradient ascent scheme given by

and

where the superscripts new and old represent the values after and before a single updating step, respectively, and is the learning rate. Updating was stopped when

became less than the permissible error (the first term in the numerator is the square of the 2-norm; the second term is the square of the Frobenius norm). We set and . For the monthly data, we set to reduce computation time. Initial values of and were set to 0.

The model variables and parameters , , , and are particularly important throughout the study. They are listed with explanations in S3 Table.

Accuracy indices.

To quantify the accuracy of the pairwise maximum entropy model fitted to the data, we used the following two accuracy indices [19,32,33]: The first accuracy index is defined by

where is the Shannon entropy of the maximum entropy model incorporating correlations up to the th order. The probability distribution represents the independent maximum entropy model in which we ignore any interactions between items by forcing for all . Distribution represents the pairwise maximum entropy model. Distribution is equivalent to the distribution of the given data, . The second accuracy index is defined by

where is the Kullback–Leibler divergence between two distributions and .

Assigning an energy to each state.

We calculated the energies for all states. For two states and , if , this means that , and under the condition of detailed balance, a state transition is more likely to occur from to than from to .

Stable states.

A stable state is a state (-dimensional vector) whose energy is lower than that of all adjacent states (i.e., states that differ by 1 bit) and is a local minimum. For all states, we examined whether each state was a stable state by comparing the energies of all adjacent states.

Basin and basin size.

For all states except stable states, we determined which stable state was reached by repeatedly moving to the state with the lowest energy among adjacent states. Thus, each state belonged to the basin for the reached stable state. We defined the basin size as the number of states belonging to the basin.

Disconnectivity graph.

A disconnectivity graph is a visualization method based on a dendrogram to represent the (dis)connectivity between stable states. In the disconnectivity graph, stable states are plotted as leaves and are located at a height on the y-axis corresponding to their energy values. The branching structure of the disconnectivity graph represents the energy barriers between any pair of stable states. Short branches were removed following the protocol shown in [34].

Probabilities of being in a basin and of transitioning between basins.

Using the basins from the network, we determined which basins the data belonged to. The probability of being in a basin is calculated as the frequency with which the data fall into that basin. The transition probability from basin A to basin B is the conditional probability that if an individual is in basin A in a given month, the same individual will be in basin B the following month. It is calculated as [frequency of individuals who are in basin A in month n and in basin B the following month] divided by [frequency of individuals in basin A in month n].

Numerical simulations.

We performed numerical simulations to generate hypothetical transition sequences constrained by the obtained landscape [35]. We employed Gibbs sampling as follows. First, we set one of the stable states (000000 or 111111) as the initial state. Then, at each time step, a transition is attempted from the current state to one of its adjacent states , selected with probability 1/6. The transition to the selected state occurs with probability .

Statistical analysis of brain image features

The analysis of adolescent brain trajectory and its relationship with the clustering was performed using R software version 4.3.2. To explore the adolescent developmental trajectory of each brain feature, we built a GAMM implemented with the mgcv package (version 1.9-1) [36], including the main effect of sex as a linear term, the main effect of age as a smooth term using a penalized cubic regression spline and a basis function of 4, a tensor interaction term between age and sex, intracranial volume as a linear covariate, and a random intercept per participant (S2 Methods). Because the use of a subset of brain images (266 scans) from the 84 participants included in the energy landscape analyses may cause sampling bias in relation to the whole cohort (1,211 scans from 471 participants), we extracted the deviation from the nonlinear model for each scan and feature [37]. We then tested the difference between the groups from the clustering using GLMMs implemented with the lme4 package (version 1.1-35.1), including the main effect of group (G1 or G2), age × group interaction, child IQ, socioeconomic status, and handedness as covariates, and a random intercept per participant. Because we conducted 75 repeated tests, FDR correction was applied. To see whether the GHQ-28 scores at Wave 3 and/or the difference between waves was also associated with adolescent brain features, we further tested the same GLMMs. For the features with significant associations, we tested whether the group (G1 or G2) and/or the GHQ-28 scores could explain the brain difference and trajectory by performing the GLMMs adding the main effects of both variables and the interactions by age.

Ethics statement

This study was approved by the ethics committee of the Faculty of Medicine, The University of Tokyo (No. 10069), and the research ethics committee of The University of Tokyo (No. 22-396). Written informed consent was obtained from participants in the monthly web-based survey. After written informed consent was obtained, the participants’ parents were informed of their participation and had the opportunity to withdraw consent. For brain MRI scans, written informed consent was obtained from participants’ primary caregivers and participants 15 years of age or older prior to participation in this study.

STROBE guideline and TRIPOD statement

This study is reported as per the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guideline (S1 Checklist) and the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis Or Diagnosis (TRIPOD) statement (S2 Checklist).

Results

Collection of K6 questionnaire responses from adolescents before and during the COVID-19 pandemic

To analyze changes in adolescents’ emotional states during the COVID-19 pandemic, we obtained monthly responses to the K6 questionnaire [15], which measures nonspecific psychological distress, completed by 16- to 18-year-old high school students in Tokyo who participated in the pn-TTC from July 2019 to October 2021, starting 8 months before the first state of emergency declaration (Figs 1A and S1). Participants included 42 males and 42 females. The questionnaire contains 6 questions about the participants’ psychological distress over the past 30 days, and each item is rated on a 5-point scale (Fig 1B). The results showed Pearson correlation coefficients ranging from 0.53 to 0.81 between items (Fig 1C), suggesting that the items of the K6 questionnaire, whose total score indicates the level of psychological distress, were moderately to strongly correlated, as shown in previous validation studies [3840].

thumbnail
Fig 1. Collecting K6 scores from high school students in Japan.

(A) The timeline of K6 responses is described for each participant (n = 84 participants and 1,278 total responses). Black dots indicate the date on which the response was received. (B) Questionnaire items and scales. (C) The correlation matrix between the item-level scores. Pearson’s correlation coefficients are shown in pseudo-colors ranging from orange to purple. Each item is arranged based on hierarchical clustering. (D) The time series of the item-level scores after data cleaning (see Methods, S2 and S3 Figs). All items from K6-1 to K6-6 are plotted with time on the x-axis and score on the y-axis for each participant. The thick line represents the mean. Error bars indicate standard deviation (SD).

https://doi.org/10.1371/journal.pmed.1004884.g001

Conventional statistical analyses of K6 responses have mainly focused on the total score and have not been able to fully exploit the multidimensional structure and dynamic features of the questionnaire. In order to analyze the dynamics of K6 responses in an integrated manner, encompassing six dimensions, we employed a method referred to as an “energy landscape analysis”. In this analysis, each questionnaire item is represented by a spin that takes the value of either 0 or 1 (in statistical physics, a spin usually takes the value of +1 or −1, but the mathematical model is the same except for differences in constants [41]). We assign an “energy” to each spin () and to the interaction of two spins (), and the total energy represents the probability of a specific combination of K6 responses (lower energy means higher probability; see Methods and S1 Note (Glossary) for further details).

Thus, the collected and preprocessed questionnaire responses (Figs 1D, S2, and S4) were binarized according to the procedures described in the Methods (illustrated in S3A Fig), and the binarized K6 scores were used in subsequent analyses. Here, binarization cutoffs were set to the individual mean and were therefore allowed to vary between individuals and between questionnaire items. See S2 Note and S5 Fig for a comparison of different binarization methods.

Energy landscapes for K6 scores during the COVID-19 pandemic

We first pooled the binarized K6 scores of all participants and constructed the energy landscape for the entire period, which we visualized as a “disconnectivity graph” (Methods and S3B Fig). We provide an explanation of a disconnectivity graph in Figs 2A and S1 Note. The accuracy of the model fitting is evaluated in S6A Fig.

thumbnail
Fig 2. Energy landscape analysis of time-series K6 questionnaire responses for all participants.

(A) An example of a disconnectivity graph (left) and the equivalent ball-and-cup diagram (right) are shown with explanations. Both display stable states and the energy barrier between them. Additional details on disconnectivity graphs can be found in S1 Note (Glossary). (B) The disconnectivity graph of the energy landscape for the entire period is shown. Each stable state is represented by a six-bit number consisting of the binarized values of K6-1, K6-2, K6-3, K6-4, K6-5, and K6-6 from left to right. The cutoff value of binarization was set to the individual mean; thus, 0 is below or equal to the individual mean and 1 is above the individual mean. (C) The graph of the basin of attraction obtained from the energy landscape analysis for the entire period. The stable states are plotted as squares. Each state is connected to the direction of attraction by an edge and each basin of attraction is shown in a different color. The basin 000000 is shown in red; the basin 111111 is shown in green. The density of the color corresponds to the magnitude of the energy. The darker the color, the lower the energy, and the lighter the color, the higher the energy. The six-bit number under each state shows the binarized values of K6-1, K6-2, K6-3, K6-4, K6-5, and K6-6 from left to right. (D) The definition of the four periods utilized in the analysis is explained, along with the number of school days per week for each participant. The thick line represents the mean. Error bars indicate standard deviation (SD). SoE refers to a state of emergency. (E) The mean of the total K6 scores for the 4 periods is shown. Error bars indicate standard deviation (SD). (F) The disconnectivity graphs of the energy landscape for the 4 periods are shown. (G, H) The simulated times from 000000 to 111111 or from 111111 to 000000 are compared between the 4 periods. 000000 (binarized responses of 0 to all six items) can be considered “healthy,” whereas 111111 (binarized responses of 1 to all six items) represents the “depressive” state. Box plots show the median (center line), interquartile range (box), and range (whiskers). The upper outliers are hidden to improve visibility. Yellow indicates increase compared with Period 1.

https://doi.org/10.1371/journal.pmed.1004884.g002

The disconnectivity graph was drawn with the stable states shown in Fig 2B. Here, each state is represented by a six-bit number, each bit of which represents K6-1, K6-2, K6-3, K6-4, K6-5, and K6-6 from left to right. The stable states, defined as local minima in the energy landscape (i.e., state(s) with lower energy than all adjacent states), were 000000 (i.e., binarized responses to all 6 items were 0) and 111111 (i.e., binarized responses to all 6 items were 1), corresponding to the “healthy” and the “depressive” states (states of psychological distress). The fact that the two stable states had the same digits across all items is consistent with the relatively strong correlation of the K6 questionnaire items. Indeed, all the estimated coefficients of interaction () were positive (S4 Table), which means that states become more stable when different digits take the same values. Furthermore, to quantify the uncertainty of the estimated parameters, we computed confidence intervals for both and and confirmed that these intervals were notably narrow, indicating stable estimates (S7 Fig). We also confirmed stable parameter estimation based on the convergence of estimation errors and the change in parameter estimates across iterations (S8 Fig). The energy difference between 000000 and 111111 was , meaning that the probability of 000000 was times greater than that of 111111 (i.e., lower energy means higher probability).

To see how all states are related in the energy landscape, we also plotted a forest structure (called the graph of basins of attraction) in which each state is connected to an adjacent state with the lowest energy (Figs 2C and S3B). Here, each state is represented by a six-bit number; an adjacent state of state A differs from A by only one bit. The structure consisted of two disjoint trees (called basins of attraction, hereafter “basins”) with root 000000 or 111111. Each basin consists of all states that lead to its root (stable state) by the steepest descent method, i.e., by moving downhill in the energy landscape. In other words, a basin is a set of states that are expected to be attracted to the same stable state. Therefore, the 000000 basin can be interpreted as the relatively healthier states (attracted to the healthy stable state 000000), and the 111111 basin as the relatively more depressive states (attracted to the depressive stable state 111111). The basin sizes of the 000000 and the 111111 basins were 40 and 24, respectively.

To explore the connection between students’ depressive states and their social context, we examined differences across the four periods outlined by the declarations of states of emergency (Fig 2D), considering that social interactions at high schools primarily occur on school days. The first state of emergency was declared in Period 2 (January to May 2020), whereas the second, third, and fourth declarations took place in Period 4 (January to September 2021). Notably, the number of school days during the states of emergency, especially in the first one, was significantly small. When we compared the mean of the total K6 score across the four periods, there was a significant trend indicating a decrease in the K6 score in Period 2 and then a slight increase in Periods 3 and 4 (Jonckheere–Terpstra trend test, ) (Fig 2E; see also S9 Fig). We speculate that an increase in time spent at home during the pandemic, especially in the first state of emergency, led to a reduction in social interactions. This reduction in social interactions may have helped to alleviate psychological distress for high school students, as previously mentioned in [42]. In addition, as demonstrated by our different questionnaire data presented in S10 Fig, we revealed a correlation between fewer school days per week and lower K6 scores.

However, the total K6 score alone does not reveal which combination of items contributed to the healthy and depressive states, nor how these states changed over time. To see this, we looked at individual K6 items and performed an energy landscape analysis on the pooled data for each of the 4 periods; the disconnectivity graphs were drawn similarly (Fig 2F; the accuracy is evaluated in S6B Fig). The stable states included 000000 and 111111 for all periods. There were also other stable states with higher energy (lower probability), with an energy difference of >3 from 000000 (i.e., more than 20 times lower probability). The energy of 000000 is 0 from the definition of the model. The energy of 111111 was lower in Period 1 (2.40) than in the other periods (2.90, 2.92, and 2.71), meaning the probability of 111111 was higher before the pandemic and became lower after the onset of the pandemic. Note that, when the K6 scores were analyzed item by item, use of traditional statistical methods such as the Jonckheere–Terpstra trend test, Tukey’s test, or Mood’s median test did not reveal significant differences among the different periods (S5 Table), suggesting that these methods were not sensitive enough to detect item-level differences. To further validate the findings from the energy landscape analysis, we performed a latent class analysis based on total K6 scores, which independently reproduced the same temporal pattern (see S3 Note and S11 Fig).

To examine how participants moved between different states over time, we simulated state transitions constrained by the energy landscape using Gibbs sampling (Methods), based on the simplifying assumption of detailed balance [43]. The logarithm of the average dwell time of each state was a linear function of its energy with a negative slope, consistent with the theoretical prediction (the probability of states follows a Boltzmann distribution in the model) (S12 Fig). Notably, when we started the simulation at 000000 and continued until the state reached 111111, the mean time from 000000 to 111111 became significantly longer in Periods 2–4 than in Period 1 (Fig 2G). In contrast, when we started the simulation at 111111 and continued until the state reached 000000, the mean time from 111111 to 000000 was not significantly different among the four periods (Fig 2H). These results suggest that it became more difficult to move from 000000 to 111111 after the pandemic started.

The above trend in the simulations was confirmed by a statistical analysis of the two basins (000000 and 111111) (S4 Note and S13AB Fig). The probability of staying in the healthy 000000 basin increased in Periods 2–4 compared to Period 1 (S13A Fig), and the monthly transition probability from the depressive 111111 basin to the healthy 000000 basin increased in Periods 2–4 compared to Period 1 (S13B Fig), suggesting that students were more likely to be in the healthy 000000 basin after the pandemic.

We further constructed energy landscapes on the pooled data for each month (from July 2019 to September 2021, S14A Fig), although we note that the accuracy of landscape construction may have been lower due to a small sample size (see also S15 Fig for confidence intervals of energy estimates). Nevertheless, the energy difference between 000000 (healthy state) and 111111 (depressive state) was negatively correlated with the mean number of school days per week (r = −0.35, p = 0.071) (S14BC Fig), again suggesting that students were less depressed when school days per week were fewer.

Energy landscapes for diverse psychological distress to the COVID-19 pandemic

The patterns of psychological responses to crises, including to the COVID-19 pandemic, vary greatly among individuals [44]. To gain deeper insights into such diverse responses, we next stratified participants using k-means clustering, with a focus on the time series of total K6 scores (Figs 3A and S16). We identified two distinct groups: G1 consisted of 61 participants whose total K6 score was relatively low (less than 5) and stable over time, and G2 consisted of 23 participants whose total K6 score was higher (with most being higher than 5) and less stable (Figs 3B and S17) (Welch t test with Benjamini–Hochberg correction; , , , for Periods 1–4, respectively). Some participants in G2 maintained a total K6 score above 5 across all periods, while others had a total K6 score below 5 in at least one period (S9 Fig). The number of participants in the four periods, categorized by the clinically relevant cutoffs (5, 8, 13) of the total K6 score (the cutoff values were chosen based on [22]), is shown in S6 Table. G1 included 36 males and 25 females, while G2 included 6 males and 17 females. Similar to the findings for the K6, G2 showed higher scores on the 28-item version of the GHQ (GHQ-28) [16] (S1 Methods and S7 Table), which assesses more subjective affective symptoms than does the K6. G2 also showed a greater increase than G1 in the GHQ-28 score between the two brain MRI scans. No significant differences were observed between G1 and G2 in IQ or parental socioeconomic status (S7 Table).

thumbnail
Fig 3. Energy landscape analysis in the stratified groups.

(A) Clustering of time-series total K6 scores. Total K6 scores are plotted with time for each participant. The thick line represents the mean. Error bars indicate standard deviation (SD). The colors correspond to the stratified groups G1 and G2. G1 (plotted in pink) tended to have lower and more stable total K6 scores, whereas G2 (plotted in blue) tended to have higher and more variable scores. (B) The mean of the total K6 score for G1 and G2 is shown over the 4 periods. Periods 1–4, defined in Fig 2D, are as follows: Period 1 (July–December 2019), before the COVID-19 pandemic; Period 2 (January–May 2020), including the first COVID-19 case in Japan and the first state of emergency; Period 3 (June–December 2020), when no state of emergency was declared; and Period 4 (January–September 2021), including the second to fourth state of emergency declarations. Error bars indicate standard deviation (SD). (C, D) The graphs of the basin of attraction are drawn separately for G1 and G2 participants. (E, F) The disconnectivity graphs of the energy landscape for G1 and G2 are shown for the entire period. (G, H) The disconnectivity graphs of the energy landscape for G1 and G2 are shown for the 4 periods. (I–L) The simulated times from 000000 to 111111 or from 111111 to 000000 for G1 and G2 are compared between the 4 periods, respectively. 000000 (binarized responses of 0 to all six items) can be considered “healthy,” whereas 111111 (binarized responses of 1 to all six items) represents the “depressive” state. Box plots show the median (center line), interquartile range (box), and range (whiskers). The upper outliers are hidden to improve visibility. Yellow and blue indicate increase and decrease, respectively, compared with Period 1.

https://doi.org/10.1371/journal.pmed.1004884.g003

We constructed separate energy landscapes for G1 and G2 and compared the basins derived from the energy landscapes (Fig 3C and 3D). The stable states for both groups were 000000 (i.e., healthy state) and 111110/111111 (i.e., depressive(-like) state). The sizes for the 000000 and 111110/111111 basins were 47 and 17 for G1, and 33 and 31 for G2, respectively.

We compared the disconnectivity graphs of G1 and G2 drawn from the energy landscape (Fig 3E and 3F: the accuracy indices are shown in S6C–S6E). The energy of 111111 or 111110 (depressive(-like) state) was higher in G1 (G1: 4.10, G2: 0.54), consistent with G1 participants being less likely to be in the depressive state. When energy landscapes were constructed for the 4 periods, the stable states included 000000 and 010101/010110/011110/111110 (depressive-like states) for G1 participants and 000000, 010101, and 111111 for G2 participants (Figs 3G, 3H, and S6D–S6F). Note that G1 participants had a very small probability of being in state 111111 (fully depressive), and thus 111111 was not a stable state; either K6-1 or K6-6 was 0 in the stable state (i.e., their emotional state lacked nervousness or feelings of worthlessness). The energy of 000000 is 0 as defined in the model. The energies of the depressive-like states in G1 peaked in Period 2 and then decreased (3.57, 4.57, 3.60, and 3.76), but in G2 were at about the same level as 000000 (−0.39, 0.70, 0.50, and 0.72), indicating that G1 participants were least likely to exhibit depressive symptoms in Period 2, whereas G2 participants were equally likely to be healthy or depressed over the 4 periods.

To examine how G1 and G2 participants moved between different states over time, we performed simulations of state transitions constrained by the energy landscape of G1 or G2. Two simulations showed an interesting difference: In the G1 landscape, the mean time from 000000 to 111111 increased more than 2-fold in Periods 2–4 compared with Period 1 (Fig 3I), but the mean time from 111111 to 000000 did not differ significantly across the four periods (Fig 3J). In contrast, in the G2 landscape, the mean time from 000000 to 111111 did not significantly differ across the four periods (Fig 3K), whereas the mean time from 111111 to 000000 decreased more than 2-fold in Periods 2–4 compared with Period 1 (Fig 3L). These results suggest that the pandemic states of emergency prevented the G1 participants from becoming depressed, and at the same time triggered the G2 participants to move into a healthy state.

Again, this trend in the simulations was also confirmed by a statistical analysis of the two basins (the healthy 000000 basin and the depressive 111110(G1)/111111(G2) basin) (S13C-F Fig). The probability of staying in the healthy basin increased in Periods 2–4 compared to Period 1 in both G1 and G2 (S13C-D Fig), and the monthly transition probability from the healthy basin to the depressive basin decreased in G1 (S13E Fig), whereas the monthly transition probability from the depressive basin to the healthy basin increased in G2 (S13F Fig).

Biological characterization of diverse psychological responses to the COVID-19 pandemic

We further tested whether brain structural features and adolescent developmental trajectories differed between the G1 and G2 groups. In the pn-TTC, a total of 479 participants underwent brain MRI scans, of which a subset (84 participants) participated in a monthly web-based questionnaire survey (Figs 4A and S1 and S2 Methods). After modeling nonlinear developmental trajectories of each brain feature using GAMMs with participant-level random effects and extracting individual deviations from these models, GLMM analyses revealed a decreasing trend in the cortical thickness of the caudal part of the middle frontal gyrus (cMFG) during adolescence, and the degree of decrease was greater in G2 than in G1 after false discovery rate (FDR) correction (; Fig 4C). On the other hand, the increase in GHQ-28 scores between Waves 3 and 4 was associated with an increase in cortical thickness in the temporal pole (TP) and entorhinal cortex (; (FDR correction); Fig 4H) and a decrease in surface area in the same regions (; (FDR correction)). There was no significant association in the GHQ-28 score at Wave 3 (Fig 4E and 4F). For the three psychological features (energy-landscape-related G1/G2 clustering based on the K6 items, GHQ-28 scores at Wave 3, and difference in the GHQ-28 scores between Waves 3 and 4), there was no significant model for the main effect for any brain features.

thumbnail
Fig 4. Developmental trajectories of brain features in stratified groups.

(A) Summary of data collection schedule (see also Methods, Figs 2D and S1 and S1 Methods). IQ, intelligence quotient; SES, socioeconomic status; SoE, state of emergency. (B) cMFG and TP in the brain cortex are visualized along with t-values in the GLMM analysis (S8 and S9 Tables). cMFG, caudal part of the middle frontal gyrus; TP, temporal pole. (C–H) Developmental trajectories of brain features for each group are plotted with age at MRI (y) on the x-axis and cortical thickness on the y-axis for each participant. The thick line represents the estimated value by general linear mixed models (GLMMs). (C) A greater decrease in the cortical thickness of cMFG is observed in G2 than in G1. G1 and G2 represent the clustering groups identified in Fig 3; G1 showed lower and more stable total K6 scores, whereas G2 showed higher and more variable scores. (D) A greater increase in the cortical thickness of TP is observed in G2 than in G1. (E, F) Cortical thickness of cMFG and TP in 479 participants, colored by GHQ-28 score (low/high) at Wave 3. (G, H) Cortical thickness of cMFG and TP in 479 participants, colored by the difference in GHQ-28 score (decrease/increase) between Waves 3 and 4. The illustration of the human brain in Fig 4 was drawn using BrainPainter (https://github.com/razvanmarinescu/brain-coloring).

https://doi.org/10.1371/journal.pmed.1004884.g004

We next tested whether the G1 or G2 group or the difference in GHQ-28 scores was more significantly associated with change in five brain features (cortical thickness of cMFG, cortical thickness and surface area of entorhinal cortex, and cortical thickness and surface area of TP) by simultaneously adding the psychological features to the GLMMs. The models showed significant age × group interactions, but not age × GHQ-28 score difference interactions, on the cortical thickness of cMFG (Figs 4B and S18 and S8 Table and S5 Note; (FDR correction); ) and TP (Figs 4B and S18 and S9 Table and S5 Note; (FDR correction); ), indicating the direction of accelerated brain maturation in G2 compared with G1.

Taken together, the energy-landscape-related group was associated with the change in the cortical thickness in the cMFG and TP, with G2 (the group with higher and unstable K6 scores) showing a greater change. This suggests that morphological changes in these brain regions may be associated with increased sensitivity to psychological fluctuations.

Discussion

In this study, we examined monthly questionnaire responses by adolescents in Japan about their psychological distress and analyzed the effect of states of emergency during the COVID-19 pandemic. Specifically, our longitudinal study included periods both before and during the pandemic (including the declarations of states of emergency), allowing us to directly assess the effects of the states of emergency. By dividing the study dataset into four time periods and applying energy landscape analysis to each, we examined the temporal aspect of psychological distress. Comparison of disconnectivity graphs before and after the pandemic revealed an association between the pandemic period and a reduced likelihood of psychological distress. Numerical simulations of the energy landscape also suggested that the pandemic states of emergency were associated with a reduced probability of transitions from healthy states to depressive states.

Our findings are consistent with reports from the United Kingdom [45], China [11,46], and South Korea [47,48], which indicate an improvement in the mental health of adolescents in response to the COVID-19 public health emergency. However, our results differ from reports from the United States [49] and Europe [50,51], which suggested a general decline in adolescents’ mental health during the pandemic. These seemingly contradictory findings point to two different aspects of high school. In one aspect, schools are a fun place that provide face-to-face interaction with peers; the pandemic deprived students of such opportunities. In another aspect, schools are a nuisance, forcing students to participate in mandatory activities and study in preparation for college entrance exams (especially in East Asia, including Japan); the pandemic and school closures freed students from this, if only temporarily. These results also highlight the variability in the resilience of adolescents in the face of the pandemic [52]. Alternatively, external factors, such as family socioeconomic status, parental stress, and access to the internet including social media, may have played a role [53,54]. In fact, our participants tended to come from relatively affluent families who were cooperative with our repeated surveys; therefore, it is possible that the adolescents were content to stay home rather than go to school.

Our analysis successfully identified two groups of students with different psychological characteristics, with G1 having lower and more stable total K6 scores and G2 having higher and more volatile total K6 scores. In particular, both numerical simulations and statistical analyses of basins suggested that during the pandemic, G1 participants were less likely to switch from healthy to depressive states, whereas G2 participants were more likely to switch from depressive to healthy states. Notably, brain MRI analysis showed that G2 participants had greater changes in the cortical thickness of the cMFG and TP than did G1 participants, and the effect was greater than the difference in depressive symptoms assessed on the dates of the MRI scans. The changes were in the direction of accelerated adolescent brain development, suggesting that morphological changes in these regions, which are responsible for higher-order cognitive functions, may be associated with increasing sensitivity to psychological fluctuations.

Our observation that the state of emergency shifted G2 participants from a depressive state to a healthy state suggests that these participants may have returned to the depressive state when the state of emergency was lifted. In fact, in our analysis, the energy of 111,111 was slightly lower in Period 4 than in Periods 2 and 3 (Fig 2F), suggesting that the probability of being in the depressive state increased again in Period 4 (see also S13A Fig). This is reminiscent of reports from Japan [55,56] that the later phase of the pandemic, when schools reopened, saw an increase in the incidence of suicide.

What can we learn from these findings to prepare for the next pandemic? Our results revealed an unexpected degree of variability not only in the frequency of psychological states, but also in their transition dynamics. In this sense, our work can be placed in the lineage of recent studies that view health and disease in the context of dynamical systems [5760]. Dynamical systems theory may help not only to understand disease states, but also to intervene and prevent them by detecting early warning signals [61,62]. Disasters such as pandemics are particularly suited to this type of analysis because the external cause is clearly defined and, although variability exists, individuals tend to have similar responses. For example, examining inter-item associations of K6 identified by the energy landscape analysis, beyond the total K6 score, could provide deeper insights into how specific symptoms, such as concentration difficulties, fatigue, or feelings of hopelessness, interact and evolve over time. These item-level patterns may highlight distinct psychological profiles that are overlooked by aggregate scoring, helping educators and school health professionals to identify at-risk students and tailor preventive support more effectively during future public health crises. Further characterization of individual variability will require more data, but it is clear that we need to be vigilant about the possible adverse effects of the next pandemic on adolescents and provide psychological support to those who need it. Furthermore, in addition to characterizing psychological dynamics, the energy landscape analysis could be integrated into economic and predictive modeling frameworks to quantify the potential value of early mental health interventions. For instance, by linking energy landscape-derived stability metrics with longitudinal outcomes such as absenteeism, academic performance, or clinical referrals, future studies could estimate the cost-effectiveness of targeted school-based prevention under different resource or pandemic scenarios. This integration may ultimately inform policy decisions on allocating resources for youth mental health programs.

We acknowledge some limitations of this study. First, the participants in this cohort lived in Tokyo only, and the generalizability of these findings to adolescents living in other local areas and countries needs to be confirmed. Future studies that include energy landscape analysis for participants of different demographic backgrounds, such as other age groups, would provide a useful perspective for earlier prediction of depressive symptoms. Second, recruitment of participants to the monthly questionnaire survey took place on the date of MRI imaging, and the number of participants gradually increased from before the pandemic to the state of emergency. Although this prevented exact individual comparisons between different periods, our method enabled analysis by extracting dynamics from the pooled population data. Careful design in future cohorts will allow for more precise assessments through individual comparisons. In addition, a larger sample size and longer time period may provide a fluctuation pattern independent of specific events. Smartphone apps and wearable devices could be further utilized to incorporate Ecological Momentary Assessment to assess participants’ psychological states in detail [63]. Third, the current energy landscape analysis required the data to be binarized, in part due to the combinatorial explosion of the number of possible states. In our analysis, this simplified and emphasized the contrast between healthy and depressive states, but it is possible that removing the binarization may reveal other ”sych’logical states. Furthermore, while the current energy landscape analysis relies on the Markovian assumption, future methodological extensions could relax this constraint to better capture temporal dependencies inherent in longitudinal psychological processes. For example, adopting a kinetic Ising-based formulation or incorporating lagged variables may enable the model to represent non-Markovian dynamics and memory effects more realistically. Fourth, the present study was not designed to perform predictive modeling or cross-validation analyses. Accordingly, the robustness of the identified energy landscape structures in terms of out-of-sample prediction was not formally evaluated and should be examined in future studies, ideally using independent cohorts. Finally, the association we found between psychological states and brain development remains descriptive, and further experimental or interventional studies are needed to validate causality.

Energy landscape analysis has been successfully applied to chemical physics [64], neuroscience [19,6567], and ecology [35,68]. Energy-based explanations of multistable behavior help to extract temporal information, even from short-item questionnaires such as the K6, and are a useful complement to conventional statistical analyses. This work further extends the repertoire of energy landscape analysis to the fields of psychology and psychiatry, where multidimensional questionnaire-based surveys abound and the concept of dynamical systems is effective for explaining disease states [57,62]. Future studies should test the model’s generalizability by applying the energy landscape analysis to other cohorts, thereby evaluating its predictive validity and practical relevance across diverse populations. Such efforts will surely lead to unraveling the evolving landscape of many more psychological and psychiatric situations.

Supporting information

S2 Fig. Preprocessing of each participant’s questionnaire responses.

https://doi.org/10.1371/journal.pmed.1004884.s002

(DOCX)

S3 Fig. Schematic illustration of energy landscape analysis on the K6 questionnaire.

https://doi.org/10.1371/journal.pmed.1004884.s003

(DOCX)

S5 Fig. Different binarization methods.

https://doi.org/10.1371/journal.pmed.1004884.s005

(DOCX)

S7 Fig. Confidence intervals of parameter estimate.

https://doi.org/10.1371/journal.pmed.1004884.s007

(DOCX)

S8 Fig. Convergence of estimation errors and changes in parameter estimates across iterations.

https://doi.org/10.1371/journal.pmed.1004884.s008

(DOCX)

S9 Fig. Changes in total K6 scores across different periods.

https://doi.org/10.1371/journal.pmed.1004884.s009

(DOCX)

S10 Fig. Relationship between school days per week and K6 questionnaire responses.

https://doi.org/10.1371/journal.pmed.1004884.s010

(DOCX)

S11 Fig. Latent class analysis of time-series K6 questionnaire responses for all participants.

https://doi.org/10.1371/journal.pmed.1004884.s011

(DOCX)

S12 Fig. The average dwell time of each state in the simulation.

https://doi.org/10.1371/journal.pmed.1004884.s012

(DOCX)

S13 Fig. Probabilities of staying in the basin and of transitioning between basins.

https://doi.org/10.1371/journal.pmed.1004884.s013

(DOCX)

S14 Fig. Energy landscape analysis of time-series K6 questionnaire responses by month.

https://doi.org/10.1371/journal.pmed.1004884.s014

(DOCX)

S15 Fig. Confidence intervals of energy estimates.

https://doi.org/10.1371/journal.pmed.1004884.s015

(DOCX)

S16 Fig. Clustering of participants by dynamic time warping.

https://doi.org/10.1371/journal.pmed.1004884.s016

(DOCX)

S18 Fig. Estimated power at a range of sample sizes.

https://doi.org/10.1371/journal.pmed.1004884.s018

(DOCX)

S1 Note. Glossary of energy landscape analysis terms.

https://doi.org/10.1371/journal.pmed.1004884.s019

(DOCX)

S2 Note. Comparison of different binarization and smoothing methods.

https://doi.org/10.1371/journal.pmed.1004884.s020

(DOCX)

S3 Note. Validation of energy landscape analysis using latent class analysis.

https://doi.org/10.1371/journal.pmed.1004884.s021

(DOCX)

S4 Note. Further statistical analysis on probabilities of staying in the basin and of transitioning between basins.

https://doi.org/10.1371/journal.pmed.1004884.s022

(DOCX)

S1 Methods. Data acquisition of population-neuroscience Tokyo TEEN Cohort.

https://doi.org/10.1371/journal.pmed.1004884.s024

(DOCX)

S2 Methods. Acquisition and preprocessing methods for the brain imaging procedures.

https://doi.org/10.1371/journal.pmed.1004884.s025

(DOCX)

S1 Table. Image acquisition parameters and preprocessing methods.

https://doi.org/10.1371/journal.pmed.1004884.s026

(DOCX)

S2 Table. Difference in depressive symptom scores between energy-landscape-related G1 and G2 groups.

https://doi.org/10.1371/journal.pmed.1004884.s027

(DOCX)

S3 Table. List of the model variables and parameters.

https://doi.org/10.1371/journal.pmed.1004884.s028

(DOCX)

S4 Table. Estimated coefficients of the Ising model for the entire period.

https://doi.org/10.1371/journal.pmed.1004884.s029

(DOCX)

S6 Table. Number of participants, categorized by the mean total K6 scores, in each Period.

https://doi.org/10.1371/journal.pmed.1004884.s031

(DOCX)

S7 Table. Demographic characteristics in energy-landscape-related G1 and G2 groups.

https://doi.org/10.1371/journal.pmed.1004884.s032

(DOCX)

S8 Table. Relationship of three psychological features with the cortical thickness in the caudal part of the middle frontal gyrus (cMFG).

https://doi.org/10.1371/journal.pmed.1004884.s033

(DOCX)

S9 Table. Relationship of three psychological features with the cortical thickness in the temporal pole (TP).

https://doi.org/10.1371/journal.pmed.1004884.s034

(DOCX)

S1 Checklist. STROBE Checklist.

STrengthening the Reporting of OBservational studies in Epidemiology (STROBE) Statement—checklist of items that should be included in reports of observational studies, available at https://www.strobe-statement.org/, licenced under CC BY 4.0.

https://doi.org/10.1371/journal.pmed.1004884.s035

(DOCX)

References

  1. 1. The Lancet Psychiatry. COVID-19 and mental health. Lancet Psychiatry. 2021;8(2):87. pmid:33485416
  2. 2. Pfefferbaum B, North CS. Mental health and the Covid-19 pandemic. N Engl J Med. 2020;383(6):510–2. pmid:32283003
  3. 3. Robinson E, Sutin AR, Daly M, Jones A. A systematic review and meta-analysis of longitudinal cohort studies comparing mental health before versus during the COVID-19 pandemic in 2020. J Affect Disord. 2022;296:567–76. pmid:34600966
  4. 4. Prati G, Mancini AD. The psychological impact of COVID-19 pandemic lockdowns: a review and meta-analysis of longitudinal studies and natural experiments. Psychol Med. 2021;51(2):201–11. pmid:33436130
  5. 5. Patel K, Robertson E, Kwong ASF, Griffith GJ, Willan K, Green MJ, et al. Psychological distress before and during the COVID-19 pandemic among adults in the United Kingdom based on coordinated analyses of 11 longitudinal studies. JAMA Netw Open. 2022;5(4):e227629. pmid:35452109
  6. 6. Ernst M, Niederer D, Werner AM, Czaja SJ, Mikton C, Ong AD, et al. Loneliness before and during the COVID-19 pandemic: a systematic review with meta-analysis. Am Psychol. 2022;77(5):660–77. pmid:35533109
  7. 7. de Figueiredo CS, Sandre PC, Portugal LCL, Mázala-de-Oliveira T, da Silva Chagas L, Raony Í, et al. COVID-19 pandemic impact on children and adolescents’ mental health: biological, environmental, and social factors. Prog Neuropsychopharmacol Biol Psychiatry. 2021;106:110171. pmid:33186638
  8. 8. Theberath M, Bauer D, Chen W, Salinas M, Mohabbat AB, Yang J, et al. Effects of COVID-19 pandemic on mental health of children and adolescents: a systematic review of survey studies. SAGE Open Med. 2022;10:20503121221086712. pmid:35371484
  9. 9. Samji H, Wu J, Ladak A, Vossen C, Stewart E, Dove N, et al. Review: mental health impacts of the COVID-19 pandemic on children and youth—a systematic review. Child Adolesc Ment Health. 2022;27(2):173–89. pmid:34455683
  10. 10. Cai L, Maikusa N, Zhu Y, Nishida A, Ando S, Okada N, et al. Hippocampal structures among Japanese adolescents before and after the COVID-19 pandemic. JAMA Netw Open. 2024;7(2):e2355292. pmid:38329755
  11. 11. Qu M, Yang K, Cao Y, Wang X, Tan S, Xiu M, et al. Symptoms of anxiety and depression among adolescents before vs during COVID-19-related school closures in China. JAMA Netw Open. 2022;5(11):e2241752. pmid:36374505
  12. 12. Xiang M, Yamamoto S, Mizoue T. Depressive symptoms in students during school closure due to COVID-19 in Shanghai. Psychiatry Clin Neurosci. 2020;74(12):664–6. pmid:33000515
  13. 13. Ellwardt L, Präg P. Heterogeneous mental health development during the COVID-19 pandemic in the United Kingdom. Sci Rep. 2021;11(1):15958. pmid:34354201
  14. 14. Goldberg DP. The detection of psychiatric illness by questionnaire. Oxford U Press; 1972.
  15. 15. Kessler RC, Barker PR, Colpe LJ, Epstein JF, Gfroerer JC, Hiripi E, et al. Screening for serious mental illness in the general population. Arch Gen Psychiatry. 2003;60(2):184–9. pmid:12578436
  16. 16. Goldberg DP, Hillier VF. A scaled version of the General Health Questionnaire. Psychol Med. 1979;9(1):139–45. pmid:424481
  17. 17. Jack R, Oster E. COVID-19, school closures, and outcomes. J Econ Perspect. 2023;37(4):51–70.
  18. 18. Okada N, Ando S, Sanada M, Hirata-Mogi S, Iijima Y, Sugiyama H, et al. Population-neuroscience study of the Tokyo TEEN Cohort (pn-TTC): cohort longitudinal study to explore the neurobiological substrates of adolescent psychological and behavioral development. Psychiatry Clin Neurosci. 2019;73(5):231–42. pmid:30588712
  19. 19. Ezaki T, Watanabe T, Ohzeki M, Masuda N. Energy landscape analysis of neuroimaging data. Philos Trans A Math Phys Eng Sci. 2017;375(2096):20160287. pmid:28507232
  20. 20. Ando S, Nishida A, Yamasaki S, Koike S, Morimoto Y, Hoshino A, et al. Cohort profile: the Tokyo Teen Cohort study (TTC). Int J Epidemiol. 2019;48(5):1414–1414g. pmid:30879075
  21. 21. Kessler RC, Green JG, Gruber MJ, Sampson NA, Bromet E, Cuitan M, et al. Screening for serious mental illness in the general population with the K6 screening scale: results from the WHO World Mental Health (WMH) survey initiative. Int J Methods Psychiatr Res. 2010;19 Suppl 1(Suppl 1):4–22. pmid:20527002
  22. 22. Furukawa TA, Kawakami N, Saitoh M, Ono Y, Nakane Y, Nakamura Y, et al. The performance of the Japanese version of the K6 and K10 in the World Mental Health Survey Japan. Int J Methods Psychiatr Res. 2008;17(3):152–8. pmid:18763695
  23. 23. Cabinet of Japan. [cited 2025 May 1]. Available from: https://www.japaneselawtranslation.go.jp/ja/laws/view/4574
  24. 24. Glasser MF, Sotiropoulos SN, Wilson JA, Coalson TS, Fischl B, Andersson JL, et al. The minimal preprocessing pipelines for the Human Connectome Project. Neuroimage. 2013;80:105–24. pmid:23668970
  25. 25. Koike S, Tanaka SC, Okada T, Aso T, Yamashita A, Yamashita O, et al. Brain/MINDS beyond human brain MRI project: a protocol for multi-level harmonization across brain disorders throughout the lifespan. Neuroimage Clin. 2021;30:102600. pmid:33741307
  26. 26. Maikusa N, Zhu Y, Uematsu A, Yamashita A, Saotome K, Okada N, et al. Comparison of traveling-subject and ComBat harmonization methods for assessing structural brain characteristics. Hum Brain Mapp. 2021;42(16):5278–87. pmid:34402132
  27. 27. World Health Organization. [cited 2025 May 1]. Available from: https://www.who.int/emergencies/disease-outbreak-news/item/2020-DON229
  28. 28. Ministry of Health, Labour and Welfare. [cited 2025 May 1]. Available from: https://www.mhlw.go.jp/stf/newpage_08906.html
  29. 29. Cabinet Infectious Disease Crisis Management Agency. [cited 2025 May 1]. Available from: https://www.caicm.go.jp/information/citizen/corona/emergency.html
  30. 30. Ezaki T. Energy Landscape Analysis Toolbox (ELAT) for MATLAB [cited 2025 May 1]. Available from: https://github.com/tkEzaki/energy-landscape-analysis
  31. 31. Tang A, Jackson D, Hobbs J, Chen W, Smith JL, Patel H, et al. A maximum entropy model applied to spatial and temporal correlations from cortical networks in vitro. J Neurosci. 2008;28(2):505–18. pmid:18184793
  32. 32. Schneidman E, Berry MJ 2nd, Segev R, Bialek W. Weak pairwise correlations imply strongly correlated network states in a neural population. Nature. 2006;440(7087):1007–12. pmid:16625187
  33. 33. Schneidman E, Still S, Berry MJ 2nd, Bialek W. Network information and connected correlations. Phys Rev Lett. 2003;91(23):238701. pmid:14683220
  34. 34. Khanra P, Nakuci J, Muldoon S, Watanabe T, Masuda N. Reliability of energy landscape analysis of resting-state functional MRI data. Eur J Neurosci. 2024;60(3):4265–90. pmid:38837814
  35. 35. Suzuki K, Nakaoka S, Fukuda S, Masuya H. Energy landscape analysis elucidates the multistability of ecological communities across environmental gradients. Ecol Monogr. 2021;91(3).
  36. 36. Wood SN. Generalized additive models: an introduction with R. 2nd ed. Chapman and Hall/CRC; 2017.
  37. 37. Zhu Y, Maikusa N, Radua J, Sämann PG, Fusar-Poli P, Agartz I, et al. Using brain structural neuroimaging measures to predict psychosis onset for individuals at clinical high-risk. Mol Psychiatry. 2024;29(5):1465–77. pmid:38332374
  38. 38. Sakurai K, Nishi A, Kondo K, Yanagida K, Kawakami N. Screening performance of K6/K10 and other screening instruments for mood and anxiety disorders in Japan. Psychiatry Clin Neurosci. 2011;65(5):434–41. pmid:21851452
  39. 39. Nishi A, Noguchi H, Hashimoto H, Tamiya N. Scale development of health status for secondary data analysis using a nationally representative survey. Environ Health Prev Med. 2012;17(3):252–7. pmid:21918867
  40. 40. Uchida H, Kuroiwa C, Ohki S, Takahashi K, Tsuchiya K, Kikuchi S, et al. Assessing the smallest detectable change of the Kessler psychological distress scale score in an adult population in Japan. Psychol Res Behav Manag. 2023;16:2647–54. pmid:37465046
  41. 41. Haslbeck JMB, Epskamp S, Marsman M, Waldorp LJ. Interpreting the Ising model: the input matters. Multivariate Behav Res. 2021;56(2):303–13. pmid:32162537
  42. 42. Lee SS, Shim Y, Choi J, Choi I. Paradoxical impacts of social relationship on well-being during the COVID-19 pandemic. J Happiness Stud. 2023;24(2):745–67. pmid:36686601
  43. 43. Masuda N, Islam S, Thu Aung S, Watanabe T. Energy landscape analysis based on the Ising model: tutorial review. PLOS Complex Syst. 2025;2(5):e0000039.
  44. 44. Makwana N. Disaster and its impact on mental health: a narrative review. J Family Med Prim Care. 2019;8(10):3090–5. pmid:31742125
  45. 45. Ford T, John A, Gunnell D. Mental health of children and young people during pandemic. BMJ. 2021;372:n614. pmid:33692087
  46. 46. Yang X, Lawrence AJ, Harrison P, Liu Y, Chen L, Wang C, et al. Positive effects of the COVID-19 pandemic on depression and anxiety in Chinese adolescents. Eur Child Adolesc Psychiatry. 2024;33(5):1551–61. pmid:37486386
  47. 47. Lee J, Ko Y-H, Chi S, Lee M-S, Yoon H-K. Impact of the COVID-19 pandemic on Korean adolescents’ mental health and lifestyle factors. J Adolesc Health. 2022;71(3):270–6. pmid:35715347
  48. 48. Kim SY, Kim H-R, Park B, Choi HG. Comparison of stress and suicide-related behaviors among Korean youths before and during the COVID-19 pandemic. JAMA Netw Open. 2021;4(12):e2136137. pmid:34902040
  49. 49. Zima BT, Edgcomb JB, Rodean J, Cochran SD, Harle CA, Pathak J, et al. Use of acute mental health care in U.S. children’s hospitals before and after statewide COVID-19 school closure orders. Psychiatr Serv. 2022;73(11):1202–9. pmid:35611510
  50. 50. Ludwig-Walz H, Dannheim I, Pfadenhauer LM, Fegert JM, Bujard M. Increase of depression among children and adolescents after the onset of the COVID-19 pandemic in Europe: a systematic review and meta-analysis. Child Adolesc Psychiatry Ment Health. 2022;16(1):109. pmid:36587221
  51. 51. Di Fazio N, Morena D, Delogu G, Volonnino G, Manetti F, Padovano M, et al. Mental health consequences of COVID-19 pandemic period in the European population: an institutional challenge. Int J Environ Res Public Health. 2022;19(15):9347. pmid:35954706
  52. 52. Shi W, Zhao L, Liu M, Hong B, Jiang L, Jia P. Resilience and mental health: a longitudinal cohort study of Chinese adolescents before and during COVID-19. Front Psychiatry. 2022;13:948036. pmid:36061276
  53. 53. Felfe C, Saurer J, Schneider P, Vornberger J, Erhart M, Kaman A, et al. The youth mental health crisis: Quasi-experimental evidence on the role of school closures. Sci Adv. 2023;9(33):eadh4030. pmid:37595042
  54. 54. Monnier M, Moulin F, Thierry X, Vandentorren S, Côté S, Barbosa S, et al. Children’s mental and behavioral health, schooling, and socioeconomic characteristics during school closure in France due to COVID-19: the SAPRIS project. Sci Rep. 2021;11(1):22373. pmid:34789783
  55. 55. Tanaka T, Okamoto S. Increase in suicide following an initial decline during the COVID-19 pandemic in Japan. Nat Hum Behav. 2021;5(2):229–38. pmid:33452498
  56. 56. Horita N, Moriguchi S. COVID-19, young people, and suicidal behaviour. Lancet Psychiatry. 2023;10(7):484–5. pmid:37353259
  57. 57. Scheffer M, Bockting CL, Borsboom D, Cools R, Delecroix C, Hartmann JA, et al. A dynamical systems view of psychiatric disorders-theory: a review. JAMA Psychiatry. 2024;81(6):618–23. pmid:38568615
  58. 58. Salvi JD, Rauch SL, Baker JT. Behavior as physiology: how dynamical-systems theory could advance psychiatry. Am J Psychiatry. 2021;178(9):791–2. pmid:34516231
  59. 59. Rolls ET. Attractor cortical neurodynamics, schizophrenia, and depression. Transl Psychiatry. 2021;11(1):215. pmid:33846293
  60. 60. Globus GG, Arpaia JP. Psychiatry and the new dynamics. Biol Psychiatry. 1994;35(5):352–64. pmid:8011803
  61. 61. Scheffer M, Bascompte J, Brock WA, Brovkin V, Carpenter SR, Dakos V, et al. Early-warning signals for critical transitions. Nature. 2009;461(7260):53–9. pmid:19727193
  62. 62. van de Leemput IA, Wichers M, Cramer AOJ, Borsboom D, Tuerlinckx F, Kuppens P, et al. Critical slowing down as early warning for the onset and termination of depression. Proc Natl Acad Sci U S A. 2014;111(1):87–92. pmid:24324144
  63. 63. Stone AA, Schneider S, Smyth JM. Evaluation of pressing issues in ecological momentary assessment. Annu Rev Clin Psychol. 2023;19:107–31. pmid:36475718
  64. 64. Wales DJ. Exploring energy landscapes. Annu Rev Phys Chem. 2018;69:401–25. pmid:29677468
  65. 65. Ezaki T, Fonseca Dos Reis E, Watanabe T, Sakaki M, Masuda N. Closer to critical resting-state neural dynamics in individuals with higher fluid intelligence. Commun Biol. 2020;3(1):52. pmid:32015402
  66. 66. Watanabe T, Masuda N, Megumi F, Kanai R, Rees G. Energy landscape and dynamics of brain activity during human bistable perception. Nat Commun. 2014;5:4765. pmid:25163855
  67. 67. Watanabe T, Hirose S, Wada H, Imai Y, Machida T, Shirouzu I, et al. Energy landscapes of resting-state brain networks. Front Neuroinform. 2014;8:12. pmid:24611044
  68. 68. Fujita H, Ushio M, Suzuki K, Abe MS, Yamamichi M, Iwayama K, et al. Alternative stable states, nonlinear behavior, and predictability of microbiome dynamics. Microbiome. 2023;11(1):63. pmid:36978146