Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Evaluation of the genetic risk for COVID-19 outcomes in COPD and differences among worldwide populations

  • Rui Marçalo ,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review & editing

    ruifilipemarcalo@ua.pt

    Affiliations Department of Medical Sciences, Genome Medicine Laboratory, Institute of Biomedicine—iBiMED, University of Aveiro, Aveiro, Portugal, Lab3R-Respiratory Research and Rehabilitation, School for Health Sciences (ESSUA) and Institute of Biomedicine (iBiMED), University of Aveiro, Aveiro, Portugal

  • Sonya Neto,

    Roles Methodology, Writing – review & editing

    Affiliation Department of Medical Sciences, Genome Medicine Laboratory, Institute of Biomedicine—iBiMED, University of Aveiro, Aveiro, Portugal

  • Miguel Pinheiro,

    Roles Methodology, Writing – review & editing

    Affiliation Department of Medical Sciences, Genome Medicine Laboratory, Institute of Biomedicine—iBiMED, University of Aveiro, Aveiro, Portugal

  • Ana J. Rodrigues,

    Roles Funding acquisition, Writing – review & editing

    Affiliation Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho–Braga, Portugal

  • Nuno Sousa,

    Roles Funding acquisition, Project administration, Resources, Writing – review & editing

    Affiliation Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho–Braga, Portugal

  • Manuel A. S. Santos,

    Roles Funding acquisition, Project administration, Resources, Writing – review & editing

    Affiliation Department of Medical Sciences, Genome Medicine Laboratory, Institute of Biomedicine—iBiMED, University of Aveiro, Aveiro, Portugal

  • Paula Simão,

    Roles Resources, Writing – review & editing

    Affiliation Pulmonology Department, Unidade Local de Saúde de Matosinhos—Porto, Porto, Portugal

  • Carla Valente,

    Roles Resources, Writing – review & editing

    Affiliation Pulmonology Department, Centro Hospitalar do Baixo Vouga–Aveiro, Aveiro, Portugal

  • Lília Andrade,

    Roles Resources, Writing – review & editing

    Affiliation Pulmonology Department, Centro Hospitalar do Baixo Vouga–Aveiro, Aveiro, Portugal

  • Alda Marques,

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Resources, Supervision, Validation, Visualization, Writing – review & editing

    Affiliation Lab3R-Respiratory Research and Rehabilitation, School for Health Sciences (ESSUA) and Institute of Biomedicine (iBiMED), University of Aveiro, Aveiro, Portugal

  • Gabriela R. Moura

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Resources, Supervision, Validation, Visualization, Writing – review & editing

    Affiliation Department of Medical Sciences, Genome Medicine Laboratory, Institute of Biomedicine—iBiMED, University of Aveiro, Aveiro, Portugal

Abstract

Background

Populations seem to respond differently to the global pandemic of severe acute respiratory syndrome coronavirus 2. Recent studies show individual variability in both susceptibility and clinical response to COVID-19 infection. People with chronic obstructive pulmonary disease (COPD) constitute one of COVID-19 risk groups, being already associated with a poor prognosis upon infection. This study aims contributing to unveil the underlying reasons for such prognosis in people with COPD and the variability in the response observed across worldwide populations, by looking at the genetic background as a possible answer to COVID-19 infection response heterogeneity.

Methods

SNPs already associated with susceptibility to COVID-19 infection (rs286914 and rs12329760) and severe COVID-19 with respiratory failure (rs657152 and rs11385942) were assessed and their allelic frequencies used to calculate the probability of having multiple risk alleles. This was performed on a Portuguese case-control COPD cohort, previously clinically characterized and genotyped from saliva samples, and also on worldwide populations (European, Spanish, Italian, African, American and Asian), using publicly available frequencies data. A polygenic risk analysis was also conducted on the Portuguese COPD cohort for the two mentioned phenotypes, and also for hospitalization and survival to COVID-19 infection.

Findings

No differences in genetic risk for COVID-19 susceptibility, hospitalization, severity or survival were found between people with COPD and the control group (all p-values > 0.01), either considering risk alleles individually, allelic combinations or polygenic risk scores. All populations, even those with European ancestry (Portuguese, Spanish and Italian), showed significant differences from the European population in genetic risk for both COVID-19 susceptibility and severity (all p-values < 0.0001).

Conclusion

Our results indicate a low genetic contribution for COVID-19 infection predisposition or worse outcomes observed in people with COPD. Also, our study unveiled a high genetic heterogeneity across major world populations for the same alleles, even within European sub-populations, demonstrating the need to build a higher resolution European genetic map, so that differences in the distribution of relevant alleles can be easily accessed and used to better manage diseases, ultimately, safeguarding populations with higher genetic predisposition to such diseases.

Introduction

There is increasing evidence about both individual and populational variability in the susceptibility and disease behaviour of patients with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection, ranging from asymptomatic to severe respiratory failure and need for prolonged mechanical ventilation [1]. The underlying causes of this spectrum of clinical outcomes remain elusive. Determining the genetic and environmental risk factors capable of originating such discrepancies is important to identify those at most risk, enabling to protect them from infection, reduce hospitalizations and decrease mortality.

Recently, the human genetics community came together and launched the COVID-19 Host Genetics Initiative [2], an unprecedent worldwide collaboration whose mission is to generate, share and identify genetic variants associated with COVID-19 susceptibility, severity and other clinically relevant outcomes. It is hoped that this international effort may help identify groups of people at high or low risk, as well as to generate hypotheses for therapies, and contribute to an advance in the global knowledge of COVID-19 physiopathology.

Two large-patient cohorts from Italy and Spain, epicentres of the COVID-19 first wave of pandemic in Europe, have been analysed in a genome-wide association study (GWAS) that looked for genetic variants associated with the occurrence of severe COVID-19 with respiratory failure [3]. This study identified two genetic variants significantly associated to this phenotype, namely: rs11385942 at locus 3p21.31 and rs657152 at locus 9q34.2. Additionally, three independent studies, aimed at finding if patients’ genetic background could be responsible for the differences in susceptibility to COVID-19 infection [46], identified 3 genetic variants associated with COVID-19 positive testing: rs286914 at locus 11p13, rs12329760 at locus 21q22.3 and rs41303171 at locus Xp22.2.

Chronic obstructive pulmonary disease (COPD) is the third leading cause of mortality worldwide [7]. The current evidence suggests that COVID-19 incidence is not higher in people with COPD, however, chronic inflammation of the small airways, higher rates of hospitalization and admission to intensive care unit (ICU), as well as invasive mechanical ventilation [8], show that the COPD sub-population may be particularly vulnerable to severe COVID-19 [9,10]. In order to determine what is causing people with COPD to respond more severely, both genetical and environmental factors need to be tackled, further developing our knowledge on the pathophysiology of COPD itself and the reasons behind a worse evolution of COVID-19 on people with COPD [8]. Therefore, we aimed to explore 1) the genetic risk of people with COPD for increased COVID-19 incidence and severity and 2) how the same risk alleles are distributed across the European population, its sub-populations (Portuguese, Spanish and Italian); and the other major worldwide populations (African, American and Asian), estimating the global distribution of people with an increased genetic risk for COVID-19 infection and severe COVID-19.

Materials and methods

Recruitment of participants

In this study, two cohorts were used. The Baixo Vouga cohort [i.e. healthy controls and individuals with COPD] was used for the case-control study, whereas the Minho cohort [i.e. community-dwelling individuals] was used for validation.

The Baixo Vouga cohort recruitment was performed by physicians, from hospitals and primary healthcare centres of the Centre Region of Portugal. A set of inclusion criteria was established for people with COPD, i.e., having a diagnosis of COPD according to Global Initiative for Chronic Obstructive Lung Disease GOLD criteria [11]; being in a stable state (i.e., no acute exacerbations in the previous month), and able of giving informed consent. Control group individuals (sex- and age- matched to people with COPD) were recruited from routine appointments in primary health care centres and senior universities. They were considered eligible if they did not have a diagnosed respiratory disease and were able to give informed consent. Exclusion criteria for both groups were set for the presence of severe cardiac, musculoskeletal, or neuromuscular diseases, cognitive impairment or a history of neoplasia or immune disease that would interfere with patients’ collaboration in data collection or interpretation. Therefore, people with the most prevalent age-related conditions, e.g., controlled hypertension, hypercholesterolaemia were included in the study since according to the World Health Organization definition, Health is a state of complete physical, mental and social well-being and not merely the absence of disease or infirmity [12]. Sociodemographic (age, sex), anthropometric (weight and height to compute body mass index), clinical data and saliva samples were collected with a structured protocol. Information on lung function was retrieved from clinical records or collected via spirometry.

The Minho cohort is part of a larger cohort randomly selected from the north of Portugal [13,14]. Primary exclusion criteria included participants diagnosed with dementia, who had a stroke, renal failure or overt thyroid pathology. The cohorts were established according to the ethical principles of the Declaration of Helsinki and experiments were approved by the Portuguese ethical committee (Comissão Nacional de Proteção de Dados) and local ethic review boards (Administração Regional de Saúde do Centro; Centro Hospitalar do Baixo Vouga; Unidade Local de Saúde de Matosinhos; Centro Hospitalar do Médio Ave; Hospital Distrital da Figueira da Foz; Hospital Escola Braga, Braga; Centro Hospitalar do Alto Ave, Guimarães; and Unidade Local de Saúde do Alto Minho, Viana do Castelo/Ponte de Lima). All participants provided voluntary and informed written consent.

Individual genotyping

DNA for genotyping was extracted from saliva samples (Baixo Vouga cohort) or peripheral blood mononuclear cells (Minho cohort) using QIAamp DNA Mini Kit (QIAGEN) and following the kit’s protocol with slight modifications. Samples from the Baixo Vouga cohort were genotyped using the Infinitum Global Screening Array-24 v1.0 while the Minho cohort was genotyped using the Neuro Consortium Array (Infinium Core-24+ v1.2, Illumina), following the Illumina Infinitum HTS Assay protocols. Quality control (QC), of both cohorts, was performed using PLINK 1.9 [15] software following standard protocols [16,17]. Briefly, samples having a call rate lower than 95%; gender discrepancies; divergent ancestry; heterozygosity rate higher than three standard deviations away from the mean rate; and third-degree related individuals were excluded. Additionally, SNPs with missing rate higher than 5%; showing deviations from Hardy-Weinberg equilibrium; with a minor allele frequency lower than 5%; and displaying distinct missing rates when comparing cases with controls were also excluded [16]. Subsequently genotypes were imputed using the Michigan Imputation Server [18] and submitted to the same QC filters [1921]. A total of 498 samples and 6.838.946 SNPs in the Baixo Vouga cohort and 380 samples and 5.389.594 SNPs in the Minho cohort passed QC and were used in this work.

Genetic risk assessment

SNPs significantly associated with susceptibility to COVID-19 infection [46] (rs286914 and rs12329760) and severe COVID-19 with respiratory failure [3] (rs657152 and rs11385942) were selected from the literature (S1 Table). Allelic frequencies were used to calculate the probability of having multiple risk alleles in each population (European, Portuguese, Spanish, Italian, African, American and Asian) (S2 Table). For each SNP, genotypes where coded as 0,1,2 depending on the number of risk alleles, with the genetic risk category representing the total count of alleles for each pair of SNPs (score range 0–4). Data for the Portuguese(n = 623), Spanish (n = 9761) and Italian (6363) populations correspond to observed values extrapolated to 1 million, whereas data for remaining populations correspond to estimations (also to 1 million) based on the published effect allele frequencies (rs286914, rs12329760 and rs11385942 data were obtained from gnomAD-Genome project [22], while rs657152 data was obtained from the ALFA project] [23]. Prior to estimations, Hardy-Weinberg equilibrium was assessed and validated (susceptibility SNPs: p-value = 0.80; severity SNPs: p-value = 0.29).

Polygenic risk scores (PRS) offer a quantifiable metric of an individual’s inherited risk based on the cumulative impact of several common polymorphisms and can be used to test the genetic propensity of an individual to a wide range of diseases. PRS for severe COVID-19 with respiratory failure [3], susceptibility to COVID-19 infection [24] and COVID-19 related hospitalization [24] and survivability [24] were calculated using publicly available summary statistics. All summary statistics used resulted from studies conducted among participants of European ancestry. PRSice software [25] was used for the calculation of individual PRS within our cohorts, following a standard protocol [26,27]. Analyses were adjusted for the same covariables as the original GWAS [2,3] (gender, age, and the first 10 principal components).

Statistical analysis

Statistical analyses were performed using IBM SPSS Statistics, version 25 (IBM Corporation, Armonk, NY, USA) and plots created using GraphPad Prism, version 6 (GraphPad, San Diego, CA, USA). Differences between groups’ effect allele frequencies (both COPD vs control and European vs remaining world populations) were assessed by means of a Chi-Square test. Chi-Square test was also used to assess differences in proportions between populations (European vs every other population) for the cumulative risk of having two SNPs, for either COVID-19 infection susceptibility or severe COVID-19 with respiratory failure. Given the magnitude of the differences observed in the analyses of European vs the remaining populations, an adjusted residues analysis [28] was performed to specify which population and/or risk category (0 to 4) was being the most impactful on the statistical significance.

The normality of polygenic risk scores data was assessed using the Shapiro–Wilk test. Polygenic risk scores of control and COPD groups were compared by means of an unpaired student’s t test (susceptibility, hospitalization and survivability), whereas for severity, a Mann-Whitney test was used. Statistical significance level for every analysis was set at 0.01.

Results

Cohort characterization

The Baixo Vouga cohort consisted of 498 people (243 controls and 255 individuals with COPD). Recruitment was performed so that groups were sex- and age- matched (male: 72.4% vs 79.6%, p-value = 0.06; age: 67 [60, 72] vs 68 [61, 74] years old, p-value = 0.044; control and COPD, respectively). All other measures were significantly different between both groups (body mass index: 27.32 [24.91, 29.75] vs 25.97 [23.44, 29.73] Kg/m2, p-value < 0.01; FEV1: 2.58 [2.11, 3.06] vs 1.32 [0.94, 1.81] litters, p-value < 0.01; FEV1/FVC: 83.90 [78.01, 89.02] vs 53.02 [41.24, 61.94], p-value < 0.01; control and COPD, respectively). Detail characterization of the samples is shown in Table 1.

thumbnail
Table 1. Sociodemographic, anthropometric and clinical characteristics of participants.

https://doi.org/10.1371/journal.pone.0264009.t001

The Minho cohort was composed of 380 community-dwelling individuals, with a median age of 66 [53, 72] years, 48.2% (n = 183) male individuals and a median body mass index of 27.65 [25.1, 30.45] Kg/m2 (all other metrics of Table 1 were not available for this cohort). Details characterization of the Minho cohort is shown in (S3 Table).

A new pseudo-cohort was created, composed of 623 individuals, and from now on designated as Portuguese cohort by merging both cohorts (control individuals from the Baixo Vouga cohort and all individuals from the Minho cohort) since no significant differences between original groups were found (S1 Fig). Comparison for SNP rs11385942 was not possible, as it was not genotyped or imputed in the Minho cohort. People with COPD, from the Baixo Vouga cohort, were not included given the disease-specific nature of the group. The Portuguese cohort median age was 66 [58, 72] years old, with 57.62% (n = 359) male individuals and a median BMI of 27.53 [25.08, 30.22] Kg/m2. Detail characterization is shown in Table 1.

Genetic risk assessment for COVID-19 associated phenotypes in people with COPD

Single loci risk assessment.

People with COPD (n = 255) did not display an increased frequency of risk alleles for COVID-19 infection susceptibility (rs286914 and rs12329760) or severe response to COVID-19 infection SNPs (rs657152 and rs11385942), when compared to the control group (n = 243) (p-value = 0.6; p-value = 0.55; p-value = 0.85; p-value = 0.72; respectively) (Fig 1). These results were validated on an external cohort from the Minho region of Portugal (n = 380), with no major differences emerging from the analyses (i.e. data from Minho was not different from the COPD nor the control group, data not shown).

thumbnail
Fig 1.

Allele frequencies for significant SNPs, for a) susceptibility (rs286914 and rs12329760) and b) severe response (rs657152 and rs11385942) to COVID-19 infection. No significant differences were found between COPD and control groups for any of the tested SNPs; rs286914: p-value = 0.60; rs12329760: p-value = 0.55; rs657152: p-value = 0.85; rs11385942: p-value = 0.72.

https://doi.org/10.1371/journal.pone.0264009.g001

Bi-allelic risk assessment.

The bi-allelic risk in our cohort was then calculated to determine the number of people that could be at a high risk for COVID-19 infection and a severe COVID-19 with respiratory failure, since each phenotype was previously associated with two SNPs. Homogeneity between the COPD and control groups was once again observed, with no differences in the number of people at low (0 risk alleles), moderate-low (1 risk allele), moderate (2 risk alleles), moderate-high (3 risk alleles) or high risk (4 risk alleles) for either the outcomes, (Fig 2). Most people were at low to moderate risk for COVID-19 infection, 244 for the COPD group (96%) and 233 for the control group (96%). Both groups had 10 people (4%) at a moderate-high risk. There was only 1 person, belonging to the COPD group, that had the highest risk for COVID-19 infection, being double-homozygous for the risk alleles (Fig 2A). Regarding the response to COVID-19 infection, 248 (97%) people with COPD and 239 (98%) people from the control group were at a low to moderate risk. There were 7 (3%) and 4 (2%) people, COPD and control groups respectively, with a moderate-high risk, but no one was a double-homozygous for the risk alleles, in either group (Fig 2B).

thumbnail
Fig 2.

Number of people with a cumulative number of risk alleles for a) susceptibility for COVID-19 infection and b) severe COVID-19 with respiratory failure. 0 to 4 represents the sum of effect alleles for each COVID-19 associated phenotype. No significant differences were found, between COPD and control groups, for the distribution of people in the risk groups. a) p-value = 0.71 and b) p-value = 0.80.

https://doi.org/10.1371/journal.pone.0264009.g002

Polygenic risk assessment.

A complementary genetic analysis was conducted by calculating polygenic risk scores (PRSs) for four different phenotypes to gain further insight on the overall genetic risk for the onset of COVID-19 relevant phenotypes in people with COPD. Summary statistics from the COVID-19 Host Genetics Initiative were used to calculate PRS for: i) COVID-19 infection susceptibility, ii) COVID-19-related hospitalization, iii) severe COVID-19 with respiratory failure, and iv) survival after COVID-19 infection. Again, people with COPD did not display significant differences in PRS for any of the outcomes when compared to the control group, (Fig 3A–3D). These results were also validated by comparing both groups with the Minho cohort, with no significative differences (data not shown).

thumbnail
Fig 3.

Polygenic risk score for susceptibility to COVID-19 infection (a), hospitalization due to COVID-19 infection (b), severe COVID-19 with respiratory failure (c) and survivability to COVID-19 infection (d). Results shown as scatter dot plots with median represented. No significant differences were found between the polygenic risk scores of COPD and control groups for any of the phenotypes tested; a) p-value = 0.05; b) p-value = 0.03; c) p-value = 0.38; d) p-value = 0.09.

https://doi.org/10.1371/journal.pone.0264009.g003

Worldwide COVID-19 genetic risk scenario

Single loci risk assessment.

We conducted a similar risk assessment, but focusing on the major human populations, as characterized in the National Center for Biotechnology Information (NCBI) [22,23], to determine how stable the COVID-19 associated genetic variants were in the human population. Overall, we found many significant differences across human populations. The effect allele frequency of SNP rs286914 in the European population (26.81%) was significantly lower than for the Portuguese (32.10%), African (38.01%) and American (35.90%) populations, but significantly higher than the Asian population (13.14%) (all p-values < 0.0001), (Fig 4A). The European effect allele frequency of SNP rs12329760 (27.23%) was significantly higher than in the Portuguese (19.90%) and American (17.2%) populations, but significantly lower than the African (28.84%) and the Asian (41.72%) populations (all p-values < 0.0001) (Fig 4B). There was no information available for SNPs rs286914 and rs12329760 in the Spanish and Italian populations, therefore, precluding the analyses. SNP rs657152 appears to have a decreased frequency in the European population (37.09%) when compared with the Portuguese (42.70%; p-value < 0.0001), Italian (38.43%; p-value = 0.0025), African (45.08%; p-value < 0.0001) and Asian (72%; p-value < 0.0001) populations. On the other hand, it has an increased frequency compared to the American (22.65%; p-value < 0.0001) population. No differences were found between European and Spanish (37.41%; p-value = 0.383) populations, (Fig 4C). For the last SNP of interest, rs11385942, the effect allele frequency on the European population (7.87%) was significantly higher than the Spanish (6%; p-value < 0.0001), African (5.76%; p-value < 0.0001), American (5.8%; p-value = 0.0018) and Asian (0.06%; p-value < 0.0001) populations, but significantly lower than the Italian population (10.33%; p-value < 0.0001). There were no differences between the European and Portuguese (5.56%; p-value = 0.06) populations, (Fig 4D). All data is detailed in S4 Table.

thumbnail
Fig 4.

Effect allele proportions for COVID-19 susceptibility (a and b) and severe response to COVID-19 infection (c and d) SNPs in different world populations. Bars represent the proportion of each allele in the respective population. Allele frequencies for rs286914, rs12329760 and rs11385942 were obtained from gnomAD-Genome project, while rs657152 allele frequencies were obtained from the ALFA project. *: p-value<0.01; **: p-value<0.001; ***: p-value<0.0001; ✝ - population used as reference for statistical analyses. NA—No data available.

https://doi.org/10.1371/journal.pone.0264009.g004

Bi-allelic risk assessment.

A bi-allelic genetic analysis was performed to assess the number of people from each population that might have an aggravated genetic risk for COVID-19 infection or for severe COVID-19 with respiratory failure. Since this analysis multiplies the individual allelic frequencies that were already so diverse, the bi-allelic risk scenario is also highly heterogeneous among human populations. The European population was significantly different from all other populations in both COVID-19 susceptibility and severe COVID-19 with respiratory failure (p-value<0.0001 for all comparisons), (Fig 5A and 5B respectively).There was no information available for SNPs rs286914 and rs12329760 in the Spanish and Italian populations, therefore, precluding the estimation. The residues analysis highlights the African population as having the highest genetic risk for COVID-19 infection susceptibility (Fig 6A), with 12017 people per million estimated to be double-homozygous for the risk SNPs (total of 4 risk alleles), at the expense of the number of people with 0 risk alleles, which was only 194588 per million (S5 Table).

thumbnail
Fig 5.

Estimation on the number of people with a cumulative number of risk alleles in the world major populations for a) susceptibility for COVID-19 infection (rs286914 + rs12329760) and b) severe COVID-19 with respiratory failure (rs657152 + rs11385942). 0 to 4 represents the sum of effect alleles for each COVID-19 associated phenotype. Data for the Portuguese(n = 623), Spanish (n = 9761) and Italian (6363) populations correspond to observed values extrapolated to 1 million, whereas data for major world populations correspond to estimations (also to 1 million) based on the published effect allele frequencies, after Hardy-Weinberg equilibrium validation. Allele frequencies for rs286914, rs12329760 and rs11385942 were obtained from gnomAD-Genome project, while rs657152 allele frequencies were obtained from the ALFA project. All populations’ distributions were compared with the European distribution. ✝ - population used as reference for statistical analyses. ***: p-value<0.0001. NA—No data available.

https://doi.org/10.1371/journal.pone.0264009.g005

thumbnail
Fig 6.

Adjusted residues heatmap for a) bi-allelic risk of COVID-19 infection susceptibility and b) bi-allelic risk of severe COVID-19 with respiratory failure. Red colour means that the target population/risk level is overrepresented, comparatively to the European distribution; Purple colour means that the target population/risk level is underrepresented, comparatively to the European distribution. NA—No data available.

https://doi.org/10.1371/journal.pone.0264009.g006

The remaining populations displayed a higher resemblance in their risk distribution. As for the severe COVID-19 with respiratory failure, the Italian population had the highest estimated number of people, 1576 per million, being double-homozygous for the risk SNPs (total of 4 risk alleles) (Fig 6B). Alternatively, the American population had an enrichment of people with 0 risk alleles, 530912 per million. In addition, the Asian population had a substantial increase in the number of people with a total of 2 risk alleles, 518262 per million, when compared to the remaining populations, (S6 Table).

Discussion

This study demonstrated that, genetics seem not to contribute to the increased risk of poor outcomes due to COVID-19 in people with COPD. High heterogeneity for COVID-19 associated genetic variants across world populations was also demonstrated. Comparative analyses on genetic predisposition to COVID-19 outcomes has been previously explored [2931]. This study brings novelty by being the first comprehensive study (single-loci, bi-allelic and polygenic risk assessment) on the genetic predisposition of people with COPD to COVID-19 phenotypes; and by assessing the overall genetical risk distribution for COVID-19 in a global scenario.

People with COPD do not have increased genetic risk for COVID-19 associated phenotypes

Significant differences for COVID-19 relevant SNPs between people with COPD and the control group were not found. This was observed consistently by looking at individual effect alleles coming from previously published genetic studies [36], additive effect of risk variants and by polygenic risk scores, for each phenotype. Furthermore, people with COPD could not be differentiated from controls when considering the genetic basis for COVID-19 susceptibility, neither for associated hospitalization, respiratory failure or death. These findings, together with recent reports showing that people with COPD have a poorer prognosis [8], support the hypothesis that the aggravated risk for poor COVID-19 clinical outcomes is independent of individual genetic backgrounds and is probably due to the characteristic debilitated state of these people’s respiratory system [32], their chronic inflammatory state [32,33] and their increased number of respiratory infections [34], such as pneumonia [35].

The world population shows high heterogeneity in COVID-19 genetic variants

Portuguese, Spanish and Italian populations belong to what is regarded as the European population, and therefore we would expect them to behave similarly regarding the COVID-19 associated SNPs, however, our results show otherwise. Differences were further emphasised when comparing the European population with the remaining world major populations, as observed in the bi-allelic risk for severe COVID-19 with respiratory failure, specifically in the American and Asian populations. The Italian population, which was the European COVID-19 first wave epicentre, showed the highest number of people double-homozygous for severe COVID-19 SNPs, highlighting a possible explanation for the severe epidemiological scenario observed in Italy. Overall, these results are in line with what has been reported in other studies, with the risk for COVID-19 outcomes (cardiac, neurological, etc.) varying substantially between ethnicities [36].

Study limitations

Our study was performed with a relatively small sample size (COPD: n = 255; control: n = 243; Minho: n = 380), although, in proportion to the whole Portuguese population, these numbers (0.006%) were close to those previously used for the Spanish and Italian populations study (0.021% and 0.011%, respectively) [3]. Rs41303171 may play a very important role in COVID-19 infection, knowing that this SNP is located in the gene encoding the most important SARS-CoV-2 receptor protein (ACE2) [37], however it could not be found in either of our cohorts, inhibiting its analysis. The COPD cohort was highly enriched on heavy smokers, whereas in the control group only a small proportion of people were smokers, which could have skewed the results. SNP rs11385942 was neither genotyped or imputed in our validation cohort (cohort from Minho), and therefore external validation of the results shown in Fig 2 was precluded. The PRS analysis was based on the summary statistics of external GWAS, meaning that any limitation of their studies would be translated into ours. SNP rs12329760 has only been associated with COVID-19 infection susceptibility indirectly, through the known role of its gene (TMPRSS2) in SARS-CoV-2 infection and not by a GWAS, therefore, lacking a genetic validation for such association. The populational risks (e.g. Fig 5) were estimated for a population of one million people, which led the Chi-Square test value to increase artificially. To downsize this limitation, we have used a very stringent significance threshold (p-value<0.01) and performed a residual analysis to highlight major effects within the dataset.

Conclusion

This study showed that genetic background does not play a significant role in predisposition to severe COVID-19 among people with COPD based on the explored genetic variants. We also demonstrated the need to build a higher resolution European genetic map, so that differences in the distribution of relevant alleles, as those detected by us, can be easily accessed and used to better manage the diseases, ultimately, safeguarding populations with higher genetic predisposition to disease.

Supporting information

S1 Fig.

Allelic frequencies for significant SNPs, for A—susceptibility (rs286914 and rs12329760) and B—severe response (rs657152 and rs11385942) to COVID-19 infection. No significant differences were found between Control group of Baixo Vouga cohort and Minho cohort for any of the tested SNPs; rs286914: p-value = 0.54; rs12329760: p-value = 0.06; rs657152: p-value = 0.44. rs11385942 was not evaluated in Minho Cohort.

https://doi.org/10.1371/journal.pone.0264009.s001

(PDF)

S1 Table. COVID-19 associated SNPs description.

Ref. allele—reference allele. Alt. allele—alternative allele. A—adenine. C—cytosine. G—guanine. T—thymine. dupA—duplication of an adenine. V—valine. M—Methionine.

https://doi.org/10.1371/journal.pone.0264009.s002

(PDF)

S2 Table. Probability formula of having multiple risk alleles, assuming the Hardy-Weinberg’s law.

SNP X and SNP Y are representative. p2—major allele in homozygosity probability; 2pq—heterozygosity probability; q2—minor allele in homozygosity probability.

https://doi.org/10.1371/journal.pone.0264009.s003

(PDF)

S3 Table. Sociodemographic, anthropometric and clinical characteristics of the Minho cohort.

N (%)—number of individuals and corresponding percentage; remaining data is presented as medians with interquartile range in square brackets. FEV1—Forced Expiratory Volume in 1-sec in litres; FVC—Forced Vital Capacity in litres; n.d.—no data available.

https://doi.org/10.1371/journal.pone.0264009.s004

(PDF)

S4 Table. Single-loci frequency comparison between European and the other world populations.

SNP—single nucleotide polymorphism; CHR:POS—genomic coordinates; N -total number of individuals enrolled in the study; A1—effect allele; A2—reference allele; A1F - Average effect allele frequency; A2F - Average reference allele frequency. Data source: rs286914, rs12329760 and rs11385942—gnomAD-Genome project7; rs657152—ALFA project6.

https://doi.org/10.1371/journal.pone.0264009.s005

(PDF)

S5 Table. Estimation on the number of people with a cumulative number of risk alleles in the world major populations, for susceptibility to COVID-19 infection (rs286914 + rs12329760).

0 to 4 represent the sum of effect alleles. Data for the Portuguese population correspond to observed values (n = 623) extrapolated to 1 million, whereas data for major world populations correspond to estimations (also to 1 million) based on the published effect allele frequencies, after Hardy-Weinberg equilibrium validation. Allele frequencies were obtained from gnomAD-Genome project7.

https://doi.org/10.1371/journal.pone.0264009.s006

(PDF)

S6 Table. Estimation on the number of people with a cumulative number of risk alleles in the world major populations for severe COVID-19 with respiratory failure (rs657152 + rs11385942).

0 to 4 represent the sum of effect alleles. Data for the Portuguese(n = 623), Spanish (n = 9761) and Italian (6363) populations correspond to observed values extrapolated to 1 million, whereas data for major world populations correspond to estimations (also to 1 million) based on the published effect allele frequencies, after Hardy-Weinberg equilibrium validation. Allele frequencies for rs11385942 were obtained from gnomAD-Genome project7, while rs657152 allele frequencies were obtained from the ALFA project6.

https://doi.org/10.1371/journal.pone.0264009.s007

(PDF)

References

  1. 1. Stawicki S, Jeanmonod R, Miller A, Paladino L, Gaieski D, Yaffee A, et al. The 2019–2020 novel coronavirus (severe acute respiratory syndrome coronavirus 2) pandemic: A joint american college of academic international medicine-world academic council of emergency medicine multidisciplinary COVID-19 working group consensus paper. Journal of Global Infectious Diseases. 2020.
  2. 2. The COVID-19 Host Genetics Initiative, a global initiative to elucidate the role of host genetic factors in susceptibility and severity of the SARS-CoV-2 virus pandemic. Eur J Hum Genet. 2020;28(6):715–8. pmid:32404885
  3. 3. Ellinghaus D, Degenhardt F, Bujanda L, Buti M, Albillos A, Invernizzi P, et al. Genomewide Association Study of Severe Covid-19 with Respiratory Failure. N Engl J Med. 2020. pmid:32558485
  4. 4. Kachuri L, Francis SS, Morrison M, Bossé Y, Cavazos TB, Rashkin SR, et al. The landscape of host genetic factors involved in infection to common viruses and SARS-CoV-2. medRxiv Prepr Serv Heal Sci. 2020.
  5. 5. Hou Y, Zhao J, Martin W, Kallianpur A, Chung MK, Jehi L, et al. New insights into genetic susceptibility of COVID-19: An ACE2 and TMPRSS2 polymorphism analysis. BMC Med. 2020. pmid:32664879
  6. 6. Pereira NL, Ahmad F, Byku M, Cummins NW, Morris AA, Owens A, et al. COVID-19: Understanding Inter-Individual Variability and Implications for Precision Medicine. Vol. 96, Mayo Clinic Proceedings. Elsevier Ltd; 2021. p. 446–63. https://doi.org/10.1016/j.mayocp.2020.11.024 pmid:33549263
  7. 7. World health statistics 2018: monitoring health for the SDGs, sustainable development goals. Geneva: World Health Organization; 2018.
  8. 8. Attaway AA, Zein J, Hatipoğlu US. SARS-CoV-2 infection in the COPD population is associated with increased healthcare utilization: An analysis of Cleveland clinic’s COVID-19 registry. EClinicalMedicine. 2020;26. pmid:32869011
  9. 9. Leung JM, Niikura M, Yang CWT, Sin DD. COVID-19 and COPD. Vol. 56, European Respiratory Journal. European Respiratory Society; 2020. p. 2002108.
  10. 10. Centers for Disease Control and Prevention (CDC). People with Certain Medical Conditions [cited 2020 Oct 13]. Available from: https://www.cdc.gov/coronavirus/2019-ncov/need-extra-precautions/people-with-medical-conditions.html.
  11. 11. Global Initiative for Chronic Obstructive Lung Disease. Global strategy for the diagnosis management and prevention of chronic obstructive pulmonary disease (2019 Report). 2019.
  12. 12. World Health Organization—WHO. Preamble to the Constitution of the World Health Organization as adopted by the International Health Conference, New York, 19–22 June, 1946. Geneva, Switzerland; 1948.
  13. 13. Santos NC, Costa PS, Cunha P, Cotter J, Sampaio A, Zihl J, et al. Mood is a key determinant of cognitive performance in community-dwelling older adults: a cross-sectional analysis. Age (Omaha). 2013;35(5):1983–93. pmid:23054829
  14. 14. Santos NC, Costa PS, Cunha P, Portugal-Nunes C, Amorim L, Cotter J, et al. Clinical, physical and lifestyle variables and relationship with cognition and mood in aging: A cross-sectional analysis of distinct educational groups. Front Aging Neurosci. 2014;6(FEB). pmid:24605100
  15. 15. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. pmid:17701901
  16. 16. Marees AT, de Kluiver H, Stringer S, Vorspan F, Curis E, Marie-Claire C, et al. A tutorial on conducting genome-wide association studies: Quality control and statistical analysis. Int J Methods Psychiatr Res. 2018 Jun 1;27(2). pmid:29484742
  17. 17. Anderson CA, Pettersson FH, Clarke GM, Cardon LR, Morris AP, Zondervan KT. Data quality control in genetic case-control association studies. Nat Protoc. 2010 Aug 26;5(9):1564–73. pmid:21085122
  18. 18. Das S, Forer L, Schönherr S, Sidore C, Locke AE, Kwong A, et al. Next-generation genotype imputation service and methods. Nat Genet. 2016;48(10):1284–7. pmid:27571263
  19. 19. Marchini J, Howie B. Genotype imputation for genome-wide association studies. Vol. 11, Nature Reviews Genetics. 2010. p. 499–511. pmid:20517342
  20. 20. Naj AC. Genotype Imputation in Genome‐Wide Association Studies. Curr Protoc Hum Genet [Internet]. 2019 Jun 13 [cited 2019 Nov 5];102(1). Available from: pmid:31216114
  21. 21. Li Y, Willer CJ, Ding J, Scheet P, Abecasis GR. MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes. Genet Epidemiol. 2010;34(8):816–34. pmid:21058334
  22. 22. Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581(7809):434–43. pmid:32461654
  23. 23. Phan L., Jin Y., Zhang H., Qiang W., Shekhtman E., Shao D., et al. ALFA: Allele Frequency Aggregator. U.S. National Library of Medicine. 2010.
  24. 24. NATIONAL HEART, LUNG ABI. GRASP: Genome-Wide Repository of Associations Between SNPs and Phenotypes.
  25. 25. Euesden J, Lewis CM, O’Reilly PF. PRSice: Polygenic Risk Score software. Bioinformatics. 2015;31(9):1466–8. pmid:25550326
  26. 26. Choi SW, Mak TSH, O’Reilly PF. Tutorial: a guide to performing polygenic risk score analyses. Nature Protocols. 2020. pmid:32709988
  27. 27. Choi SW, Heng Mak TS, O’Reilly P. A guide to performing Polygenic Risk Score analyses. bioRxiv. 2018.
  28. 28. Sharpe D. Chi-Square Test is Statistically Significant: Now What? Pract Assessment, Res Eval Pract Assess. 2015;20.
  29. 29. Mohammadpour S, Torshizi Esfahani A, Halaji M, Lak M, Ranjbar R. An updated review of the association of host genetic factors with susceptibility and resistance to COVID-19. Vol. 236, Journal of Cellular Physiology. Wiley-Liss Inc.; 2021. p. 49–54. https://doi.org/10.1002/jcp.29868 pmid:32542735
  30. 30. Wang R, Chen J, Hozumi Y, Yin C, Wei GW. Decoding Asymptomatic COVID-19 Infection and Transmission. J Phys Chem Lett. 2020;11(23):10007–15. pmid:33179934
  31. 31. Lawaczeck R. COVID-19 and Body Iron: A Survey on Phenomenological and Genetic Correlations. Vol. 11, ACS Chemical Neuroscience. American Chemical Society; 2020. p. 3996–4000. https://doi.org/10.1021/acschemneuro.0c00572 pmid:33196159
  32. 32. Barnes PJ. Cellular and molecular mechanisms of asthma and COPD. Vol. 131, Clinical Science. Portland Press Ltd; 2017. p. 1541–58. https://doi.org/10.1042/CS20160487 pmid:28659395
  33. 33. Racanelli AC, Kikkers SA, Choi AMK, Cloonan SM. Autophagy and inflammation in chronic respiratory disease. Vol. 14, Autophagy. Taylor and Francis Inc.; 2018. p. 221–32. https://doi.org/10.1080/15548627.2017.1389823 pmid:29130366
  34. 34. Cui Y, Luo L, Li C, Chen P, Chen Y. Long-term macrolide treatment for the prevention of acute exacerbations in COPD: A systematic review and meta-analysis. Vol. 13, International Journal of COPD. Dove Medical Press Ltd.; 2018. p. 3813–29. https://doi.org/10.2147/COPD.S181246 pmid:30538443
  35. 35. Jiang F, Deng L, Zhang L, Cai Y, Cheung CW, Xia Z. Review of the Clinical Characteristics of Coronavirus Disease 2019 (COVID-19). Vol. 35, Journal of General Internal Medicine. Springer; 2020. p. 1545–9. https://doi.org/10.1007/s11606-020-05762-w pmid:32133578
  36. 36. Chu JY, Kaliwal Y, Koh M, Chen R, Chow CM, Ko DT, et al. COVID-19 and its Cardiac and Neurological Complications among Ontario Visible Minorities. Can J Neurol Sci. 2021;1–10. pmid:34162448
  37. 37. Tan HW, Xu YM, Lau ATY. Angiotensin-converting enzyme 2: The old door for new severe acute respiratory syndrome coronavirus 2 infection. Rev Med Virol. 2020;30(5):e2122. pmid:32602627