Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Genetic correlation analysis between sepsis and hematological traits: Identifying shared genomic regions

  • Jianmin Zhang,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Traditional Chinese Medicine, Children’s Hospital of Soochow University, Suzhou, Jiangsu, China

  • Miao Hou,

    Roles Data curation, Formal analysis, Investigation, Software

    Affiliation Department of Cardiology, Children’s Hospital of Soochow University, Suzhou, Jiangsu, China

  • Zhenjiang Bai,

    Roles Data curation, Formal analysis, Investigation, Methodology, Software

    Affiliation Department of Critical Care Medicine, Children’s Hospital of Soochow University, Suzhou, Jiangsu, China

  • Jian Wang,

    Roles Conceptualization, Methodology, Supervision

    Affiliation Department of Surgery, Children’s Hospital of Soochow University, Suzhou, Jiangsu, China

  • Jie Huang

    Roles Conceptualization, Funding acquisition, Methodology, Project administration, Supervision

    J.shuang@163.com

    Affiliation Department of Cardiology, Children’s Hospital of Soochow University, Suzhou, Jiangsu, China

Abstract

Background

Sepsis, dysregulated cascades of inflammatory response to infection, remains a critical clinical condition, leading to morbidity and mortality. Better understanding of the genetic basis underlying sepsis has the potential to better prevent and treat this potentially life-threatening condition. Recent studies have identified genetic variants for sepsis and accumulated evidence for the relevance of hematological traits. However, genetic correlation analysis, testing the hypothesis of shared genetic underpinning of sepsis and hematological traits, as well as identifying shared specific genetic regions, is lacking.

Method

In this study, we systematically evaluated the extent and statistical significance of global genetic correlation between sepsis and a comprehensive battery of 29 hematological traits, as well as performed local genetic correlation analysis to pinpoint the shared genomic regions.

Results

Our analyses revealed significant global genetic correlation between sepsis and five red blood cell indices. Local genetic correlation analyses identified 6–21 genomic regions with lengths ranging from 1.4kb to 5.6Mb shared between sepsis and seven hematological traits.

Conclusion

Our findings improve our general understanding of the shared genetics between sepsis and different categories of blood cell traits, and have the potential to advance personalized medicine for sepsis.

Introduction

Sepsis is a life-threatening organ dysfunction triggered by a cascade of dysregulated and overwhelming host responses to an infection. Sepsis is often associated with widespread inflammation, tissue damage, organ failure, and even death [13]. Sepsis unfortunately affects people of any age group: for example, pediatric sepsis afflicting children under one year of age; while among adults literature has reported increased risk for people above 65 years old. Immune-compromised individuals and people with pre-existing and/or chronic medical conditions are also at elevated risk of developing sepsis [4]. According to the 2024 Centers for Disease Control and Prevention estimates [5], each year at least 1.7 million US adults become septic; at least 350,000 die because of it.

The incidence of sepsis has been increasing, and the disease has been frequently associated with chronic physical, psychological, and cognitive impairments, as well as much reduced quality of life [6]. The mortality rate for sepsis was reported to be as high as 35–45% and is still at 18.4% [7]. Investigators have been making efforts to decipher the mechanisms behind this disease for better diagnosis, prevention and treatment. The etiopathology underlying sepsis has been recognized as complicated and obscure, both in terms of disease development and progress to its severe forms.

Immune response to infection in general and sepsis have both been reported to be at least partially genetically determined [8,9]. Genetic association studies have identified multiple loci associated with sepsis [811] and the polygenic risk scores constructed have been reported to show promising discriminating power between people affected with sepsis and controls [8].

Despite the advancement of genetic variants identified for sepsis and its close connection to immune response, genetic correlation between sepsis and immune biomarkers has been under-explored. More importantly, published studies investigating genetic correlation between sepsis and other related diseases and traits [12,13] focus on global genetic correlation, failing neither to reflect the complexity of shared genetic architecture between sepsis and other phenotypes nor to identify genetic regions contributing to the global genetic correlation. In this study, we attempt to fill the gaps by both comprehensively estimating the global genetic correlation between sepsis and the battery of hematological biomarkers from routine complete blood cell count (CBC) tests and pinpointing genetic regions where the global genetic correlations can be attributed to.

Methods

Overview

Our main objective is to identify local genetic regions that account for the shared genetic underpinning of sepsis and hematological traits. In order to detect these genetic regions, we performed local genetic correlation analysis based on findings from genome-wide association studies (GWAS) for sepsis and each of the relevant hematological traits. There is a comprehensive battery of blood cell traits, including white blood cell indices, red blood cell traits, and platelet indices, for which genetic association studies have been performed in large samples of individuals from multiple population groups [1416]. To select the relevant blood cell traits, we adopted a two-component approach (Fig 1) with one component based on prior knowledge from literature, and the other component driven by data. For the second component, specifically, we carry out global genetic correlation, using LDSC [17], to select hematological traits that are genetically correlated with sepsis.

thumbnail
Fig 1. Overall study design.

We selected blood cell traits that are potentially relevant to sepsis through two sources: based on prior knowledge where monocyte and neutrophil counts are selected (top left box with yellow color background); and based on data-driven global correlation analysis using LD score regression (LDSC) based on sepsis and blood cell traits GWAS results, which leads to five additional traits including MCH, MCV, MRV, MSCV and RBC (top right box with peach color background). Combined together, the resulting seven blood cell traits are further subject to local genetic correlation analysis using LOGODetect with sepsis. MCH: mean corpuscular hemoglobin; MCV: mean corpuscular volume; MRV: mean reticulocyte volume; MSCV: mean sphered corpuscular volume; RBC: red blood cell count.

https://doi.org/10.1371/journal.pone.0333675.g001

GWAS summary statistics

There are multiple published GWAS studies for sepsis [10,11,18], also including survival outcome [19], pediatric sepsis [20]. Similarly there are many published GWAS for blood cell indices [1416,21,22]. Considering multiple factors including sample size, genome coverage of genetic variants interrogated, sample ascertainment (particularly for sepsis where most of the large-sample size studies are based on biobank data with sepsis cases defined with suboptimal ICD9 or ICD10 codes), comprehensiveness of hematological traits analyzed, and availability of genome-wide summary statistics, we chose results from Vuckovic et al. [16] for blood cell traits and summary statistics from Butler-Laporte et al. [11] for subsequent analyses.

Global genetic correlation analysis

Global genetic correlation analysis was performed using the LD Score Regression (LDSC) method [17,23]. GWAS summary statistics from Butler-Laporte et al. [11] and Vuckovic et al. [16] were used to estimate global genetic correlation as well as to test the statistical significance of the genetic correlation between sepsis and each of the following 29 blood cell phenotypes: four platelet related traits including platelet count (PLT), mean platelet volume (MPV), platelet distribution width (PDW), and plateletcrit (PCT); 8 indices pertinent to mature red blood cells (including red blood cell count (RBC), mean corpuscular volume (MCV), hematocrit (HCT), mean corpuscular hemoglobin (MCH), mean sphered corpuscular volume (MSCV), mean corpuscular hemoglobin concentration (MCHC), hemoglobin concentration (HGB), red cell distribution width (RDW)), 6 indices pertinent to immature red blood cells including reticulocyte count (RET), reticulocyte fraction of red cells (RET%), immature fraction of reticulocytes (IRF), high light scatter reticulocyte count (HLR), High light scatter reticulocyte percentage of red cells (HLR%), and mean reticulocyte volume (MRV); four traits for myeloid white blood cell counts including monocyte count (MONO), neutrophil count (NEUT), eosinophil count (ESO), basophil count (BASO); lymphocyte count (LYMPH); total white blood cell count (WBC); and five percentages of white blood cell subtypes including monocyte percentage (MONO%), neutrophil percentage (NEUT%), eosinophil percentage (ESO%), basophil percentage (BASO%), and lymphocyte percentage (LYMPH%).

Local genetic correlation analysis

To pinpoint genomic regions harboring shared genetic underpinnings behind sepsis and hematological indices, we carried out local genetic correlation analysis using GWAS summary statistics for sepsis [11] and for each of the 29 blood cell phenotypes [16]. Specifically, we employed the LOGODetect method [24] to perform local genetic correlation analysis between sepsis and the 29 blood cell indices detailed in the “Global genetic correlation analysis” section above.

Results

GWAS summary statistics

GWAS summary statistics from Vuckovic et al. [16] for a battery of 29 blood cell indices were derived from 408,112 European ancestry individuals. The number of variants examined for association analysis is around 41 million, ranging from 41,266,573 for reticulocyte fraction of red cells to 41266573 for lymphocyte count. The number of significant variants exceeding genome-wide significance threshold of 5E-8 ranges from 17,358 for basophil percentage of white cells to 174,249 for mean sphered corpuscular volume. In contrast, there were no genome-wide significant variants for sepsis, despite the large sample size of 18,931 cases and 663,531 controls.

LDSC analysis reveals five red blood cell indices genetically correlated with sepsis

Our global genetic correlation analysis using LDSC (summarized in Table 1) identified nominally significant (LDSC P value < 0.05) correlation between sepsis and multiple blood cell phenotypes, including mean corpuscular hemoglobin (MCH) (genetic correlation rg = 0.1286, P value = 0.0259), mean corpuscular volume (MCV) (rg = 0.142, P value = 0.0094), mean reticulocyte volume (MRV) (rg = 0.1181, P value = 0.0127), mean sphered corpuscular volume (MSCV) (rg = 0.1242, P value = 0.0055), and red blood cell count (RBC) (rg = −0.1084, P value = 0.0256). Interestingly, all of these significant hematological traits are pertinent to red blood cells, both with mature red blood cells (MCH, MCV, MSCV, and RBD) or with immature red cells (MRV). No significant genetic correlation was detected between sepsis and white blood cell indices including neutrophils or monocytes that have been linked to sepsis in the literature. The results can be partially driven by the higher heritability of the red blood cell phenotypes. For example, MSCV showing the most significant genetic correlation with sepsis (LDSC P value = 0.0055) has the highest heritability estimate among the five red blood cell indices that exhibit significant genetic correlation. As a matter of fact, heritability estimates for all of the five indices rank among the top 8, ranging from 0.2307 to 0.2881 (Table 1). In comparison, neutrophil and monocyte indices, including counts and percentages, all have lower heritability estimates, with the highest of 0.2252 for monocyte count, and all the other three much lower ranging from 0.1534 to 0.1819.

thumbnail
Table 1. Global genetic correlation results using LDSCLDSC.

https://doi.org/10.1371/journal.pone.0333675.t001

LOGODetect narrows down genomic regions shared behind sepsis and blood cell traits

As illustrated in Fig 1, we performed local genetic correlation analysis using the LOGODetect method for the five blood cell phenotypes that show significant global genetic correlation with sepsis (namely MCH, MCV, MSCV, MRV, and RBD), as well as for the monocyte and neutrophil counts whose relevance to sepsis has been reported by previous studies.

LOGODetect identified 6−21 genomic regions, for the seven tested blood cell phenotypes, with lengths ranging from 1.4kb to 5.6Mb that are locally correlated with sepsis genetics. The results are summarized in S1 File. Fig 2 visualizes results for MSCV that showed the most significant genetic correlation (P value = 0.0055) and for MCV that showed the largest magnitude of genetic correlation (rg = 0.142) based on LDSC analyses presented above. We observed partial overlaps between the two groups of identified regions. Among the 8 regions identified between MCV and sepsis, and the 9 regions between MSCV and sepsis, four regions (chr1: 229,737,680–229,802,996, chr2: 8,718,430–8,775,344, chr3: 24,340,817–24,357,286, and chr11: 19,208,070–19,277,199) are shared, reflecting shared genetic underpinnings behind all three of the phenotypes. As a matter of fact, the chr11: 19,208,070–19,277,199 region was identified also for MCH, MRV, and RBC (S2 File).

thumbnail
Fig 2. Local genetic correlation results for selected red blood cell indices.

Local genetic correlation results for two selected red blood cell indices are visualized through mirrored Manhattan plots. There are two sub-figures, for MSCV (A) and MCV (B) respectively. For each sub-figure, we show a mirrored Manhattan plot where the top panel shows the -log10(p-value) of sepsis GWAS on the top panel and the -log10(p-value) of the red blood cell trait GWAS on the bottom panel.

https://doi.org/10.1371/journal.pone.0333675.g002

Fig 3 shows LOGODetect results between monocyte count and sepsis, and between neutrophil count and sepsis. Despite no significant global genetic correlation identified by LDSC between monocyte count and sepsis (rg = −0.0231, P value = 0.6702), the largest number of genomic regions (21 compared to ≤ 15 for other blood cell traits subjected to LOGODetect analysis) were revealed from local genetic correlation analysis. More interestingly, none of the 21 regions overlapped with the regions identified for the other six blood cell traits, suggesting that genetic contributions shared between monocyte count and sepsis are largely independent of the other blood cell indices analyzed.

thumbnail
Fig 3. Local genetic correlation results for white blood cell indices.

Local genetic correlation results for two white blood cell indices are visualized through mirrored Manhattan plots. There are two sub-figures, for neutrophil count (neut) and monocyte count (mono) respectively. For each sub-figure, we show a mirrored Manhattan plot where the top panel shows the -log10(p-value) of sepsis GWAS on the top panel and the -log10(p-value) of the white blood cell trait GWAS on the bottom panel.

https://doi.org/10.1371/journal.pone.0333675.g003

Discussion

Our study is the first to perform systematic genetic correlation analysis between sepsis and blood cell phenotypes. Our interrogations encompass both global genetic correlation estimates and local genetic correlation analysis, the latter of which enables pinpointing specific genomic regions that may account for the shared genetic underpinning of sepsis and the corresponding hematological traits that demonstrate significant local genetic correlation in these regions.

Despite accumulated evidence in the literature supporting the relevance to sepsis of leucocytes and its subtypes, particularly monocytes, neutrophils, and lymphocytes, global genetic correlation analyses revealed significant genetic correlations only with red blood cell indices. Interestingly, despite insignificant global genetic correlation between sepsis and monocyte count, we observed the largest number of genomic regions exhibiting significant local genetic correlation. This apparent discrepancy has been observed in the literature, including several examples (such as between type 1 diabetes and lupus, asthma and hypothyroidism) highlighted in Werme et al [25], where multiple genetic regions exhibited significant local correlation despite close to zero global genetic correlation estimates. Similarly, the LOGODetect publication identified more regions for schizophrenia and ID (53 regions, rg = −0.23) than for major depressive disorder and neuroticism (40 regions, rg = 0.78) despite the much larger absolute correlation of the latter. The main reason, as pointed out in Werme et al [25] is that global correlation fails to capture heterogeneous genetic relations across the genome.

The lack of significant global genetic correlation is also likely a power issue that is influenced by multiple factors including the heritability both the blood cell traits (as speculated earlier in the Results section) and sepsis, and the sample sizes of the underlying GWAS studies. We note that there are no GWAS findings exceeding the stringent genome-wide significance threshold of p-value 5 × 10−8. Despite the large sample size (total N = 682,462) and the relatively large number of sepsis cases (7,463 from FinnGen and 11,468 from UK Biobank) in the GWAS study [11], many factors can still lead to lack of genome-wide significant signals. These factors include the possible highly polygenic genetic architecture with many variants exerting only small effects, disease heterogeneity and differential effects of genetic variants on different subtypes (which combined can lead to power loss when lumping all cases together), and common issues for multi-biobank meta-analysis including population stratification, uncontrolled environmental effects, and differential disease definitions, recruitment strategies, and participant baseline characteristics. Therefore, we strongly advocate data sharing wheneve possible and establishment of global efforts to apply best practices starting at the design stage to ensure most powerful studies, similar to the Global Biobank Meta-analysis Initiative [26]. We have made our best efforts to use the largest sample size GWAS studies to maximally mitigate the issues. Future studies revisiting genetic correlations using updated GWAS results will be valuable to further investigate the shared genetic determinants between blood cell traits and sepsis.

In addition to the power and sample size, study populations can also impact our findings. For example, different disease subtypes (adult versus pediatric, sepsis with different clinical symptoms and/or co-morbidities), may have different genetic underpinnings behind. For another example, different ancestral populations may also have distinct genetic mechanisms. In this study, we focus on genetic correlation among European ancestry individuals due to various practical considerations including sample size; the availability of large-scale GWAS (limited, if any, comparable large-scale GWAS were conducted in non-European ancestry populations, particularly for sepsis), and matching of ancestry for sepsis and blood cell traits GWAS (it is much more challenging to match for non-European ancestry populations, e.g., due to multi-ancestry meta-analyses dominated by European ancestry samples rather than broken down by distinct ancestry groups). When GWAS studies from comparable sample sizes from other population groups become available, it is worthy to further investigate across various different populations.

Despite the limitations due to practical constraints, ours represent the first systematic genetic correlation study between sepsis and the whole battery of 29 blood cell traits. Our findings advance our general understanding of the shared genetics between sepsis and different categories of blood cell traits, and reveal biological targets that have the potential to be used as drug targets for the development pharmacotherapies, and that can be personalized based on the complete blood count lab tests readily and commonly performed in clinical practice.

Supporting information

S1 File. Supplementary Table.

Local genetic correlation analysis results between sepsis and 7 blood cell traits are tabulated. For each blood cell trait, all the genomic regions that exceed FDR 5% (qval < 0.05) are shown, tabulating chromosome (chr), beginning position (begin_pos), stopping position (stop_pos), LOGODetect test statistics (stat), raw p-value (pval), and multiple testing adjusted p-value (qval).

https://doi.org/10.1371/journal.pone.0333675.s001

(XLSX)

S2 File. Supplementary Figure.

Local genetic correlation results for three additional red blood cell indices are visualized through mirrored Manhattan plots. There are three sub-figures, in the order of MCH, MRV, and RBC respectively. For each sub-figure, we show a mirrored Manhattan plot where the top panel shows the -log10(p-value) of sepsis GWAS on the top panel and the -log10(p-value) of the blood cell trait GWAS on the bottom panel.

https://doi.org/10.1371/journal.pone.0333675.s002

(DOCX)

Acknowledgments

We thank the participants of the GWAS studies that enabled our study.

References

  1. 1. Arina P, Singer M. Pathophysiology of sepsis. Curr Opin Anaesthesiol. 2021;34(2):77–84. pmid:33652454
  2. 2. Torres LK, Pickkers P, van der Poll T. Sepsis-induced immunosuppression. Annu Rev Physiol. 2022;84:157–81. pmid:34705481
  3. 3. van der Poll T, Shankar-Hari M, Wiersinga WJ. The immunology of sepsis. Immunity. 2021;54(11):2450–64. pmid:34758337
  4. 4. Buchman TG, Simpson SQ, Sciarretta KL, Finne KP, Sowers N, Collier M, et al. Sepsis among medicare beneficiaries: 1. The Burdens of Sepsis, 2012-2018. Crit Care Med. 2020;48(3):276–88. pmid:32058366
  5. 5. Sciences NIoGM. Sepsis. sAccessed 2025 February 9. Available from: https://www.nigms.nih.gov/education/fact-sheets/Pages/sepsis.aspx
  6. 6. Hotchkiss RS, Moldawer LL, Opal SM, Reinhart K, Turnbull IR, Vincent J-L. Sepsis and septic shock. Nat Rev Dis Primers. 2016;2:16045. pmid:28117397
  7. 7. Kaukonen K-M, Bailey M, Suzuki S, Pilcher D, Bellomo R. Mortality related to severe sepsis and septic shock among critically ill patients in Australia and New Zealand, 2000-2012. JAMA. 2014;311(13):1308–16. pmid:24638143
  8. 8. Engoren M, Jewell ES, Douville N, Moser S, Maile MD, Bauer ME. Genetic variants associated with sepsis. PLoS One. 2022;17(3):e0265052. pmid:35275946
  9. 9. Man M, Close SL, Shaw AD, Bernard GR, Douglas IS, Kaner RJ, et al. Beyond single-marker analyses: mining whole genome scans for insights into treatment responses in severe sepsis. Pharmacogenomics J. 2013;13(3):218–26. pmid:22310353
  10. 10. Backman JD, Li AH, Marcketta A, Sun D, Mbatchou J, Kessler MD, et al. Exome sequencing and analysis of 454,787 UK Biobank participants. Nature. 2021;599(7886):628–34. pmid:34662886
  11. 11. Butler-Laporte G, Farjoun Y, Chen Y, Hultström M, Liang KYH, Nakanishi T, et al. Increasing serum iron levels and their role in the risk of infectious diseases: a Mendelian randomization approach. Int J Epidemiol. 2023;52(4):1163–74. pmid:36773317
  12. 12. Zhang Z, Chen L, Zhang H, Xiao W, Yang J, Huang J, et al. Genetic correlations and causal relationships between cardio-metabolic traits and sepsis. Sci Rep. 2024;14(1):5718. pmid:38459230
  13. 13. Zhou J, Zhang Y, Yang T, Zhang K, Li A, Li M, et al. Causal relationships between lung cancer and sepsis: a genetic correlation and multivariate mendelian randomization analysis. Front Genet. 2024;15:1381303. pmid:39005629
  14. 14. Chen M-H, Raffield LM, Mousas A, Sakaue S, Huffman JE, Moscati A, et al. Trans-ethnic and ancestry-specific blood-cell genetics in 746,667 individuals from 5 global populations. Cell. 2020;182(5):1198-1213.e14. pmid:32888493
  15. 15. Sun Y, Cheng Z, Guo Z, Dai G, Li Y, Chen Y, et al. Preliminary study of genome-wide association identified novel susceptibility genes for hemorheological indexes in a Chinese population. Transfus Med Hemother. 2022;49(6):346–57. pmid:36654975
  16. 16. Vuckovic D, Bao EL, Akbari P, Lareau CA, Mousas A, Jiang T, et al. The polygenic and monogenic basis of blood traits and diseases. Cell. 2020;182(5):1214-1231.e11. pmid:32888494
  17. 17. Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh P-R, et al. An atlas of genetic correlations across human diseases and traits. Nat Genet. 2015;47(11):1236–41. pmid:26414676
  18. 18. Jiang L, Zheng Z, Fang H, Yang J. A generalized linear mixed model association tool for biobank-scale data. Nat Genet. 2021;53(11):1616–21. pmid:34737426
  19. 19. Hernandez-Beeftink T, Guillen-Guio B, Lorenzo-Salazar JM, Corrales A, Suarez-Pajes E, Feng R, et al. A genome-wide association study of survival in patients with sepsis. Crit Care. 2022;26(1):341. pmid:36335405
  20. 20. Srinivasan L, Page G, Kirpalani H, Murray JC, Das A, Higgins RD, et al. Genome-wide association study of sepsis in extremely premature infants. Arch Dis Child Fetal Neonatal Ed. 2017;102(5):F439–45. pmid:28283553
  21. 21. Hu Y, Stilp AM, McHugh CP, Rao S, Jain D, Zheng X, et al. Whole-genome sequencing association analysis of quantitative red blood cell phenotypes: The NHLBI TOPMed program. Am J Hum Genet. 2021;108(5):874–93. pmid:33887194
  22. 22. Little A, Hu Y, Sun Q, Jain D, Broome J, Chen M-H, et al. Whole genome sequence analysis of platelet traits in the NHLBI Trans-Omics for Precision Medicine (TOPMed) initiative. Hum Mol Genet. 2022;31(3):347–61. pmid:34553764
  23. 23. Bulik-Sullivan BK, Loh P-R, Finucane HK, Ripke S, Yang J, Schizophrenia Working Group of the Psychiatric Genomics Consortium, et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. 2015;47(3):291–5. pmid:25642630
  24. 24. Guo H, Li JJ, Lu Q, Hou L. Detecting local genetic correlations with scan statistics. bioRxiv. 2019.
  25. 25. Werme J, van der Sluis S, Posthuma D, de Leeuw CA. An integrated framework for local genetic correlation analysis. Nat Genet. 2022;54(3):274–82. pmid:35288712
  26. 26. Zhou W, Kanai M, Wu K-HH, Rasheed H, Tsuo K, Hirbo JB, et al. Global Biobank Meta-analysis Initiative: Powering genetic discovery across human disease. Cell Genom. 2022;2(10):100192. pmid:36777996