Figures
Abstract
Objectives
This study aimed to explore the correlation between the sentiment of nursing notes and the one-year mortality of sepsis patients.
Methods
The box plot was used to compare the differences in sentiment polarity/sentiment subjectivity between different groups. Multivariate logistic regression was used to explore the correlation between sentiment polarity/sentiment subjectivity and one-year mortality of elderly sepsis patients. Ridge regression, XGBoost regression, and random forest were used to explore the importance of sentiment polarity and subjectivity in the one-year mortality of elderly sepsis patients. Restricted cubic spline (RCS) was used to explore whether there was a linear relationship between sentiment polarity, sentiment subjectivity and the one-year mortality of elderly sepsis patients. Kaplan-Meier (KM) curve was used to explore the relationship between the sentiment polarity (or sentiment subjectivity) and the 1-year death of the patient.
Results
Compared with the control group, the one-year mortality group year had lower sentiment polarity and higher sentiment subjectivity. Sentiment polarity and sentiment subjectivity were independently related to the one-year mortality of elderly sepsis patients. There was a linear relationship between sentiment polarity and the one-year mortality of elderly sepsis patients. At the same time, there was a nonlinear relationship between sentiment subjectivity and the one-year mortality of elderly sepsis patients.KM.
Citation: Sheng L, Zheng Q (2025) Relationship between the sentiment of nursing notes and one-year mortality of elderly sepsis patients. PLoS One 20(5): e0323686. https://doi.org/10.1371/journal.pone.0323686
Editor: Barry Kweh, National Trauma Research Institute, AUSTRALIA
Received: September 20, 2024; Accepted: April 11, 2025; Published: May 16, 2025
Copyright: © 2025 Sheng, Zheng. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
1 Introduction
Sepsis is a serious life-threatening disease, which is characterized by the imbalance of immune response caused by infection and life-threatening multiple organ dysfunction. At present, about 48.9 million people in the world suffer from sepsis, and 11.1 million people die of sepsis [1]. Sepsis cases mainly occur in low-income or middle-income countries. The prevalence rates of severe sepsis in surgical Intensive Care Unit (ICU) and general ICU in China were 8.7% and 37.3% respectively [2]. The prevalence of sepsis in the ICU in India was 33.2% [3]. The overall prevalence of sepsis in ICU is 24.5% in low-middle-income areas in Asia [4]. The overall prognosis of sepsis patients is still poor despite the improvement of medical technology. A systematic review indicated that the mortality rate of sepsis patients in China was 29.0% [5] The in-hospital mortality rate of sepsis in the American population is 20.52% [6]. Therefore, it is necessary and urgent to actively explore the factors related to the prognosis of sepsis.
Sentiment analysis is a process of analyzing, processing, inducing, and reasoning subjective texts with sentiment color, which is mainly based on text data and is the main content of natural language processing. The purpose of sentiment analysis is to understand the author’s critical attitude (support or opposition to likes or dislikes, etc.) or sentiment state (happiness, anger, sadness, fear, etc.) towards an entity (including products, services, people, organizations, events, and topics) in the text. In recent years, sentiment analysis has been gradually extended to the medical field. It is found that keyword extraction and sentiment analysis based on discharge reports have certain predictive performance for 30-day unplanned readmission [7]. Sentiment in nursing notes was correlated with 30-day mortality and survival rate of ICU patients [8].
At present, there are few studies on sepsis and the sentiment of nursing notes. The existing studies mainly focus on the short-term prognosis of sepsis patients, but no research has focused on the long-term prognosis. Therefore, this study explored the correlation between the sentiment of nursing notes and the one-year mortality of sepsis patients.
2 Methods
2.1 Data source and indicators
The patient data came from the Medical Information Mart for Intensive Care (MIMIC)-IV database, which was an open database containing the data of all ICU patients in internal medicine and surgery at Beth Israel Deaconess Medical Center of Harvard Medical School [9].
2.2 Inclusion and exclusion criteria
Inclusion criteria: diagnosed with sepsis, older than 60 years of age.
Exclusion criteria: (1) Missing time of death after discharge from hospital, hospital death. (2) Miss clinical records. (3) Note writing fuzzy cannot be identified; (4) Have been identified as wrong notes by the doctor; (5) Notes less than 12 hours before death.
2.3 Sentiment analysis
Python and Text Blob were used to extract sentiment polarity and sentiment subjectivity from nursing records [8]. Given a string of texts, the Pattern module of TextBlob was marked, retrieved, and calculated. Specifically, the module includes a dictionary of English adverbs and adjectives, which can be mapped to the polarity and subjectivity of sentiment scores [10].The TextBlob returns two scores: one indicating the sentiment polarity and the other indicating the sentiment subjectivity. The sentiment polarity score ranges from -1–1, with higher scores indicating a more positive sentiment. The sentiment subjectivity score ranges from 0 to 1, with higher scores indicating a more subjective sentiment. This study calculated the sentiment score by creating a TextBlob object initialized using a nursing record string and extracting the ‘ sentiment ‘ attribute from the object. Each individual nursing note of the patient was calculated to obtain an emotional polarity score and an emotional subjectivity score. Then the average score of each patient’s polarity and subjectivity was calculated as the patient’s two nursing emotional characteristic indicators.
2.4 Indicators
In this study, the primary outcome was one-year mortality.
Indicators with more than 30 missing data were excluded. Indicators included age, glucose, sentiment polarity, sentiment subjectivity, systolic blood pressure (SBP), diastolic blood pressure (DBP), platelets, anion gap, chloride, creatinine, red cell distribution width (RDW), white cell count (WBC), red blood cell count (RBC), Charlson Comorbidity Index (CCI), Glasgow coma scale (GCS), Sequential Organ Failure Assessment (SOFA), Simplified Acute Physiology Score II (SAPS II), Oxford acute severity of illness score (OASIS), systemic inflammatory response syndrome (SIRS), LOS (Length of stay), Length of stay in ICU, gender (male, female), tobacco (yes, no), alcohol (yes, no), surgery (yes, no), diabetes (yes, no), hypertension (yes, no), renal disease (yes, no), coronary artery disease (yes, no), mechanical ventilation (yes, no), sepsis shock (yes, no), acute kidney injury (AKI) stage (0,1,2,3).
One-year mortality: the patient dies within one year after discharge from the hospital due to illness or other reasons.
2.5 Statistical analysis
R language was used for data analysis. The median (P25-P75) was used to describe the quantitative data of non-normal distribution, and the Mann-Whitney U test was used to compare between groups. Qualitative data were described by composition comparison, and the chi-square test was used to compare between groups. The box plot was used to compare the differences in sentiment polarity/sentiment subjectivity between different groups. Multivariate logistic regression was used to explore the independent correlation between sentiment polarity/sentiment subjectivity and one-year mortality of elderly sepsis patients after controlling for confounding variables, with a significant difference between the two groups. Ridge regression, XGBoost regression, and random forest were used to explore the importance of sentiment polarity and subjectivity in the one-year mortality of elderly sepsis patients. Restricted cubic spline (RCS) was used to explore whether there was a linear relationship between sentiment polarity, sentiment subjectivity and the one-year mortality of elderly sepsis patients. Kaplan-Meier (KM) curve was used to explore the relationship between the sentiment polarity (or sentiment subjectivity) and the 1-year death of the patient. P < 0.05 is considered to be statistically significant.
3 Results
3.1 Patient baseline information
A total of 5305 patients with sepsis were enrolled in this study, of which 3632 patients died within one year. As can be seen from Table 1, there were differences in age, sentiment polarity, sentiment subjectivity, RDW, WBC, RBC, anion gap, chloride, CCI, GCS, SAPS II, OASIS, LOS, Length of stay in ICU, tobacco, alcohol, coronary artery disease, diabetes, hypertension, separate shock and AKI stage.
As shown in Fig 1A-1B, compared with the control group, the one-year mortality group year had lower sentiment polarity and higher sentiment subjectivity. However, there was no difference in sentiment polarity between men and women (P = 0.871, Fig 1C). The same result was found in sentiment subjectivity (P = 0.141, Fig 1D).
3.2 Correlation between sentiment polarity/sentiment subjectivity and one-year mortality of elderly sepsis patients
We used multivariate logistic regression to further explore whether sentiment polarity and sentiment subjectivity were independently related to the one-year mortality of elderly sepsis patients. The results of multivariate logistic regression (Table 2) showed that sentiment polarity and sentiment subjectivity were independently related to one-year mortality of elderly sepsis patients after adjusting for confounding variables, in which sentiment polarity was a protective factor and sentiment subjectivity was a risk factor.
Three machine learning models explored the independent factors of one-year mortality of elderly sepsis patients, and the results (Table 3) suggested that sentiment polarity and sentiment subjectivity were major factors.
We further explored whether there was a linear relationship between sentiment polarity, sentiment subjectivity and the one-year mortality of elderly sepsis patients. Fig 2A-2B showed that there was a linear relationship between sentiment polarity and the one-year mortality of elderly sepsis patients. At the same time, there was a nonlinear relationship between sentiment subjectivity and the one-year mortality of elderly sepsis patients. KM results showed that high subjectivity was beneficial to the 1-year survival of patients (Fig 3A, HR = 0.70 95% CI [0.66,0.75] p < 0.05), and low polarity was advantageous to the 1-year survival of patients (Fig 3B, HR = 0.89,95% CI [0.84, 0.96])(The high and low groups were divided according to the median.).
4 Discussion
4.1 Nursing sentiment and prognosis
This study was based on the MIMIC-IV database to explore the correlation between the sentiment of nursing notes and the one-year mortality of sepsis patients. We found that sentiment polarity and sentiment subjectivity were independently related to one-year mortality of elderly sepsis patients.
Emotional scores in nursing notes reflect the health care provider’s attitude toward the patient. This emotion or attitude generally arises when the health care provider communicates with the patient. Therefore, the emotional attitude of the health care provider not only reflects the physician’s assessment of the patient’s current physical status, but is also based to some extent on feedback about the efficacy of the patient’s treatment. This feedback largely influences the patient’s adherence to subsequent treatment [11,12]. Specifically, the positive mood of the healthcare provider towards the patient indicates that the patient’s treatment is working well. After receiving such feedback, the patient’s trust in the treatment is increased, which makes them more willing to follow the medical advice and cooperate with the treatment. A more positive attitude and compliance with treatment lead to better patient outcomes. This results in a positive cycle that accelerates patient healing. For example, a cohort study of diabetic patients showed that a highly empathetic physician’s positive attitude toward the patient significantly increased the patient’s adherence to treatment, which in turn increased the rate of glycohemoglobin control [13]. In addition, healthcare professionals’ attitudes affect patients’ psychological state and physiological responses. Positive emotions reduce patients’ anxiety and depression levels. A randomized controlled trial showed that nurses trained in communication built better trust with patients, which led to lower postoperative anxiety and pain [14]. Chronic stress due to illness activates the hypothalamic-pituitary-adrenal (HPA) axis, leading to elevated cortisol and suppressed immune function [15]. Physicians’ emotions toward patients may affect the patients’ HPA axis, influencing their levels of inflammatory markers and thus their prognosis [16,17].
In this study, we found that sentiment polarity and sentiment subjectivity were independently related to one-year mortality of elderly sepsis patients. Sentiment polarity was negatively related to one-year mortality of elderly sepsis patients and sentiment subjectivity was positively related to one-year mortality of elderly sepsis patients. A study based on the MIMIC-III database found that sentiment polarity and sentiment subjectivity were independently associated with the 28-day hospitalization mortality of sepsis patients, and both sentiment polarity and sentiment subjectivity were negatively related to the 28-day hospitalization mortality of sepsis patients [18]. A similar relationship was also found in another study that explored the emotion of nursing notes and the 30-day death of sepsis patients [19]. Our results were inconsistent with those of other people’s research, which may be mainly related to the inconsistency of the selected research objects and research nodes. Specifically, the dependent variables in Gao et al. ‘s [18] and Zhou et al. ‘s [19] studies were 28-day and 30-day in-hospital deaths, respectively, whereas the dependent variable in our study was 1-year mortality. Different criteria for exclusion, which in turn lead to heterogeneity in the study population. Their studies included 1851 and 1844 patients, respectively, whereas our study included 5305 patients. The sentiment of medical staff’s notes not only affected the prognosis of sepsis patients but also affected the prognosis of other diseases. McCoy and others conducted sentiment analysis on the discharge records of psychiatric inpatients and patients in general medical units from 2011 to 2014, and found that more positive sentiments were related to patients’ lower risk of readmission [20]. In another study of severe acute renal failure, it was found that compared with the survival group, the sentiment subjectivity score of the death group was higher, the sentiment polarity score was lower, and the prediction model with sentiment score had better prediction performance [21]. Another study found similar results [8]. Studies on the correlation between sentiment polarity and prognosis have found a negative correlation between emotional polarity and prognosis, indicating that the more positive the sentiment of medical staff, the better the prognosis of patients. This also suggested that medical staff need to maintain a positive attitude in the daily diagnosis and treatment process.
4.2 Other factors and prognosis in sepsis
In this study, we also found some influencing factors of one-year mortality of elderly sepsis patients, such as age, CCI, and RDW. Many studies have found that age is an influential factor in the death of sepsis patients. Compared with young people, elderly patients have weakened physical resistance and impaired physiological barriers. CCI is the most commonly used comorbidity assessment tool at present. The higher the score, the more serious the number and degree of underlying diseases are. Elderly patients often have more basic diseases (comorbidities), and many common comorbidities increase the risk of infection and death [22]. RDW is a parameter reflecting the volume heterogeneity of peripheral red blood cells, which is closely related to the prognosis of many diseases. At the present stage, most studies focus on the influence of patients themselves on the prognosis, and few studies pay attention to the influence of medical staff. This study suggests that we should also pay attention to the influence of medical staff’s behavior or sentiment on the prognosis of diseases in subsequent studies.
4.3 Strengths and limitations
This study explored the relationship between the emotional score of nursing notes and the death of elderly patients with sepsis within one year. We found its potential in predicting the death of patients within one year, which suggested the application value of nursing notes in clinical practice. However, some limitations also existed in this study. Firstly, due to the limited data in the database, this study lacked some information about biomarkers of sepsis patients, such as Uncoupling Protein 2. Secondly, this study only revealed the association and cannot determine the causal relationship between them. Thirdly, TextBlob has a weak ability to analyze complex sentences, and may have errors in the emotional direction of complex sentences in nursing notes and form bias for subjectivity and polarity scores, which affects the accuracy of the conclusion. Fourthly, the subjective nature of nursing notes may be influenced by the personality of the health care provider. Fifthly, the study was not validated with external data due to the lack of suitable external data at present. In the future, we will conduct data collection in our hospital to validate the findings. Sixthly, this study was a single-center retrospective study, which inevitably had some bias. For example, MIMIC comprises deidentified health-related data from patients who were admitted to the critical care units of the Beth Israel Deaconess Medical Center. In the United States, this center is a top 3 hospital with a suitable staff ratio and a high level of nurse experience. Therefore, the conclusion of this study may only be suitable for such large hospitals. For other small hospitals, the conclusion may not be suitable. In addition, patients from a single center are biased and heterogeneous with patients from other hospitals (such as disease severity, race, social status, family economics, education level, etc.). These results may result in the conclusions obtained in this study not being able to be verified in patients from other hospitals.
4.4 Future research directions
Future research should focus on exploring the causal relationship between sentiment scores in nursing notes and patient prognosis via well-designed clinical trials, detailing the mechanisms between the two. Then, develop a multimodal prediction model that integrates sentiment analysis of nursing texts with traditional clinical scoring systems, and enhance the generalization of the model through multicenter validation; at the same time, it is necessary to build a real-time automated monitoring tool, incorporate dynamic sentiment indicators into the clinical decision support system to achieve early warning of high-risk patients, and deeply analyze the intrinsic relationship between sentiment characteristics (such as negative/positive word combination patterns) and pathophysiological mechanisms. In addition, the impact of cross-language and cultural differences on the validity of sentiment analysis needs to be addressed urgently, and adaptive algorithms (such as Chinese subjective dictionary construction technology) need to be developed to improve the universality of application in different medical contexts.
5 Conclusions
The sentiment of nursing notes was correlated with the one-year mortality of elderly sepsis patients.
References
- 1. Rudd KE, Johnson SC, Agesa KM, Shackelford KA, Tsoi D, Kievlan DR, et al. Global, regional, and national sepsis incidence and mortality, 1990-2017: analysis for the Global Burden of Disease Study. Lancet. 2020;395(10219):200–11.
- 2. Jiang W. Current status of sepsis epidemiology in China. J Grad Med. 2019;31(1):5–8.
- 3. Hammond NE, Kumar A, Kaur P, Tirupakuzhi Vijayaraghavan BK, Ghosh A, Grattan S, et al. Estimates of sepsis prevalence and outcomes in adult patients in the ICU in India: a cross-sectional study. Chest. 2022;161(6):1543–54. pmid:35092747
- 4. Li A, Ling L, Qin H, Arabi YM, Myatra SN, Egi M, et al. Epidemiology, management, and outcomes of sepsis in ICUs among countries of differing National Wealth across Asia. Am J Respir Crit Care Med. 2022;206(9):1107–16. pmid:35763381
- 5. Liu Y-C, Yao Y, Yu M-M, Gao Y-L, Qi A-L, Jiang T-Y, et al. Frequency and mortality of sepsis and septic shock in China: a systematic review and meta-analysis. BMC Infect Dis. 2022;22(1):564. pmid:35729526
- 6. Xu F, Zhang L, Huang T, Yang R, Han D, Zheng S, et al. Influence of ambulatory blood pressure-related indicators within 24 h on in-hospital death in sepsis patients. Int J Med Sci. 2022;19(3):460–71. pmid:35370467
- 7. Zubillaga A, Laccourreye P, Kerexeta J, Larburu N, Alonso E, Gomez DJ. Hospital readmission prediction via keyword extraction and sentiment analysis on clinical notes. Stud Health Technol Inform. 2022;295:339–42.
- 8. Waudby-Smith IER, Tran N, Dubin JA, Lee J. Sentiment in nursing notes as an indicator of out-of-hospital mortality in intensive care patients. PLoS One. 2018;13(6):e0198687. pmid:29879201
- 9.
Johnson A, Bulgarelli L, Pollard T, Horng S, Celi LA, Mark R. MIMIC-IV (version 1.0): PhysioNet. 2021. https://doi.org/10.13026/s6n6-xd98
- 10. Zhang J, Jin H. Method of subjective lexicon creation for Chinese sentiment analysis. Appl Mechanics Material. 2010;34–35:801–5.
- 11. Zolnierek KBH, Dimatteo MR. Physician communication and patient adherence to treatment: a meta-analysis. Med Care. 2009;47(8):826–34. pmid:19584762
- 12. Bukstein DA. Patient adherence and effective communication. Ann Allergy Asthma Immunol. 2016;117(6):613–9. pmid:27979018
- 13. Hojat M, Louis DZ, Markham FW, Wender R, Rabinowitz C, Gonnella JS. Physicians’ empathy and clinical outcomes for diabetic patients. Acad Med. 2011;86(3):359–64. pmid:21248604
- 14. Wittenberg E, Goldsmith J, Buller H, Ragan SL, Ferrell B. Communication training: needs among oncology nurses across the cancer continuum. Clin J Oncol Nurs. 2019;23(1):82–91. pmid:30682007
- 15. Lwin MN, Serhal L, Holroyd C, Edwards CJ. Rheumatoid arthritis: the impact of mental health on disease: a narrative review. Rheumatol Ther. 2020;7(3):457–71. pmid:32535834
- 16. Dich N, Rozing MP, Kivimäki M, Doan SN. Life events, emotions, and immune function: evidence from Whitehall II Cohort Study. Behav Med. 2020;46(2):153–60. pmid:30908163
- 17. Wirtz PH, von Känel R. Psychological stress, inflammation, and coronary heart disease. Curr Cardiol Rep. 2017;19(11):111. pmid:28932967
- 18. Gao Q, Wang D, Sun P, Luan X, Wang W. Sentiment analysis based on the nursing notes on in-hospital 28-day mortality of sepsis patients utilizing the MIMIC-III database. Comput Math Methods Med. 2021;2021:3440778. pmid:34691236
- 19. Zou Y, Wang J, Lei Z, Zhang Y, Wang W. Sentiment analysis for necessary preview of 30-day mortality in sepsis patients and the control strategies. J Healthc Eng. 2021;2021:1713363. pmid:34733452
- 20. McCoy TH, Castro VM, Cagan A, Roberson AM, Kohane IS, Perlis RH. Sentiment measured in hospital discharge notes is associated with readmission and mortality risk: an electronic health record study. PLoS One. 2015;10(8):e0136341. pmid:26302085
- 21. Zha D, Yang X, Zhang H, Xu L, Jin Y, Li N, et al. Development and validation of prediction model using nursing notes on sentiment scores for prognosis of patients with severe acute kidney injury receiving continuous renal replacement therapy based on computational intelligence algorithms. Ann Transl Med. 2022;10(20):1110. pmid:36388821
- 22.
Zhang XWW, Dong S. Difficulties and challenges in the diagnosis and treatment of sepsis in the elderly. China Emergency Medicine. 2024;44(5):376–84.