Understanding patterns of loneliness in older long-term care users using natural language processing with free text case notes

Sam Rickman; Jose-Luis Fernandez; Juliette Malley

doi:10.1371/journal.pone.0319745

Abstract

Loneliness and social isolation are distressing for individuals and predictors of mortality, yet data on their impact on publicly funded long-term care is limited. Using recent advances in natural language processing (NLP), we analysed pseudonymised administrative records containing 1.1 million free-text case notes about 3,046 older adults recorded in a London council between 2008 and 2020. We applied three NLP methods—document-term matrices, pre-trained embeddings, and transformer-based models—to identify loneliness or social isolation. The best-performing model, a bidirectional transformer, achieved an F₁ score of 0.92 on a test set of unseen sentences. Using this model, we generated predictions for the full dataset and assessed construct validity through comparison with survey data and the literature. Our measure is associated with expected characteristics, such as living alone and impaired memory, and is a strong predictor of social inclusion services. Approximately 43% of individuals had a sentence indicating loneliness or isolation in their case notes at their initial care assessment, comparable to survey-based estimates. Unlike surveys, our indicator is linked to other administrative data, enabling development of models of service use with loneliness or isolation as independent variables. An open-source version of the model is available in a GitHub repository.

Citation: Rickman S, Fernandez J-L, Malley J (2025) Understanding patterns of loneliness in older long-term care users using natural language processing with free text case notes. PLoS ONE 20(4): e0319745. https://doi.org/10.1371/journal.pone.0319745

Editor: Mario Graff-Guerrero, Infotec: Centro de Investigacion e Innovacion enTecnologias de la Informacion y Comunicacion, MEXICO

Received: July 27, 2024; Accepted: February 6, 2025; Published: April 2, 2025

Copyright: © 2025 Rickman et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The full minimal dataset required to replicate the study findings is publicly available in the following GitHub repository: [https://github.com/samrickman/ lonelinessmodel]. This has also been permanently archived with a DOI: [doi: 10.5281/zenodo.14697740]. The code is licensed under the GNU General Public License v3.0. Instructions for running the code, dependencies, and versioning information are provided in the repository to ensure reproducibility. There are no legal or ethical restrictions on sharing the code. This study uses pseudonymised, individual-level administrative social care records. Identifiable personal information was pseudonymised before egress. This includes names, addresses, email, telephone numbers, unique identifiers (e.g. NHS number), financial information, dates, and location details. Due to the sensitive nature of these records, we are unable to share the data publicly. Under the UK General Data Protection Regulation (GDPR), we are designated as data processors, and the data controller (a UK local authority) has not provided consent for public sharing of the data. We have agreed with the data controller not to disclose their identity to avoid increasing the probability of an individual’s records being re-identified. The data used in this study cannot be shared publicly due to these legal and ethical restrictions. Requests for access to the underlying dataset would need to be directed to the Data Controller. To protect the anonymity of the local authority and individuals in the study, interested parties should contact cpec@lse.ac.uk, and the project support team can forward any enquiries appropriately.

Funding: This paper is based on independent research funded through the NIHR Policy Research Unit in Adult Social Care, reference PR-PRU-1217-21101. Funding was also received from the UK National Institute of Health and Care Research (NIHR) Applied Research Collaboration (ARC) North Thames under grant number NIHR200163. An additional grant was received from the NHS Digital Social Care Pathfinders initiative under the contract 8717. The views expressed are those of the authors and not necessarily those of the NIHR, ARC, NHS, or the Department of Health and Social Care. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The authors have declared that no competing interests exist.

Competing interests: The authors have declared that no competing interests exist.

Introduction

In 2021, public expenditure on long-term care was 1.98% of GDP in OECD countries [1]. In England, where the term adult social care is used to describe long-term support to complete activities of daily living, public spending was £23.7 billion (USD $30.4 billion) in 2022/2023 [2]. By 2038, projections indicate a 55% increase from 2018 levels in the number of older people receiving care, with public expenditure approximately doubling [3]. As most participants in national surveys do not receive publicly funded care [4,5], administrative care records provide a rich alternative about people using the social care system. In the UK, long-term care needs have been widely recorded in electronic databases since the 1990s [6], and similar systems exist internationally [7,8]. Recent papers use natural language processing to extract information from free text electronic health records [9–14]. Few studies apply these methods to social care records [15–17], and none have focused on loneliness or social isolation.

This paper extracts an indicator of loneliness or social isolation from free-text administrative records. Needs assessment forms often lack structured indicators but include free text on social needs. Classified free-text data can be used to model care expenditure or service use data, such as care home entry, which are part of these records.

The impact of loneliness and social isolation

Loneliness and social isolation are as significant predictors of mortality as smoking, obesity, or hypertension [18–20]. Social isolation is an “objective lack of relationships” [21], whereas loneliness is a “subjective, distressing feeling” when social relationships are inadequate [22].

Loneliness has been a longstanding priority for the WHO and governments, with its importance increasingly emphasised in recent years, particularly following the Covid-19 pandemic [23–28]. Yet, internationally, in recent decades, many countries have had a retrenchment in community care services towards personal care, with social support reduced [29–32]. Evidence suggests that loneliness increases long-term care use [33–37]. However, surveys often include few publicly funded care users [38–40], and records cannot be linked to detailed service use information, limiting its insights compared to administrative data.

Administrative records in England record eligibility-related social needs in structured and free text formats [41]. Distinguishing loneliness from social isolation in free text is challenging, as terminology often diverges from literature definitions. For example, “feels isolated” might refer to subjective loneliness or limited social contact. A 2024 paper by Patra et al. distinguished social support needs from psychiatric records, noting greater consistency and detail than typical social care notes [14]. Given the inconsistency in our dataset, we analyse loneliness and social isolation jointly. This combined approach is supported by findings showing both loneliness and isolation adversely affect older adults’ mortality and is common in public health and work extracting Social Determinants of Health (SDoH) from clinical notes [12,18–20,42–44].

Materials and methods

Data collection

In England, every person requesting publicly funded care must receive an assessment under the Care Act 2014. In this paper we attempt to identify social isolation or loneliness from the free text notes of a London borough. Adult social care records are written by individuals employed by a local authority to assess needs and commission care. This generally consists of social workers, occupational therapists or care managers. Workers complete an assessment form, which is a snapshot of needs at a certain time containing both structured data and free text. Recording systems also contain case notes, which are free text fields to record ongoing work on the case over time. In Fig 1 we show how the assessment form and case notes appear to caseworkers.

Download:

Fig 1. Example of format of structured and unstructured data.

https://doi.org/10.1371/journal.pone.0319745.g001

Ethics statement

This study uses secondary data from pseudonymised administrative social care records. We sought and were granted departmental ethics approval for the project on 30th May 2019 at the London School of Economics and Political Science (LSE), in line with LSE’s Research Ethics Policy and Procedure.

The data were pseudonymised prior to processing, including the removal or replacement of identifiable personal information such as names, addresses, email addresses, telephone numbers, unique identifiers (e.g., NHS numbers), financial information, and location details. A Data Processing Impact Assessment (DPIA) was carried out to ensure the protection of individuals’ data privacy, and no automated decision-making processes were involved. The Data Flow Diagram is set out in S1 Fig in the Data Flow Appendix.

Details of the project were made available in the local authority’s Privacy Notice and on a separate website informing individuals of and explaining the study, allowing individuals to opt out if desired. Individual consent for data use was not required, as the data were processed in line with the UK General Data Protection Regulation (GDPR) under the legal basis of legitimate interests. This legal basis allows processing of pseudonymised data for research purposes where it serves a social or public interest and individuals are informed and able to opt out. Permission for data processing was granted by the National Health Service NHS Confidentiality Advisory Group (CAG) in June 2020 (reference number 20/CAG/0043), which was renewed annually. CAG ensures that data processing complies with national regulations for handling confidential patient information in the UK.

Data extraction and characteristics

A query was written to identify all individuals aged 65 or over on August 1st 2020 who had been receiving services for at least one year since 1st January 2016. Administrative records for these individuals were then extracted from the local authority database. Identifiable free text data tokens were masked using the open-source text pseudonymisation software PSCleaner [45]. The data was then sent to an NHS Commissioning Support Unit, where identifiable structured data such as NHS numbers were removed. Finally, the data was transferred securely to the research team at the Care Policy and Evaluation Centre (CPEC) at the London School of Economics and Political Science (LSE).

The data includes all free text case notes recorded for individuals in the cohort between 2008 and 2020, as well as needs assessment and service receipt data. During this period, there were 3,046 individuals aged over 65 receiving long-term care. The data contains 10,821 assessment forms comprising 19.1 million words of free text, and 1.14 million case notes, containing 87.8 million words of free text. Case notes in the dataset encompass a wide range of updates related to the care and support of individuals. These include records of emails and telephone calls, descriptions of home visits, case screening and allocation, managerial direction, case summaries, allegations of abuse or neglect, as well as referrals to services such as occupational therapy, physiotherapy, and intermediate care. The volume of notes reflects the comprehensive documentation required in social care to capture various interactions and decisions throughout the course of care. There is significant variation in the distribution, with for example the top 50 individuals having 8 million words recorded (7.8% of the total), the same amount as the 850 individuals with the fewest. Summary statistics per person are presented in Table 1.

Download:

Table 1. Quantity of free text per person.

https://doi.org/10.1371/journal.pone.0313772.t001

In addition to the free text case notes, structured data fields are routinely collected during the assessment process. These fields capture key demographic and personal information that is relevant for care planning and service provision. Structured data includes information such as gender, ethnicity, age, functional ability with activities of daily living (ADLs), and whether the individual lives alone. This information is collected directly by social care professionals during initial assessments and periodic reviews as part of standard care practices. These structured data fields provide important context for understanding the care needs of individuals and were used in conjunction with the free text data in our analysis. Of the 3,046 individuals, 61.2% were women, 47.8% White British, with a median age in 2020 of 81, and median of 3 years and 6 months of services received. These characteristics are set out in Table 2.

Download:

Table 2. Characteristics of individuals in the training and test set.

https://doi.org/10.1371/journal.pone.0313772.t002

Overview of model development and evaluation

This section outlines methods for model development and evaluation. We describe data pre-processing, manual text classification, and training machine learning algorithms. Model evaluation involves assessing performance metrics on a test set and examining construct validity by testing expected relationships with needs, demographics, and service use.

Model development

We endeavoured to use as parsimonious a model as possible, beginning with count-based vector representations of words such as document-term matrices [46] and Term Frequency Inverse Document Frequency (Tf-idf) [47]. We also used the SpaCy large pre-trained word embeddings [48], and transformer-based representations, specifically RoBERTa and DistilRoBERTa [49]. The overall process for training and comparing these models is set out in Fig 2. Unless otherwise stated, we used Python 3.9.7 in all the analysis [50].

Download:

Fig 2. Overall training and evaluation process.

https://doi.org/10.1371/journal.pone.0319745.g002

For all approaches, we replaced the pseudonymised masks (e.g., ∗ ∗ ∗ ∗ , ∗ ∗ ∗ ∗, which had been used to mask identifiable information) with randomly generated names and locations to ensure that the language models could correctly tokenise and parse the sentences. Retaining the pseudonymisation masks could have led to issues with tokenisation, as the models may not have handled repeated placeholders effectively. For the count-based methods, we also lemmatised the text, converted it to lowercase, and removed stop words. We set out further details in Data pre-processing in S2 Fig in our Supporting Information document. We then divided the data into a training and test set, using stratified random sampling to ensure similar proportions of individuals in each set (see Table 2). Each set contained notes about 200 distinct individuals. We split each set by person to ensure that the test set did not contain sentences about individuals who are in the training set.

Human annotators manually classified 10,083 sentences in the training set and 3,573 sentences in the test set for model evaluation. These manually classified sentences represent a subset of the total number of sentences in the dataset, which exceeds 600,000. It would not have been feasible to classify every sentence manually; instead, we focused on classifying sentences most likely to inform the development of the model. We defined a set of rules for annotators to determine which sentences to classify, using binary classification (either indicative or not indicative of loneliness or social isolation). These rules covered statements such as when a person explicitly expressed feeling lonely, had little social contact, or received referrals to services like befriending. Conversely, sentences indicating practical support needs, support for safety or cognition, or day centre attendance for carer respite were classified as not indicative of loneliness or social isolation. The full set of rules is detailed in S1 Text. Our interrater reliability measures produced Cohen’s κ [51] of 0.89 (95% CI 0.84–0.94) and Krippendorff’s α [52] of 0.89 (95% CI 0.89–0.93). The maximum level of agreement in both cases is 1, and 0.89 represents excellent levels of agreement beyond chance [53,54]. The training dataset was imbalanced, with 9,383 sentences in the negative class (not indicative of loneliness or social isolation) and 700 in the positive class.

We implemented three approaches for the representation of words:

Count-based approaches: We split each sentence into lemmatised, word-level tokens. Each sentence was represented by a raw count of the number of times a word appears in it (a document-term matrix) [46]. We also applied Term Frequency Inverse Document Frequency (Tf-idf) [55] to transform the count matrix to a weighted representation, reducing the weighting of higher frequency words across all documents.
Pre-trained vectors: We used the Spacy large English model [48], which represents language through dense embeddings [56], where words which have similar semantic meanings are clustered together in vector space. We took the mean of each dimension to create a single 300-dimensional vector to represent each sentence.
Transformer-based approaches: We used the RoBERTa base model, which has 12 hidden layers, 768 dimensions and 12 heads [49]. This was relatively computationally expensive to fine-tune, so for comparison we also used DistilRoBERTa, which has identical parameters except it has 6 hidden layers, and is around twice as fast to train. In both cases, we used the HuggingFace implementation of each model’s tokenizer to split each sentence into sub-word tokens [57,58].

We describe these approaches in more detail in S1 Text. After pre-processing, vectorising and labelling each sentence, the problem becomes a binary classification task. For both the count and pre-trained embedding based approaches, we evaluated five classification algorithms. We used k fold cross-validation to avoid overfitting on the training set, choosing 5 folds for k as a value which tends to elicit reasonably high accuracy [59] while reducing training time compared with higher values. We used five classification algorithms: class-weighted logistic regression, bootstrap aggregation, random forest, quadratic discriminant analysis and a feed-forward neural network. Again we set these out in the Classification algorithms section of S1 Text. For the transformers approach, the HuggingFace implementation of both the RoBERTa and DistilRoBERTa models contain a classification head. We trained this final layer of the model on the labelled, tokenised sentences using the HuggingFace Transformers and PyTorch libraries [60,61]. Our final model had a training batch size of 16 sentences, with 500 warm up steps, and weight decay of 0.01. The weight decay parameter is bounded between 0 and 1, with 0.01 indicating relatively low L2 regularisation, which can help the model fit the smaller, positive classes more accurately, but can risk overfitting. The final output layer produces a predicted probability for the negative and positive classes (either indicative or not indicative of loneliness or social isolation). During training the parameters of the classification head were optimised using binary cross-entropy loss, which measures the difference between the predicted probabilities and the true labels.

Model evaluation

We evaluate the model’s accuracy by comparing performance metrics (accuracy, precision, recall, F₁) on a test set of 3,573 unseen sentences drawn from individuals not included in the training data (3026 in the negative class and 547 in the positive class). We assess construct validity of the indicator derived from the best-performing classification model by analysing associations between the NLP model output and demographic characteristics, and comparing this with associations in survey data. We also conduct logistic regression to assess whether the model’s loneliness or isolation predictions are associated with the use of services typically related to social support needs.

Construct validity: Comparison with survey data.

Using the best-performing model, we assess construct validity by classifying free text from the initial assessments of 1,331 individuals at their first contact with statutory care services. We analyse text from assessment forms and case notes within 90 days of the assessment, with the indicator of loneliness or isolation treated as binary. We derive four metrics: individuals with no positive sentences (Neither), those with a positive sentence only in their assessment (Assessment), those with a positive sentence only in case notes (Case notes) and those with both (Assessment). This is set out in Table 3.

Download:

Table 3. Classification outcomes across assessments and case notes.

https://doi.org/10.1371/journal.pone.0313772.t003

We then compare the results of the model predictions with pooled data from waves 6–9 (2012–2019) of ELSA [40]. This secondary dataset, collected as part of a large national survey through structured interviews and self-reported questionnaires, provides a validated source of information on the characteristics of older adults in England. We use ELSA data for all older adults who stated that they had care needs and received publicly funded care (N = 995 unique individuals with 1361 total observations over the period). We pool the results due to the low number of responses in some groups. We tabulate responses to the ELSA Center for Epidemiological Studies Depression Scale (CES-D) loneliness question [62]. We also compare our results to the three UCLA loneliness scale questions within ELSA, converting a total score of 6 or more into a binary indicator of loneliness, as in Hanratty et al. [33]. As our model measures loneliness and social isolation, in the ELSA data we also establish which individuals are socially isolated according to the Social Network Index (SNI) defined in Minicuci et al. (2016) [63].

We compare the results with ELSA graphically, by examining the proportion of people in our data and in ELSA who appear lonely or socially isolated, broken down by demographic characteristics and care needs. We also conduct a Pearson’s χ² test of independence [64] of each need or demographic factor with loneliness or isolation, to establish whether there are the same associations between our indicator of loneliness and those found in ELSA. Finally, we conduct a logistic regression of all these factors and loneliness or isolation, to establish which factors remain significant after controlling for characteristics such as living alone.

Construct validity: Predicting service receipt for loneliness or isolation.

Our data includes information on whether individuals are attending day centres, which are community-based services provided for older people at risk of loneliness or social isolation [30]. To assess whether, as we expect, our indicator is associated with day centre attendance, we generated predictions of loneliness or social isolation using our best-performing model, the RoBERTa-based language model. Next, we conducted a logistic regression to examine whether these predictions were associated with the receipt of day centre services within 90 days of the initial assessment, for the 1,331 individuals whose initial assessment could be identified. To ensure that the RoBERTa model was not simply picking up cases with more notes, or cases driven by demographic characteristics rather than actual loneliness or isolation, we included the number of notes and relevant demographic variables as controls in the logistic regression model. This allowed us to verify that the RoBERTa model’s predictions were not confounded by factors such as a greater volume of documentation or demographic differences, rather than genuine cases of loneliness or social isolation. The logistic regression model is specified in Eq (1).

(1)

Where p is the probability of receiving day services in the first 90 days, SIL is the binary prediction of social isolation or loneliness generated by our model, notes is the number of sentences written within 90 days of assessment, sex is a binary variable where 1 indicates male, ethnicity is a binary indicator of white or non-white and age is age of the person receiving care in years. Additionally, we include as continuous variables the following rank of severity of needs, where higher indicates more care needs. pc is personal care needs (the sum of mobility, toileting and dressing), memory is the score for memory and cognition, safety is the extent to which the person is aware of their own safety and risk and alone is a binary indicator of whether an individual lives alone. The demographic and needs-related scores are extracted from the structured data of the initial assessment.

Results

We present a set of results for each method of model evaluation. Firstly, we evaluate the performance of each model against the test set. Secondly, we run the best-performing model on text recorded within 90 days of every initial assessment and compare the significance of association with demographic characteristics with survey data from ELSA. Finally, we present the logistic regression of the results of the best-performing model on day centre attendance.

Model performance on the test set

In Table 4 we detail the accuracy, precision, recall and F₁ score [65] of each model on the test set of 3,573 labelled sentences not seen by the training set. The transformer-based models considerably outperform all other models, with DistilRoBERTa achieving an F₁ score of 0.86 and RoBERTa 0.92. The pre-trained Spacy embeddings outperformed all non-transformers based approaches when classes were predicted using a feed-forward neural network, with an F₁ score of 0.61. However, using the same embeddings, the neural network only slightly outperformed logistic regression, which had an F₁ score of 0.58. The count-based approaches were not effective at prediction using any of the classification methods. High accuracy alongside low precision, recall, and F1 scores in some models reflects the imbalanced dataset, as models like random forest almost exclusively predict the majority class, inflating accuracy while failing to classify minority cases. In Fig 3, we present a confusion matrix comparing the predictions of the best-performing model against the classes defined by human annotation.

Download:

Table 4. Accuracy and F1 score of classification models.

https://doi.org/10.1371/journal.pone.0313772.t004

Download:

Fig 3. Confusion matrix (RoBERTa).

https://doi.org/10.1371/journal.pone.0319745.g003

Construct validity: Comparison with survey data.

The overall proportion of individuals with at least one case note indicating loneliness or social isolation according to our model is 0.44 (95% CI 0.42–0.47), and the proportion with at least one sentence indicating the same in their needs assessment is 0.43 (95% CI 0.40–0.45). This compares with a proportion of 0.38 in ELSA (95% CI 0.32–0.43) who are lonely according to the CES-D measure or SNI isolated, and 0.45 (95% CI 0.39–0.51) who are lonely according to the UCLA measure, or SNI isolated. The overall proportions are similar to the UCLA loneliness measure, and this holds for many characteristics. We present in Fig 4 a breakdown of these proportions by demographic and needs-related factors. While these similarities are reassuring, there are differences between the results of our model and ELSA. For example, the difference in loneliness between individuals who live alone and live with others is wider in ELSA than in our data. We present the results in Fig 4 in tabular form in the S1 Text document.

Download:

Fig 4. Proportion of lonely/isolated by demographic characteristics: Administrative and survey data.

https://doi.org/10.1371/journal.pone.0319745.g004

We set out in Table 5 the results of the χ² test of independence between loneliness or isolation and the needs-related factors in both our results and ELSA. We also present the results of the combined indicators, Either and Both . The χ² tests reveal both similarities and differences in the associations between our RoBERTa-based indicator and the ELSA measures of loneliness and social isolation (CES-D and UCLA combined with SNI). Both our indicator and the survey data show a strong association between loneliness and living alone. However, our indicator also identifies a significant link between memory issues and loneliness, which is not found in the ELSA data. Additionally, ELSA data shows that people receiving unpaid care are more likely to be lonely, a pattern not reflected in our findings.

Download:

Table 5. Factors in structured data associated with loneliness and social isolation: Administrative data and ELSA.

https://doi.org/10.1371/journal.pone.0313772.t005

We set out the results of the χ² test and regression of the association with needs and demographic factors in Table 5. We assessed multicollinearity using the generalised variance inflation factor (GVIF), with a maximum value of 1.3, well below the typical threshold of 4–10 [66,67]. The regression output indicates that in ELSA, living alone is by far the most significant predictor, though requiring support with shopping and presence of unpaid care are also significant. Across all four of our measures, living alone is also a very important predictor of loneliness or isolation. The coefficient is around the magnitude of that for memory, where individuals who have memory problems are more likely to be lonely or socially isolated.

This discrepancy between our results and ELSA may be due to differences in the cohorts or the nature of the data, as ELSA data is self-reported, while administrative assessments of functional ability are recorded by professionals. Although we have taken a subset of individuals from ELSA who are older people receiving local authority care, individuals in the administrative data have higher needs than those in ELSA (see Table 6). We do not consider this a barrier to comparing the datasets, but we do consider it when interpreting the results. We elaborate on this in the Discussion section.

Download:

Table 6. Comparison of demographic and ADL needs between ELSA waves 6–9 and administrative data.

https://doi.org/10.1371/journal.pone.0313772.t006

Construct validity: Predicting service receipt for loneliness or isolation.

We include the results of the day centre services regression in Table 7. Accounting for the number of notes and demographic factors, the model output remains a strong predictor of whether an individual is in receipt of day centre services. The maximum GVIF for any indicator is less than 1.4.

Download:

Table 7. Logistic regression: Association of loneliness extracted from free text with services received for loneliness.

https://doi.org/10.1371/journal.pone.0313772.t007

Discussion

The goal of this analysis was to extract an indicator of loneliness or social isolation from free text. Our key finding is that a RoBERTa-based transformer model can produce this indicator with high accuracy (F₁ = 0.92), outperforming simpler methods like document-term matrices or pre-trained embeddings. Transformer models handle the complexity of adult social care records better, likely due to their attention mechanism, which captures context-dependent distinctions. Example sentences in Fig 5 illustrate cases where transformer models succeed while other methods do not, reflecting their ability to process the complex, unstructured data that is found in adult social care records.

Download:

Fig 5. Examples of polysemy in adult social care case notes.

https://doi.org/10.1371/journal.pone.0319745.g005

We validated the indicator by applying the model to initial assessments of 1,331 individuals and comparing its predictions to survey data and the literature. The indicator strongly predicts the receipt of social inclusion services and aligns with known associations, such as living alone. However, there are differences from survey findings: in ELSA, living alone shows a stronger link to loneliness, likely because marital status, a component of the ELSA SNI indicator [63], is strongly correlated with living alone. Conversely, our indicator identifies a significant association between loneliness and memory issues, which is absent in ELSA.

These discrepancies may stem from differences in datasets. Administrative records include higher-need individuals than ELSA, where only 12% report impaired memory compared to 60% in administrative data (Table 6). Survey attrition may exclude those with severe needs [68], while self-reports in ELSA could understate functional impairments due to social desirability bias or cognitive issues [69,70]. Prior research shows correlations between self-reported and actual ability can be as low as 0.2, with individuals often overstating their mobility [71,72].

Self-reports in ELSA may also explain other differences, such as the link between unpaid care and loneliness observed in ELSA but not in our data. Unpaid care in ELSA may act as a proxy for need, which may not be fully captured by ELSA’s functional questions. Despite small sample sizes in ELSA, we retained all older individuals receiving publicly funded care but interpret these findings cautiously. While challenges exist in comparing self-reported data to social care records, our indicator aligns well with measures where self-reporting is less likely to differ from professional assessment, like gender, where both sources show slightly higher loneliness among women, reinforcing its validity.

In the administrative data, loneliness or isolation appears less common among individuals with higher physical care needs. While no single physical ADL shows a consistent negative association across all measures, there is a general trend that requiring more physical support correlates with reduced loneliness. This could reflect a real effect, as suggested by the negative (though not significant) dressing coefficient in ELSA, or it may result from how workers prioritise recorded needs. For individuals with high physical care needs, workers may focus on immediate risks, such as falls or pressure ulcers, rather than loneliness, limiting the classifier’s ability to capture true prevalence. Additionally, unlike administrative data, ELSA shows no significant association between memory problems and loneliness, despite literature suggesting such a link [73,74].

The comparison with ELSA is challenging to interpret, owing to the apparent differences in the population and that both care needs and loneliness are self-reported in ELSA but not in administrative data. We are therefore reassured by the results in Table 7 of the probability of receipt of day centre services within 90 days of the first assessment. It is clear that the indicator of loneliness or isolation is a strong and significant predictor of whether an individual receives services for social inclusion. This holds when controlling for the number of notes and demographic factors, suggesting that our indicator is picking up a distinct phenomenon. It also leads to the reassuring conclusion that workers who record that a person is lonely or isolated are much likelier to put in place services for this need.

Interpreting the model’s output involves determining which of the four metrics (Assessment, Case notes, Either, or Both) is most appropriate. All metrics are associated with similar demographic and needs-related factors, likely due to the binary nature of the measurement. This binary approach oversimplifies loneliness, which varies in intensity, but this might be captured to an extent by combining the binary metrics. For instance, the Assessment and Case notes metrics (around 43% prevalence each) are approximately as prevalent as individuals who are SNI isolated or have a UCLA loneliness score of 6 or higher (45%). In contrast, the Both metric (26% prevalence) may identify individuals with more severe loneliness, comparable to those scoring 9 out of 9 on the UCLA scale (26%). The Either metric (62% prevalence) aligns with UCLA scores between 4 (71%) and 5 (58%). The choice of metric depends on the policy goal: a higher threshold might target those with the highest need, while a lower one could cast a wider net for preventive interventions. However, this is speculative, and whether these proportions are in fact indicative of intensity is an empirical question that requires further validation.

Limitations

Our findings have some limitations. For pre-trained embeddings, we used mean pooling to represent sentences, but methods that summarise spans of embeddings may improve performance [75]. Similarly, count-based approaches could have benefited from using n-gram co-occurrence matrices to capture contextual relationships more effectively. These enhancements might have increased the F₁ score of simpler NLP methods. However, we ensured robust evaluation by testing a range of classifiers, including boosting, bagging, logistic regression, MLP, and random forests. Additionally, with transformer models, we achieved strong results using default parameters without hyperparameter tuning, suggesting potential for further optimisation in these approaches too.

Our binary classification of loneliness or social isolation oversimplifies the concept, as it cannot capture variations in intensity. This limitation reflects the recorded data, necessitating a pragmatic approach. However, the different measures derived from the binary indicator may hint at intensity. Another limitation is combining loneliness and social isolation, though consistent with prior research [e.g. 12]. Distinguishing these concepts is important for targeted interventions; for instance, day centres reduce social isolation but may not address emotional loneliness [as conceptualised in e.g. 76]. While social care administrative records do not allow for such distinctions, insights from other datasets could better inform interventions [see e.g. 12,14,18–20,42,43].

Another notable limitation is related to the dataset which, although large in terms of sentence count, is limited to a relatively small geographic area. Although notes in the training and test set are not about the same individual, they may have been written by the same worker. Similarly, there may be organisational culture issues which lead to individuals using similar phrases that would not be seen elsewhere. We expect that the model will not perform quite as well on free text case notes from another area, although the magnitude of the drop-off, and how many new samples need to be labelled to improve performance, is an empirical question that we hope to answer in the future.

Studies using administrative data face inherent limitations when measuring phenomena like loneliness or isolation. While Table 7 shows that workers who record loneliness or isolation are more likely to arrange social inclusion services, it is unclear how often services are provided to individuals who are actually lonely or isolated. Some loneliness or isolation may remain unrecorded, and services may also be declined. However, the observed associations with characteristics like impaired memory and living alone suggest unobserved cases are not significant enough to invalidate our results. Administrative data provides real-time information on service use and has been used to link care home admission to factors like age, gender, and disability [e.g. 77,78]. Including loneliness or isolation as a structured indicator in such models could further enhance such models.

Conclusion

Our best-performing model achieves an F₁ score of 0.92 on unseen test data, demonstrating its accuracy for identifying loneliness and social isolation in long-term care case notes. The measure of loneliness and isolation seems valid, as it aligns with expected associations, such as living alone and impaired memory, and strongly predicts the receipt of social inclusion services. Approximately 43% of individuals had an assessment sentence indicating loneliness or isolation, 44% had a case note, 62% had either, and 26% had both. These prevalence estimates are comparable to survey data but benefit from administrative data’s larger sample size of statutory care users, inclusion of high-needs individuals, and availability of time-variant service cost data, enabling detailed subgroup analyses and associations with service use.

Future research could use predictive outputs from administrative free text in regression models to explore variations in long-term care usage, such as the risk of care home entry. Our model enables such analyses, and highlights methods for extracting other characteristics not captured in structured data, such as economic hardship or psychological wellbeing. We provide an open-source version of the model in S1 Text, offering a foundation for researchers to apply it to their own data.

Supporting information

S1 Fig. Data flow diagram. Diagram of data sharing agreements between data controller and data processors.

https://doi.org/10.1371/journal.pone.0319745.s001

(TIF)

S2 Fig. Data pre-processing. Pre-processing steps taken with the count-based, skip-gram, and transformers vectors.

https://doi.org/10.1371/journal.pone.0319745.s002

(TIF)

S1 Text. Supplementary appendices and results. This file contains: (1) Data Flow Appendix—Explanation of the pseudonymisation process, information governance, and data flow arrangements. (2) Methods Appendix—Details about the methods, including data pre-processing, labelling data, sentence vectors, model parameters, and classification rules. (3) Open-source model repository—Link to the model at GitHub. (4) Additional results—Tabular results corresponding to Fig 4.

https://doi.org/10.1371/journal.pone.0319745.s003

(PDF)

Acknowledgments

The authors extend their heartfelt gratitude to Uche Osuagwu for his tireless dedication to managing data extraction and quality, ensuring the data met the highest standards for academic research. We are deeply appreciative of William Wood and the Intelligence Solutions for London team for their vital contributions to Information Governance. Our sincere thanks go to Hannah Kendrick for her extraordinary generosity in dedicating her time and effort to establish Inter-Rater Reliability of human annotators. We are also immensely grateful to Explosion AI for providing us with a free research license for their proprietary annotation software, Prodigy.

References

1. OECD Stat. Health expenditure and financing; 2023. https://stats.oecd.org/Index.aspx?DataSetCode=SHA
- View Article
- Google Scholar
2. NHS Digital. Adult social care activity and finance report, England, 2022–23. 2023.
- View Article
- Google Scholar
3. Hu B, Hancock R, Wittenberg R. Projections of adult social care demand and expenditure 2018 to 2038. CPEC Working Paper 7. 2020.
- View Article
- Google Scholar
4. Fernandez J, Malley J, Marczak J, Snell T, Wittenberg R, King D. Unmet social care needs in England. CPEC Working Paper 7. 2020.
- View Article
- Google Scholar
5. King D, Wittenberg R. Data on adult social care. NIHR school for social care research scoping review. 2015.
- View Article
- Google Scholar
6. Challis D, Clarkson P, Davies S, Hughes J, Stewart K, Xie C. Resource allocation at the micro level in adult social care: A scoping review. 2016.
- View Article
- Google Scholar
7. DHSC . People at the heart of care: Adult social care reform white paper. 2021.
- View Article
- Google Scholar
8. Gillingham P. Practitioner perspectives on the implementation of an electronic information system to enforce practice standards in England. Eur J Soc Work. 2021;24(5):761–71.
- View Article
- Google Scholar
9. Chaichulee S, Promchai C, Kaewkomon T, Kongkamol C, Ingviya T, Sangsupawanich P. Multi-label classification of symptom terms from free-text bilingual adverse drug reaction reports using natural language processing. PLoS One 2022;17(8):e0270595. pmid:35925971
- View Article
- PubMed/NCBI
- Google Scholar
10. Blinov P, Avetisian M, Kokh V, Umerenkov D, Tuzhilin A. Predicting clinical diagnosis from patients electronic health records using BERT-based neural networks. In: Artificial intelligence in medicine: 18th international conference on artificial intelligence in medicine, AIME 2020. New York: Springer; 2020. p. 111–121.
- View Article
- Google Scholar
11. Li Y, Rao S, Solares JRA, Hassaine A, Ramakrishnan R, Canoy D, et al. BEHRT: Transformer for electronic health records. Sci Rep 2020;10(1):7155. pmid:32346050
- View Article
- PubMed/NCBI
- Google Scholar
12. Zhu V, Lenert L, Bunnell B, Obeid J, Jefferson M, Halbert C. Automatically identifying social isolation from clinical narratives for patients with prostate cancer. BMC Med Inform Decis Mak 2019;19(1):1–9.
- View Article
- Google Scholar
13. Yadav P, Steinbach M, Kumar V, Simon G. Mining electronic health records (EHRs) a survey. ACM Comput Surv. (CSUR). 2018;50(6):1–40.
- View Article
- Google Scholar
14. Patra BG, Lepow LA, Kumar PKRJ, Vekaria V, Sharma MM, Adekkanattu P. Extracting social support and social isolation information from clinical psychiatry notes: Comparing a rule-based NLP system and a large language model. arXiv Preprint. 2024.
- View Article
- Google Scholar
15. Victor B, Perron B, Sokol R, Fedina L, Ryan J. Automated identification of domestic violence in written child welfare records: Leveraging text mining and machine learning to enhance social work research and evaluation. J Soc Soc Work Res. 2021;12(4):631–55.
- View Article
- Google Scholar
16. Bako AT, Taylor HL, Wiley K Jr, Zheng J, Walter-McCabe H, Kasthurirathne SN, et al. Using natural language processing to classify social work interventions. Am J Manag Care 2021;27(1):e24–31. pmid:33471465
- View Article
- PubMed/NCBI
- Google Scholar
17. Annapragada AV, Donaruma-Kwoh MM, Annapragada AV, Starosolski ZA. A natural language processing and deep learning approach to identify child abuse from pediatric electronic medical records. PLoS One 2021;16(2):e0247404. pmid:33635890
- View Article
- PubMed/NCBI
- Google Scholar
18. Berkman LF, Syme SL. Social networks, host resistance, and mortality: a nine-year follow-up study of Alameda County residents. Am J Epidemiol 1979;109(2):186–204. pmid:425958
- View Article
- PubMed/NCBI
- Google Scholar
19. Holt-Lunstad J, Smith TB, Baker M, Harris T, Stephenson D. Loneliness and social isolation as risk factors for mortality: a meta-analytic review. Perspect Psychol Sci 2015;10(2):227–37. pmid:25910392
- View Article
- PubMed/NCBI
- Google Scholar
20. Pantell M, Rehkopf D, Jutte D, Syme SL, Balmes J, Adler N. Social isolation: a predictor of mortality comparable to traditional clinical risk factors. Am J Public Health 2013;103(11):2056–62. pmid:24028260
- View Article
- PubMed/NCBI
- Google Scholar
21. Coyle CE, Dugan E. Social isolation, loneliness and health among older adults. J Aging Health 2012;24(8):1346–63. pmid:23006425
- View Article
- PubMed/NCBI
- Google Scholar
22. de Jong-Gierveld J. Developing and testing a model of loneliness. J Pers Soc Psychol 1987;53(1):119–28. pmid:3612484
- View Article
- PubMed/NCBI
- Google Scholar
23. WHO . Psychogeriatric care in the community. 1979.
- View Article
- Google Scholar
24. Prohaska T, Burholt V, Burns A, Golden J, Hawkley L, Lawlor B, et al. Consensus statement: loneliness in older adults, the 21st century social determinant of health? BMJ Open. 2020;10(8):e034967. pmid:32788184
- View Article
- PubMed/NCBI
- Google Scholar
25. WHO . Social isolation and loneliness among older people: Advocacy brief. 2021.
- View Article
- Google Scholar
26. DCMS . A connected Society: a strategy for tackling loneliness–laying the foundations for change. London: Department for Digital, Culture, Media and Sport; 2018.
27. DCMS . Tackling loneliness evidence review: main report. London: Department for Digital, Culture, Media and Sport; 2023.
28. DHSC. Social care user surveys (ASCS and SACE Data Collections) 2024. Available from: https://digital.nhs.uk/data-and-information/data-collections-and-data-sets/data-collections/social-care-user-surveys
- View Article
- Google Scholar
29. Rostgaard T, Jacobsen F, Kröger T, Peterson E. Revisiting the Nordic long-term care model for older people-still equal? Eur J Ageing. 2022;19(2):201–10. pmid:35528216
- View Article
- PubMed/NCBI
- Google Scholar
30. Orellana K, Manthorpe J, Tinker A. Day centres for older people: a systematically conducted scoping review of literature about their benefits, purposes and how they are perceived. Ageing Soc 2020;40(1):73–104. pmid:31798195
- View Article
- PubMed/NCBI
- Google Scholar
31. Noone C. The changing role of the day centre for older people in addressing loneliness: a participatory action research study. 2023.
- View Article
- Google Scholar
32. Grootegoed E, Van Dijk D. The return of the family? Welfare state retrenchment and client autonomy in long-term care. J Soc Policy. 2012;41(4):677–94.
- View Article
- Google Scholar
33. Hanratty B, Stow D, Collingridge Moore D, Valtorta NK, Matthews F. Loneliness as a risk factor for care home admission in the English longitudinal study of ageing. Age Ageing 2018;47(6):896–900. pmid:30007359
- View Article
- PubMed/NCBI
- Google Scholar
34. Bharucha AJ, Pandav R, Shen C, Dodge HH, Ganguli M. Predictors of nursing facility admission: a 12-year epidemiological study in the United States. J Am Geriatr Soc 2004;52(3):434–9. pmid:14962161
- View Article
- PubMed/NCBI
- Google Scholar
35. Kersting RC. Impact of social support, diversity, and poverty on nursing home utilization in a nationally representative sample of older Americans. Soc Work Health Care 2001;33(2):67–87. pmid:11760116
- View Article
- PubMed/NCBI
- Google Scholar
36. Cai Q, Salmon J, Rodgers M. Factors associated with long-stay nursing home admissions among the US elderly population: comparison of logistic regression. J Nurs Home Res 2023;10(2):123–30.
- View Article
- Google Scholar
37. Clarkson P, Brand C, Hughes J, Challis D. Integrating assessments of older people: examining evidence and impact from a randomised controlled trial. Age Ageing 2011;40(3):388–91. pmid:21422011
- View Article
- PubMed/NCBI
- Google Scholar
38. Department for Work & Pensions. Family resources survey: financial year 2021 to 2022. 2024. Available from: https://web.archive.org/web/20240303153450/https://www.gov.uk/government/statistics/family-resources-survey-financial-year 202021-to-2022/family-resources-survey-financial-year-2021-to-2022
- View Article
- Google Scholar
39. Society U. Are institutional populations (e.g. students in university halls, individuals in care homes, people in prison) included in the Study? 2024. Available from: https://web.archive.org/web/20240828114359/https://www.understandingsocie ty.ac.uk/help/faqs/ are-institutional-populations-e-g-students-in-university-halls-individuals-in-care-homes-people- in-prison-included-in-the-study/
- View Article
- Google Scholar
40. Banks J, Batty G, Breedvelt J, Coughlin K, Crawford R, Marmot M. English longitudinal study of ageing: Waves 0–9, 1998–2019. UK Data Service. 2021; SN:5050.
- View Article
- Google Scholar
41. UK Parliament. The care and support (Eligibility Criteria) regulations 2015 SI 2023/313; 2015.
- View Article
- Google Scholar
42. Office of the Surgeon General. Our epidemic of loneliness and isolation: The US Surgeon General’s Advisory on the healing effects of social connection and community. 2023.
- View Article
- Google Scholar
43. Tilvis RS, Routasalo P, Karppinen H, Strandberg TE, Kautiainen H, Pitkala KH. Social isolation, social activity and loneliness as survival indicators in old age; a nationwide survey with a 7-year follow-up. Eur Geriatr Med. 2012;3(1):18–22.
- View Article
- Google Scholar
44. Guevara M, Chen S, Thomas S, Chaunzwa TL, Franco I, Kann BH, et al. Large language models to identify social determinants of health in electronic health records. NPJ Digit Med 2024;7(1):6. pmid:38200151
- View Article
- PubMed/NCBI
- Google Scholar
45. NELCSU. PSCleaner: Process CSV files by identifying and removing personal sensitive text. 2022. Available from: https://github.com/NELCSU/PSCleaner
- View Article
- Google Scholar
46. Nguyen E. Text mining and network analysis of digital libraries in R. Data mining applications with R. 2013. p. 95–115.
- View Article
- Google Scholar
47. Jurafsky D, Martin J. Speech and language processing. 3rd ed. 2019.
- View Article
- Google Scholar
48. Honnibal M, Montani I. English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute ruler, lemmatizer. 2021. Available from: https://github.com/explosion/spacy-models/releases/tag/en_core_web_lg-3.0.0
- View Article
- Google Scholar
49. Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D. Roberta: A robustly optimized bert pretraining approach. arXiv Preprint. 2019.
- View Article
- Google Scholar
50. Van Rossum G, Drake F. Python 3 reference manual. Scotts Valley, CA: CreateSpace; 2009.
51. McHugh ML. Interrater reliability: the kappa statistic. Biochem Med (Zagreb). 2012;22(3):276–82. pmid:23092060
- View Article
- PubMed/NCBI
- Google Scholar
52. Krippendorff K. Measuring the reliability of qualitative text analysis data. Qual Quant. 2004;38(4):787–800.
- View Article
- Google Scholar
53. Fleiss J, Levin B, Paik M. Statistical methods for rates and proportions. Statistical methods for rates and proportions. 1981. p. 2212–2236.
- View Article
- Google Scholar
54. De Swert K. Calculating inter-coder reliability in media content analysis using Krippendorff’s Alpha. Centre for Politics and Communication. 2012.
- View Article
- Google Scholar
55. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O. Scikit-learn: Machine learning in Python. J Mach Learn Res. 2011;12:2825–30.
- View Article
- Google Scholar
56. Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. In: Proceedings of the International conference on learning representations (ICLR). 2013.
- View Article
- Google Scholar
57. Huggingface. distilroberta-base. 2022. Available from: https://huggingface.co/distilroberta-base.distilroberta-base
- View Article
- Google Scholar
58. Huggingface. roberta-base. 2022. Available from: https://huggingface.co/roberta-base
- View Article
- Google Scholar
59. Nti I, Nyarko-Boateng O, Aning J. Performance of machine learning algorithms with different k values in k-fold cross-validation. Int J Inf Technol Comput Sci. 2021;13(6):61–71.
- View Article
- Google Scholar
60. Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G. Pytorch: An imperative style, high-performance deep learning library. Adv Neural Inf Process Syst. 2019;32:8026–37.
- View Article
- Google Scholar
61. Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A. Huggingface’s transformers: State-of-the-art natural language processing. arXiv Preprint. 2019.
- View Article
- Google Scholar
62. Radloff LS. The CES-D scale: A self-report depression scale for research in the general population. Appl Psychol Meas. 1977;1(3):385–401.
- View Article
- Google Scholar
63. Minicuci N, Naidoo N, Chatterji S, Kowal P. Data resource profile: cross-national and cross-study sociodemographic and health-related harmonized domains from SAGE plus ELSA, HRS and SHARE (SAGE+, Wave 1). Int J Epidemiol. 2016;45(5):1403–1403j. pmid:27794522
- View Article
- PubMed/NCBI
- Google Scholar
64. Agresti A. Categorical data analysis. Vol. 792. New Jersey: John Wiley & Sons; 2012.
- View Article
- Google Scholar
65. Raschka S, Mirjalili V. Python machine learning: Machine learning and deep learning with Python, scikit-learn, and TensorFlow 2. Packt Publishing Ltd; 2019.
66. Fox J, Weisberg S. An R companion to applied regression. 3rd ed. Thousand Oaks CA: Sage; 2019. Available from: https://socialsciences.mcmaster.ca/jfox/Books/Companion/
67. O’Brien RM. A caution regarding rules of thumb for variance inflation factors. Qual Quant. 2007;41:673–90.
- View Article
- Google Scholar
68. Green I, Stow D, Matthews FE, Hanratty B. Changes over time in the health and functioning of older people moving into care homes: analysis of data from the English longitudinal study of ageing. Age Ageing 2017;46(4):693–6. pmid:28402421
- View Article
- PubMed/NCBI
- Google Scholar
69. Grimm P. Social desirability bias. Wiley international encyclopedia of marketing. 2010.
- View Article
- Google Scholar
70. Stoye G, Zaranko B. How accurate are self-reported diagnoses? Comparing self-reported health events in the English Longitudinal Study of Ageing with administrative hospital records. IFS Working Papers. 2020.
- View Article
- Google Scholar
71. Hoeymans N, Feskens EJ, van den Bos GA, Kromhout D. Measuring functional status: cross-sectional and longitudinal associations between performance and self-report (Zutphen Elderly Study 1990–1993). J Clin Epidemiol 1996;49(10):1103–10. pmid:8826989
- View Article
- PubMed/NCBI
- Google Scholar
72. Angel R, Ostir G, Frisco M, Markides K. Comparison of a self-reported and a performance-based assessment of mobility in the hispanic established population for epidemiological studies of the elderly. Res Aging. 2000;22(6):715–37.
- View Article
- Google Scholar
73. Ayalon L, Shiovitz-Ezra S, Roziner I. A cross-lagged model of the reciprocal associations of loneliness and memory functioning. Psychol Aging 2016;31(3):255–61. pmid:26974589
- View Article
- PubMed/NCBI
- Google Scholar
74. Yin J, Lassale C, Steptoe A, Cadar D. Exploring the bidirectional associations between loneliness and cognitive functioning over 10 years: the English longitudinal study of ageing. Int J Epidemiol 2019;48(6):1937–48. pmid:31056641
- View Article
- PubMed/NCBI
- Google Scholar
75. Kenter T, Borisov A, De Rijke M. Siamese cbow: Optimizing word embeddings for sentence representations. arXiv Preprint. 2016.
- View Article
- Google Scholar
76. Van Baarsen B, Snijders TA, Smit JH, Van Duijn MA. Lonely but not alone: Emotional isolation and social isolation as two distinct dimensions of loneliness in older people. Educ Psychol Meas. 2001;61(1):119–135.
- View Article
- Google Scholar
77. Knapp M, Chua K-C, Broadbent M, Chang C-K, Fernandez J-L, Milea D, et al. Predictors of care home and hospital admissions and their costs for older people with Alzheimer’s disease: findings from a large London case register. BMJ Open 2016;6(11):e013591. pmid:27864252
- View Article
- PubMed/NCBI
- Google Scholar
78. McCann M, Donnelly M, O’Reilly D. Gender differences in care home admission risk: partner’s age explains the higher risk for women. Age Ageing 2012;41(3):416–9. pmid:22510517
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. OECD Stat. Health expenditure and financing; 2023. https://stats.oecd.org/Index.aspx?DataSetCode=SHA
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. NHS Digital. Adult social care activity and finance report, England, 2022–23. 2023.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Hu B, Hancock R, Wittenberg R. Projections of adult social care demand and expenditure 2018 to 2038. CPEC Working Paper 7. 2020.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Fernandez J, Malley J, Marczak J, Snell T, Wittenberg R, King D. Unmet social care needs in England. CPEC Working Paper 7. 2020.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref5] 5. King D, Wittenberg R. Data on adult social care. NIHR school for social care research scoping review. 2015.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref6] 6. Challis D, Clarkson P, Davies S, Hughes J, Stewart K, Xie C. Resource allocation at the micro level in adult social care: A scoping review. 2016.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. DHSC . People at the heart of care: Adult social care reform white paper. 2021.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref8] 8. Gillingham P. Practitioner perspectives on the implementation of an electronic information system to enforce practice standards in England. Eur J Soc Work. 2021;24(5):761–71.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref9] 9. Chaichulee S, Promchai C, Kaewkomon T, Kongkamol C, Ingviya T, Sangsupawanich P. Multi-label classification of symptom terms from free-text bilingual adverse drug reaction reports using natural language processing. PLoS One 2022;17(8):e0270595. pmid:35925971
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref10] 10. Blinov P, Avetisian M, Kokh V, Umerenkov D, Tuzhilin A. Predicting clinical diagnosis from patients electronic health records using BERT-based neural networks. In: Artificial intelligence in medicine: 18th international conference on artificial intelligence in medicine, AIME 2020. New York: Springer; 2020. p. 111–121.
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref11] 11. Li Y, Rao S, Solares JRA, Hassaine A, Ramakrishnan R, Canoy D, et al. BEHRT: Transformer for electronic health records. Sci Rep 2020;10(1):7155. pmid:32346050
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref12] 12. Zhu V, Lenert L, Bunnell B, Obeid J, Jefferson M, Halbert C. Automatically identifying social isolation from clinical narratives for patients with prostate cancer. BMC Med Inform Decis Mak 2019;19(1):1–9.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref13] 13. Yadav P, Steinbach M, Kumar V, Simon G. Mining electronic health records (EHRs) a survey. ACM Comput Surv. (CSUR). 2018;50(6):1–40.
View Article
Google Scholar

[40] View Article

[41] Google Scholar

[ref14] 14. Patra BG, Lepow LA, Kumar PKRJ, Vekaria V, Sharma MM, Adekkanattu P. Extracting social support and social isolation information from clinical psychiatry notes: Comparing a rule-based NLP system and a large language model. arXiv Preprint. 2024.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref15] 15. Victor B, Perron B, Sokol R, Fedina L, Ryan J. Automated identification of domestic violence in written child welfare records: Leveraging text mining and machine learning to enhance social work research and evaluation. J Soc Soc Work Res. 2021;12(4):631–55.
View Article
Google Scholar

[46] View Article

[47] Google Scholar

[ref16] 16. Bako AT, Taylor HL, Wiley K Jr, Zheng J, Walter-McCabe H, Kasthurirathne SN, et al. Using natural language processing to classify social work interventions. Am J Manag Care 2021;27(1):e24–31. pmid:33471465
View Article
PubMed/NCBI
Google Scholar

[49] View Article

[50] PubMed/NCBI

[51] Google Scholar

[ref17] 17. Annapragada AV, Donaruma-Kwoh MM, Annapragada AV, Starosolski ZA. A natural language processing and deep learning approach to identify child abuse from pediatric electronic medical records. PLoS One 2021;16(2):e0247404. pmid:33635890
View Article
PubMed/NCBI
Google Scholar

[53] View Article

[54] PubMed/NCBI

[55] Google Scholar

[ref18] 18. Berkman LF, Syme SL. Social networks, host resistance, and mortality: a nine-year follow-up study of Alameda County residents. Am J Epidemiol 1979;109(2):186–204. pmid:425958
View Article
PubMed/NCBI
Google Scholar

[57] View Article

[58] PubMed/NCBI

[59] Google Scholar

[ref19] 19. Holt-Lunstad J, Smith TB, Baker M, Harris T, Stephenson D. Loneliness and social isolation as risk factors for mortality: a meta-analytic review. Perspect Psychol Sci 2015;10(2):227–37. pmid:25910392
View Article
PubMed/NCBI
Google Scholar

[61] View Article

[62] PubMed/NCBI

[63] Google Scholar

[ref20] 20. Pantell M, Rehkopf D, Jutte D, Syme SL, Balmes J, Adler N. Social isolation: a predictor of mortality comparable to traditional clinical risk factors. Am J Public Health 2013;103(11):2056–62. pmid:24028260
View Article
PubMed/NCBI
Google Scholar

[65] View Article

[66] PubMed/NCBI

[67] Google Scholar

[ref21] 21. Coyle CE, Dugan E. Social isolation, loneliness and health among older adults. J Aging Health 2012;24(8):1346–63. pmid:23006425
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref22] 22. de Jong-Gierveld J. Developing and testing a model of loneliness. J Pers Soc Psychol 1987;53(1):119–28. pmid:3612484
View Article
PubMed/NCBI
Google Scholar

[73] View Article

[74] PubMed/NCBI

[75] Google Scholar

[ref23] 23. WHO . Psychogeriatric care in the community. 1979.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref24] 24. Prohaska T, Burholt V, Burns A, Golden J, Hawkley L, Lawlor B, et al. Consensus statement: loneliness in older adults, the 21st century social determinant of health? BMJ Open. 2020;10(8):e034967. pmid:32788184
View Article
PubMed/NCBI
Google Scholar

[80] View Article

[81] PubMed/NCBI

[82] Google Scholar

[ref25] 25. WHO . Social isolation and loneliness among older people: Advocacy brief. 2021.
View Article
Google Scholar

[84] View Article

[85] Google Scholar

[ref26] 26. DCMS . A connected Society: a strategy for tackling loneliness–laying the foundations for change. London: Department for Digital, Culture, Media and Sport; 2018.

[ref27] 27. DCMS . Tackling loneliness evidence review: main report. London: Department for Digital, Culture, Media and Sport; 2023.

[ref28] 28. DHSC. Social care user surveys (ASCS and SACE Data Collections) 2024. Available from: https://digital.nhs.uk/data-and-information/data-collections-and-data-sets/data-collections/social-care-user-surveys
View Article
Google Scholar

[89] View Article

[90] Google Scholar

[ref29] 29. Rostgaard T, Jacobsen F, Kröger T, Peterson E. Revisiting the Nordic long-term care model for older people-still equal? Eur J Ageing. 2022;19(2):201–10. pmid:35528216
View Article
PubMed/NCBI
Google Scholar

[92] View Article

[93] PubMed/NCBI

[94] Google Scholar

[ref30] 30. Orellana K, Manthorpe J, Tinker A. Day centres for older people: a systematically conducted scoping review of literature about their benefits, purposes and how they are perceived. Ageing Soc 2020;40(1):73–104. pmid:31798195
View Article
PubMed/NCBI
Google Scholar

[96] View Article

[97] PubMed/NCBI

[98] Google Scholar

[ref31] 31. Noone C. The changing role of the day centre for older people in addressing loneliness: a participatory action research study. 2023.
View Article
Google Scholar

[100] View Article

[101] Google Scholar

[ref32] 32. Grootegoed E, Van Dijk D. The return of the family? Welfare state retrenchment and client autonomy in long-term care. J Soc Policy. 2012;41(4):677–94.
View Article
Google Scholar

[103] View Article

[104] Google Scholar

[ref33] 33. Hanratty B, Stow D, Collingridge Moore D, Valtorta NK, Matthews F. Loneliness as a risk factor for care home admission in the English longitudinal study of ageing. Age Ageing 2018;47(6):896–900. pmid:30007359
View Article
PubMed/NCBI
Google Scholar

[106] View Article

[107] PubMed/NCBI

[108] Google Scholar

[ref34] 34. Bharucha AJ, Pandav R, Shen C, Dodge HH, Ganguli M. Predictors of nursing facility admission: a 12-year epidemiological study in the United States. J Am Geriatr Soc 2004;52(3):434–9. pmid:14962161
View Article
PubMed/NCBI
Google Scholar

[110] View Article

[111] PubMed/NCBI

[112] Google Scholar

[ref35] 35. Kersting RC. Impact of social support, diversity, and poverty on nursing home utilization in a nationally representative sample of older Americans. Soc Work Health Care 2001;33(2):67–87. pmid:11760116
View Article
PubMed/NCBI
Google Scholar

[114] View Article

[115] PubMed/NCBI

[116] Google Scholar

[ref36] 36. Cai Q, Salmon J, Rodgers M. Factors associated with long-stay nursing home admissions among the US elderly population: comparison of logistic regression. J Nurs Home Res 2023;10(2):123–30.
View Article
Google Scholar

[118] View Article

[119] Google Scholar

[ref37] 37. Clarkson P, Brand C, Hughes J, Challis D. Integrating assessments of older people: examining evidence and impact from a randomised controlled trial. Age Ageing 2011;40(3):388–91. pmid:21422011
View Article
PubMed/NCBI
Google Scholar

[121] View Article

[122] PubMed/NCBI

[123] Google Scholar

[ref38] 38. Department for Work & Pensions. Family resources survey: financial year 2021 to 2022. 2024. Available from: https://web.archive.org/web/20240303153450/https://www.gov.uk/government/statistics/family-resources-survey-financial-year 202021-to-2022/family-resources-survey-financial-year-2021-to-2022
View Article
Google Scholar

[125] View Article

[126] Google Scholar

[ref39] 39. Society U. Are institutional populations (e.g. students in university halls, individuals in care homes, people in prison) included in the Study? 2024. Available from: https://web.archive.org/web/20240828114359/https://www.understandingsocie ty.ac.uk/help/faqs/ are-institutional-populations-e-g-students-in-university-halls-individuals-in-care-homes-people- in-prison-included-in-the-study/
View Article
Google Scholar

[128] View Article

[129] Google Scholar

[ref40] 40. Banks J, Batty G, Breedvelt J, Coughlin K, Crawford R, Marmot M. English longitudinal study of ageing: Waves 0–9, 1998–2019. UK Data Service. 2021; SN:5050.
View Article
Google Scholar

[131] View Article

[132] Google Scholar

[ref41] 41. UK Parliament. The care and support (Eligibility Criteria) regulations 2015 SI 2023/313; 2015.
View Article
Google Scholar

[134] View Article

[135] Google Scholar

[ref42] 42. Office of the Surgeon General. Our epidemic of loneliness and isolation: The US Surgeon General’s Advisory on the healing effects of social connection and community. 2023.
View Article
Google Scholar

[137] View Article

[138] Google Scholar

[ref43] 43. Tilvis RS, Routasalo P, Karppinen H, Strandberg TE, Kautiainen H, Pitkala KH. Social isolation, social activity and loneliness as survival indicators in old age; a nationwide survey with a 7-year follow-up. Eur Geriatr Med. 2012;3(1):18–22.
View Article
Google Scholar

[140] View Article

[141] Google Scholar

[ref44] 44. Guevara M, Chen S, Thomas S, Chaunzwa TL, Franco I, Kann BH, et al. Large language models to identify social determinants of health in electronic health records. NPJ Digit Med 2024;7(1):6. pmid:38200151
View Article
PubMed/NCBI
Google Scholar

[143] View Article

[144] PubMed/NCBI

[145] Google Scholar

[ref45] 45. NELCSU. PSCleaner: Process CSV files by identifying and removing personal sensitive text. 2022. Available from: https://github.com/NELCSU/PSCleaner
View Article
Google Scholar

[147] View Article

[148] Google Scholar

[ref46] 46. Nguyen E. Text mining and network analysis of digital libraries in R. Data mining applications with R. 2013. p. 95–115.
View Article
Google Scholar

[150] View Article

[151] Google Scholar

[ref47] 47. Jurafsky D, Martin J. Speech and language processing. 3rd ed. 2019.
View Article
Google Scholar

[153] View Article

[154] Google Scholar

[ref48] 48. Honnibal M, Montani I. English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute ruler, lemmatizer. 2021. Available from: https://github.com/explosion/spacy-models/releases/tag/en_core_web_lg-3.0.0
View Article
Google Scholar

[156] View Article

[157] Google Scholar

[ref49] 49. Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D. Roberta: A robustly optimized bert pretraining approach. arXiv Preprint. 2019.
View Article
Google Scholar

[159] View Article

[160] Google Scholar

[ref50] 50. Van Rossum G, Drake F. Python 3 reference manual. Scotts Valley, CA: CreateSpace; 2009.

[ref51] 51. McHugh ML. Interrater reliability: the kappa statistic. Biochem Med (Zagreb). 2012;22(3):276–82. pmid:23092060
View Article
PubMed/NCBI
Google Scholar

[163] View Article

[164] PubMed/NCBI

[165] Google Scholar

[ref52] 52. Krippendorff K. Measuring the reliability of qualitative text analysis data. Qual Quant. 2004;38(4):787–800.
View Article
Google Scholar

[167] View Article

[168] Google Scholar

[ref53] 53. Fleiss J, Levin B, Paik M. Statistical methods for rates and proportions. Statistical methods for rates and proportions. 1981. p. 2212–2236.
View Article
Google Scholar

[170] View Article

[171] Google Scholar

[ref54] 54. De Swert K. Calculating inter-coder reliability in media content analysis using Krippendorff’s Alpha. Centre for Politics and Communication. 2012.
View Article
Google Scholar

[173] View Article

[174] Google Scholar

[ref55] 55. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O. Scikit-learn: Machine learning in Python. J Mach Learn Res. 2011;12:2825–30.
View Article
Google Scholar

[176] View Article

[177] Google Scholar

[ref56] 56. Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. In: Proceedings of the International conference on learning representations (ICLR). 2013.
View Article
Google Scholar

[179] View Article

[180] Google Scholar

[ref57] 57. Huggingface. distilroberta-base. 2022. Available from: https://huggingface.co/distilroberta-base.distilroberta-base
View Article
Google Scholar

[182] View Article

[183] Google Scholar

[ref58] 58. Huggingface. roberta-base. 2022. Available from: https://huggingface.co/roberta-base
View Article
Google Scholar

[185] View Article

[186] Google Scholar

[ref59] 59. Nti I, Nyarko-Boateng O, Aning J. Performance of machine learning algorithms with different k values in k-fold cross-validation. Int J Inf Technol Comput Sci. 2021;13(6):61–71.
View Article
Google Scholar

[188] View Article

[189] Google Scholar

[ref60] 60. Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G. Pytorch: An imperative style, high-performance deep learning library. Adv Neural Inf Process Syst. 2019;32:8026–37.
View Article
Google Scholar

[191] View Article

[192] Google Scholar

[ref61] 61. Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A. Huggingface’s transformers: State-of-the-art natural language processing. arXiv Preprint. 2019.
View Article
Google Scholar

[194] View Article

[195] Google Scholar

[ref62] 62. Radloff LS. The CES-D scale: A self-report depression scale for research in the general population. Appl Psychol Meas. 1977;1(3):385–401.
View Article
Google Scholar

[197] View Article

[198] Google Scholar

[ref63] 63. Minicuci N, Naidoo N, Chatterji S, Kowal P. Data resource profile: cross-national and cross-study sociodemographic and health-related harmonized domains from SAGE plus ELSA, HRS and SHARE (SAGE+, Wave 1). Int J Epidemiol. 2016;45(5):1403–1403j. pmid:27794522
View Article
PubMed/NCBI
Google Scholar

[200] View Article

[201] PubMed/NCBI

[202] Google Scholar

[ref64] 64. Agresti A. Categorical data analysis. Vol. 792. New Jersey: John Wiley & Sons; 2012.
View Article
Google Scholar

[204] View Article

[205] Google Scholar

[ref65] 65. Raschka S, Mirjalili V. Python machine learning: Machine learning and deep learning with Python, scikit-learn, and TensorFlow 2. Packt Publishing Ltd; 2019.

[ref66] 66. Fox J, Weisberg S. An R companion to applied regression. 3rd ed. Thousand Oaks CA: Sage; 2019. Available from: https://socialsciences.mcmaster.ca/jfox/Books/Companion/

[ref67] 67. O’Brien RM. A caution regarding rules of thumb for variance inflation factors. Qual Quant. 2007;41:673–90.
View Article
Google Scholar

[209] View Article

[210] Google Scholar

[ref68] 68. Green I, Stow D, Matthews FE, Hanratty B. Changes over time in the health and functioning of older people moving into care homes: analysis of data from the English longitudinal study of ageing. Age Ageing 2017;46(4):693–6. pmid:28402421
View Article
PubMed/NCBI
Google Scholar

[212] View Article

[213] PubMed/NCBI

[214] Google Scholar

[ref69] 69. Grimm P. Social desirability bias. Wiley international encyclopedia of marketing. 2010.
View Article
Google Scholar

[216] View Article

[217] Google Scholar

[ref70] 70. Stoye G, Zaranko B. How accurate are self-reported diagnoses? Comparing self-reported health events in the English Longitudinal Study of Ageing with administrative hospital records. IFS Working Papers. 2020.
View Article
Google Scholar

[219] View Article

[220] Google Scholar

[ref71] 71. Hoeymans N, Feskens EJ, van den Bos GA, Kromhout D. Measuring functional status: cross-sectional and longitudinal associations between performance and self-report (Zutphen Elderly Study 1990–1993). J Clin Epidemiol 1996;49(10):1103–10. pmid:8826989
View Article
PubMed/NCBI
Google Scholar

[222] View Article

[223] PubMed/NCBI

[224] Google Scholar

[ref72] 72. Angel R, Ostir G, Frisco M, Markides K. Comparison of a self-reported and a performance-based assessment of mobility in the hispanic established population for epidemiological studies of the elderly. Res Aging. 2000;22(6):715–37.
View Article
Google Scholar

[226] View Article

[227] Google Scholar

[ref73] 73. Ayalon L, Shiovitz-Ezra S, Roziner I. A cross-lagged model of the reciprocal associations of loneliness and memory functioning. Psychol Aging 2016;31(3):255–61. pmid:26974589
View Article
PubMed/NCBI
Google Scholar

[229] View Article

[230] PubMed/NCBI

[231] Google Scholar

[ref74] 74. Yin J, Lassale C, Steptoe A, Cadar D. Exploring the bidirectional associations between loneliness and cognitive functioning over 10 years: the English longitudinal study of ageing. Int J Epidemiol 2019;48(6):1937–48. pmid:31056641
View Article
PubMed/NCBI
Google Scholar

[233] View Article

[234] PubMed/NCBI

[235] Google Scholar

[ref75] 75. Kenter T, Borisov A, De Rijke M. Siamese cbow: Optimizing word embeddings for sentence representations. arXiv Preprint. 2016.
View Article
Google Scholar

[237] View Article

[238] Google Scholar

[ref76] 76. Van Baarsen B, Snijders TA, Smit JH, Van Duijn MA. Lonely but not alone: Emotional isolation and social isolation as two distinct dimensions of loneliness in older people. Educ Psychol Meas. 2001;61(1):119–135.
View Article
Google Scholar

[240] View Article

[241] Google Scholar

[ref77] 77. Knapp M, Chua K-C, Broadbent M, Chang C-K, Fernandez J-L, Milea D, et al. Predictors of care home and hospital admissions and their costs for older people with Alzheimer’s disease: findings from a large London case register. BMJ Open 2016;6(11):e013591. pmid:27864252
View Article
PubMed/NCBI
Google Scholar

[243] View Article

[244] PubMed/NCBI

[245] Google Scholar

[ref78] 78. McCann M, Donnelly M, O’Reilly D. Gender differences in care home admission risk: partner’s age explains the higher risk for women. Age Ageing 2012;41(3):416–9. pmid:22510517
View Article
PubMed/NCBI
Google Scholar

[247] View Article

[248] PubMed/NCBI

[249] Google Scholar

Figures

Abstract

Introduction

The impact of loneliness and social isolation

Materials and methods

Data collection

Ethics statement

Data extraction and characteristics

Overview of model development and evaluation

Model development

Model evaluation

Construct validity: Comparison with survey data.

Construct validity: Predicting service receipt for loneliness or isolation.

Results

Model performance on the test set

Construct validity: Comparison with survey data.

Construct validity: Predicting service receipt for loneliness or isolation.

Discussion

Limitations

Conclusion

Supporting information

S1 Fig. Data flow diagram. Diagram of data sharing agreements between data controller and data processors.

S2 Fig. Data pre-processing. Pre-processing steps taken with the count-based, skip-gram, and transformers vectors.

Acknowledgments

References