Cookies?
Library Header Image
LSE Research Online LSE Library Services

Understanding patterns of loneliness in older long-term care users using natural language processing with free text case notes

Rickman, Samuel ORCID: 0000-0003-1921-5258, Fernández, José-Luis ORCID: 0000-0002-4190-7341 and Malley, Juliette ORCID: 0000-0001-5759-1647 (2025) Understanding patterns of loneliness in older long-term care users using natural language processing with free text case notes. PLOS ONE. ISSN 1932-6203 (In Press)

[img] Text (manuscript (003)) - Accepted Version
Pending embargo until 1 January 2100.
Available under License Creative Commons Attribution.

Download (478kB)
[img] Text (supporting-info) - Accepted Version
Pending embargo until 1 January 2100.

Download (226kB)
Identification Number: 10.1371/journal.pone.0319745

Abstract

Loneliness and social isolation are distressing for individuals and predictors of mortality, yet data on their impact on publicly funded long-term care is limited. Using recent advances in natural language processing (NLP), we analysed pseudonymised administrative records containing 1.1 million free-text case notes about 3,046 older adults recorded in a London council between 2008 and 2020. We applied three NLP methods — document-term matrices, pre-trained embeddings, and transformer-based models — to identify loneliness or social isolation. The best-performing model, a bidirectional transformer, achieved an F1 score of 0.92 on a test set of unseen sentences. Using this model, we generated predictions for the full dataset and assessed construct validity through comparison with survey data and the literature. Our measure is associated with expected characteristics, such as living alone and impaired memory, and is a strong predictor of social inclusion services. Approximately 43% of individuals had a sentence indicating loneliness or isolation in their case notes at their initial care assessment, comparable to survey-based estimates. Unlike surveys, our indicator is linked to other administrative data, enabling development of models of service use with loneliness or isolation as independent variables. An open-source version of the model is available in a GitHub repository: https://github.com/samrickman/lonelinessmodel

Item Type: Article
Additional Information: © 2025 The Author(s)
Divisions: Care Policy and Evaluation Centre
Subjects: H Social Sciences
R Medicine > RA Public aspects of medicine > RA0421 Public health. Hygiene. Preventive Medicine
Date Deposited: 19 Feb 2025 11:00
Last Modified: 21 Feb 2025 08:42
URI: http://eprints.lse.ac.uk/id/eprint/127374

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics