Cookies?
Library Header Image
LSE Research Online LSE Library Services

DiversityOne: a multi-country smartphone sensor dataset for everyday life behavior modeling

Busso, Matteo, Bontempelli, Andrea, Malcotti, Leonardo Javier, Meegahapola, Lakmal, Kun, Peter, Diwakar, Shyam, Nutakki, Chaitanya, Britez, Marcelo Dario Rodas, Xu, Hao, Song, Donglei, Correa, Salvador Ruiz, Mendoza-Lara, Andrea-Rebeca, Gaskell, George ORCID: 0000-0001-6135-9496, Stares, Sally ORCID: 0000-0003-4697-0347, Bidoglia, Miriam, Ganbold, Amarsanaa, Chagnaa, Altangerel, Cernuzzi, Luca, Hume, Alethia, Chenu-Abente, Ronald, Asiku, Roy Alia, Gatica-Perez, Daniel, de Götzen, Amalia, Bison, Ivano and Giunchiglia, Fausto (2025) DiversityOne: a multi-country smartphone sensor dataset for everyday life behavior modeling. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 9 (1). ISSN 2474-9567

[img] Text (3712289) - Published Version
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (5MB)

Identification Number: 10.1145/3712289

Abstract

Understanding everyday life behavior of young adults through personal devices, e.g., smartphones and smartwatches, is key for various applications, from enhancing the user experience in mobile apps to enabling appropriate interventions in digital health apps. Towards this goal, previous studies have relied on datasets combining passive sensor data with human-provided annotations or self-reports. However, many existing datasets are limited in scope, often focusing on specific countries primarily in the Global North, involving a small number of participants, or using a limited range of pre-processed sensors. These limitations restrict the ability to capture cross-country variations of human behavior, including the possibility of studying model generalization, and robustness. To address this gap, we introduce DiversityOne, a dataset which spans eight countries (China, Denmark, India, Italy, Mexico, Mongolia, Paraguay, and the United Kingdom) and includes data from 782 college students over four weeks. DiversityOne contains data from 26 smartphone sensor modalities and 350K+ self-reports. As of today, it is one of the largest and most diverse publicly available datasets, while featuring extensive demographic and psychosocial survey data. DiversityOne opens the possibility of studying important research problems in ubiquitous computing, particularly in domain adaptation and generalization across countries, all research areas so far largely underexplored because of the lack of adequate datasets.

Item Type: Article
Additional Information: © 2025 Copyright held by the owner/author(s)
Divisions: Methodology
LSE
Subjects: H Social Sciences
Q Science > Q Science (General)
Date Deposited: 17 Mar 2025 15:03
Last Modified: 25 Mar 2025 17:05
URI: http://eprints.lse.ac.uk/id/eprint/127571

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

Loading...

View more statistics