UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Classification of lapses in smokers attempting to stop: A supervised machine learning approach using data from a popular smoking cessation smartphone app

Perski, Olga; Li, Kezhi; Pontikos, Nikolas; Simons, David; Goldstein, Stephanie P; Naughton, Felix; Brown, Jamie; (2023) Classification of lapses in smokers attempting to stop: A supervised machine learning approach using data from a popular smoking cessation smartphone app. Nicotine & Tobacco Research , 25 (7) pp. 1330-1339. 10.1093/ntr/ntad051. Green open access

[thumbnail of Brown_ntad051.pdf]
Preview
Text
Brown_ntad051.pdf

Download (6MB) | Preview

Abstract

Introduction Smoking lapses after the quit date often lead to full relapse. To inform the development of real-time, tailored lapse prevention support, we used observational data from a popular smoking cessation app to develop supervised machine learning algorithms to distinguish lapse from non-lapse reports. Methods We used data from app users with ≥20 unprompted data entries, which included information about craving severity, mood, activity, social context, and lapse incidence. A series of group-level supervised machine learning algorithms (e.g., Random Forest, XGBoost) were trained and tested. Their ability to classify lapses for out-of-sample i) observations and ii) individuals were evaluated. Next, a series of individual-level and hybrid algorithms were trained and tested. Results Participants (N=791) provided 37,002 data entries (7.6% lapses). The best-performing group-level algorithm had an area under the receiver operating characteristic curve (AUC) of 0.969 (95% CI= 0.961-0.978). Its ability to classify lapses for out-of-sample individuals ranged from poor to excellent (AUC=0.482-1.000). Individual-level algorithms could be constructed for 39/791 participants with sufficient data, with a median AUC of 0.938 (range: 0.518-1.000). Hybrid algorithms could be constructed for 184/791 participants and had a median AUC of 0.825 (range: 0.375-1.000). Discussion Using unprompted app data appeared feasible for constructing a high-performing group-level lapse classification algorithm but its performance was variable when applied to unseen individuals. Algorithms trained on each individual’s dataset, in addition to hybrid algorithms trained on the group plus a proportion of each individual’s data, had improved performance but could only be constructed for a minority of participants. Implications This study used routinely collected data from a popular smartphone app to train and test a series of supervised machine learning algorithms to distinguish lapse from non-lapse events. Although a high-performing group-level algorithm was developed, it had variable performance when applied to new, unseen individuals. Individual-level and hybrid algorithms had somewhat greater performance but could not be constructed for all participants due to lack of variability in the outcome measure. Triangulation of results with those from a prompted study design is recommended prior to intervention development, with real-world lapse prediction likely requiring a balance between unprompted and prompted app data.

Type: Article
Title: Classification of lapses in smokers attempting to stop: A supervised machine learning approach using data from a popular smoking cessation smartphone app
Open access status: An open access version is available from UCL Discovery
DOI: 10.1093/ntr/ntad051
Publisher version: https://doi.org/10.1093/ntr/ntad051
Language: English
Additional information: © The Author(s) 2023. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Institute of Ophthalmology
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Epidemiology and Health
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Epidemiology and Health > Behavioural Science and Health
URI: https://discovery.ucl.ac.uk/id/eprint/10167466
Downloads since deposit
45Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item