UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Evaluating the Feasibility and Acceptability of a GPT-Based Chatbot for Depression Screening: A Mixed-Methods Study

Guo, Z; Lai, A; Deng, Z; Li, K; (2024) Evaluating the Feasibility and Acceptability of a GPT-Based Chatbot for Depression Screening: A Mixed-Methods Study. In: Xie, X and Styles, I and Powathil, G and Ceccarelli, M, (eds.) Artificial Intelligence in Healthcare. AIiH 2024. (pp. pp. 249-263). Springer: Cham, Switzerland.

[thumbnail of Evaluating the Feasibility and Acceptability of a GPT-based Chatbot for Depression Screening A Mixed-Methods Study.pdf] Text
Evaluating the Feasibility and Acceptability of a GPT-based Chatbot for Depression Screening A Mixed-Methods Study.pdf - Accepted Version
Access restricted to UCL open access staff until 15 August 2025.

Download (449kB)

Abstract

Depression is a significant challenge to public health, exhibiting escalating prevalence and socio-economic burdens. Traditional screening methods are often hampered by inaccessibility and expense. Large language models (LLMs), like Generative Pre-trained Transformers (GPT), offer a novel solution. This study assesses a GPT-powered chatbot’s effectiveness for mental health interactions and its potential for initial depression screening. Using OpenAI’s GPT-3.5 Turbo, we developed ’HopeBot,’ a chatbot based on the Patient Health Questionnaire-9 (PHQ-9). It was engineered to conduct voice-based interactions for initial depression screening through prompt design and audio processing techniques. 10 simulated diverse personas engaged with HopeBot, and 20 participants analyzed these sessions and provided structured feedback. We used statistical analysis, including the Mann–Whitney U test, to evaluate the chatbot’s performance. The study’s findings indicate that GPT-based chatbots, exemplified by HopeBot, provide cost-effective support for managing depression, garnering substantial user satisfaction with an impressive average rating of 3.8 out of 5. Users highly praised HopeBot’s skills in comprehension, adaptability, and role clarity. However, the study identified challenges in HopeBot’s ability to handle extreme emotional responses. Furthermore, there was a significant gap between user-reported satisfaction and their actual willingness to embrace artificial intelligence (AI)-driven chatbots for personal mental health assessments or recommendations to their surroundings, highlighting widespread reservations about incorporating AI into healthcare practices.

Type: Proceedings paper
Title: Evaluating the Feasibility and Acceptability of a GPT-Based Chatbot for Depression Screening: A Mixed-Methods Study
Event: AIiH 2024
ISBN-13: 9783031672774
DOI: 10.1007/978-3-031-67278-1_20
Publisher version: http://dx.doi.org/10.1007/978-3-031-67278-1_20
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Large language models, Depression, Artificial intelligence, Mental health, PHQ-9, ChatGPT
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Health Informatics
URI: https://discovery.ucl.ac.uk/id/eprint/10198053
Downloads since deposit
1Download
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item