UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Automated tabulation of clinical trial results: A joint entity and relation extraction approach with transformer-based language representations

Whitton, J; Hunter, A; (2023) Automated tabulation of clinical trial results: A joint entity and relation extraction approach with transformer-based language representations. Artificial Intelligence in Medicine , 144 , Article 102661. 10.1016/j.artmed.2023.102661. Green open access

[thumbnail of AIM20023.pdf]
Preview
Text
AIM20023.pdf - Accepted Version

Download (1MB) | Preview

Abstract

Evidence-based medicine, the practice in which healthcare professionals refer to the best available evidence when making decisions, forms the foundation of modern healthcare. However, it relies on labour-intensive systematic reviews, where domain specialists must aggregate and extract information from thousands of publications, primarily of randomised controlled trial (RCT) results, into evidence tables. This paper investigates automating evidence table generation by decomposing the problem across two language processing tasks: named entity recognition, which identifies key entities within text, such as drug names, and relation extraction, which maps their relationships for separating them into ordered tuples. We focus on the automatic tabulation of sentences from published RCT abstracts that report the results of the study outcomes. Two deep neural net models were developed as part of a joint extraction pipeline, using the principles of transfer learning and transformer-based language representations. To train and test these models, a new gold-standard corpus was developed, comprising over 550 result sentences from six disease areas. This approach demonstrated significant advantages, with our system performing well across multiple natural language processing tasks and disease areas, as well as in generalising to disease domains unseen during training. Furthermore, we show these results were achievable through training our models on as few as 170 example sentences. The final system is a proof of concept that the generation of evidence tables can be semi-automated, representing a step towards fully automating systematic reviews.

Type: Article
Title: Automated tabulation of clinical trial results: A joint entity and relation extraction approach with transformer-based language representations
Location: Netherlands
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.artmed.2023.102661
Publisher version: https://doi.org/10.1016/j.artmed.2023.102661
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: BERT, Evidence table, Information extraction, Natural language processing, Randomised controlled trial, Systematic review, Transformer, Humans, Natural Language Processing, Evidence-Based Medicine, Randomized Controlled Trials as Topic, Data Analysis
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10179010
Downloads since deposit
3Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item