UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Multiprobabilistic prediction in early medical diagnoses

Nouretdinov, I; Devetyarov, D; Vovk, V; Burford, B; Camuzeaux, S; Gentry-Maharaj, A; Tiss, A; ... Gammerman, A; + view all (2015) Multiprobabilistic prediction in early medical diagnoses. Annals of Mathematics and Artificial Intelligence , 74 (1) pp. 203-222. 10.1007/s10472-013-9367-5. Green open access

[thumbnail of Timms_Multiprobabilistic prediction in early medical diagnoses accepted version.pdf]
Preview
Text
Timms_Multiprobabilistic prediction in early medical diagnoses accepted version.pdf - Accepted Version

Download (873kB) | Preview

Abstract

This paper describes the methodology of providing multiprobability predictions for proteomic mass spectrometry data. The methodology is based on a newly developed machine learning framework called Venn machines. Is allows to output a valid probability interval. The methodology is designed for mass spectrometry data. For demonstrative purposes, we applied this methodology to MALDI-TOF data sets in order to predict the diagnosis of heart disease and early diagnoses of ovarian cancer and breast cancer. The experiments showed that probability intervals are narrow, that is, the output of the multiprobability predictor is similar to a single probability distribution. In addition, probability intervals produced for heart disease and ovarian cancer data were more accurate than the output of corresponding probability predictor. When Venn machines were forced to make point predictions, the accuracy of such predictions is for the most data better than the accuracy of the underlying algorithm that outputs single probability distribution of a label. Application of this methodology to MALDI-TOF data sets empirically demonstrates the validity. The accuracy of the proposed method on ovarian cancer data rises from 66.7 % 11 months in advance of the moment of diagnosis to up to 90.2 % at the moment of diagnosis. The same approach has been applied to heart disease data without time dependency, although the achieved accuracy was not as high (up to 69.9 %). The methodology allowed us to confirm mass spectrometry peaks previously identified as carrying statistically significant information for discrimination between controls and cases.

Type: Article
Title: Multiprobabilistic prediction in early medical diagnoses
Open access status: An open access version is available from UCL Discovery
DOI: 10.1007/s10472-013-9367-5
Publisher version: http://dx.doi.org/10.1007/s10472-013-9367-5
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Confident prediction, probabilistic prediction, risk, diagnostic
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Inst of Clinical Trials and Methodology
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Inst of Clinical Trials and Methodology > MRC Clinical Trials Unit at UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL EGA Institute for Womens Health
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL EGA Institute for Womens Health > Womens Cancer
URI: https://discovery.ucl.ac.uk/id/eprint/1495660
Downloads since deposit
170Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item