UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Bayesian Performance Comparison of Text Classifiers

Zhang, D; Wang, J; Yilmaz, E; Wang, X; Zhou, Y; (2016) Bayesian Performance Comparison of Text Classifiers. In: Perego, R and Sebastiani, F and Lucchese, C, (eds.) Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval. (pp. pp. 15-24). Association for Computing Machinery (ACM): New York, NY, USA. Green open access

[thumbnail of dellzhang_sigir2016.pdf]
Preview
Text
dellzhang_sigir2016.pdf - Published Version

Download (314kB) | Preview

Abstract

How can we know whether one classifier is really better than the other? In the area of text classification, since the publication of Yang and Liu's seminal SIGIR-1999 paper, it has become a standard practice for researchers to apply null-hypothesis significance testing (NHST) on their experimental results in order to establish the superiority of a classifier. However, such a frequentist approach has a number of inherent deficiencies and limitations, e.g., the inability to accept the null hypothesis (that the two classifiers perform equally well), the difficulty to compare commonly-used multivariate performance measures like F1 scores instead of accuracy, and so on. In this paper, we propose a novel Bayesian approach to the performance comparison of text classifiers, and argue its advantages over the traditional frequentist approach based on t-test etc. In contrast to the existing probabilistic model for F1 scores which is unpaired, our proposed model takes the correlation between classifiers into account and thus achieves greater statistical power. Using several typical text classification algorithms and a benchmark dataset, we demonstrate that the our approach provides rich information about the difference between two classifiers' performances.

Type: Proceedings paper
Title: Bayesian Performance Comparison of Text Classifiers
Event: SIGIR ’16: 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, 17-21 July 2016, Pisa, Italy
ISBN-13: 9781450340694
Open access status: An open access version is available from UCL Discovery
DOI: 10.1145/2911451.2911547
Publisher version: http://dx.doi.org/10.1145/2911451.2911547
Language: English
Additional information: Copyright © 2016 The author(s). Publication rights licensed to Association for Computing Machinery (ACM).
Keywords: Bayesian inference; hypothesis testing; performance evaluation; text classification
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/1503911
Downloads since deposit
131Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item