UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Validation of probabilistic classifiers

Hillel, Tim; Bierlaire, Michel; Elshafie, Mohammed; Jin, Ying; (2018) Validation of probabilistic classifiers. In: Scherer, Patrick, (ed.) 18th Swiss Transport Research Conference. IVT | Institute for Transport Planning and Systems, ETH Zurich: Zurich, Switzerland. Green open access

[thumbnail of Version of Record]
Preview
Text (Version of Record)
Hillel_EtAl.pdf - Submitted Version

Download (289kB) | Preview

Abstract

Non-parametric probabilistic classification models are increasingly being investigated as an alternative to Discrete Choice Models (DCMs), e.g. for predicting mode choice. There exist many strategies within the literature for model selection between DCMs, either through the testing of a null hypothesis, e.g. likelihood ratio, Wald, Lagrange Multiplier tests, or through the comparison of information criteria, e.g. Bayesian and Aikaike information criteria. However, these tests are only valid for parametric models, and cannot be applied to non-parametric classifiers. Typically, the performance of Machine Learning classifiers is validated by computing a performance metric on out-of-sample test data, either through cross validation or hold-out testing. Whilst bootstrapping can be used to investigate whether differences between test scores are stable under resampling, there are few studies within the literature investigating whether these differences are significant for non-parametric models. To address this, in this paper we introduce three statistical tests which can be applied to both parametric and non-parametric probabilistic classification models. The first test considers the analytical distribution of the expected likelihood of a model given the true model. The second test uses similar anaylsis to determine the distribution of the Kullback-Leibler divergence between two models. The final test considers the convex combination of two classifiers under comparison. These tests allow ML classifiers to be compared directly, including with DCMs.

Type: Proceedings paper
Title: Validation of probabilistic classifiers
Event: 18th Swiss Transport Research Conference
Location: Ascona, Switzerland
Dates: 16 May 2018 - 18 May 2018
Open access status: An open access version is available from UCL Discovery
Publisher version: https://www.strc.ch/2018.php
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Discrete choice models, machine learning, significance testing
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Civil, Environ and Geomatic Eng
URI: https://discovery.ucl.ac.uk/id/eprint/10174126
Downloads since deposit
0Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item