UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Model validation using mutated training labels: An exploratory study

Zhang, JM; Harman, M; Guedj, B; Barr, ET; Shawe-Taylor, J; (2023) Model validation using mutated training labels: An exploratory study. Neurocomputing , 539 , Article 126116. 10.1016/j.neucom.2023.02.042. Green open access

[thumbnail of MV_Neurocomputing (2).pdf]
Preview
Text
MV_Neurocomputing (2).pdf - Other

Download (4MB) | Preview

Abstract

We introduce an exploratory study on Mutation Validation (MV), a model validation method using mutated training labels for supervised learning. MV mutates training data labels, retrains the model against the mutated data, and then uses the metamorphic relation that captures the consequent training performance changes to assess model fit. It does not use a validation set or test set. The intuition underpinning MV is that overfitting models tend to fit noise in the training data. MV does not aim to replace out-of-sample validation. Instead, we provide the first exploratory study on the possibility of using MV as a complement of out-of-sample validation. We explore 8 different learning algorithms, 18 datasets, and 5 types of hyperparameter tuning tasks. Our results demonstrate that MV complements well cross-validation and test accuracy in model selection and hyperparameter tuning tasks. MV deserves more attention from developers when simplicity, sustainaiblity, security (e.g., defending training data attack), and interpretability of the built models are required.

Type: Article
Title: Model validation using mutated training labels: An exploratory study
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.neucom.2023.02.042
Publisher version: https://doi.org/10.1016/j.neucom.2023.02.042
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher's terms and conditions.
Keywords: Model validation, Model complexity, Model overfitting
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10170853
Downloads since deposit
5Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item