UCL logo

UCL Discovery

UCL home » Library Services » Electronic resources » UCL Discovery

Semi-parametric analysis of multi-rater data

Rogers, S; Girolami, M; Polajnar, T; (2010) Semi-parametric analysis of multi-rater data. STAT COMPUT , 20 (3) 317 - 334. 10.1007/s11222-009-9125-z.

Full text not available from this repository.

Abstract

Datasets that are subjectively labeled by a number of experts are becoming more common in tasks such as biological text annotation where class definitions are necessarily somewhat subjective. Standard classification and regression models are not suited to multiple labels and typically a pre-processing step (normally assigning the majority class) is performed. We propose Bayesian models for classification and ordinal regression that naturally incorporate multiple expert opinions in defining predictive distributions. The models make use of Gaussian process priors, resulting in great flexibility and particular suitability to text based problems where the number of covariates can be far greater than the number of data instances. We show that using all labels rather than just the majority improves performance on a recent biological dataset.

Type:Article
Title:Semi-parametric analysis of multi-rater data
DOI:10.1007/s11222-009-9125-z
Keywords:Semi-parametric, Gaussian processes, Machine learning, Multi-rater, Classification, BAYESIAN-ANALYSIS
UCL classification:UCL > School of BEAMS > Faculty of Maths and Physical Sciences > Statistical Science

Archive Staff Only: edit this record