Szabo, Z;
Sriperumbudur, B;
Poczos, B;
Gretton, A;
(2015)
Vector-valued Distribution Regression - Keep It Simple and Consistent.
Presented at: CSML reading group, Department of Statistics, University of Oxford, Oxford, United Kingdom.
Preview |
Text
Zoltan_Szabo_invited_talk_University_of_Warwick_PPT.pdf Download (1MB) | Preview |
Abstract
We tackle the distribution regression problem (DRP): regressing from probability measures to vector-valued outputs in the two-stage sampled case, where the input distributions are only available through samples. Numerous important and challenging machine learning and statistical tasks fit into the studied problem family such as multi-instance learning or point estimation tasks. Although there is a vast number of heuristics in the literature to address the DRP problem, to the best of our knowledge the only existing technique with performance guarantees requires density estimation (which often scales poorly in practice) and the distributions to have densities on compact Euclidean domains. In my talk, I am going to present a simple alternative to solve the DRP problem by embedding the input distributions to a reproducing kernel Hilbert space, followed by ridge regression from the embeddings to the outputs. We prove that the proposed approach is consistent: we derive finite sample excess risk bounds which hold with high probability and establish explicit convergence rates as a function of the problem difficulty and sample numbers. Specifically, we justify the applicability of set kernels in regression, which was a 15-year-old open question, and construct alternative kernels on the embedded distributions. The studied scheme is viable under mild conditions, on separable topological domains endowed with kernels. We demonstrate the efficiency of the method in two applications, supervised entropy learning and aerosol optical depth prediction based on multispectral satellite images.
Type: | Conference item (Presentation) |
---|---|
Title: | Vector-valued Distribution Regression - Keep It Simple and Consistent |
Event: | CSML reading group, Department of Statistics, University of Oxford |
Location: | Oxford, United Kingdom |
Dates: | 01 May 2015 |
Open access status: | An open access version is available from UCL Discovery |
Publisher version: | http://www.gatsby.ucl.ac.uk/~szabo/talks/invited_t... |
Language: | English |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Gatsby Computational Neurosci Unit |
URI: | https://discovery.ucl.ac.uk/id/eprint/1468353 |
Archive Staff Only
View Item |